About me
Keep Learning & Be Positive!
Lijun Wu is a Researcher in Shanghai AI Laboratory. Previously, he was a Research Scientist in ByteDance, a Senior Researcher in Microsoft Research. He got the Ph.D. degree from Sun Yat-sen University (SYSU), and was a member of joint Ph.D. program between SYSU and MSRA, advised by Dr. Tie-Yan Liu and Prof. Jianhuang Lai.
His research interests are on AI/LLMs (e.g., data-centric intelligence, SFT/RL), AI4Science (e.g., LLM4Science, scientific reasoning). His research works are published in top conferences and journals, such as Nature Communications, Nature Machine Intelligence, TPAMI, NeurIPS, ICML, ICLR, ACL, KDD and so on. He has served as AC/SPC in top conferences, e.g., ICLR, NeurIPS, ACL, EMNLP, NAACL, AAAI, IJCAI and so on.
We are hiring AI researchers working on LLM/MLLM and AI4Science, contact me if you are interested!
News
2025.9
Caco is accepted by NeurIPS-2025, which aims to scaling the reasoning data by code-assisted verfications.
2025.8
3 papers are accepted by EMNLP-2025,topics cover math reasoning and advanced data synthesis. Check CFT, MetaLadder, Middo.
2025.8
Invited to serve as Area Chair for ICLR-2026.
2025.7
μFormer is accepted by Nature Machine Intelligence!
2025.7
Invited to serve as Area Chair for NeurIPS-2025 workshop AI4Science and SEA.
2025.7
Invited to serve as Area Chair for AAAI-2026.
2025.6
CovDocker is accepted by KDD-2025.
2025.5
6 papers are accepted by ACL-2025, topics cover math reasoning, data synthesis and LLM benchmarks. Check Mathfusion, GRA, Lemma, CipherBank.
2025.3
Invited to serve as Area Chair for NeurIPS-2025.
2025.3
NatureLM, a large scientific foundation model, is released.
2025.1
3 papers are accepted by ICLR-2025, including FABFlex, the extension of FABind/FABind+ to the flexible docking scenario; 3D-MolT5, the extension of BioT5/BioT5+ in 3D molecular space.
Highlights
- MSRA Ph.D. Fellowship in 2018 (12 in Asia-Pacific Universities)
- First human parity achieved in Chinese-English machine translation system in 2018
- WMT 2019 champion in 8 translation directions
- OGB-LSC@KDD cup 2021 Runner up
- 1st/2nd in Language+Molecule@ACL2024 shared tasks!
- (Tech Transfer) R-Drop successfully applied over 20+ language translations in Microsoft Translator, Meituan products
- (Tech Transfer) CT4Rec successfully applied in Tencent news recommendation
Selected Research
- LLM
- AI/NMT
- AI/LLM4Science
Surveys/Reports
🔥2024.3
We have released a comprehensive survey about Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey. Check it!
🔥2023.11
We have released a comprehensive report on Large Language Models (GPT-4) on Scienctific Discovery. Check it!
🔥2022.4
We have released a comprehensive survey about Non-Autoregressive Generation for Neural Machine Translation and Beyond. Check it!