shlab

About me

Keep Learning & Be Positive!

Lijun Wu is a Researcher in Shanghai AI Laboratory. Previously, he was a Research Scientist in ByteDance, a Senior Researcher in Microsoft Research. He got the Ph.D. degree from Sun Yat-sen University (SYSU), and was a member of joint Ph.D. program between SYSU and MSRA, advised by Dr. Tie-Yan Liu and Prof. Jianhuang Lai.

His research interests are on AI/LLMs (e.g., data-centric intelligence, SFT/RL), AI4Science (e.g., LLM4Science, scientific reasoning). His research works are published in top conferences and journals, such as Nature Communications, Nature Machine Intelligence, TPAMI, NeurIPS, ICML, ICLR, ACL, KDD and so on. He has served as AC/SPC in top conferences, e.g., ICLR, NeurIPS, ACL, EMNLP, NAACL, AAAI, IJCAI and so on.

We are hiring AI researchers working on LLM/MLLM and AI4Science, contact me if you are interested!

News

🔥2025.7 Our μFormer is accepted by Nature Machine Intelligence!
🔥2024.11 Our TamGen is accepted by Nature Communications!
🔥2024.7 Super excited that our BioT5+ achieves 1st/2nd in Language+Molecule@ACL2024 shared tasks!
2025.8 3 papers are accepted by EMNLP-2025,topics cover math reasoning and advanced data synthesis. Check CFT, MetaLadder, Middo.
2025.8 I am honered to serve as Area Chair for ICLR-2026.
2025.7 I am honered to serve as Area Chair for NeurIPS-2025 workshop AI4Science and workhsop SEA.
2025.7 I am honered to serve as Area Chair for AAAI-2026.
2025.6 Our CovDocker is accepted by KDD-2025.
2025.5 6 papers are accepted by ACL-2025, topics cover math reasoning, data synthesis and LLM benchmarks. Check Mathfusion, GRA, Lemma, CipherBank.
2025.3 I am honered to serve as Area Chair for NeurIPS-2025.
2025.3 Our NatureLM, a large scientific foundation model, is released.
2025.1 3 papers are accepted by ICLR-2025, including FABFlex, the extension of FABind/FABind+ to the flexible docking scenario; 3D-MolT5, the extension of BioT5/BioT5+ in 3D molecular space.

Highlights

Selected Research

Surveys/Reports

🔥2024.3 We have released a comprehensive survey about Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey. Check it!
🔥2023.11 We have released a comprehensive report on Large Language Models (GPT-4) on Scienctific Discovery. Check it!
🔥2022.4 We have released a comprehensive survey about Non-Autoregressive Generation for Neural Machine Translation and Beyond. Check it!

Awesome Repos