About me

Keep Learning & Be Positive!

Lijun Wu is an AI Researcher. Previously, he was a Research Scientist in ByteDance, a Senoir Researcher in Microsoft Research. He got the Ph.D. degree from Sun Yat-sen University (SYSU), School of Data and Computer Science, and was a member of joint Ph.D. program between SYSU and MSRA, advised by Dr. Tie-Yan Liu and Prof. Jianhuang Lai. He was honored to be awarded with MSRA Ph.D. Fellowship. His team has won 8 champions in WMT19 machine translation competition.

His researches focus on Large Language Model (e.g., RLHF, post-training), AI4Science (e.g., LLM4Science, Drug Discovery), Multimodality Learning. He has rich experiences on sequence learning such as neural machine translation. He has published many papers in top conferences and journals, such as ICLR, NeurIPS, ACL, TPAMI. He has served as AC/SPC in toptier conferences, e.g., ACL, EMNLP, NAACL, COLING, AAAI, IJCAI and so on.

I am hiring AI researchers working on LLM, drop me an email for more information if you are interested!

Highlights

News

🔥2024.7 Super excited that our BioT5+ achieves 1st in molecule generation and 2nd in molecular captioning on Language+Molecule@ACL2024 shared task!
🔥2024.3 We have released an AI4Science Research Project page with multiple different research projects, check it if you are interested!
2024.10 I am honered to serve as Area Chair for AAAI-2025 AI4Science workshop.
2024.9 One paper about protein sequence representation learning is accepted by EMNLP-2024.
2024.9 One paper about quantum hamiltonian prediction is accepted by NeurIPS-2024.
2024.9 One paper about federated learning is accepted as Oral presentation by NeurIPS-2024 workshop federated learning.
2024.7 Our solution about the champion in Language+Molecule@ACL2024 shared task is accepted as Oral presentation!
2024.7 Our kNN-DTA about drug-target affinity prediction is accepted by CIKM-2024.
2024.6 I am honered to serve as SPC for AAAI-2025.
2024.6 The slides of my talk on LLM4Science when attening YSSNLP 2024 is released.
2024.5 Our BioT5+ is accepted by ACL-2024 Findings.
2024.5 I recently moved to ByteDance to start a new position as a Research Scientist for LLM. Drop me an email if you are interested in internship.
2024.5 I’ll attend YSSNLP 2024 on 06.16 to give a talk about LLM4Science.
2024.5 I am honered to serve as Area Chair for ICML-2024 AI4Science workshop.
2024.5 I am honered to serve as Area Chair for ACL-2024 Language and Molecules workshop.
2024.5 I am honered to serve as Area Chair for EMNLP-2024.
2024.4 Our FABind+, a much stronger extension of FABind is released.
2024.3 Our BioT5+, a much stronger extension of BioT5 is released.
2024.2 I am honered to give a talk about the LLM in Science Discovery at AGI Leap Summit 2024.
2024.2 One paper is accepted by TPAMI-2024.
2024.1 I am servering as Area Chair for IEEE-CAI-2024.

Surveys/Reports

🔥2024.3 We have released a comprehensive survey about Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey. Check it!
🔥2023.11 We have released a report on Large Language Models (GPT-4) on Scienctific Discovery, check it!
🔥2022.4 We have released a comprehensive survey about Non-Autoregressive Generation for Neural Machine Translation and Beyond. Check it!

Awesome Repos

Selected Research

  • Consistency Training and Dropout
  • LLM for Science/Drug Discovery
    • BioT5, BioT5+ (pre-trained large language model for bio-chemistry)
    • FABind, FABind+ (Fast and Accurate for Protein-Ligand Binding)
    • AbGNN (pre-training for antibody design)
  • Neural Machine Translation