科学研究 |
田博学, PhD

田博学

研究员,博士生导师

田博学,研究员,博士生导师,于2020年6月加入清华大学药学院,以及清华-北大生命科学联合中心。田博学2008年本科毕业于华东理工大学,2009年硕士毕业于瑞典厄勒布鲁大学。2012年10月博士毕业于爱尔兰国立大学-高威,师从Leif A. Eriksson。2013年2月进入美国加州大学旧金山分校 Matthew P. Jacobson实验室从事博士后研究。田博学博士主要从事计算化学和计算生物学相关研究。


研究方向

1)AI底层技术:通过在模型中注入知识/规则,提升分子性质预测模型的泛化能力;

2)智能化学合成:利用深度学习、专家规则、量子化学构建化学合成智能体;

3)智能生物合成:利用AI设计、优化萜类合成酶和P450酶,辅助类药分子合成;

4)蛋白词表技术:从大模型中提取具有功能的蛋白语义单元(蛋白质的“官能团”),辅助药物、酶的设计。

实验室主页:https://www.tianlab-tsinghua.com/


学术贡献

田博学实验室近5年在化学信息学与AI制药领域取得系统性的研究成果,以通讯作者(含共同)身份发表系列研究论文,具体总结如下: 

1)发展了蛋白词表技术,利用无监督学习分析大模型的注意力矩阵,解析蛋白语义单元,并利用监督学习映射到功能,有望成为下一代蛋白模拟、蛋白设计、药物设计、细胞模拟技术的基础 (B.Tian* BioRxiv 2025)。

2)发展了物理启发式蛋白语言模型,利用远端突变提升了萜类合成酶的产物产量(70倍以上),辅助类药分子合成 (J. Qiao* B.Tian* Research Square 2024)。

3)发展了基于蛋白质语言模型的蛋白-DNA、蛋白-小分子的结合位点预测方法(B.Tian* Breif. Bioinform. 2024,B.Tian* J. Cheminform. 2024),抗体CDR结构预测 (F. Qian*, B.Tian* eLife 2024)、P450底物、产物预测方法 (B.Tian* J. Chem. Inf. Model. 2024,B.Tian* J. Chem. Inf. Model. 2025),辅助药物发现。

4)揭示了深度学习方法在分子性质预测中泛化能力不足的问题,训练获得了在MoleculeNet和MolData数据集上当前最佳模型(B.Tian* J. Chem. Inf. Model. 2025, B.Tian* J. Chem. Theory Comput. 2024),为化学信息学和AI制药领域提供了新思路。

5)在膜蛋白生物学机制研究方面,开展了转运蛋白(B.Tian, Y. Yuan*, C. Yan* Nature 2024)、离子通道(B.Tian*, X.Li*, B. Xiao* Neuron 2025)等体系的计算模拟。

6)利用量子化学方法研究了复杂环化、偶联、重排反应机制(B.Tian*, W.Deng* ACS Catal. 2021,W. Yang*,B.Tian*, W.Deng* ACS Catal. 2021, B.Tian*, X.Liao* Cell Rep. Phys. Sci. 2023, B.Tian*, Y.Tang* Org. Chem. Front. 2022)。


近5年论文发表情况:

1. Automatically Defining Protein Words for Diverse Functional Predictions Based on Attention Analysis of a Protein Language Model. Hedi Chen, Jingrui Zhong, Xiaochun Zhang, Jingke Chen, Lin Guo, Xiaoliang Xiong, Xiaonan Zhang, Xiangyu Liu, Bailong Xiao, Boxue Tian* bioRxiv 2025, https://doi.org/10.1101/2025.01.20.633699

2. GLMCyp: A Deep Learning-Based Method for CYP450-Mediated Reaction Site Prediction. Xuhai Huang, Jiamin Chang, Boxue Tian* Journal of Chemical Information and Modeling 2025, 65(5):2322-2335

3. SynthMol: A Drug Safety Prediction Framework Integrating Graph Attention and Molecular Descriptors into Pre-Trained Geometric Models. Zidong Su, Rong Zhang, Xiaoyu Fan, Boxue Tian* Journal of Chemical Information and Modeling 2025, 65:2256−2267

4. An intermediate open structure reveals the gating transition of the mechanically activated PIEZO1 channel. Sijia Liu ,Xuzhong Yang, Xudong Chen, Xiaochun Zhang, Jinghui Jiang, Jingyi Yuan, Wenhao Liu, Li Wang, Heng Zhou, Kun Wu, Boxue Tian*, Xueming Li*, Bailong Xiao* Neuron 2025,113,1-15

5. Enhancing Enzyme Activity with Mutation Combinations Guided by Few-shot Learning and Causal Inference. Research Square 2024, https://www.researchsquare.com/article/rs-5354708/v1

6. Protein–DNA binding sites prediction based on pre-trained protein language model and contrastive learning. Yufan Liu, Boxue Tian* Briefings in Bioinformatics 2024, 25(1):1-12

7. DeepP450: Predicting human P450 activities of small molecules by integrating pretrained protein language model and molecular representation. Jiamin Chang, Xiaoyu Fan, Boxue Tian* Journal of Chemical Information and Modeling 2024, 64(18):3149-3160

8. PointGAT: A quantum chemical property prediction model integrating graph attention and 3D geometry. Rong Zhang, Rongqing Yuan, Boxue Tian* Journal of Chemical Theory and Computation 2024, 20(10):4115-4128

9. Accurate prediction of CDR-H3 loop structures of antibodies with deep learning Hedi Chen,Xiaoyu Fan,Shuqian Zhu,Yuchan Pei,Xiaochun Zhang,Xiaonan Zhang,Lihang Liu,Feng Qian*,Boxue Tian* eLife 2024 12:RP91512

10. Molecular basis of human noradrenaline transporter reuptake and inhibition Jiaxin Tan,Yuan Xiao,Fang Kong,Xiaochun Zhang,Hanwen Xu,Angqi Zhu,Yiming Liu,Jianlin Lei,Boxue Tian,Yafei Yuan*,Chuangye Yan* Nature 2024, 632:921-929

11. Protein-small molecule binding site prediction based on a pre-trained protein language model with contrastive learning. Jue Wang, Yufan Liu & Boxue Tian* Journal of Cheminformatics 2024 16(1):125

12. Coupling of alkynes and aryl halides with nickel-catalyzed Sonogashira reactions. Hui Chen,Licheng Yao,Lin Guo,Yahu A. Liu,Boxue Tian,Xuebin Liao* Cell Reports Physical Science 2023, 5(1):102900

13. Elucidating the selectivity of dyotropic rearrangements of β-lactones: a computational survey. Jingyang Zhang, Yumiao Ma, Ke Qiu, Bo Li, Zhengwen Xue, Boxue Tian*,Yefeng Tang* Organic Chemistry Frontiers 2022, 9:329-341

14. Nickel-catalyzed enantioselective domino Heck/Sonogashira coupling for construction of C(sp)-C(sp3) bond-substituted quaternary carbon centers. Hui Chen, Zhenkang Ai, Lingmei Guo,Licheng Yao, Yaopeng Li, Buming Gu, Yixuan Zhang, Yahu A. Liu, Boxue Tian*, Xuebin Liao* Tetrahedron Chem 2022, 2:100021

15. Cooperative N-heterocyclic Carbene and Iridium Catalysis Enables Stereoselective and Regiodivergent [3 + 2] and [3 + 3] Annulation Reactions. Jian Zhang, Yanshan Gao, Buming Gu, Wulin Yang*, Boxue Tian*,Weiping Deng* ACS Catalysis 2021 11(7):3810-3821

16. Diastereo- and Enantioselective Synthesis of Eight-Membered Heterocycles via an Allylation/Ring Expansion Sequence Enabled by Multiple Catalysis. Wulin Yang*,Yuanlin Wang, Wen Li, Buming Gu, Siwen Wang, Xiaoyan Luo, Boxue Tian*, Weiping Deng* ACS Catalysis 2021 11(20):12557-12564