当前位置:科学网首页 > 小柯机器人 >详情
作者:小柯机器人 发布时间:2024/5/16 16:11:05

美国哥伦比亚大学Mohammed AlQuraishi等研究人员合作发现,AlphaFold2的再训练对其学习机制和泛化能力提供新见解。这一研究成果于2024年5月14日在线发表在国际学术期刊《自然—方法学》上。




Title: OpenFold: retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization

Author: Ahdritz, Gustaf, Bouatta, Nazim, Floristean, Christina, Kadyan, Sachin, Xia, Qinghui, Gerecke, William, ODonnell, Timothy J., Berenberg, Daniel, Fisk, Ian, Zanichelli, Niccol, Zhang, Bo, Nowaczynski, Arkadiusz, Wang, Bei, Stepniewska-Dziubinska, Marta M., Zhang, Shang, Ojewole, Adegoke, Guney, Murat Efe, Biderman, Stella, Watkins, Andrew M., Ra, Stephen, Lorenzo, Pablo Ribalta, Nivon, Lucas, Weitzner, Brian, Ban, Yih-En Andrew, Chen, Shiyang, Zhang, Minjia, Li, Conglong, Song, Shuaiwen Leon, He, Yuxiong, Sorger, Peter K., Mostaque, Emad, Zhang, Zhao, Bonneau, Richard, AlQuraishi, Mohammed

Issue&Volume: 2024-05-14

Abstract: AlphaFold2 revolutionized structural biology with the ability to predict protein structures with exceptionally high accuracy. Its implementation, however, lacks the code and data required to train new models. These are necessary to (1) tackle new tasks, like protein–ligand complex structure prediction, (2) investigate the process by which the model learns and (3) assess the model’s capacity to generalize to unseen regions of fold space. Here we report OpenFold, a fast, memory efficient and trainable implementation of AlphaFold2. We train OpenFold from scratch, matching the accuracy of AlphaFold2. Having established parity, we find that OpenFold is remarkably robust at generalizing even when the size and diversity of its training set is deliberately limited, including near-complete elisions of classes of secondary structure elements. By analyzing intermediate structures produced during training, we also gain insights into the hierarchical manner in which OpenFold learns to fold. In sum, our studies demonstrate the power and utility of OpenFold, which we believe will prove to be a crucial resource for the protein modeling community.

DOI: 10.1038/s41592-024-02272-z

Source: https://www.nature.com/articles/s41592-024-02272-z


Nature Methods:《自然—方法学》,创刊于2004年。隶属于施普林格·自然出版集团,最新IF:47.99