Large-scale design and refinement of stable proteins using sequence-only models.

Singer, Jedediah M; Novotney, Scott; Strickland, Devin; Haddox, Hugh K; Leiby, Nicholas; Rocklin, Gabriel J; Chow, Cameron M; Roy, Anindya; Bera, Asim K; Motta, Francis C; Cao, Longxing; Strauch, Eva-Maria; Chidyausiku, Tamuka M; Ford, Alex; Ho, Ethan; Zaitzeff, Alexander; Mackenzie, Craig O; Eramian, Hamed; DiMaio, Frank; Grigoryan, Gevorg; Vaughn, Matthew; Stewart, Lance J; Baker, David; Klavins, Eric

Singer, Jedediah M; Novotney, Scott; Strickland, Devin; Haddox, Hugh K; Leiby, Nicholas; Rocklin, Gabriel J; Chow, Cameron M; Roy, Anindya; Bera, Asim K; Motta, Francis C; Cao, Longxing; Strauch, Eva-Maria; Chidyausiku, Tamuka M; Ford, Alex; Ho, Ethan; Zaitzeff, Alexander; Mackenzie, Craig O; Eramian, Hamed; DiMaio, Frank; Grigoryan, Gevorg; Vaughn, Matthew; Stewart, Lance J; Baker, David; Klavins, Eric.

Afiliación

Singer JM; Two Six Technologies, Arlington, Virginia, United States of America.
Novotney S; Two Six Technologies, Arlington, Virginia, United States of America.
Strickland D; Department of Electrical and Computer Engineering, University of Washington, Seattle, Washington, United States of America.
Haddox HK; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Leiby N; Two Six Technologies, Arlington, Virginia, United States of America.
Rocklin GJ; Department of Pharmacology and Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, Illinois, United States of America.
Chow CM; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Roy A; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Bera AK; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Motta FC; Department of Mathematical Sciences, Florida Atlantic University, Boca Raton, Florida, United States of America.
Cao L; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Strauch EM; Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, Georgia, United States of America.
Chidyausiku TM; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Ford A; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Ho E; Texas Advanced Computing Center, Austin, Texas, United States of America.
Zaitzeff A; Two Six Technologies, Arlington, Virginia, United States of America.
Mackenzie CO; Quantitative Biomedical Sciences Graduate Program, Dartmouth College, Hanover, New Hampshire, United States of America.
Eramian H; Netrias, Cambridge, Massachusetts, United States of America.
DiMaio F; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Grigoryan G; Departments of Computer Science and Biological Sciences, Dartmouth College, Hanover, New Hampshire, United States of America.
Vaughn M; Texas Advanced Computing Center, Austin, Texas, United States of America.
Stewart LJ; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Baker D; Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, Washington, United States of America.
Klavins E; Department of Electrical and Computer Engineering, University of Washington, Seattle, Washington, United States of America.

PLoS One ; 17(3): e0265020, 2022.

Article en En | MEDLINE | ID: mdl-35286324

RESUMEN

Engineered proteins generally must possess a stable structure in order to achieve their designed function. Stable designs, however, are astronomically rare within the space of all possible amino acid sequences. As a consequence, many designs must be tested computationally and experimentally in order to find stable ones, which is expensive in terms of time and resources. Here we use a high-throughput, low-fidelity assay to experimentally evaluate the stability of approximately 200,000 novel proteins. These include a wide range of sequence perturbations, providing a baseline for future work in the field. We build a neural network model that predicts protein stability given only sequences of amino acids, and compare its performance to the assayed values. We also report another network model that is able to generate the amino acid sequences of novel stable proteins given requested secondary sequences. Finally, we show that the predictive model-despite weaknesses including a noisy data set-can be used to substantially increase the stability of both expert-designed and model-generated proteins.

Asunto(s)

Redes Neurales de la Computación; Proteínas; Secuencia de Aminoácidos; Aminoácidos; Estabilidad Proteica; Proteínas/química

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies Idioma: En Revista: PLoS One Asunto de la revista: CIENCIA / MEDICINA Año: 2022 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google