Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
Genetics ; 208(4): 1387-1395, 2018 04.
Artículo en Inglés | MEDLINE | ID: mdl-29382650

RESUMEN

Biological evolution generates a surprising amount of site-specific variability in protein sequences. Yet, attempts at modeling this process have been only moderately successful, and current models based on protein structural metrics explain, at best, 60% of the observed variation. Surprisingly, simple measures of protein structure, such as solvent accessibility, are often better predictors of site-specific variability than more complex models employing all-atom energy functions and detailed structural modeling. We suggest here that these more complex models perform poorly because they lack consideration of the evolutionary process, which is, in part, captured by the simpler metrics. We compare protein sequences that are computationally designed to sequences that are computationally evolved using the same protein-design energy function and to homologous natural sequences. We find that, by a wide variety of metrics, evolved sequences are much more similar to natural sequences than are designed sequences. In particular, designed sequences are too conserved on the protein surface relative to natural sequences, whereas evolved sequences are not. Our results suggest that evolutionary simulation produces a realistic sampling of sequence space. By contrast, protein design-at least as currently implemented-does not. Existing energy functions seem to be sufficiently accurate to correctly describe the key thermodynamic constraints acting on protein sequences, but they need to be paired with realistic sampling schemes to generate realistic sequence alignments.


Asunto(s)
Sustitución de Aminoácidos , Evolución Molecular , Variación Genética , Proteínas/química , Proteínas/genética , Termodinámica , Algoritmos , Simulación por Computador , Conformación Proteica , Proteínas/metabolismo , Alineación de Secuencia , Relación Estructura-Actividad
2.
PLoS One ; 12(4): e0164905, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28369116

RESUMEN

Proteins evolve through two primary mechanisms: substitution, where mutations alter a protein's amino-acid sequence, and insertions and deletions (indels), where amino acids are either added to or removed from the sequence. Protein structure has been shown to influence the rate at which substitutions accumulate across sites in proteins, but whether structure similarly constrains the occurrence of indels has not been rigorously studied. Here, we investigate the extent to which structural properties known to covary with protein evolutionary rates might also predict protein tolerance to indels. Specifically, we analyze a publicly available dataset of single-amino-acid deletion mutations in enhanced green fluorescent protein (eGFP) to assess how well the functional effect of deletions can be predicted from protein structure. We find that weighted contact number (WCN), which measures how densely packed a residue is within the protein's three-dimensional structure, provides the best single predictor for whether eGFP will tolerate a given deletion. We additionally find that using protein design to explicitly model deletions results in improved predictions of functional status when combined with other structural predictors. Our work suggests that structure plays fundamental role in constraining deletions at sites in proteins, and further that similar biophysical constraints influence both substitutions and deletions. This study therefore provides a solid foundation for future work to examine how protein structure influences tolerance of more complex indel events, such as insertions or large deletions.


Asunto(s)
Proteínas Fluorescentes Verdes/química , Proteínas Fluorescentes Verdes/genética , Secuencia de Aminoácidos , Evolución Molecular Dirigida , Fluorescencia , Mutación INDEL , Modelos Logísticos , Modelos Moleculares , Estructura Secundaria de Proteína , Eliminación de Secuencia , Máquina de Vectores de Soporte
3.
Protein Sci ; 25(7): 1341-53, 2016 07.
Artículo en Inglés | MEDLINE | ID: mdl-26971720

RESUMEN

Structural properties such as solvent accessibility and contact number predict site-specific sequence variability in many proteins. However, the strength and significance of these structure-sequence relationships vary widely among different proteins, with absolute correlation strengths ranging from 0 to 0.8. In particular, two recent works have made contradictory observations. Yeh et al. (Mol. Biol. Evol. 31:135-139, 2014) found that both relative solvent accessibility (RSA) and weighted contact number (WCN) are good predictors of sitewise evolutionary rate in enzymes, with WCN clearly out-performing RSA. Shahmoradi et al. (J. Mol. Evol. 79:130-142, 2014) considered these same predictors (as well as others) in viral proteins and found much weaker correlations and no clear advantage of WCN over RSA. Because these two studies had substantial methodological differences, however, a direct comparison of their results is not possible. Here, we reanalyze the datasets of the two studies with one uniform analysis pipeline, and we find that many apparent discrepancies between the two analyses can be attributed to the extent of sequence divergence in individual alignments. Specifically, the alignments of the enzyme dataset are much more diverged than those of the virus dataset, and proteins with higher divergence exhibit, on average, stronger structure-sequence correlations. However, the highest structure-sequence correlations are observed at intermediate divergence levels, where both highly conserved and highly variable sites are present in the same alignment.


Asunto(s)
Secuencia de Aminoácidos , Enzimas/química , Proteínas Virales/química , Biología Computacional/métodos , Enzimas/genética , Evolución Molecular , Modelos Moleculares , Conformación Proteica , Alineación de Secuencia , Solventes/química , Proteínas Virales/genética
4.
J Virol ; 89(22): 11643-53, 2015 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-26355089

RESUMEN

UNLABELLED: Animal viruses frequently cause zoonotic disease in humans. As these viruses are highly diverse, evaluating the threat that they pose remains a major challenge, and efficient approaches are needed to rapidly predict virus-host compatibility. Here, we develop a combined computational and experimental approach to assess the compatibility of New World arenaviruses, endemic in rodents, with the host TfR1 entry receptors of different potential new host species. Using signatures of positive selection, we identify a small motif on rodent TfR1 that conveys species specificity to the entry of viruses into cells. However, we show that mutations in this region affect the entry of each arenavirus differently. For example, a human single nucleotide polymorphism (SNP) in this region, L212V, makes human TfR1 a weaker receptor for one arenavirus, Machupo virus, but a stronger receptor for two other arenaviruses, Junin and Sabia viruses. Collectively, these findings set the stage for potential evolutionary trade-offs, where natural selection for resistance to one virus may make humans or rodents susceptible to other arenavirus species. Given the complexity of this host-virus interplay, we propose a computational method to predict these interactions, based on homology modeling and computational docking of the virus-receptor protein-protein interaction. We demonstrate the utility of this model for Machupo virus, for which a suitable cocrystal structural template exists. Our model effectively predicts whether the TfR1 receptors of different species will be functional receptors for Machupo virus entry. Approaches such at this could provide a first step toward computationally predicting the "host jumping" potential of a virus into a new host species. IMPORTANCE: We demonstrate how evolutionary trade-offs may exist in the dynamic evolutionary interplay between viruses and their hosts, where natural selection for resistance to one virus could make humans or rodents susceptible to other virus species. We present an algorithm that predicts which species have cell surface receptors that make them susceptible to Machupo virus, based on computational docking of protein structures. Few molecular models exist for predicting the risk of spillover of a particular animal virus into humans or new animal populations. Our results suggest that a combination of evolutionary analysis, structural modeling, and experimental verification may provide an efficient approach for screening and assessing the potential spillover risks of viruses circulating in animal populations.


Asunto(s)
Antígenos CD/genética , Arenavirus del Nuevo Mundo/fisiología , Especificidad del Huésped , Receptores de Transferrina/genética , Receptores Virales/metabolismo , Acoplamiento Viral , Algoritmos , Animales , Línea Celular Tumoral , Biología Computacional/métodos , Resistencia a la Enfermedad/genética , Perros , Células HEK293 , Humanos , Simulación del Acoplamiento Molecular , Receptores de Transferrina/metabolismo , Receptores Virales/ultraestructura , Internalización del Virus
5.
Phys Biol ; 12(2): 025002, 2015 Mar 19.
Artículo en Inglés | MEDLINE | ID: mdl-25787027

RESUMEN

Evolutionary-rate variation among sites within proteins depends on functional and biophysical properties that constrain protein evolution. It is generally accepted that proteins must be able to fold stably in order to function. However, the relationship between stability constraints and among-sites rate variation is not well understood. Here, we present a biophysical model that links the thermodynamic stability changes due to mutations at sites in proteins ([Formula: see text]) to the rate at which mutations accumulate at those sites over evolutionary time. We find that such a 'stability model' generally performs well, displaying correlations between predicted and empirically observed rates of up to 0.75 for some proteins. We further find that our model has comparable predictive power as does an alternative, recently proposed 'stress model' that explains evolutionary-rate variation among sites in terms of the excess energy needed for mutants to adopt the correct active structure ([Formula: see text]). The two models make distinct predictions, though, and for some proteins the stability model outperforms the stress model and vice versa. We conclude that both stability and stress constrain site-specific sequence evolution in proteins.


Asunto(s)
Secuencia de Aminoácidos , Evolución Molecular , Mutación , Modelos Genéticos , Termodinámica
6.
J Mol Evol ; 79(3-4): 130-42, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25217382

RESUMEN

Several recent works have shown that protein structure can predict site-specific evolutionary sequence variation. In particular, sites that are buried and/or have many contacts with other sites in a structure have been shown to evolve more slowly, on average, than surface sites with few contacts. Here, we present a comprehensive study of the extent to which numerous structural properties can predict sequence variation. The quantities we considered include buriedness (as measured by relative solvent accessibility), packing density (as measured by contact number), structural flexibility (as measured by B factors, root-mean-square fluctuations, and variation in dihedral angles), and variability in designed structures. We obtained structural flexibility measures both from molecular dynamics simulations performed on nine non-homologous viral protein structures and from variation in homologous variants of those proteins, where they were available. We obtained measures of variability in designed structures from flexible-backbone design in the Rosetta software. We found that most of the structural properties correlate with site variation in the majority of structures, though the correlations are generally weak (correlation coefficients of 0.1-0.4). Moreover, we found that buriedness and packing density were better predictors of evolutionary variation than structural flexibility. Finally, variability in designed structures was a weaker predictor of evolutionary variability than buriedness or packing density, but it was comparable in its predictive power to the best structural flexibility measures. We conclude that simple measures of buriedness and packing density are better predictors of evolutionary variation than the more complicated predictors obtained from dynamic simulations, ensembles of homologous structures, or computational protein design.


Asunto(s)
Evolución Molecular , Proteínas Virales/química , Secuencia de Aminoácidos , Entropía , Simulación de Dinámica Molecular , Conformación Proteica
7.
PeerJ ; 1: e211, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-24255821

RESUMEN

Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA