Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 81
Filtrar
1.
Genome Biol Evol ; 16(8)2024 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-39004885

RESUMEN

New protein-coding genes can evolve from previously noncoding genomic regions through a process known as de novo gene emergence. Evidence suggests that this process has likely occurred throughout evolution and across the tree of life. Yet, confidently identifying de novo emerged genes remains challenging. Ancestral sequence reconstruction is a promising approach for inferring whether a gene has emerged de novo or not, as it allows us to inspect whether a given genomic locus ancestrally harbored protein-coding capacity. However, the use of ancestral sequence reconstruction in the context of de novo emergence is still in its infancy and its capabilities, limitations, and overall potential are largely unknown. Notably, it is difficult to formally evaluate the protein-coding capacity of ancestral sequences, particularly when new gene candidates are short. How well-suited is ancestral sequence reconstruction as a tool for the detection and study of de novo genes? Here, we address this question by designing an ancestral sequence reconstruction workflow incorporating different tools and sets of parameters and by introducing a formal criterion that allows to estimate, within a desired level of confidence, when protein-coding capacity originated at a particular locus. Applying this workflow on ∼2,600 short, annotated budding yeast genes (<1,000 nucleotides), we found that ancestral sequence reconstruction robustly predicts an ancient origin for the most widely conserved genes, which constitute "easy" cases. For less robust cases, we calculated a randomization-based empirical P-value estimating whether the observed conservation between the extant and ancestral reading frame could be attributed to chance. This formal criterion allowed us to pinpoint a branch of origin for most of the less robust cases, identifying 49 genes that can unequivocally be considered de novo originated since the split of the Saccharomyces genus, including 37 Saccharomyces cerevisiae-specific genes. We find that for the remaining equivocal cases we cannot rule out different evolutionary scenarios including rapid evolution, multiple gene losses, or a recent de novo origin. Overall, our findings suggest that ancestral sequence reconstruction is a valuable tool to study de novo gene emergence but should be applied with caution and awareness of its limitations.


Asunto(s)
Evolución Molecular , Saccharomyces cerevisiae/genética , Filogenia , Genoma Fúngico , Genes Fúngicos
2.
J Integr Plant Biol ; 2024 Jul 11.
Artículo en Inglés | MEDLINE | ID: mdl-38990113

RESUMEN

Domestication has shaped the population structure and agronomic traits of tea plants, yet the complexity of tea population structure and genetic variation that determines these traits remains unclear. We here investigated the resequencing data of 363 diverse tea accessions collected extensively from almost all tea distributions and found that the population structure of tea plants was divided into eight subgroups, which were basically consistent with their geographical distributions. The genetic diversity of tea plants in China decreased from southwest to east as latitude increased. Results also indicated that Camellia sinensis var. assamica (CSA) illustrated divergent selection signatures with Camellia sinensis var. sinensis (CSS). The domesticated genes of CSA were mainly involved in leaf development, flavonoid and alkaloid biosynthesis, while the domesticated genes in CSS mainly participated in amino acid metabolism, aroma compounds biosynthesis, and cold stress. Comparative population genomics further identified ~730 Mb novel sequences, generating 6,058 full-length protein-encoding genes, significantly expanding the gene pool of tea plants. We also discovered 217,376 large-scale structural variations and 56,583 presence and absence variations (PAVs) across diverse tea accessions, some of which were associated with tea quality and stress resistance. Functional experiments demonstrated that two PAV genes (CSS0049975 and CSS0006599) were likely to drive trait diversification in cold tolerance between CSA and CSS tea plants. The overall findings not only revealed the genetic diversity and domestication of tea plants, but also underscored the vital role of structural variations in the diversification of tea plant traits.

4.
J Anim Sci Biotechnol ; 15(1): 64, 2024 May 06.
Artículo en Inglés | MEDLINE | ID: mdl-38706000

RESUMEN

BACKGROUND: The genetic diversity of yak, a key domestic animal on the Qinghai-Tibetan Plateau (QTP), is a vital resource for domestication and breeding efforts. This study presents the first yak pangenome obtained through the de novo assembly of 16 yak genomes. RESULTS: We discovered 290 Mb of nonreference sequences and 504 new genes. Our pangenome-wide presence and absence variation (PAV) analysis revealed 5,120 PAV-related genes, highlighting a wide range of variety-specific genes and genes with varying frequencies across yak populations. Principal component analysis (PCA) based on binary gene PAV data classified yaks into three new groups: wild, domestic, and Jinchuan. Moreover, we proposed a 'two-haplotype genomic hybridization model' for understanding the hybridization patterns among breeds by integrating gene frequency, heterozygosity, and gene PAV data. A gene PAV-GWAS identified a novel gene (BosGru3G009179) that may be associated with the multirib trait in Jinchuan yaks. Furthermore, an integrated transcriptome and pangenome analysis highlighted the significant differences in the expression of core genes and the mutational burden of differentially expressed genes between yaks from high and low altitudes. Transcriptome analysis across multiple species revealed that yaks have the most unique differentially expressed mRNAs and lncRNAs (between high- and low-altitude regions), especially in the heart and lungs, when comparing high- and low-altitude adaptations. CONCLUSIONS: The yak pangenome offers a comprehensive resource and new insights for functional genomic studies, supporting future biological research and breeding strategies.

5.
Trends Microbiol ; 32(8): 746-755, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38307786

RESUMEN

The evolutionary history of cells has been marked by drastic increases in complexity. Some hypothesize that such cellular complexification requires a massive energy flux as the origin of new features is hypothetically more energetically costly than their evolutionary maintenance. However, it remains unclear how increases in cellular complexity demand more energy. I propose that the early evolution of new genes with weak functions imposes higher energetic costs by overexpression before their functions are evolutionarily refined. In the long term, the accumulation of new genes deviates resources away from growth and reproduction. Accrued cellular complexity further requires additional infrastructure for its maintenance. Altogether, this suggests that larger and more complex cells are defined by increased survival but lower reproductive capacity.


Asunto(s)
Evolución Biológica , Metabolismo Energético , Evolución Molecular
6.
Plant Cell Environ ; 47(6): 1997-2010, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38379450

RESUMEN

Gummy stem blight (GSB), a widespread disease causing great loss to cucurbit production, has become a major threat to melon cultivation. However, the melon-GSB interaction remains largely unknown. Here, full-length transcriptome and widely targeted metabolome were used to investigate the defence responses of resistant (PI511089) and susceptible (Payzawat) melon accessions to GSB pathogen infection at 24 h. The biosynthesis of secondary metabolites and MAPK signalling pathway were specifically enriched for differentially expressed genes in PI511890, while carbohydrate metabolism and amino acid metabolism were specifically enriched in Payzawat. More than 1000 novel genes were identified and MAPK signalling pathway was specifically enriched for them in PI511890. There were 11 793 alternative splicing events involving in the defence response to GSB. Totally, 910 metabolites were identified in Payzawat and PI511890, and flavonoids were the dominant metabolites. Integrated full-length transcriptome and metabolome analysis showed eriodictyol and oxalic acid were the potential marker metabolites for GSB resistance in melon. Moreover, posttranscription regulation was widely involved in the defence response of melon to GSB pathogen infection. These results not only improve our understanding on the interaction between melon and GSB, but also facilitate the genetic improvement of melon with GSB resistance.


Asunto(s)
Cucurbitaceae , Resistencia a la Enfermedad , Regulación de la Expresión Génica de las Plantas , Metaboloma , Enfermedades de las Plantas , Transcriptoma , Enfermedades de las Plantas/microbiología , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/inmunología , Resistencia a la Enfermedad/genética , Cucurbitaceae/microbiología , Cucurbitaceae/genética , Cucurbitaceae/metabolismo , Perfilación de la Expresión Génica
7.
Genomics ; 116(2): 110782, 2024 03.
Artículo en Inglés | MEDLINE | ID: mdl-38176574

RESUMEN

There is an increasing understanding that a reference genome representing an individual cannot capture all the gene repertoire of a species. Here, we conduct a population-scale missing sequences detection of Chinese domestic pigs using whole-genome sequencing data from 534 individuals. We identify 132.41 Mb of sequences absent in the reference assembly, including eight novel genes. In particular, the breeds spread in Chinese high-altitude regions perform significantly different frequencies of new sequences in promoters than other breeds. Furthermore, we dissect the role of non-coding variants and identify a novel sequence inserted in the 3'UTR of the FMO3 gene, which may be associated with the intramuscular fat phenotype. This novel sequence could be a candidate marker for meat quality. Our study provides a comprehensive overview of the missing sequences in Chinese domestic pigs and indicates that this dataset is a valuable resource for understanding the diversity and biology of pigs.


Asunto(s)
Genoma , Sus scrofa , Animales , Cruzamiento , China , Fenotipo , Sus scrofa/genética , Porcinos/genética
8.
Genetics ; 225(4)2023 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-37933691

RESUMEN

Genes that have been identified in the genome but remain uncharacterized with regards to function offer an opportunity to uncover novel biological information. Novelty is exciting but can also be a barrier. If nothing is known, how does one start planning and executing experiments? Here, we provide a recommended information-mining workflow and a corresponding guide to accessing information about uncharacterized Drosophila melanogaster genes, such as those assigned only a systematic coding gene identifier. The available information can provide insights into where and when the gene is expressed, what the function of the gene might be, whether there are similar genes in other species, whether there are known relationships to other genes, and whether any other features have already been determined. In addition, available information about relevant reagents can inspire and facilitate experimental studies. Altogether, mining available information can help prioritize genes for further study, as well as provide starting points for experimental assays and other analyses.


Asunto(s)
Drosophila melanogaster , Genoma , Animales , Drosophila melanogaster/genética
9.
J Agric Food Chem ; 71(49): 19445-19456, 2023 Dec 13.
Artículo en Inglés | MEDLINE | ID: mdl-38033160

RESUMEN

The ability of the model organism, Caenorhabditis elegans, to distinguish and escape from pathogenic bacteria has been extensively studied; however, studies on the repulsive response of Meloidogyne incognita are still in their infancy. We have recently demonstrated that biocontrol bacteria induce a repulsive response in M. incognita via two classical signaling pathways. The present study aimed to identify the novel genes and signaling molecules of M. incognita that potentially contribute to its defense reaction. Analysis of the transcriptome data of M. incognita with and without a repulsive response against Bacillus nematocida B16 obtained 15 candidate genes, of which the novel genes Minc3s01748g26034 and Minc3s02548g30585 were found to regulate the aversive behavior of M. incognita, and their functions were further validated. To further confirm the neuronal localization of the two novel genes in M. incognita, in situ hybridization was conducted using the digoxin-labeled probes of ten tag genes, and preferentially profiled the localization of amphid sensory neurons of M. incognita. Analysis of the overviewed neuronal map suggested that Minc3s01748g26034 and Minc3s02548g30585 functioned in ASK/ASI and CEPD/V neurons, respectively. During their interactions, the volatile compounds 3-methyl-butyric acid and 2-methyl-butyric acid produced by the biocontrol bacteria were predicted as the primary signaling molecules that promoted the repulsive behavior of M. incognita against biocontrol bacteria. The findings provided novel insights into the mechanisms underlying the repulsive response of M. incognita that are different from the canonical molecular pathways previously found in C. elegans and can aid in developing novel strategies for controlling root-knot nematodes.


Asunto(s)
Tylenchoidea , Animales , Tylenchoidea/fisiología , Caenorhabditis elegans/genética , Ácido Butírico/metabolismo , Bacterias/genética , Transcriptoma
10.
BMC Plant Biol ; 23(1): 440, 2023 Sep 19.
Artículo en Inglés | MEDLINE | ID: mdl-37726703

RESUMEN

BACKGROUND: Dioecious plants have male and female flowers on separate plants. Jojoba is a dioecious plant that is drought-tolerant and native to arid areas. The genome sequence of male and female plants was recently reported and revealed an X and Y chromosome system, with two large male-specific insertions in the Y chromosome. RESULTS: A total of 16,923 differentially expressed genes (DEG) were identified between the flowers of the male and female jojoba plants. This represented 40% of the annotated genes in the genome. Many genes, including those responsible for plant environmental responses and those encoding transcription factors (TFs), were specific to male or female reproductive organs. Genes involved in plant hormone metabolism were also found to be associated with flower and pollen development. A total of 8938 up-regulated and 7985 down-regulated genes were identified in comparison between male and female flowers, including many novel genes specific to the jojoba plant. The most differentially expressed genes were associated with reproductive organ development. The highest number of DEG were linked with the Y chromosome in male plants. The male specific parts of the Y chromosome encoded 12 very highly expressed genes including 9 novel genes and 3 known genes associated with TFs and a plant hormone which may play an important role in flower development. CONCLUSION: Many genes, largely with unknown functions, may explain the sexual dimorphisms in jojoba plants and the differentiation of male and female flowers.


Asunto(s)
Caryophyllales , Reguladores del Crecimiento de las Plantas , Animales , Sequías , Flores/genética , Expresión Génica
11.
Infect Agent Cancer ; 18(1): 47, 2023 Aug 28.
Artículo en Inglés | MEDLINE | ID: mdl-37641095

RESUMEN

BACKGROUND: Esophageal squamous cell carcinoma (ESCC) has a poor prognosis and is one of the deadliest gastrointestinal malignancies. Despite numerous transcriptomics studies to understand its molecular basis, the impact of population-specific differences on this disease remains unexplored. AIMS: This study aimed to investigate the population-specific differences in gene expression patterns among ESCC samples obtained from six distinct global populations, identify differentially expressed genes (DEGs) and their associated pathways, and identify potential biomarkers for ESCC diagnosis and prognosis. In addition, this study deciphers population specific microbial and chemical risk factors in ESCC. METHODS: We compared the gene expression patterns of ESCC samples from six different global populations by analyzing microarray datasets. To identify DEGs, we conducted stringent quality control and employed linear modeling. We cross-compared the resulting DEG lists of each populations along with ESCC ATLAS to identify known and novel DEGs. We performed a survival analysis using The Cancer Genome Atlas Program (TCGA) data to identify potential biomarkers for ESCC diagnosis and prognosis among the novel DEGs. Finally, we performed comparative functional enrichment and toxicogenomic analysis. RESULTS: Here we report 19 genes with distinct expression patterns among populations, indicating population-specific variations in ESCC. Additionally, we discovered 166 novel DEGs, such as ENDOU, SLCO1B3, KCNS3, IFI35, among others. The survival analysis identified three novel genes (CHRM3, CREG2, H2AC6) critical for ESCC survival. Notably, our findings showed that ECM-related gene ontology terms and pathways were significantly enriched among the DEGs in ESCC. We also found population-specific variations in immune response and microbial infection-related pathways which included genes enriched for HPV, Ameobiosis, Leishmaniosis, and Human Cytomegaloviruses. Our toxicogenomic analysis identified tobacco smoking as the primary risk factor and cisplatin as the main drug chemical interacting with the maximum number of DEGs across populations. CONCLUSION: This study provides new insights into population-specific differences in gene expression patterns and their associated pathways in ESCC. Our findings suggest that changes in extracellular matrix (ECM) organization may be crucial to the development and progression of this cancer, and that environmental and genetic factors play important roles in the disease. The novel DEGs identified may serve as potential biomarkers for diagnosis, prognosis and treatment.

12.
BMC Genomics ; 24(1): 340, 2023 Jun 20.
Artículo en Inglés | MEDLINE | ID: mdl-37340366

RESUMEN

BACKGROUND: Oriental river prawn (Macrobrachium nipponense) is one of the most dominant species in shrimp farming in China, which is a rich source of protein and contributes to a significant impact on the quality of human life. Thus, more complete and accurate annotation of gene models are important for the breeding research of oriental river prawn. RESULTS: A full-length transcriptome of oriental river prawn muscle was obtained using the PacBio Sequel platform. Then, 37.99 Gb of subreads were sequenced, including 584,498 circular consensus sequences, among which 512,216 were full length non-chimeric sequences. After Illumina-based correction of long PacBio reads, 6,599 error-corrected isoforms were identified. Transcriptome structural analysis revealed 2,263 and 2,555 alternative splicing (AS) events and alternative polyadenylation (APA) sites, respectively. In total, 620 novel genes (NGs), 197 putative transcription factors (TFs), and 291 novel long non-coding RNAs (lncRNAs) were identified. CONCLUSIONS: In summary, this study offers novel insights into the transcriptome complexity and diversity of this prawn species, and provides valuable information for understanding the genomic structure and improving the draft genome annotation of oriental river prawn.


Asunto(s)
Palaemonidae , Animales , Humanos , Palaemonidae/genética , Perfilación de la Expresión Génica , Transcriptoma , Empalme Alternativo , Isoformas de Proteínas/genética
13.
BMC Genomics ; 24(1): 254, 2023 May 11.
Artículo en Inglés | MEDLINE | ID: mdl-37170194

RESUMEN

BACKGROUND: Genomic complexity is a growing field of evolution, with case studies for comparative evolutionary analyses in model and emerging non-model systems. Understanding complexity and the functional components of the genome is an untapped wealth of knowledge ripe for exploration. With the "remarkable lack of correspondence" between genome size and complexity, there needs to be a way to quantify complexity across organisms. In this study, we use a set of complexity metrics that allow for evaluating changes in complexity using TranD. RESULTS: We ascertain if complexity is increasing or decreasing across transcriptomes and at what structural level, as complexity varies. In this study, we define three metrics - TpG, EpT, and EpG- to quantify the transcriptome's complexity that encapsulates the dynamics of alternative splicing. Here we compare complexity metrics across 1) whole genome annotations, 2) a filtered subset of orthologs, and 3) novel genes to elucidate the impacts of orthologs and novel genes in transcript model analysis. Effective Exon Number (EEN) issued to compare the distribution of exon sizes within transcripts against random expectations of uniform exon placement. EEN accounts for differences in exon size, which is important because novel gene differences in complexity for orthologs and whole-transcriptome analyses are biased towards low-complexity genes with few exons and few alternative transcripts. CONCLUSIONS: With our metric analyses, we are able to quantify changes in complexity across diverse lineages with greater precision and accuracy than previous cross-species comparisons under ortholog conditioning. These analyses represent a step toward whole-transcriptome analysis in the emerging field of non-model evolutionary genomics, with key insights for evolutionary inference of complexity changes on deep timescales across the tree of life. We suggest a means to quantify biases generated in ortholog calling and correct complexity analysis for lineage-specific effects. With these metrics, we directly assay the quantitative properties of newly formed lineage-specific genes as they lower complexity.


Asunto(s)
Eucariontes , Transcriptoma , Eucariontes/genética , Genómica , Perfilación de la Expresión Génica , Genoma , Empalme Alternativo , Evolución Molecular
14.
Int J Mol Sci ; 24(10)2023 May 11.
Artículo en Inglés | MEDLINE | ID: mdl-37239953

RESUMEN

A theory of the evolutionary role of hereditary tumors, or the carcino-evo-devo theory, is being developed. The main hypothesis of the theory, the hypothesis of evolution by tumor neofunctionalization, posits that hereditary tumors provided additional cell masses during the evolution of multicellular organisms for the expression of evolutionarily novel genes. The carcino-evo-devo theory has formulated several nontrivial predictions that have been confirmed in the laboratory of the author. It also suggests several nontrivial explanations of biological phenomena previously unexplained by the existing theories or incompletely understood. By considering three major types of biological development-individual, evolutionary, and neoplastic development-within one theoretical framework, the carcino-evo-devo theory has the potential to become a unifying biological theory.


Asunto(s)
Evolución Biológica , Biología Evolutiva
15.
Genes (Basel) ; 14(5)2023 05 17.
Artículo en Inglés | MEDLINE | ID: mdl-37239458

RESUMEN

The magnitude of the childhood obesity epidemic and its effects on public health has accelerated the pursuit of practical preventative measures. Epigenetics is one subject that holds a lot of promise, despite being relatively new. The study of potentially heritable variations in gene expression that do not require modifications to the underlying DNA sequence is known as epigenetics. Here, we used Illumina MethylationEPIC BeadChip Array to identify differentially methylated regions in DNA isolated from saliva between normal weight (NW) and overweight/obese (OW/OB) children and between European American (EA) and African American (AA) children. A total of 3133 target IDs (associated with 2313 genes) were differentially methylated (p < 0.05) between NW and OW/OB children. In OW/OB children, 792 target IDs were hypermethylated and 2341 were hypomethylated compared to NW. Similarly, in the racial groups EA and AA, a total of 1239 target IDs corresponding to 739 genes were significantly differentially methylated in which 643 target IDs were hypermethylated and 596 were hypomethylated in the AA compared to EA participants. Along with this, the study identified novel genes that could contribute to the epigenetic regulation of childhood obesity.


Asunto(s)
Disparidades en el Estado de Salud , Obesidad Infantil , Niño , Humanos , Negro o Afroamericano/genética , Metilación de ADN , Epigénesis Genética , Estudio de Asociación del Genoma Completo , Obesidad Infantil/etnología , Obesidad Infantil/genética , Blanco
16.
Mol Biol Evol ; 40(3)2023 03 04.
Artículo en Inglés | MEDLINE | ID: mdl-36917489

RESUMEN

Intergenic genomic regions have essential regulatory and structural roles that impose constraints on their sequences. But regions that do not currently encode proteins also carry the potential to do so in the future. De novo gene emergence, the evolution of novel genes out of previously noncoding sequences has now been established as a potent force for genomic novelty. Recently, it was shown that intergenic regions in the genome of Saccharomyces cerevisiae harbor pervasive cryptic potential to, if theoretically translated, form transmembrane domains (TM domains) more frequently than expected by chance given their nucleotide composition, a property that we refer to as TM-forming enrichment. The source and biological relevance of this property is unknown. Here, we expand the investigation into the TM-forming potential of intergenic regions to the entire Saccharomycotina budding yeast subphylum, in an effort to explain this property and understand its importance. We find pervasive but variable enrichment in TM-forming potential across the subphylum regardless of the composition and average size of intergenic regions. This cryptic property is evenly spread across the genome, cannot be explained by the hydrophobic content of the sequence, and does not appear to localize to regions containing regulatory motifs. This TM-forming enrichment specifically, and not the actual TM-forming potential, is associated, across genomes, with more TM domains in evolutionarily young genes. Our findings shed light on this newly discovered feature of yeast genomes and constitute a first step toward understanding its evolutionary importance.


Asunto(s)
Saccharomycetales , Levaduras , ADN Intergénico/genética , Levaduras/genética , Saccharomyces cerevisiae/genética , Genómica , Genoma , Saccharomycetales/genética
17.
Int J Mol Sci ; 23(24)2022 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-36555647

RESUMEN

Klebsiella pneumoniae is not only a human and animal opportunistic pathogen, but a food-borne pathogen. Cross-kingdom infection has been focused on since K. pneumoniae was identified as the pathogen of maize, banana, and pomegranate. Although the pathogenicity of K. pneumoniae strains (from ditch water, maize, and human) on plant and mice has been confirmed, there are no reports to explain the molecular mechanisms of the pathogen. This study uncovered the K. pneumoniae KpC4 isolated from maize top rot for the determination of various virulence genes and resistance genes. At least thirteen plant disease-causing genes are found to be involved in the disruption of plant defense. Among them, rcsB is responsible for causing disease in both plants and animals. The novel sequence types provide solid evidence that the pathogen invades plant and has robust ecological adaptability. It is imperative to perform further studies on the verification of these KpC4 genes' functions to understand the molecular mechanisms involved in plant−pathogen interactions.


Asunto(s)
Infección Hospitalaria , Infecciones por Klebsiella , Animales , Ratones , Humanos , Klebsiella pneumoniae , Factores de Virulencia/genética , Zea mays , Virulencia/genética
18.
Cell Rep ; 41(12): 111808, 2022 12 20.
Artículo en Inglés | MEDLINE | ID: mdl-36543139

RESUMEN

Small open reading frames (sORFs) can encode functional "microproteins" that perform crucial biological tasks. However, their size makes them less amenable to genomic analysis, and their origins and conservation are poorly understood. Given their short length, it is plausible that some of these functional microproteins have recently originated entirely de novo from noncoding sequences. Here we sought to identify such cases in the human lineage by reconstructing the evolutionary origins of human microproteins previously found to have measurable, statistically significant fitness effects. By tracing the formation of each ORF and its transcriptional activation, we show that novel microproteins with significant phenotypic effects have emerged de novo throughout animal evolution, including two after the human-chimpanzee split. Notably, traditional methods for assessing coding potential would miss most of these cases. This evidence demonstrates that the functional potential intrinsic to sORFs can be relatively rapidly and frequently realized through de novo gene emergence.


Asunto(s)
Evolución Molecular , Hominidae , Animales , Humanos , Hominidae/genética , Genoma , Sistemas de Lectura Abierta/genética , Pan troglodytes , Micropéptidos
19.
Curr Biol ; 32(12): 2632-2639.e2, 2022 06 20.
Artículo en Inglés | MEDLINE | ID: mdl-35588743

RESUMEN

Comparisons of genomes of different species are used to identify lineage-specific genes, those genes that appear unique to one species or clade. Lineage-specific genes are often thought to represent genetic novelty that underlies unique adaptations. Identification of these genes depends not only on genome sequences, but also on inferred gene annotations. Comparative analyses typically use available genomes that have been annotated using different methods, increasing the risk that orthologous DNA sequences may be erroneously annotated as a gene in one species but not another, appearing lineage specific as a result. To evaluate the impact of such "annotation heterogeneity," we identified four clades of species with sequenced genomes with more than one publicly available gene annotation, allowing us to compare the number of lineage-specific genes inferred when differing annotation methods are used to those resulting when annotation method is uniform across the clade. In these case studies, annotation heterogeneity increases the apparent number of lineage-specific genes by up to 15-fold, suggesting that annotation heterogeneity is a substantial source of potential artifact.


Asunto(s)
Genoma , Secuencia de Bases , Genoma/genética , Anotación de Secuencia Molecular
20.
Genes (Basel) ; 13(4)2022 03 27.
Artículo en Inglés | MEDLINE | ID: mdl-35456404

RESUMEN

Advances in sequencing technologies and bioinformatics tools have fueled a renewed interest in whole genome sequencing efforts in many organisms. The growing availability of multiple genome sequences has advanced our understanding of the within-species diversity, in the form of a pangenome. Pangenomics has opened new avenues for future research such as allowing dissection of complex molecular mechanisms and increased confidence in genome mapping. To comprehensively capture the genetic diversity for improving plant performance, the pangenome concept is further extended from species to genus level by the inclusion of wild species, constituting a super-pangenome. Characterization of pangenome has implications for both basic and applied research. The concept of pangenome has transformed the way biological questions are addressed. From understanding evolution and adaptation to elucidating host-pathogen interactions, finding novel genes or breeding targets to aid crop improvement to design effective vaccines for human prophylaxis, the increasing availability of the pangenome has revolutionized several aspects of biological research. The future availability of high-resolution pangenomes based on reference-level near-complete genome assemblies would greatly improve our ability to address complex biological problems.


Asunto(s)
Fitomejoramiento , Plantas , Mapeo Cromosómico , Humanos , Plantas/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA