Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Genes (Basel) ; 13(5)2022 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-35627292

RESUMO

Many living organisms have DNA in their cells that is responsible for their biological features. DNA is an organic molecule of two complementary strands of four different nucleotides wound up in a double helix. These nucleotides are adenine (A), thymine (T), guanine (G), and cytosine (C). Genes are DNA sequences containing the information to synthesize proteins. The genes of higher eukaryotic organisms contain coding sequences, known as exons and non-coding sequences, known as introns, which are removed on splice sites after the DNA is transcribed into RNA. Genome annotation is the process of identifying the location of coding regions and determining their function. This process is fundamental for understanding gene structure; however, it is time-consuming and expensive when done by biochemical methods. With technological advances, splice site detection can be done computationally. Although various software tools have been developed to predict splice sites, they need to improve accuracy and reduce false-positive rates. The main goal of this research was to generate Deep Splicer, a deep learning model to identify splice sites in the genomes of humans and other species. This model has good performance metrics and a lower false-positive rate than the currently existing tools. Deep Splicer achieved an accuracy between 93.55% and 99.66% on the genetic sequences of different organisms, while Splice2Deep, another splice site detection tool, had an accuracy between 90.52% and 98.08%. Splice2Deep surpassed Deep Splicer on the accuracy obtained after evaluating C. elegans genomic sequences (97.88% vs. 93.62%) and A. thaliana (95.40% vs. 94.93%); however, Deep Splicer's accuracy was better for H. sapiens (98.94% vs. 97.15%) and D. melanogaster (97.14% vs. 92.30%). The rate of false positives was 0.11% for human genetic sequences and 0.25% for other species' genetic sequences. Another splice prediction tool, Splice Finder, had between 1% and 3% of false positives for human sequences, while other species' sequences had around 4% and 10%.


Assuntos
Caenorhabditis elegans , Drosophila melanogaster , Animais , Caenorhabditis elegans/genética , DNA/genética , Drosophila melanogaster/genética , Humanos , Nucleotídeos , Software
2.
RNA ; 26(7): 784-793, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32241834

RESUMO

Long noncoding RNAs (lncRNAs) have recently emerged as prominent regulators of gene expression in eukaryotes. LncRNAs often drive the modification and maintenance of gene activation or gene silencing states via chromatin conformation rearrangements. In plants, lncRNAs have been shown to participate in gene regulation, and are essential to processes such as vernalization and photomorphogenesis. Despite their prominent functions, only over a dozen lncRNAs have been experimentally and functionally characterized. Similar to its animal counterparts, the rates of sequence divergence are much higher in plant lncRNAs than in protein coding mRNAs, making it difficult to identify lncRNA conservation using traditional sequence comparison methods. Beyond this, little is known about the evolutionary patterns of lncRNAs in plants. Here, we characterized the splicing conservation of lncRNAs in Brassicaceae. We generated a whole-genome alignment of 16 Brassica species and used it to identify synthenic lncRNA orthologs. Using a scoring system trained on transcriptomes from A. thaliana and B. oleracea, we identified splice sites across the whole alignment and measured their conservation. Our analysis revealed that 17.9% (112/627) of all intergenic lncRNAs display splicing conservation in at least one exon, an estimate that is substantially higher than previous estimates of lncRNA conservation in this group. Our findings agree with similar studies in vertebrates, demonstrating that splicing conservation can be evidence of stabilizing selection. We provide conclusive evidence for the existence of evolutionary deeply conserved lncRNAs in plants and describe a generally applicable computational workflow to identify functional lncRNAs in plants.


Assuntos
Sequência Conservada/genética , Splicing de RNA/genética , RNA Longo não Codificante/genética , RNA de Plantas/genética , Arabidopsis/genética , Brassica/genética , Evolução Molecular , Genoma de Planta/genética , RNA Mensageiro/genética
3.
Artigo em Inglês | MEDLINE | ID: mdl-30038900

RESUMO

Syf1 is a tetratricopeptide repeat (TPR) protein implicated in transcription elongation, spliceosome conformation, mRNA nuclear-cytoplasmic export and transcription-coupled DNA repair. Recently, we identified the spliceosomal components of the human parasite Entamoeba histolytica, among them is EhSyf. Molecular predictions confirmed that EhSyf contains 15 type 1 TPR tandem α-antiparallel array motifs. Amoeba transformants carrying plasmids overexpressing HA-tagged or EhSyf silencing plasmids were established to monitor the impact of EhSyf on the splicing of several test Entamoeba transcripts. EhSyf Entamoeba transformants efficiently silenced or overexpressed the proteins in the nucleus. The overexpression or absence of EhSyf notably enhanced or blocked splicing of transcripts irrespective of the strength of their 3' splice site. Finally, the absence of EhSyf negatively affected the transcription of an intron-less transcript. Altogether our data suggest that EhSyf is a bona fide Syf1 ortholog involved in transcription and splicing.


Assuntos
Entamoeba histolytica/enzimologia , Entamoeba histolytica/metabolismo , Proteínas de Protozoários/metabolismo , Splicing de RNA , RNA Mensageiro/metabolismo , Motivos de Aminoácidos , Entamoeba histolytica/genética , Regulação da Expressão Gênica , Conformação Proteica , Proteínas de Protozoários/genética , Sequências Repetitivas de Aminoácidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA