RESUMEN
Leishmania parasite infections, termed the leishmaniases, cause significant global infectious disease burden. The lifecycle of the parasite embodies three main stages that require precise coordination of gene regulation to survive environmental shifts between sandfly and mammalian hosts. Constitutive transcription in kinetoplastid parasites means that gene regulation is overwhelmingly reliant on post-transcriptional mechanisms, yet strikingly few Leishmania trans-regulators are known. Using optimized crosslinking and deep, quantified mass spectrometry, we present a comprehensive analysis of 1400 mRNA binding proteins (mRBPs) and whole cell proteomes from the three main Leishmania lifecycle stages. Supporting the validity, although the crosslinked RBPome is magnitudes more enriched, the protein identities of the crosslinked and non-crosslinked RBPomes were nearly identical. Moreover, multiple candidate RBPs were endogenously tagged and found to associate with discrete mRNA target pools in a stage-specific manner. Results indicate that in L. mexicana parasites, mRNA levels are not a strong predictor of the whole cell expression or RNA binding potential of encoded proteins. Evidence includes a low correlation between transcript and corresponding protein expression and stage-specific variation in protein expression versus RNA binding potential. Unsurprisingly, RNA binding protein enrichment correlates strongly with relative replication efficiency of the specific lifecycle stage. Our study is the first to quantitatively define and compare the mRBPome of multiple stages in kinetoplastid parasites. It provides novel, in-depth insight into the trans-regulatory mRNA:Protein (mRNP) complexes that drive Leishmania parasite lifecycle progression.
Asunto(s)
Leishmania mexicana/genética , Parásitos/genética , Proteoma/metabolismo , Animales , Ontología de Genes , Estadios del Ciclo de Vida , Ratones Endogámicos BALB C , Análisis de Componente Principal , Proteómica , Proteínas Protozoarias/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Proteínas de Unión al ARN/metabolismo , Reproducibilidad de los Resultados , Transcriptoma/genéticaRESUMEN
RNA-binding proteins (RBPs) are at the core of post-transcriptional regulation and thus of gene expression control at the RNA level. One of the principal challenges in the field of gene expression regulation is to understand RBPs mechanism of action. As a result of recent evolution of experimental techniques, it is now possible to obtain the RNA regions recognized by RBPs on a transcriptome-wide scale. In fact, CLIP-seq protocols use the joint action of CLIP, crosslinking immunoprecipitation, and high-throughput sequencing to recover the transcriptome-wide set of interaction regions for a particular protein. Nevertheless, computational methods are necessary to process CLIP-seq experimental data and are a key to advancement in the understanding of gene regulatory mechanisms. Considering the importance of computational methods in this area, we present a review of the current status of computational approaches used and proposed for CLIP-seq data.