RESUMO
SUMMARY: Growth of genome data and analysis possibilities have brought new levels of difficulty for scientists to understand, integrate and deal with all this ever-increasing information. In this scenario, GARSA has been conceived aiming to facilitate the tasks of integrating, analyzing and presenting genomic information from several bioinformatics tools and genomic databases, in a flexible way. GARSA is a user-friendly web-based system designed to analyze genomic data in the context of a pipeline. EST and GGS data can be analyzed using the system since it accepts (1) chromatograms, (2) download of sequences from GenBank, (3) Fasta files stored locally or (4) a combination of all three. Quality evaluation of chromatograms, vector removing and clusterization are easily performed as part of the pipeline. A number of local and customizable Blast and CDD analyses can be performed as well as Interpro, complemented with phylogeny analyses. GARSA is being used for the analyses of Trypanosoma vivax (GSS and EST), Trypanosoma rangeli (GSS, EST and ORESTES), Bothrops jararaca (EST), Piaractus mesopotamicus (EST) and Lutzomyia longipalpis (EST). AVAILABILITY: The GARSA system is freely available under GPL license (http://www.biowebdb.org/garsa/). For download requests visit http://www.biowebdb.org/garsa/ or contact Dr Alberto Dávila.
Assuntos
Biologia Computacional/métodos , Animais , Bothrops/metabolismo , Cromatografia , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Etiquetas de Sequências Expressas , Genoma , Genômica , Armazenamento e Recuperação da Informação , Internet , Filogenia , Alinhamento de Sequência , Análise de Sequência de DNA , Software , Trypanosoma/metabolismo , Trypanosoma vivax/genética , Interface Usuário-ComputadorRESUMO
A survey of the Trypanosoma vivax genome was carried out by the genome sequence survey (GSS) approach resulting in 1,086 genomic sequences. A total of 455 high-quality GSS sequences were generated, consisting of 331 non-redundant sequences distributed in 264 singlets and 67 clusters in a total of 135.5 Kb of the T. vivax genome. The estimation of the overall G+C content, and the prediction of the presence of ORFs and putative genes were carried out using the Glimmer and Jemboss packages. Analysis of the obtained sequences was carried out by BLAST programs against 12 different databases and also using the Conserved Domain Database, InterProScan, and tRNAscan-SE. Along with the existing 23 T. vivax entries in the GenBank, the 32 putative genes predicted and the 331 non-redundant GSS sequences reported herein represent new potential markers for the development of PCRbased assays for specific diagnosis and typing of Trypanosoma vivax.