A legume specific protein database (LegProt) improves the number of identified peptides, confidence scores and overall protein identification success rates for legume proteomics.

Lei, Zhentian; Dai, Xinbin; Watson, Bonnie S; Zhao, Patrick X; Sumner, Lloyd W

Lei, Zhentian; Dai, Xinbin; Watson, Bonnie S; Zhao, Patrick X; Sumner, Lloyd W.

Afiliación

Lei Z; Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA.

Phytochemistry ; 72(10): 1020-7, 2011 Jul.

Article en En | MEDLINE | ID: mdl-21353266

RESUMEN

A legume specific protein database (LegProt) has been created containing sequences from seven legume species, i.e., Glycine max, Lotus japonicus, Medicago sativa, Medicago truncatula, Lupinusalbus, Phaseolus vulgaris, and Pisum sativum. The database consists of amino acid sequences translated from predicted gene models and 6-frame translations of tentative consensus (TC) sequences assembled from expressed sequence tags (ESTs) and singleton ESTs. This database was queried using mass spectral data for protein identification and identification success rates were compared to the NCBI nr database. Specifically, Mascot MS/MS ion searches of tandem nano-LC Q-TOFMS/MS mass spectral data showed that relative to the NCBI nr protein database, the LegProt database yielded a 54% increase in the average protein score (i.e., from NCBI nr 480 to LegProt 739) and a 50% increase in the average number of matched peptides (i.e., from NCBI nr 8 to LegProt 12). The overall identification success rate also increased from 88% (NCBI nr) to 93% (LegProt). Mascot peptide mass fingerprinting (PMF) searches of the LegProt database using MALDI-TOFMS data yielded a significant increase in the identification success rate from 19% (NCBI nr) to 34% (LegProt) while the average scores and average number of matched peptides showed insignificant changes. The results demonstrate that the LegProt database significantly increases legume protein identification success rates and the confidence levels compared to the commonly used NCBI nr. These improvements are primarily due to the presence of a large number of legume specific TC sequences in the LegProt database that were not found in NCBI nr. The LegProt database is freely available for download (http://bioinfo.noble.org/manuscript-support/legumedb) and will serve as a valuable resource for legume proteomics.

Asunto(s)

Péptidos/análisis; Proteínas de Plantas/análisis; Proteómica; Bases de Datos de Proteínas; Espectrometría de Masas en Tándem

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Péptidos / Proteínas de Plantas / Proteómica Tipo de estudio: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Phytochemistry Año: 2011 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Reino Unido

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google