Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 2 de 2
Filtrar
Más filtros











Base de datos
Intervalo de año de publicación
1.
J Comput Biol ; 19(3): 279-92, 2012 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-22175250

RESUMEN

Unchained base reads on self-assembling DNA nanoarrays have recently emerged as a promising approach to low-cost, high-quality resequencing of human genomes. Because of unique characteristics of these mated pair reads, existing computational methods for resequencing assembly, such as those based on map-consensus calling, are not adequate for accurate variant calling. We describe novel computational methods developed for accurate calling of SNPs and short substitutions and indels (<100 bp); the same methods apply to evaluation of hypothesized larger, structural variations. We use an optimization process that iteratively adjusts the genome sequence to maximize its a posteriori probability given the observed reads. For each candidate sequence, this probability is computed using Bayesian statistics with a simple read generation model and simplifying assumptions that make the problem computationally tractable. The optimization process iteratively applies one-base substitutions, insertions, and deletions until convergence is achieved to an optimum diploid sequence. A local de novo assembly procedure that generalizes approaches based on De Bruijn graphs is used to seed the optimization process in order to reduce the chance of converging to local optima. Finally, a correlation-based filter is applied to reduce the false positive rate caused by the presence of repetitive regions in the reference genome.


Asunto(s)
Mapeo Contig/métodos , Genoma Humano , Análisis de Secuencia de ADN/métodos , Algoritmos , Alelos , Secuencia de Bases , Teorema de Bayes , Mapeo Cromosómico , Simulación por Computador , Interpretación Estadística de Datos , Humanos , Modelos Genéticos
2.
Science ; 327(5961): 78-81, 2010 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-19892942

RESUMEN

Genome sequencing of large numbers of individuals promises to advance the understanding, treatment, and prevention of human diseases, among other applications. We describe a genome sequencing platform that achieves efficient imaging and low reagent consumption with combinatorial probe anchor ligation chemistry to independently assay each base from patterned nanoarrays of self-assembling DNA nanoballs. We sequenced three human genomes with this platform, generating an average of 45- to 87-fold coverage per genome and identifying 3.2 to 4.5 million sequence variants per genome. Validation of one genome data set demonstrates a sequence accuracy of about 1 false variant per 100 kilobases. The high accuracy, affordable cost of $4400 for sequencing consumables, and scalability of this platform enable complete human genome sequencing for the detection of rare variants in large-scale genetic studies.


Asunto(s)
ADN/química , Genoma Humano , Análisis por Micromatrices , Análisis de Secuencia de ADN/métodos , Secuencia de Bases , Biología Computacional , Costos y Análisis de Costo , ADN/genética , Bases de Datos de Ácidos Nucleicos , Biblioteca Genómica , Genotipo , Haplotipos , Proyecto Genoma Humano , Humanos , Masculino , Nanoestructuras , Nanotecnología , Técnicas de Amplificación de Ácido Nucleico , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN/economía , Análisis de Secuencia de ADN/instrumentación , Análisis de Secuencia de ADN/normas , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA