Your browser doesn't support javascript.
loading
Measuring rank robustness in scored protein interaction networks.
Bozhilova, Lyuba V; Whitmore, Alan V; Wray, Jonny; Reinert, Gesine; Deane, Charlotte M.
Afiliación
  • Bozhilova LV; Department of Statistics, University of Oxford, 24-29 St Giles', Oxford, OX1 3LB, UK.
  • Whitmore AV; e-Therapeutics Plc, 17 Fenlock Rd, Long Hanborough, OX29 8LN, UK.
  • Wray J; e-Therapeutics Plc, 17 Fenlock Rd, Long Hanborough, OX29 8LN, UK.
  • Reinert G; Department of Statistics, University of Oxford, 24-29 St Giles', Oxford, OX1 3LB, UK.
  • Deane CM; Department of Statistics, University of Oxford, 24-29 St Giles', Oxford, OX1 3LB, UK. deane@stats.ox.ac.uk.
BMC Bioinformatics ; 20(1): 446, 2019 Aug 28.
Article en En | MEDLINE | ID: mdl-31462221
BACKGROUND: Protein interaction databases often provide confidence scores for each recorded interaction based on the available experimental evidence. Protein interaction networks (PINs) are then built by thresholding on these scores, so that only interactions of sufficiently high quality are included. These networks are used to identify biologically relevant motifs or nodes using metrics such as degree or betweenness centrality. This type of analysis can be sensitive to the choice of threshold. If a node metric is to be useful for extracting biological signal, it should induce similar node rankings across PINs obtained at different reasonable confidence score thresholds. RESULTS: We propose three measures-rank continuity, identifiability, and instability-to evaluate how robust a node metric is to changes in the score threshold. We apply our measures to twenty-five metrics and identify four as the most robust: the number of edges in the step-1 ego network, as well as the leave-one-out differences in average redundancy, average number of edges in the step-1 ego network, and natural connectivity. Our measures show good agreement across PINs from different species and data sources. Analysis of synthetically generated scored networks shows that robustness results are context-specific, and depend both on network topology and on how scores are placed across network edges. CONCLUSION: Due to the uncertainty associated with protein interaction detection, and therefore network structure, for PIN analysis to be reproducible, it should yield similar results across different confidence score thresholds. We demonstrate that while certain node metrics are robust with respect to threshold choice, this is not always the case. Promisingly, our results suggest that there are some metrics that are robust across networks constructed from different databases, and different scoring procedures.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Biología Computacional / Bases de Datos de Proteínas / Mapas de Interacción de Proteínas Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Proteínas / Biología Computacional / Bases de Datos de Proteínas / Mapas de Interacción de Proteínas Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article Pais de publicación: Reino Unido