Your browser doesn't support javascript.
loading
Identifying clinically important COPD sub-types using data-driven approaches in primary care population based electronic health records.
Pikoula, Maria; Quint, Jennifer Kathleen; Nissen, Francis; Hemingway, Harry; Smeeth, Liam; Denaxas, Spiros.
Afiliación
  • Pikoula M; Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK. m.pikoula@ucl.ac.uk.
  • Quint JK; Health Data Research UK London, University College London, 222 Euston Road, London, NW1 2DA, UK. m.pikoula@ucl.ac.uk.
  • Nissen F; Health Data Research UK London, University College London, 222 Euston Road, London, NW1 2DA, UK.
  • Hemingway H; Respiratory Epidemiology, Occupational Medicine and Public Health, National Heart and Lung Institute, Imperial College London, London, UK.
  • Smeeth L; EHR Research Group, School of Hygiene and Tropical Medicine, London, UK.
  • Denaxas S; Health Data Research UK London, University College London, 222 Euston Road, London, NW1 2DA, UK.
BMC Med Inform Decis Mak ; 19(1): 86, 2019 04 18.
Article en En | MEDLINE | ID: mdl-30999919
BACKGROUND: COPD is a highly heterogeneous disease composed of different phenotypes with different aetiological and prognostic profiles and current classification systems do not fully capture this heterogeneity. In this study we sought to discover, describe and validate COPD subtypes using cluster analysis on data derived from electronic health records. METHODS: We applied two unsupervised learning algorithms (k-means and hierarchical clustering) in 30,961 current and former smokers diagnosed with COPD, using linked national structured electronic health records in England available through the CALIBER resource. We used 15 clinical features, including risk factors and comorbidities and performed dimensionality reduction using multiple correspondence analysis. We compared the association between cluster membership and COPD exacerbations and respiratory and cardiovascular death with 10,736 deaths recorded over 146,466 person-years of follow-up. We also implemented and tested a process to assign unseen patients into clusters using a decision tree classifier. RESULTS: We identified and characterized five COPD patient clusters with distinct patient characteristics with respect to demographics, comorbidities, risk of death and exacerbations. The four subgroups were associated with 1) anxiety/depression; 2) severe airflow obstruction and frailty; 3) cardiovascular disease and diabetes and 4) obesity/atopy. A fifth cluster was associated with low prevalence of most comorbid conditions. CONCLUSIONS: COPD patients can be sub-classified into groups with differing risk factors, comorbidities, and prognosis, based on data included in their primary care records. The identified clusters confirm findings of previous clustering studies and draw attention to anxiety and depression as important drivers of the disease in young, female patients.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Atención Primaria de Salud / Enfermedad Pulmonar Obstructiva Crónica / Registros Electrónicos de Salud Tipo de estudio: Etiology_studies / Prevalence_studies / Prognostic_studies / Risk_factors_studies Límite: Aged / Female / Humans / Male / Middle aged País/Región como asunto: Europa Idioma: En Revista: BMC Med Inform Decis Mak Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article Pais de publicación: Reino Unido

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Atención Primaria de Salud / Enfermedad Pulmonar Obstructiva Crónica / Registros Electrónicos de Salud Tipo de estudio: Etiology_studies / Prevalence_studies / Prognostic_studies / Risk_factors_studies Límite: Aged / Female / Humans / Male / Middle aged País/Región como asunto: Europa Idioma: En Revista: BMC Med Inform Decis Mak Asunto de la revista: INFORMATICA MEDICA Año: 2019 Tipo del documento: Article Pais de publicación: Reino Unido