Your browser doesn't support javascript.
loading
Semantic Feature Extraction Using SBERT for Dementia Detection.
Santander-Cruz, Yamanki; Salazar-Colores, Sebastián; Paredes-García, Wilfrido Jacobo; Guendulain-Arenas, Humberto; Tovar-Arriaga, Saúl.
Afiliação
  • Santander-Cruz Y; Facultad de Ingeniería, Universidad Autónoma de Querétaro, Queretaro C.P. 76010, Mexico.
  • Salazar-Colores S; Centro de Investigaciones en Óptica, Leon C.P. 37150, Mexico.
  • Paredes-García WJ; Facultad de Ingeniería, Universidad Autónoma de Querétaro, Queretaro C.P. 76010, Mexico.
  • Guendulain-Arenas H; Departamento de Geriatría, Instituto Mexicano del Seguro Social, San Juan del Rio C.P. 76800, Mexico.
  • Tovar-Arriaga S; Facultad de Ingeniería, Universidad Autónoma de Querétaro, Queretaro C.P. 76010, Mexico.
Brain Sci ; 12(2)2022 Feb 15.
Article em En | MEDLINE | ID: mdl-35204032
Dementia is a neurodegenerative disease that leads to the development of cognitive deficits, such as aphasia, apraxia, and agnosia. It is currently considered one of the most significant major medical problems worldwide, primarily affecting the elderly. This condition gradually impairs the patient's cognition, eventually leading to the inability to perform everyday tasks without assistance. Since dementia is an incurable disease, early detection plays an important role in delaying its progression. Because of this, tools and methods have been developed to help accurately diagnose patients in their early stages. State-of-the-art methods have shown that the use of syntactic-type linguistic features provides a sensitive and noninvasive tool for detecting dementia in its early stages. However, these methods lack relevant semantic information. In this work, we propose a novel methodology, based on the semantic features approach, by using sentence embeddings computed by Siamese BERT networks (SBERT), along with support vector machine (SVM), K-nearest neighbors (KNN), random forest, and an artificial neural network (ANN) as classifiers. Our methodology extracted 17 features that provide demographic, lexical, syntactic, and semantic information from 550 oral production samples of elderly controls and people with Alzheimer's disease, provided by the DementiaBank Pitt Corpus database. To quantify the relevance of the extracted features for the dementia classification task, we calculated the mutual information score, which demonstrates a dependence between our features and the MMSE score. The experimental classification performance metrics, such as the accuracy, precision, recall, and F1 score (77, 80, 80, and 80%, respectively), validate that our methodology performs better than syntax-based methods and the BERT approach when only the linguistic features are used.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Screening_studies Idioma: En Revista: Brain Sci Ano de publicação: 2022 Tipo de documento: Article País de afiliação: México País de publicação: Suíça

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies / Screening_studies Idioma: En Revista: Brain Sci Ano de publicação: 2022 Tipo de documento: Article País de afiliação: México País de publicação: Suíça