Búsqueda | Portal Regional de la BVS

Speaker-turn aware diarization for speech-based cognitive assessments.

Xu, Sean Shensheng; Ke, Xiaoquan; Mak, Man-Wai; Wong, Ka Ho; Meng, Helen; Kwok, Timothy C Y; Gu, Jason; Zhang, Jian; Tao, Wei; Chang, Chunqi.

Front Neurosci ; 17: 1351848, 2023.

Artículo en Inglés | MEDLINE | ID: mdl-38292896

RESUMEN

Introduction: Speaker diarization is an essential preprocessing step for diagnosing cognitive impairments from speech-based Montreal cognitive assessments (MoCA). Methods: This paper proposes three enhancements to the conventional speaker diarization methods for such assessments. The enhancements tackle the challenges of diarizing MoCA recordings on two fronts. First, multi-scale channel interdependence speaker embedding is used as the front-end speaker representation for overcoming the acoustic mismatch caused by far-field microphones. Specifically, a squeeze-and-excitation (SE) unit and channel-dependent attention are added to Res2Net blocks for multi-scale feature aggregation. Second, a sequence comparison approach with a holistic view of the whole conversation is applied to measure the similarity of short speech segments in the conversation, which results in a speaker-turn aware scoring matrix for the subsequent clustering step. Third, to further enhance the diarization performance, we propose incorporating a pairwise similarity measure so that the speaker-turn aware scoring matrix contains both local and global information across the segments. Results: Evaluations on an interactive MoCA dataset show that the proposed enhancements lead to a diarization system that outperforms the conventional x-vector/PLDA systems under language-, age-, and microphone-mismatch scenarios. Discussion: The results also show that the proposed enhancements can help hypothesize the speaker-turn timestamps, making the diarization method amendable to datasets without timestamp information.

I-Vector-Based Patient Adaptation of Deep Neural Networks for Automatic Heartbeat Classification.

Xu, Sean Shensheng; Mak, Man-Wai; Cheung, Chi-Chung.

IEEE J Biomed Health Inform ; 24(3): 717-727, 2020 03.

Artículo en Inglés | MEDLINE | ID: mdl-31150349

RESUMEN

Automatic classification of electrocardiogram (ECG) signals is important for diagnosing heart arrhythmias. A big challenge in automatic ECG classification is the variation in the waveforms and characteristics of ECG signals among different patients. To address this issue, this paper proposes adapting a patient-independent deep neural network (DNN) using the information in the patient-dependent identity vectors (i-vectors). The adapted networks, namely i-vector adapted patient-specific DNNs (iAP-DNNs), are tuned toward the ECG characteristics of individual patients. For each patient, his/her ECG waveforms are compressed into an i-vector using a factor analysis model. Then, this i-vector is injected into the middle hidden layer of the patient-independent DNN. Stochastic gradient descent is then applied to fine-tune the whole network to form a patient-specific classifier. As a result, the adaptation makes use of not only the raw ECG waveforms from the specific patient but also the compact representation of his/her ECG characteristics through the i-vector. Analysis on the hidden-layer activations shows that by leveraging the information in the i-vectors, the iAP-DNNs are more capable of discriminating normal heartbeats against arrhythmic heartbeats than the networks that use the patient-specific ECG only for the adaptation. Experimental results based on the MIT-BIH database suggest that the iAP-DNNs perform better than existing patient-specific classifiers in terms of various performance measures. In particular, the sensitivity and specificity of the existing methods are all under the receiver operating characteristic curves of the iAP-DNNs.

Asunto(s)

Arritmias Cardíacas/diagnóstico , Electrocardiografía/métodos , Frecuencia Cardíaca/fisiología , Redes Neurales de la Computación , Algoritmos , Electrocardiografía/clasificación , Humanos , Procesamiento de Señales Asistido por Computador

Towards End-to-End ECG Classification With Raw Signal Extraction and Deep Neural Networks.

Xu, Sean Shensheng; Mak, Man-Wai; Cheung, Chi-Chung.

IEEE J Biomed Health Inform ; 23(4): 1574-1584, 2019 07.

Artículo en Inglés | MEDLINE | ID: mdl-30235153

RESUMEN

This paper proposes deep learning methods with signal alignment that facilitate the end-to-end classification of raw electrocardiogram (ECG) signals into heartbeat types, i.e., normal beat or different types of arrhythmias. Time-domain sample points are extracted from raw ECG signals, and consecutive vectors are extracted from a sliding time-window covering these sample points. Each of these vectors comprises the consecutive sample points of a complete heartbeat cycle, which includes not only the QRS complex but also the P and T waves. Unlike existing heartbeat classification methods in which medical doctors extract handcrafted features from raw ECG signals, the proposed end-to-end method leverages a deep neural network for both feature extraction and classification based on aligned heartbeats. This strategy not only obviates the need to handcraft the features but also produces optimized ECG representation for heartbeat classification. Evaluations on the MIT-BIH arrhythmia database show that at the same specificity, the proposed patient-independent classifier can detect supraventricular- and ventricular-ectopic beats at a sensitivity that is at least 10% higher than current state-of-the-art methods. More importantly, there is a wide range of operating points in which both the sensitivity and specificity of the proposed classifier are higher than those achieved by state-of-the-art classifiers. The proposed classifier can also perform comparable to patient-specific classifiers, but at the same time enjoys the advantage of patient independence.

Asunto(s)

Aprendizaje Profundo , Electrocardiografía/métodos , Procesamiento de Señales Asistido por Computador , Adulto , Anciano , Anciano de 80 o más Años , Arritmias Cardíacas/diagnóstico , Femenino , Frecuencia Cardíaca/fisiología , Humanos , Masculino , Persona de Mediana Edad , Adulto Joven

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA