DepressionEmo: A novel dataset for multilabel classification of depression emotions.

Rahman, Abu Bakar Siddiqur; Ta, Hoang-Thang; Najjar, Lotfollah; Azadmanesh, Azad; Gönul, Ali Saffet

Rahman, Abu Bakar Siddiqur; Ta, Hoang-Thang; Najjar, Lotfollah; Azadmanesh, Azad; Gönul, Ali Saffet.

Afiliación

Rahman ABS; College of Information Science and Technology, University of Nebraska Omaha, USA. Electronic address: abubakarsiddiqurra@unomaha.edu.
Ta HT; Department of Information Technology, Dalat University, Da Lat, Lam Dong, Vietnam. Electronic address: thangth@dlu.edu.vn.
Najjar L; College of Information Science and Technology, University of Nebraska Omaha, USA. Electronic address: lnajjar@unomaha.edu.
Azadmanesh A; College of Information Science and Technology, University of Nebraska Omaha, USA. Electronic address: azad@unomaha.edu.
Gönul AS; Department of Psychiatry, Ege University, Bornova, Izmir, Turkey. Electronic address: ali.saffet.gonul@ege.edu.tr.

J Affect Disord ; 366: 445-458, 2024 Dec 01.

Article en En | MEDLINE | ID: mdl-39214375

ABSTRACT

ABSTRACT

Emotions are integral to human social interactions, with diverse responses elicited by various situational contexts. Particularly, the prevalence of negative emotional states has been correlated with negative outcomes for mental health, necessitating a comprehensive analysis of their occurrence and impact on individuals. In this paper, we introduce a novel dataset named DepressionEmo designed to detect 8 emotions associated with depression by 6037 examples of long Reddit user posts. This dataset was created through a majority vote over inputs by zero-shot classifications from pre-trained models and validating the quality by annotators and ChatGPT, exhibiting an acceptable level of inter-rater reliability between annotators. The correlation between emotions, and linguistic analysis are conducted on DepressionEmo. Besides, we provide several text classification methods classified into two groups machine learning methods such as SVM, XGBoost, and LightGBM; and deep learning methods such as BERT, BART, GAN-BERT, and T5. Despite achieving the same F1 Macro score of 0.76 as BART, the pretrained BERT model, bert-base-uncased, stands out as the most efficient model in our experiments due to its lower number of parameters. Across all emotions, the highest F1 Macro value is achieved by suicide intent, indicating a certain value of our dataset in identifying emotions in individuals with depression symptoms through text analysis. The curated dataset is publicly available at https//github.com/abuBakarSiddiqurRahman/DepressionEmo.

Asunto(s)

Depresión; Emociones; Aprendizaje Automático; Humanos; Depresión/clasificación; Depresión/psicología; Reproducibilidad de los Resultados; Aprendizaje Profundo; Conjuntos de Datos como Asunto

Palabras clave

Dataset; Depression identification; Emotion analysis; Psycholinguistic analysis; Text classification

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Depresión / Emociones / Aprendizaje Automático Límite: Humans Idioma: En Revista: J Affect Disord Año: 2024 Tipo del documento: Article Pais de publicación: Países Bajos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google