TY - GEN
T1 - Establishing control corpora for depression detection in Modern Greek
T2 - 5th RaPID Workshop on Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024
AU - Stamou, Vivian
AU - Mikros, George
AU - Markopoulos, George
AU - Varlokosta, Spyridoula
N1 - Publisher Copyright:
© 2024 ELRA Language Resource Association: CC BY-NC 4.0.
PY - 2024
Y1 - 2024
N2 - This paper presents a methodological approach for establishing control corpora in the context of depression detection in the Modern Greek language. We discuss various methods used to create control corpora, focusing on the challenge of selecting representative samples from the general population when the target reference is the depressed population. Our approach includes traditional random selection among Twitter users, as well as an innovative method for creating topic-oriented control corpora. Through this study, we provide insights into the development of control corpora, offering valuable considerations for researchers working on similar projects in linguistic analysis and mental health studies. In addition, we identify several dominant topics in the depressed population such as religion, sentiments, health, sleep and digestion, which seem to align with findings consistently reported in the literature.
AB - This paper presents a methodological approach for establishing control corpora in the context of depression detection in the Modern Greek language. We discuss various methods used to create control corpora, focusing on the challenge of selecting representative samples from the general population when the target reference is the depressed population. Our approach includes traditional random selection among Twitter users, as well as an innovative method for creating topic-oriented control corpora. Through this study, we provide insights into the development of control corpora, offering valuable considerations for researchers working on similar projects in linguistic analysis and mental health studies. In addition, we identify several dominant topics in the depressed population such as religion, sentiments, health, sleep and digestion, which seem to align with findings consistently reported in the literature.
KW - control corpora
KW - depression detection
KW - topic modeling
UR - http://www.scopus.com/inward/record.url?scp=85195201420&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85195201420
T3 - 5th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 at LREC-COLING 2024 - Workshop Proceedings
SP - 68
EP - 76
BT - 5th RaPID Workshop
A2 - Kokkinakis, Dimitrios
A2 - Fraser, Kathleen C.
A2 - Themistocleous, Charalambos K.
A2 - Fors, Kristina Lundholm
A2 - Tsanas, Athanasios
A2 - Ohman, Fredrik
PB - European Language Resources Association (ELRA)
Y2 - 21 May 2024
ER -