Establishing control corpora for depression detection in Modern Greek: Methodological insights

Vivian Stamou, George Mikros, George Markopoulos, Spyridoula Varlokosta

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a methodological approach for establishing control corpora in the context of depression detection in the Modern Greek language. We discuss various methods used to create control corpora, focusing on the challenge of selecting representative samples from the general population when the target reference is the depressed population. Our approach includes traditional random selection among Twitter users, as well as an innovative method for creating topic-oriented control corpora. Through this study, we provide insights into the development of control corpora, offering valuable considerations for researchers working on similar projects in linguistic analysis and mental health studies. In addition, we identify several dominant topics in the depressed population such as religion, sentiments, health, sleep and digestion, which seem to align with findings consistently reported in the literature.

Original languageEnglish
Title of host publication5th RaPID Workshop
Subtitle of host publicationResources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 at LREC-COLING 2024 - Workshop Proceedings
EditorsDimitrios Kokkinakis, Kathleen C. Fraser, Charalambos K. Themistocleous, Kristina Lundholm Fors, Athanasios Tsanas, Fredrik Ohman
PublisherEuropean Language Resources Association (ELRA)
Pages68-76
Number of pages9
ISBN (Electronic)9782493814111
Publication statusPublished - 2024
Event5th RaPID Workshop on Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 - Torino, Italy
Duration: 21 May 2024 → …

Publication series

Name5th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024 at LREC-COLING 2024 - Workshop Proceedings

Conference

Conference5th RaPID Workshop on Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2024
Country/TerritoryItaly
CityTorino
Period21/05/24 → …

Keywords

  • control corpora
  • depression detection
  • topic modeling

Fingerprint

Dive into the research topics of 'Establishing control corpora for depression detection in Modern Greek: Methodological insights'. Together they form a unique fingerprint.

Cite this