作者
Lidan Liu, F Tydeman, Wanqing Xie, Y Wang
发表日期
2024/10/29
简介
Current assessments for depressive disorder are often influenced by cognitive function making them more susceptible to biases. Deep learning could provide more objective diagnoses with less access barriers for individuals who are unable to complete traditional assessments. In our study, we aim to explore the relations among speech, languages, and depression to demonstrate the feasibility of multi-lingual speech depression detection, and then build deep learning models using multi-lingual speech samples to support depression diagnosis. We first used a newly collected Chinese speech depression dataset to build a convolutional neural network (CNN) to conduct depression detection, and the accuracy of the test set reached 0.85. Besides, we tested the English depression speech dataset, DAIC-WOZ, using the same CNN model, and the accuracy of the test set was 0.73. While training the model using both Chinese and English speech samples and testing on mixture speeches, the accuracy achieved 0.74. We found that the CNN model can be applied across languages with a relatively stable performance of depression detection. This provides evidence that it is possible to develop a language-independent depression detection tool to support depression diagnostic and achieve worldwide long-term mental health monitoring.


