Speech-Based Depression Detection for Bahasa Malaysia Female Speakers Using Deep Learning

Mugahed Al-Ezzi  Ahmed Ezzi; Nik Nur Wahidah  Nik Hashim; Nadzirah  Ahmad Basri; Siti Fauziah  Toha

Speech-Based Depression Detection for Bahasa Malaysia Female Speakers Using Deep Learning

Authors

Mugahed Al-Ezzi Ahmed Ezzi Department of Mechatronics, Faculty of Engineering, International Islamic University Malaysia, Selangor, Malaysia
Nik Nur Wahidah Nik Hashim Department of Mechatronics, Faculty of Engineering, International Islamic University Malaysia, Selangor, Malaysia
Nadzirah Ahmad Basri Department of Psychiatry, Kulliyyah of Medicine, International Islamic University Malaysia, Kuantan, Malaysia
Siti Fauziah Toha Department of Mechatronics, Faculty of Engineering, International Islamic University Malaysia, Selangor, Malaysia

Abstract

Depression is a mental disorder of high prevalence, leading to a negative effect on individuals, society, and the economy. Traditional clinical diagnosis methods are subjective and require extensive participation of experts. Furthermore, the severe shortage in psychiatrists’ ratio per population in Malaysia imposes patients’ delay in seeking treatment and poor compliance to follow-up. Besides, the social stigma of visiting psychiatric clinics also prevents patients from seeking early treatment. Automatic depression detection using speech signals is a promising depression biometric because it is fast, convenient, and non-invasive. This research attempts to develop an end-to-end deep learning model to classify depression from female Bahasa Malaysia speech using our dataset. Depression status was identified by the Patient Health Questionnaire 9, the Malay Beck Depression Inventory-II, and subjects’ declaration of Major Depressive Disorder diagnosis by a trained clinician. The dataset consists of 110 female participants. We provided a detailed implementation of deep learning models using raw audio input. Multiple combinations of speech types were analyzed using various deep neural network models. After performing hyperparameters tunning, raw audio input from female read and spontaneous speech combination using AttCRNN model achieved an accuracy of 91%.

Downloads

Published

2021-10-15

How to Cite

Ahmed Ezzi , M. A.-E. ., Nik Hashim , N. N. W., Ahmad Basri , N. ., & Toha , S. F. . (2021). Speech-Based Depression Detection for Bahasa Malaysia Female Speakers Using Deep Learning. ELEKTRIKA- Journal of Electrical Engineering, 20(2-3), 1–6. Retrieved from https://elektrika.utm.my/index.php/ELEKTRIKA_Journal/article/view/318

Download Citation

Issue

Vol. 20 No. 2-3 (2021): Special Issue on Instrumentation & Robotics

Section

Articles

License

Copyright of articles that appear in Elektrika belongs exclusively to Penerbit Universiti Teknologi Malaysia (Penerbit UTM Press). This copyright covers the rights to reproduce the article, including reprints, electronic reproductions, or any other reproductions of similar nature.