Depression Level Prediction Using Deep Spatiotemporal Features and Multilayer Bi-LTSM

Md Azher Uddin, Joolekha Bibi Joolee, Young Koo Lee*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

59 Citations (Scopus)

Abstract

Depression is a serious psychiatric disorder that restricts an individuals ability to work properly in both their daily and professional lives. Usually, the diagnosis of depression often needs a thorough assessment by an expert. Recently, significant consideration has been given to automatic depression prediction for more reliable and efficient depression investigation. In this article, we propose a novel framework to estimate the depression level from video data by employing a two-stream deep spatiotemporal network. Our approach extracts spatial information using the Inception-ResNet-v2 network. In contrast, we introduce a volume local directional number (VLDN) based dynamic feature descriptor to capture facial motions. Then, the feature map obtained from the VLDN is fed into a convolutional neural network (CNN) to obtain more discriminative features. Additionally, we designed a multilayer bidirectional long short-term memory (Bi-LSTM) model to obtain temporal information by integrating the temporal median pooling (TMP) approach into the model. The TMP approach is employed on the temporal fragments of spatial and temporal features. Finally, extensive experimental analysis of two challenging datasets, AVEC2013 and AVEC2014, demonstrates that the proposed approach shows promising performance compared to the existing approaches for depression level prediction.

Original languageEnglish
Pages (from-to)864-870
Number of pages7
JournalIEEE Transactions on Affective Computing
Volume13
Issue number2
Early online date30 Jan 2020
DOIs
Publication statusPublished - 2022

Keywords

  • bidirectional long short-term memory
  • deep spatiotemporal network
  • Depression
  • Inception-ResNet-v2 network
  • temporal median pooling
  • volume local directional number

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Depression Level Prediction Using Deep Spatiotemporal Features and Multilayer Bi-LTSM'. Together they form a unique fingerprint.

Cite this