Deep Spatiotemporal Network Based Indian Sign Language Recognition from Videos

Md Azher Uddin*, Ryan Denny, Joolekha Bibi Joolee

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The deaf community has substantial obstacles because of the communication barrier with hearing individuals. The traditional method of relying on sign language interpreters is not a cost-effective solution to address this issue. Existing systems for dynamic sign language recognition employ the CNN-LSTM framework, which has achieved reasonable performance. However, relying solely on spatial features extracted through CNN is inadequate for accurate recognition of sign language words. In this study, we propose a novel end-to-end deep spatiotemporal network for recognizing Indian sign language from videos. Our framework combines the extraction of deep spatial features using Inception-ResNet-V2 and the utilization of handcrafted spatiotemporal features obtained from the application of Volume Local Directional Number (VLDN). Furthermore, we introduce a new encoder-decoder network based on Long Short-Term Memory (LSTM) to effectively learn the spatiotemporal features. Lastly, we conduct a comprehensive experiment to demonstrate the performance of our proposed method.

Original languageEnglish
Title of host publicationProceedings of International Conference on Information Technology and Applications
EditorsAbrar Ullah, Sajid Anwar, Davide Calandra, Raffaele Di Fuccio
PublisherSpringer
Pages171-181
Number of pages11
ISBN (Electronic)9789819983247
ISBN (Print)9789819983230
DOIs
Publication statusPublished - 18 Mar 2024
Event16th International Conference on Information Technology and Applications 2022 - Lisbon, Portugal
Duration: 20 Oct 202222 Oct 2022

Publication series

NameLecture Notes in Networks and Systems
Volume839
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference16th International Conference on Information Technology and Applications 2022
Abbreviated titleICITA 2022
Country/TerritoryPortugal
CityLisbon
Period20/10/2222/10/22

Keywords

  • Inception-ResNet-V2
  • Indian sign language recognition
  • LSTM-based encoder-decoder network
  • Volume local directional number

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Signal Processing
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Deep Spatiotemporal Network Based Indian Sign Language Recognition from Videos'. Together they form a unique fingerprint.

Cite this