Abstract
This paper presents the implementation of a practical voice recognition system using MATLAB (R2014b) to secure a given user's system so that only the user may access it. Voice recognition systems have two phases, training and testing. During the training phase, the characteristic features of the speaker are extracted from the speech signal and stored in a database. In the testing phase, the stored audio features of the test voice sample are compared with the voice samples in the database and determined if a match exists. For this research, Mel Frequency Cepstral Coefficients (MFCCs) were chosen to represent the feature vectors of the user's voice as it accurately simulates the behavior of the human ear. This characteristic of the MFCCs makes them an excellent measure of speaker characteristics. The feature matching process is then performed by subjecting the MFCCs to vector quantization using the LBG (Linde-Buzo-Gray) algorithm. In practical scenarios, noise is a major factor that adversely influences a voice recognition system. The paper addresses this issue by utilizing spectral subtraction to remove environmental noise affecting the speech signal thereby increasing the robustness of the system.
Original language | English |
---|---|
Title of host publication | 2017 International conference of Electronics, Communication and Aerospace Technology (ICECA) |
Publisher | IEEE |
Pages | 130-136 |
Number of pages | 7 |
ISBN (Electronic) | 9781509056866 |
ISBN (Print) | 9781509056859 |
DOIs | |
Publication status | Published - 18 Dec 2017 |
Keywords
- Mel Frequency Cepstral Coefficients (MFCCs)
- Speaker Identification
- Spectral Subtraction
- Vector Quantization (VQ)
- Voice Recognition
ASJC Scopus subject areas
- Computer Networks and Communications
- Aerospace Engineering
- Electrical and Electronic Engineering
- Instrumentation
- Safety, Risk, Reliability and Quality