Hidden Markov models (HMMs) isolated word recognizer with the optimization of acoustical analysis and modeling techniques

Mondher Frikha; Ahmed Ben Hamida; Mongi Lahiani

doi:10.5897/IJPS.9000291

International Journal of
Physical Sciences

Abbreviation: Int. J. Phys. Sci.
Language: English
ISSN: 1992-1950
DOI: 10.5897/IJPS
Start Year: 2006
Published Articles: 2572

Full Length Research Paper

Hidden Markov models (HMMs) isolated word recognizer with the optimization of acoustical analysis and modeling techniques

Mondher Frikha1*, Ahmed Ben Hamida1 and Mongi Lahiani2

1Advanced Technologies for Medical and Signals (ATMS) Research Unit, National School of Engineering of Sfax, Route, B. P. W, Sfax, Tunisia 2Laboratory of Electronics and Information Technologies (LETI), National School of Engineering of Sfax, B.P.W, Sfax, Tunisia.
Email: [email protected]

Article Number - 53DB8A819300
Vol.6(22), pp. 5064-5074 , October 2011
https://doi.org/10.5897/IJPS.9000291

Published: 02 October 2011

Copyright © 2024 Author(s) retain the copyright of this article.
This article is published under the terms of the Creative Commons Attribution License 4.0.

Abstract

Most state of the art automatic speech recognition (ASR) systems are typically based on continuous Hidden Markov Models (HMMs) as acoustic modeling technique. It has been shown that the performance of HMM speech recognizers may be affected by a bad choice of the type of acoustic feature parameters in the acoustic front end module. For these reasons, we propose in this paper a dedicated isolated word recognition system based on HMMs which was carefully optimized specifically at the acoustic analysis and HMM acoustical modeling levels. Such conception was tested and valued on Hidden Markov model toolkit platform (HTK). Systems performances were evaluated using the TIMIT database. One comparative study was carried out using two types of speech analysis: The cepstral method referred to as Mel frequency cepstral coefficients (MFCC) and the perceptual linear predictive (PLP) coding are used for different tests so as to evaluate and reinforce our conception. The frame shift duration effect of the acoustic analysis as well as the addition of the dynamic coefficients of the acoustic parameters (MFCC and PLP) were carefully tested in order to look for high accuracy for our optimized isolated word recognition (IWR) system. Finally, various experiments related to the HMM topology have been carried out in order to get better recognition accuracies. In fact, the effect of some modeling parameters of HMM on the recognition accuracy of the IWR system such as the number of states as well as the number of Gaussian mixtures were analyzed in order to get the optimal HMM topology.

Key words: Isolated word recognition, perceptual linear predictive (PLP) coding, Mel frequency cepstral coefficients (MFCC) PLP, HMM, Hidden Markov model toolkit platform (HTK).

Abbreviation

Abbreviations: ASR, automatic speech recognition; HMM, hidden Markov models;MFCC, mel frequency cepstral coefficient; PLP, perceptual linear predictive; IWR, Isolated word recognition; FIR, finite impulse response; P(W/O), posteriori probability.

This article is published under the terms of the Creative Commons Attribution License 4.0

Back to Vol. 6 No. 22

Back to articles

Views: 0
Downloads: 0

Related Articles:
On Google
On Google Scholar

Articles on Google by:

International Journal of Physical Sciences

Hidden Markov models (HMMs) isolated word recognizer with the optimization of acoustical analysis and modeling techniques

Mondher Frikha1*, Ahmed Ben Hamida1 and Mongi Lahiani2

International Journal of
Physical Sciences