MLFNN algorithm for speech classification using mmfc feature extraction in a smart wheelchair
Corressponding author's email:
haint@hcmute.edu.vnKeywords:
MMFC feature extraction, Speech classification, Hamming low-pass filter and Multiplayer Neural NetworksAbstract
This paper proposes a Multilayer Feed forward Neural Network (MLFNN) for speech classification in a smart electric wheelchair, in which with extraction of speech commands is performed using a Mel Frequency Cepstral Coefficients (MMFC) method. Speech commands recognized here are Left, Right, Forward, Backward and Stop. In addition, a Hamming low- pass filter is applied to reduce noise before feature extraction. Therefore, the recognized signals will be used to control the electric wheelchair. Results of this study possibly support disabled people using the wheelchair to move easily and more convenient in everyday life and also show to illustrate the effectiveness of the proposed approach.
Downloads: 0
References
Bhupinder Singh, Neha Kapur, Puneet Kaur, “Speech Recognition with Hidden Markov Model: A Review”, International Journal of Advanced in Computer Science and Soft Engineering, Vol. 2, pp. 400-403, 2012.
Dipmoy Gupta, Radha Mounima C. Navya Manjunath, Manoj PB, “Isolated Word Speech Recognition Using Vector Quantization”,International Journal of Advanced in Computer Science and Soft Engineering, Vol. 2, pp. 164-168, 2012.
Talal Bin Amin, Iftekhar Mahmood, “Speech Recognition Using Dynamic Time Warping”,ICAST, 2008.
A.Revathi, R.Ganapathy and Y.Venkataramani, “Text Independent Speaker Recognition and Speaker Independent Speech Recognition UsingIterative Clustering Approach”, International Journal of Computer science & Information Technology, Vol. 1, No 2, pp. 30-42, 2009.
Lindasalwa Muda, Mumtaj Begam and I. Elamvazuthi, “Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques”, Journal Of Computing, Vol. 2, Issue 3, pp. 138 143, 2010.
Kohei Arai, Ronny Mardiyanto, Eyes Based Eletric Wheel Chair Control System, International Journal of Advanced Computer Science and Applications, 2011.
S. M. Azam, Z.A. Mansoor, M. Shahzad Mughal, S. Mohsin, “Urdu Spoken Digits Recognition Using Classified MFCC and Backpropgation Neural Network”, Computer Graphics, Imaging and Visualisation IEEE, Vol.1, pp. 7695-2928, 2007.
Chin Luh Tan and Adznan Jantan, “Digit Recognition Using Neural Networks”, Malaysian Journal of Computer Science, Vol. 17 No. 2,pp. 40-54, 2004.
Chin Luh Tan and Adznan Jantan, “Digit Recognition Using Neural Networks”, Malaysian Journal of Computer Science, Vol. 17 No. 2, pp2012. 40-54, 2004.
S. M. Azam, Z.A. Mansoor, M. Shahzad Mughal, S. Mohsin, “Urdu Spoken Digits Recognition Using Classified MFCC and Backpropgation Neural Network”, Computer Graphics, Imaging and Visualisation IEEE, Vol.1, pp. 0-7695-2928, 2007.
R. Rojas, Neural Networks. Berlin: Springer-Verlag, 1996.
Fu-Hua Liu; Richard M. Stern; Xuedong Huang; Alejandro Acero, “Efficient cepstral normalization for robust speech recognition, human language technology”, Proceedings of a Workshop Held at Plainsboro, New Jersey, pp. 21-24, 1993.
Hachkar et al., “A Comparison of DHMM and DTW for Isolated Digits Recognition System of Arabic Language”, International J.on Computer Science and Enginerring, 2011.
Downloads
Published
How to Cite
Issue
Section
Categories
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Copyright © JTE.


