Indonesian Automatic Speech Recognition For Command Speech Controller Multimedia Player
The purpose of multimedia devices development is controlling through voice. Nowdays voice that can be recognized only in English. To overcome the issue, then recognition using Indonesian language model and accousticc model and dictionary. Automatic Speech Recognizier is build using engine CMU Sphinx with modified english language to Indonesian Language database and XBMC used as the multimedia player. The experiment is using 10 volunteers testing items based on 7 commands. The volunteers is classifiedd by the genders, 5 Male & 5 female. 10 samples is taken in each command, continue with each volunteer perform 10 testing command. Each volunteer also have to try all 7 command that already provided. Based on percentage clarification table, the word â€œKananâ€ had the most recognize with percentage 83% while â€œpilihâ€ is the lowest one. The word which had the most wrong clarification is â€œkembaliâ€ with percentagee 67%, while the word â€œkananâ€ is the lowest one. From the result of Recognition Rate by male there are several command such as â€œKembaliâ€, â€œUtamaâ€, â€œAtas â€œ and â€œBawahâ€ has the low Recognition Rate. Especially for â€œkembaliâ€ cannot be recognized as the command in the female voices but in male voice that command has 4% of RR this is because the command doesnâ€™t have similar word in english near to â€œkembaliâ€ so the system unrecognize the command. Also for the command â€œPilihâ€ using the female voice has 80% of RR but for the male voice has only 4% of RR. This problem is mostly because of the different voice characteristic between adult male and female which male has lower voice frequencies (from 85 to 180 Hz) than woman (165 to 255 Hz).The result of the experiment showed that each man had different number of recognition rate caused by the difference tone, pronunciation, and speed of speech. For further work needs to be done in order to improving the accouracy of the Indonesian Automatic Speech Recognition system.
Keywords: Automatic Speech Recognizer, Indonesian Acoustic Model, CMU Sphinx, indonesian Language Model, Recognition Rate, XBMC.
Ehsani Farzad and Knodt Sehda, Speech Technology in Computerâ€Aided Language Leraning: Strengtgs and Limitation of new Call Paradigm, LLT Journal: Speech Technology in Computerâ€Aided Language Learning, Vol 2, No 1. Pp 54â€73. 1998.
IkaNovitaDewi, FahriFirdausillah, CaturSupriyanto, Sphinxâ€4 Indonesian Isolated Digit Speech Recognition, Journal of Theoretical and Applied Information Technology, Vol. 53 No.1, Eâ€ISSN: 1817â€3195, July 2013.
V. Ferdiansyah, and A. Purwarianti, Indonesian automatic speech recognition system using Englishâ€based acoustic model, American Journal of Signal Processing 2(4): 60â€63, 2012.
Huang Xuedong, Acero alex, Hon Hsiaoâ€Wuen. Spoken Language Processing:a guide to theoty, algorithm, and system development. Prentice Hall (New Jersey), Ed , pp 05â€06, 2001.
Raza, Agha Ali. Design and Development of an Automatic Speech Recognition System for Urdu. Thesis, FASTâ€National University of Computer and Emerging Sciences, Lahore Pakistan, 2009.
The copyright to this article is transferred to Politeknik Elektronika Negeri Surabaya(PENS) if and when the article is accepted for publication. The undersigned hereby transfers any and all rights in and to the paper including without limitation all copyrights to PENS. The undersigned hereby represents and warrants that the paper is original and that he/she is the author of the paper, except for material that is clearly identified as to its original source, with permission notices from the copyright owners where required. The undersigned represents that he/she has the power and authority to make and execute this assignment. The copyright transfer form can be downloaded here .
The corresponding author signs for and accepts responsibility for releasing this material on behalf of any and all co-authors. This agreement is to be signed by at least one of the authors who have obtained the assent of the co-author(s) where applicable. After submission of this agreement signed by the corresponding author, changes of authorship or in the order of the authors listed will not be accepted.
Retained Rights/Terms and Conditions
- Authors retain all proprietary rights in any process, procedure, or article of manufacture described in the Work.
- Authors may reproduce or authorize others to reproduce the work or derivative works for the author’s personal use or company use, provided that the source and the copyright notice of Politeknik Elektronika Negeri Surabaya (PENS) publisher are indicated.
- Authors are allowed to use and reuse their articles under the same CC-BY-NC-SA license as third parties.
- Third-parties are allowed to share and adapt the publication work for all non-commercial purposes and if they remix, transform, or build upon the material, they must distribute under the same license as the original.
Plagiarism screening will be conducted by EMITTER Journal Editorial Board using iThenticate Plagiarism Checker and CrossCheck plagiarism screening service. The author should download and sign the declaration of plagiarism form here and resubmit it with the copyright transfer form via online submission.