Enhanced Running Spectrum Analysis for Robust Speech Recognition Under Adverse Conditions: A Case Study on Japanese Speech

Publisher:
ECTI
Publication Type:
Journal Article
Citation:
ECTI Transactions on Computer and Information Technology (ECTI-CIT), 2017, 11, (1), pp. 82-90
Issue Date:
2017-07-02
Filename Description Size
document (1).pdfPublished version667.05 kB
Adobe PDF
Full metadata record
In any real environment, noises degrade the performance of Automatic Speech Recognition (ASR) systems. Additionally, in the case of similar pronunciations, it is not easy to realize a high accuracy of recognition. From  this point of view, our work envisions an enhanced algorithm processing a speech modulation spectrum, such as Running Spectrum Analysis (RSA). It was also adequately applied to observed speech data. In the envisioned method, a modulation spectrum filtering (MSF) method directly modified the observed cepstral modulation spectrum by a Fourier transform of the cepstral time frequency. The method and experiments carried out for various passbands had favorable results that showed an improvement of about 1-4 % in recognition accuracycompared to conventional methods.
Please use this identifier to cite or link to this item: