Acoustically discriminative training for language models

Gakuto Kurata; Nobuyasu Itoh; Masafumi Nishimura

doi:10.1109/ICASSP.2009.4960684

Publication

ICASSP 2009

Conference paper

Acoustically discriminative training for language models

ICASSP 2009

View publication

Abstract

This paper introduces a discriminative training for language models (LMs) by leveraging phoneme similarities estimated from an acoustic model. To train an LM discriminatively, we needed the correct word sequences and the recognized results that Automatic Speech Recognition (ASR) produced by processing the utterances of those correct word sequences. But, sufficient utterances are not always available. We propose to generate the probable N-best lists, which the ASR may produce, directly from the correct word sequences by leveraging the phoneme similarities. We call this process the "Pseudo-ASR".We train the LM discriminatively by comparing the correct word sequences and the corresponding N-best lists from the Pseudo-ASR. Experiments with real-life data from a Japanese call center showed that the LM trained with the proposed method improved the accuracy of the ASR. ©2009 IEEE.

Date

23 Sep 2009

Publication

ICASSP 2009

Authors

IBM-affiliated at time of publication

Abstract

Date

Publication

Authors

Share