Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)

Sabine Deligne; Gerasimos Potamianos; Chalapathy Neti

Publication

ICSLP 2002

Conference paper

Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)

ICSLP 2002

Abstract

In this paper, we introduce a non-linear enhancement technique called Audio-Visual Codebook Dependent Cepstral Normalization (AVCDCN) and we consider its use with both audio-only and audio-visual speech recognition. AVCDCN is inspired from CDCN [1] [2], an audio-only enhancement technique that approximates the non-linear effect of noise on speech with a piece-wise constant function. Our experiments show that the use of visual information in AVCDCN allows significant performance gains over CDCN.

Date

16 Sep 2002

Publication

ICSLP 2002

Authors

IBM-affiliated at time of publication

Abstract

Date

Publication

Authors

Share