Bi-directional graph search strategies for speech recognition
Authors: Li Z.; Boulianne G.; Labute P.; Barszcz M.; Garudadri H.; Kenny P.
Source: Computer Speech & Language, Volume 10, Number 4, October 1996 , pp. 295-321(27)
Publisher: Academic Press
We describe a new search algorithm for speech recognition which applies the monotone graph search procedure to the problem of building a word graph. A first backward pass provides a method for estimating the word boundary times and phone segment boundary times needed to build the word graph using either the 1-phone or 2-phone lookahead assumptions. It also provides a heuristic for the search which satisfies the monotonicity condition. A second backward pass applies forward-backward pruning to the word graph. We show how the search can be made to run very quickly if the 1-phone lookahead assumption holds. We present the results of experiments performed on the 5000-word speaker-independent Wall Street Journal task under both the 1-phone and 2-phone lookahead assumptions. These results show that the 1-phone lookahead assumption leads to unacceptably large error rates for speaker-independent recognition using current acoustic phonetic modelling techniques. Finally, we give an account of the methods we have developed to process speech data in successive blocks so as to address the real-time issue and to control the memory requirements of the search.
Document Type: Miscellaneous
Affiliations: INRS-Telecommunications, Ile-des-Soeurs, Quebec, H3E 1H6, Canada
Publication date: 1996-10-01