4aSC37. Automatic acquisition of speech units for recognition and very low bit coding.

Session: Thursday Morning, December 5

Time:

Author: Minoru Saito
Location: Dept. of Information and Comput. Sci., Toyohashi Univ. of Tech., 1-1 Hibarigaoka Tenpaku-cho, Toyohashi-shi, Aichi-ken, 441 Japan
Author: Mikio Masukata
Location: Dept. of Information and Comput. Sci., Toyohashi Univ. of Tech., 1-1 Hibarigaoka Tenpaku-cho, Toyohashi-shi, Aichi-ken, 441 Japan
Author: Seiichi Nakagawa
Location: Dept. of Information and Comput. Sci., Toyohashi Univ. of Tech., 1-1 Hibarigaoka Tenpaku-cho, Toyohashi-shi, Aichi-ken, 441 Japan

Abstract:

In this paper, a method is described to acquire speech units automatically which have almost the same duration length as a syllable (or a phoneme). Optimal speech units are obtained following the procedure that the number of units is given beforehand and the pattern of units is updated successively based on a minimum distortion criterion. To investigate whether the speech units used in this procedure correspond to syllables, speech units were acquired automatically on condition that both the number and length of units are equivalent to syllables. As a result, acquired speech units corresponded to about half of syllables. When these units were used for 216 word recognition experiments, the obtained recognition rate was higher than that for the syllable-based method. And a word recognition rate of about 98% or so was achieved by increasing the number of units. At this time, the bit rate was still less than 60 bps, so this method may be applied to the model for ultra very low bit coding.

ASA 132nd meeting - Hawaii, December 1996