Automatic phone segmentation and labeling of continuous speech
SCIE
SCOPUS
- Title
- Automatic phone segmentation and labeling of continuous speech
- Authors
- Chagyun Jeong; Jeong, H
- Date Issued
- 1996-01
- Publisher
- ELSEVIER SCIENCE BV
- Abstract
- To obtain an accurate phone sequence from a continuous speech signal, we suggest a novel approach consisting of tightly coupled bottom-up and top-down processing. The bottom-up path consists of segmentation, recognition and labeling. Also the top-down path consists of labeling, speech generation and segmentation. In this manner, the four processes form a closed feedback loop achieving an optimal interpretation efficiently for a given noisy observation of speech signal and a priori knowledge. The major goal of this paper is to identify the system model using both the stochastic estimation theory and the mean field theory. Experimental results are obtained in terms of the TIMIT database. It is shown that introducing the top-down path to the traditional bottom-up path can improve the recognition rate by 19.7%, and reduce the error (substitution, deletion and insertion) rate by 16.1%. As a result, the overall system can transform the incoming continuous signal into one of the 61 phone classes at the rate of 73.7%.
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/32845
- DOI
- 10.1016/S0167-6393(96)00064-7
- ISSN
- 0167-6393
- Article Type
- Article
- Citation
- SPEECH COMMUNICATION, vol. 20, page. 291 - 311, 1996-01
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.