Please use this identifier to cite or link to this item:
https://hdl.handle.net/10316/102726
Title: | Broad phonetic class definition driven by phone confusions | Authors: | Lopes, Carla Perdigão, Fernando |
Keywords: | Confusion Matrix; Conditional Random Field; Frame Error Rate; Discriminative Training; Context Window | Issue Date: | 2012 | Project: | FCT - PhD Grant (SFRH/BD/27966/2006) | Serial title, monograph or event: | Eurasip Journal on Advances in Signal Processing | Volume: | 2012 | Issue: | 1 | Abstract: | Intermediate representations between the speech signal and phones may be used to improve discrimination among phones that are often confused. These representations are usually found according to broad phonetic classes, which are defined by a phonetician. This article proposes an alternative data-driven method to generate these classes. Phone confusion information from the analysis of the output of a phone recognition system is used to find clusters at high risk of mutual confusion. A metric is defined to compute the distance between phones. The results, using TIMIT data, show that the proposed confusion-driven phone clustering method is an attractive alternative to the approaches based on human knowledge. A hierarchical classification structure to improve phone recognition is also proposed using a discriminative weight training method. Experiments show improvements in phone recognition on the TIMIT database compared to a baseline system. | URI: | https://hdl.handle.net/10316/102726 | ISSN: | 1687-6180 | DOI: | 10.1186/1687-6180-2012-158 | Rights: | openAccess |
Appears in Collections: | I&D IT - Artigos em Revistas Internacionais FCTUC Eng.Electrotécnica - Artigos em Revistas Internacionais |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Broad-phonetic-class-definition-driven-by-phone-confusionsEurasip-Journal-on-Advances-in-Signal-Processing.pdf | 1.19 MB | Adobe PDF | View/Open |
SCOPUSTM
Citations
11
checked on Sep 23, 2024
WEB OF SCIENCETM
Citations
17
checked on Oct 2, 2024
Page view(s)
81
checked on Oct 1, 2024
Download(s)
29
checked on Oct 1, 2024
Google ScholarTM
Check
Altmetric
Altmetric
This item is licensed under a Creative Commons License