Logo Logo
Hilfe
Hilfe
Switch Language to English

Bucar Shigemori, Lia Saki; Reichel, Uwe D. und Schiel, Florian (2013): Predictability of the effects of phoneme merging on speech recognition performance by quantifying phoneme relations. Elektronische Sprachsignalverarbeitung ESSV 2013, Bielefeld, 26. - 28.3.2013. Wagner, Petra (Hrsg.): In: Elektronische Sprachsignalverarbeitung 2013, Dresden: TUDpress. S. 247-253 [PDF, 409kB]

[thumbnail of BucarReichelSchielESSV2013.pdf]
Vorschau
Download (409kB)

Abstract

To investigate whether the impact of phoneme merging on recognition rate can be predicted, different measures to quantify the relationship between two phonemes a and b were compared: (1) the functional load of their opposition, (2) the bigram type preservation, (3) their information radius, (4) their distance within an information gain tree induced from a distinctive feature matrix, and (5) the symmetric Kullback-Leibler divergence. For each of 25 phoneme pairs we trained a speech recognizer on data in which the respective pair was merged. Based on correlation analyses and predictor selection in stepwise regression modelling we found that the impact of phoneme merging on accuracy can tentatively be captured in terms of functional load and tree distance between the merged phonemes.

Dokument bearbeiten Dokument bearbeiten