TY - JOUR
T1 - Using conditional random fields to predict pitch accents in conversational speech
AU - Gregory, Michelle L.
AU - Altun, Yasemin
N1 - Publisher Copyright:
© 2021 Proceedings of the Annual Meeting of the Association for Computational Linguistics. All Rights Reserved.
PY - 2004
Y1 - 2004
N2 - The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better word-level recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influence pitch accent placement in natural, conversational speech in a sequence labeling setting. We introduce Conditional Random Fields (CRFs) to pitch accent prediction task in order to incorporate these factors efficiently in a sequence model. We demonstrate the usefulness and the incremental effect of these factors in a sequence model by performing experiments on hand labeled data from the Switchboard Corpus. Our model outperforms the baseline and previous models of pitch accent prediction on the Switchboard Corpus.
AB - The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better word-level recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influence pitch accent placement in natural, conversational speech in a sequence labeling setting. We introduce Conditional Random Fields (CRFs) to pitch accent prediction task in order to incorporate these factors efficiently in a sequence model. We demonstrate the usefulness and the incremental effect of these factors in a sequence model by performing experiments on hand labeled data from the Switchboard Corpus. Our model outperforms the baseline and previous models of pitch accent prediction on the Switchboard Corpus.
UR - http://www.scopus.com/inward/record.url?scp=85149102760&partnerID=8YFLogxK
M3 - Artículo de la conferencia
AN - SCOPUS:85149102760
SN - 0736-587X
SP - 677
EP - 683
JO - Proceedings of the Annual Meeting of the Association for Computational Linguistics
JF - Proceedings of the Annual Meeting of the Association for Computational Linguistics
T2 - 42nd Annual Meeting of the Association for Computational Linguistics, ACL 2004
Y2 - 21 July 2004 through 26 July 2004
ER -