TY - JOUR
T1 - Effects of disfluencies, predictability, and utterance position on word form variation in English conversation
AU - Bell, Alan
AU - Jurafsky, Daniel
AU - Fosler-Lussier, Eric
AU - Girand, Cynthia
AU - Gregory, Michelle
AU - Gildea, Daniel
PY - 2003/2/1
Y1 - 2003/2/1
N2 - Function words, especially frequently occurring ones such as (the, that, and, and of), vary widely in pronunciation. Understanding this variation is essential both for cognitive modeling of lexical production and for computer speech recognition and synthesis. This study investigates which factors affect the forms of function words, especially whether they have a fuller pronunciation (e.g., oi, oæt, ænd, Λv) or a more reduced or lenited pronunciation (e.g., oSchwa (phonetic symbol), oSchwa (phonetic symbol) barred i sign t, n, Schwa (phonetic symbol)). It is based on over 8000 occurrences of the ten most frequent English function words in a 4-h sample from conversations from the Switchboard corpus. Ordinary linear and logistic regression models were used to examine variation in the length of the words, in the form of their vowel (basic, full, or reduced), and whether final obstruents were present or not. For all these measures, after controlling for segmental context, rate of speech, and other important factors, there are strong independent effects that made high-frequency monosyllabic function words more likely to be longer or have a fuller form (1) when neighboring disfluencies (such as filled pauses uh and um) indicate that the speaker was encountering problems in planning the utterance; (2) when the word is unexpected, i.e., less predictable in context; (3) when the word is either utterance initial or utterance final. Looking at the phenomenon in a different way, frequent function words are more likely to be shorter and to have less-full forms in fluent speech, in predictable positions or multiword collocations, and utterance internally. Also considered are other factors such as sex (women are more likely to use fuller forms, even after controlling for rate of speech, for example), and some of the differences among the ten function words in their response to the factors.
AB - Function words, especially frequently occurring ones such as (the, that, and, and of), vary widely in pronunciation. Understanding this variation is essential both for cognitive modeling of lexical production and for computer speech recognition and synthesis. This study investigates which factors affect the forms of function words, especially whether they have a fuller pronunciation (e.g., oi, oæt, ænd, Λv) or a more reduced or lenited pronunciation (e.g., oSchwa (phonetic symbol), oSchwa (phonetic symbol) barred i sign t, n, Schwa (phonetic symbol)). It is based on over 8000 occurrences of the ten most frequent English function words in a 4-h sample from conversations from the Switchboard corpus. Ordinary linear and logistic regression models were used to examine variation in the length of the words, in the form of their vowel (basic, full, or reduced), and whether final obstruents were present or not. For all these measures, after controlling for segmental context, rate of speech, and other important factors, there are strong independent effects that made high-frequency monosyllabic function words more likely to be longer or have a fuller form (1) when neighboring disfluencies (such as filled pauses uh and um) indicate that the speaker was encountering problems in planning the utterance; (2) when the word is unexpected, i.e., less predictable in context; (3) when the word is either utterance initial or utterance final. Looking at the phenomenon in a different way, frequent function words are more likely to be shorter and to have less-full forms in fluent speech, in predictable positions or multiword collocations, and utterance internally. Also considered are other factors such as sex (women are more likely to use fuller forms, even after controlling for rate of speech, for example), and some of the differences among the ten function words in their response to the factors.
UR - http://www.scopus.com/inward/record.url?scp=0037324538&partnerID=8YFLogxK
U2 - 10.1121/1.1534836
DO - 10.1121/1.1534836
M3 - Artículo
C2 - 12597194
AN - SCOPUS:0037324538
SN - 0001-4966
VL - 113
SP - 1001
EP - 1024
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 2
ER -