AUC PHILOLOGICA
AUC PHILOLOGICA

AUC Philologica (Acta Universitatis Carolinae Philologica) je akademický časopis publikující jak lingvistické, tak literárně historické a teoretické studie. Nedílnou součástí časopisu jsou i recenze odborných knih a zprávy z akademického prostředí.

Časopis je indexován v databázích CEEOL, DOAJ, EBSCO a ERIH PLUS.

AUC PHILOLOGICA, Vol 2017 No 3 (2017), 35–44

Temporal variability of fundamental frequency contours

Robin Hruška, Tomáš Bořil

DOI: https://doi.org/10.14712/24646830.2017.31
zveřejněno: 01. 09. 2017

Abstract

Intonation is one of the means of performing a speech style. Thus, observing pitch variation in an utterance may be a clue to identifying speech style. We design a cumulative slope (CS) index based upon the amount of pitch variation in a measured F0 contour and the duration of that contour. The more pitch changes there are and the greater their frequency range is, the greater the CS index is. This is confirmed by an experiment we conduct: the CS index of utterances with expressive intonation is higher than that of utterances with neutral intonation, and for utterances with neutral intonation the CS index is higher than for utterances with monotonous or flat intonation. However, as there is a great variability between speakers, the CS index as defined currently, cannot be used to universally differentiate between the styles. Results obtained using automatic voice activity detection (VAD) are close to those obtained with manual VAD and thus the extraction of CS index can be reliably automatized.

klíčová slova: fundamental frequency; melody of speech; stylization; variability of pitch contours

reference (14)

1. Bates, D., Maechler, M., Bolker, B. & Walker, S. (2015). Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software, 67(1), 1–48. CrossRef

2. Boersma, P. & Weenink, D. (2016). Praat: doing phonetics by computer [Computer program]. Version 6.0.14, retrieved 11 February 2016 from http://www.praat.org.

3. Bořil, T. & Skarnitzl, R. (2016). Tools rPraat and mPraat. In: P. Sojka, A. Horák, I. Kopeček & K. Pala (Eds.), Text, Speech, and Dialogue, 367–374. Berlin: Springer International Publishing. CrossRef

4. Fujisaki, H. & Ohno, S. (1997). Comparison and assessment of models in the study of fundamental frequency contours of speech. In: INT – 1997, 131–134.

5. Hermes, D.J. (2006). Stylization of pitch contours. In: S. Sudhoff et al. (Eds.), Methods in empirical prosody research, 29–61. Berlin: Walter de Gruyter. CrossRef

6. Hirst, D. & Espesser, R. (1993). Automatic modelling of fundamental frequency using a quadratic spline function. In: Travaux de l'Institut de Phonétique d'Aix, Vol. 15, 75–85.

7. Hruška, R. (2016). Properties of fundamental frequency contours in segmental contexts. Unpublished bachelor thesis. Prague: Institute of Phonetics, Faculty of Arts, Charles University.

8. Laan, G. P. M. (1996). The Contribution of Intonation, Segmental Durations, and Spectral Features to the Perception of a Spontaneous and a Read Speaking Style. Amsterdam: Institute of Phonetic Sciences, University of Amsterdam.

9. Mertens, P. & d'Alessandro, C. (1995). Pitch contour stylization using a tonal perception model. In: Proceedings of 13th International Congress of Phonetic Sciences, Vol. 4, 228–231.

10. Mertens, P. (2004). The Prosogram: Semi-Automatic Transcription of Prosody based on a Tonal Perception Model. In: B. Bel & I. Marlien (Eds.), Proceedings of Speech Prosody 2004, Nara (Japan): ISCA.

11. Palková, Z. (1997). Modelling intonation in Czech: Neutral vs. marked TTS F0-patterns. In: Intonation: Theory, Models, and Applications. Athens, Greece.

12. R Core Team (2017). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org.

13. Sohn, J., Kim, N. S. & Sung, W. (1999). A statistical model-based voice activity detection. IEEE Signal Processing Lett., 6 (1), 1–3. CrossRef

14. Volín, J. & Bořil, T. (2013). General and speaker-specific properties of F0 contours in short utterances. AUC Philologia 1/2014, Phonetica Pragensia XIII, 9–20.

Creative Commons License
Temporal variability of fundamental frequency contours is licensed under a Creative Commons Attribution 4.0 International License.

230 x 157 mm
vychází: 3 x ročně
cena tištěného čísla: 150 Kč
ISSN: 0567-8269
E-ISSN: 2464-6830

Ke stažení