Simon Leglaive
CentraleSupélec
Can you guess to what “speech sound” each bloc corresponds?
Elementary speech sounds are called phonemes.
Image credit: https://www.englishclub.com/pronunciation/phonemic-chart.htm
We consider as articulator any mobile part of the vocal tract on which we can act voluntarily and which is used in the production of speech sounds.
Real-time MRI scan of a person talking.
What are the three main speech articulators?
Tongue
Jaw
Lips
We distinguish 3 types of sound sources, which can be combined or occur individually:
Quasi-periodic source resulting from the vibration of the vocal folds.
We say that the sound is voiced.
It can be arbitrarily long (in the limits of an exhalation).
Fricative noise source produced by a turbulent airflow with a constriction in the vocal tract.
It can also be arbitrarily long.
Plosive noise source produced by quick occlusions of the vocal tract and generating an acoustic impulse.
Here the duration is short.
Credits: Joe Wolfe, https://vimeo.com/234805962, more info at https://www.animations.physics.unsw.edu.au/waves-sound/human-sound/index.html
Average pitch (Hz) | Pitch range (Hz) | |
Male | 100 - 130 | 90 - 270 |
Female | 150 - 300 | 120 - 360 |
Child | 350 - 400 | 200 - 600 |
Credits: Joe Wolfe, https://vimeo.com/128430263, more info at https://www.animations.physics.unsw.edu.au/waves-sound/human-sound/index.html
Credits: Joe Wolfe, https://vimeo.com/128430264, more info at https://www.animations.physics.unsw.edu.au/waves-sound/human-sound/index.html
Opening of the mouth
“Frontness” of the tongue
Rounding of the lips
Nasalization: sound comes out of the mouth only, or out of the mouth and nose.
Vowel chart with audio: https://en.wikipedia.org/wiki/IPA_vowel_chart_with_audio
We can distinguish between vowels using the position of the first formants
By moving articulators, the shape of the vocal tract varies, formants move in frequency, and vowels change.
Image credit: L.R. Rabiner and R.W. Shafer, “Digital Processing of Speech Signals”, Prentice-Hall, 1978
male speakers
male and children speakers
Fricatives
Plosives
Nasal
Liquids
Consonant chart with audio: https://en.wikipedia.org/wiki/IPA_pulmonic_consonant_chart_with_audio
Fricatives
Plosives
Nasal
Liquids
Consonant chart with audio: https://en.wikipedia.org/wiki/IPA_pulmonic_consonant_chart_with_audio
Go to https://app.wooclap.com/CXIOJL
Prosodic variables:
Different combinations of these variables are exploited for intonation and accentuation.
Prosody may reflect various features of the speaker or the utterance:
Male or female speaker?
Go to https://app.wooclap.com/CXIOJL and find the vowel that corresponds to each spectrum, using the above French vocal triangle.
We could have done the same from a spectrogram representation.
With a bit of practice you could be able to decode this mystery spectrogram.
1 bonus point if you decode the message 😉.
Introduction to voice acoustics by Joe Wolfe, Emeritus Professor at the University of New South Wales (Syndney, Australia):
https://newt.phys.unsw.edu.au/jw/voice.html
Analysis, transformation and synthesis of speech signals with the source-filter model and linear predictive coding
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |