This is a university-level introduction to Prosody, covering both fundamentals and applications, totalling 4 hours of videos across 29 lectures. This is a updated version of a tutorial presented at the 2021 Meeting of the Association for Computational Linguistics, Prosody: Models, Methods, and Applications.
Nigel G. Ward, Gina-Anne Levow
Watch     Download Powerpoint
Topics Include: • Human Speech beyond Words • Core Prosody: The Musical Aspects of Speech • Other Prosodic Properties • Overview of the Tutorial Series • Relevance for Students, Teachers, Engineers, Scientists, Clinicians ... • About the Instructors
Watch     Download Powerpoint
Topics Include: • Sincerity and Positive Assessment • The Power of Prosody, for You, your Students, and your Technical Creations • Call Centers: Prosody and the Human Touch • Why this Tutorial: Recent Advances and Ongoing Challenges
Watch     Download Powerpoint
Topics Include: • Speech To Text, Text To Speech • Recognizing Individuals and their Mental States • Dialog Systems, Tutoring, etc. • Big Data and Deep Learning Approaches to Prosody • The Continuing need for Knowledge of Prosody
Watch     Download Powerpoint
Topics Include: • Airflow and the Larynx • States of the Glottis: Breathing and Speaking • Vocal Fold Vibration
See Also:
Watch     Download Powerpoint
Topics Include: • Waveforms and Pitch Periods • The Fundamental Frequency (F0) • Pitch Perception can be Hard • F0 Contours and How to Read Them • Divergences between Pitch Percepts and Computed F0 • Microprosody • The Articulatory-Perceptual-Acoustic Troika
Watch     Download Powerpoint
Topics Include: • Loudness • Rate and Timing • Reduction • Nasalization
Watch     Download Powerpoint
Topics Include: • Introduction • Creaky Voice, aka Vocal Fry • Breathy Voice • Modal Voice • Highly Harmonic Voice • Falsetto • The Periodicity Dimension • Summary: Prosody is more than Intonation
Watch     Download Powerpoint
Topics Include: • Perception Exercises • Production Exercises • Imitation Exercise
Watch     Download Powerpoint
Topics Include: • Psychoacoustics • Loudness and Intensity • Pitch Perception: Ear and Brain • Pitch Scales in Music and Speech • An Example that can be Perceived as High or Low • An Example that can be Perceived as Rising or Falling • Pitch Perception Myths • The Elusiveness of Percepts • Limitations of Human Audio Perception • Ways to Compensate
Watch     Download Powerpoint
Topics Include: • Introduction • Prosodic Features in Phonemic Contrasts • The Four Tones of Mandarin • Tone Realization and Tone Notations • Tone Serving to Convey Grammatical Functions • Stress and its Realization • Stress Differences Across Languages • Stress Patterns and Rhythm
Watch     Download Powerpoint
Topics Include: • Sequences of Units • Co-articulation • Tone Sequences and Tone Sandhi • Stress Shift • Semantic Relations • Summary
Watch     Download Powerpoint
Topics Include: • Introduction • Prosodic Words • Disambiguation through Prosody • Phrasing and Boundaries • Prominence
Watch     Download Powerpoint
Topics Include: • Why Visualize? • Pitch Contours • Smoothing • Pitch Levels • Pitch Turning Points • Two Strategies: Tidying and Extracting • Tadpole Notation • ToBI Notation • Cautions Regarding Visualizations
Watch     Download Powerpoint
Topics Include: • Features are Measurements • Low-Level, Mid-Level, and Meaningful Features • Technical Pitfalls • Computed Features at best Correlate with Perceptions • The Irreducible Subjectivity of Perceptions
Watch     Download Powerpoint
Topics Include: • Ideal and Real Pitch Contours • Pitch as a Frame-Level (Low-Level) Feature • Popular Pitch Trackers • Stages of Pitch Inference • Setting the Pitch Range • Microprosody • Smoothing • Setting the Voicing Threshold • Output Scales: Linear, Logarithmic, and Percentile • Pitch, No Pitch, and Voicing Probability Estimates
Watch     Download Powerpoint
Topics Include: • Compensating for Speaker Differences • Pitch Value Distributions • Subtracting Minimums • Subtracting Averages • Standard Deviations and Z-Normalization • Percentiles • Normalizing in General • Normalizing Intensity • General Cautions
Watch     Download Powerpoint
Topics Include: • Midlevel Features are Computed by Aggregation • Unit-Aligned Features • Unaligned Features • Multistream Features, Illustrated with Relative Peak Locations • Robustness Issues, Illustrated with Pitch Height • Pitch Range Issues • Potentially Thousands of Midlevel Features • Open-Source Feature Sets and Toolkits • Some Less-Known Features
Watch     Download Powerpoint
Topics Include: • Speech Recognition and Prosody • The Concept of an Independent Prosody Module • Unit-Linked Prosody is Less Independent than it Once Seemed • Modeling Prosodic Effects on Sound-Phoneme Mappings • Summary of Lessons Learned • Speech Recognition Today, and Unmet Needs
Watch     Download Powerpoint
Topics Include: • Machine Learning needs Features • Different Types of Features • Using Meaningful Features • Using Midlevel Features • Using Frame-Level (Low-Level) Features • Using Filterbank and other Generic Features • Using Features from Pretrained Models • Feature Set Choices for Common Tasks • Summary
Watch     Download Powerpoint
Topics Include: • Prosody as a Direct Reflection of Mental States • Indicators of Emotions • Speech Production Mechanisms • Markers of Depression • Speaker Identification • Other States and Traits • Methods for Paralinguistic Inference • Differences between Phonological and Paralinguistic Prosody
Watch     Download Powerpoint
Topics Include: • Conveying Pragmatic Functions as a Third Realm of Prosody • The Prosody of Positive Assessment • Correlations and Beyond • A Prosodic Configuration: The Positive Assessment Construction • The Late Peak Construction: Implying, Questioning, and Being Polite • The Minor Third Construction: Cueing Action • Summary: Multistream Temporal Configurations
See Also:
Watch     Download Powerpoint
Topics Include: • Prosodic Constructions for Pragmatic Functions • Their Gradient Nature • Their Flexibility of Alignment • Their Nature as Direct Form-Function Mappings • A Continuum from General to Specific • Direct versus Mediated (Symbolic) Mappings • Joint Constructions, such as Backchanneling • Superposition • The Many Pragmatic Functions of Prosody
Watch     Download Powerpoint
Topics Include: • The Frequency Code • Cross-Realm Entanglements with Creaky Voice • How Prosody Can do so Much • Differences Among the Realms • On the Independence of Prosody • Sarcasm through Prosodic-Lexical Meaning Mismatches
Watch     Download Powerpoint
Topics Include: • Prosody for Intelligibility • Rule-Based Synthesizers • Statistical Synthesis with Hidden Markov Models • End-to-End Speech Synthesizers • Machine Learning Process Overview • Loss Functions and Synthesis Quality • Variant Architectures • Limitations of the State of the Art • Beyond Text-to-Speech • Toward Synthesis for Dialog Applications • Toward Better Models through Disentanglement
Watch     Download Powerpoint
Topics Include: • Towards Truly Interactive Dialog Systems • Limitations of Current Systems • Typical Architecture of a Dialog System • Speech Synthesis and Speech Recognition, revisited • Turn-Taking and other Reactive Behaviors • Sensitively Tracking the User's State • Clearly Conveying the System's State • Challenges in Scaling Up • Societal Risks
Watch     Download Powerpoint
Topics Include: • People Vary in Prosodic Behavior and Abilities • Overview of the Mental Procesess involved in Prosody in Conversation • Low-level Pitch Perception: Genetic and Neural Factors • Knowing and Recognizing Prosodic Constructions • Some Learning Processes and Difficulties • Interpreting Prosodic Meanings • Assembling Prosodic Constructions into Plans • Executing Prosodic Plans with Proper Control • Turn-Taking • Summary
See Also:
Watch     Download Powerpoint
Topics Include: • Helping People Master Prosody • What Learners Need • General Teaching Methods • Charisma through Prosody in Public Speaking • Teaching Prosodic Effectiveness in Dialog • Intelligibility through Prosody for Non-Native Speakers • Social Dimensions of Prosody and the Potential for Misinterpretation • Challenges faced by Non-Native Speakers • Teaching Prosodic Constructions • Prosody and the Perfection of Human Society
See Also:
Watch     Download Powerpoint
Topics Include: • The Persistence of Prosody Myths • Poetry and Music • The Noble Speakers, Social Climbing, and Prescriptive Rules • Why People Believe that Questions have a Final Pitch Rise • Broadcast Speech as an Outlier Artform • Summary: Archaic Perspectives Still Linger
Watch     Download Powerpoint
Topics Include: • Improving Human Society through Advances in Prosody • Hopes for Technology • Growing Knowledge, both Applied and Fundamental • Maintaining Synergy across the Field • Infrastructure Needs • Summary of our Hopes for the Field • Bibliography