« Crossroads of Speech and Language »

Papers NEW! Surveys Special Sessions & Challenges Tutorials Show & Tell Satellite Events Areas and Topics Important Dates

Areas and Topics

1. Speech Perception, Production and Acquisition

Models of speech production
Physiology and neurophysiology of speech production and perception
Models of speech perception
Acoustic and articulatory cues in speech perception
Interaction speech production-speech perception
Multimodal speech perception
Cognition and brain studies on speech
Code switching and multilingual studies
L1 acquisition
Bilingual and L2 acquisition and processing
Speech and voice disorders
Hearing disorders
Combining speech and other biosignals
Adverse listening conditions
Other topics in Speech Perception, Production and Acquisition

2. Phonetics, Phonology, and Prosody

Phonetics and phonology
Language descriptions
Acoustic phonetics
Phonation and voice quality
Articulatory and acoustic features of prosody
Perception of prosody
Laboratory phonology
Sound changes
Sociophonetics
Phonetics of L1-L2 interaction
Forensic phonetics
Acoustic manifestations of social characteristics
Other topics in Phonetics, Phonology, and Prosody

3. Analysis of Paralinguistics in Speech and Language

Analysis of speaker states
Analysis of speaker traits
Automatic analysis of speaker states
Automatic analysis of speaker traits
Pathological speech and language
Social signal processing
Sentiment analysis and opinion mining
Perception of paralinguistic phenomena
Multimodal paralinguistics
Phonetic and linguistic aspects of paralinguistics
Other topics in Analysis of Paralinguistics in Speech and Language

4. Speaker and Language Identification

Language identification and verification, language diarization
Dialect and accent recognition
Speaker verification and identification
Features for speaker and language recognition
Speaker diarization
Higher-level knowledge in speaker and language recognition
Evaluation of speaker and language identification systems
Multimodal speaker recognition and diarization
Other topics in Speaker and Language Identification

5. Analysis of Speech and Audio Signals

Speech acoustics
Speech analysis and representation
Audio signal analysis and representation
Speech and audio segmentation
Speech and audio classification
Voice activity detection
Pitch and harmonic analysis
Source separation and computational auditory scene analysis
Speaker spatial localization
Speech analysis in the presence of music
Singing analysis
Other topics in Analysis of Speech and Audio Signals

6. Speech Coding and Enhancement

Speech coding and transmission
Perceptual audio coding of speech signals
Noise reduction for speech signals
Speech enhancement: single-channel
Speech enhancement: multi-channel
Speech intelligibility
Speech enhancement in hearing aids
Dereverberation for speech signals
Echo cancelation for speech signals
Evaluation of speech transmission, coding and enhancement
Bandwidth expansion
Other topics in Speech Coding and Enhancement

7. Speech Synthesis and Spoken Language Generation

Grapheme-to-phoneme conversion for synthesis
Text processing for speech synthesis
Signal processing methods for synthesis
Speech synthesis paradigms and methods
Towards end-to-end speech synthesis
Articulatory speech synthesis
Unit selection and concatenative speech synthesis
Statistical parametric speech synthesis
Prosody modeling and generation
Expression, emotion and personality generation
Synthesis of singing voices
Voice modification, conversion and morphing
Concept-to-speech conversion
Cross-lingual and multilingual aspects in speech synthesis, code switching
Multimodal synthesis for avatars and talking heads
Tools and data for speech synthesis
Evaluation of speech synthesis
Other topics in Speech Synthesis

8. Speech Recognition: Signal Processing, Acoustic Modeling, Robustness, Adaptation

Feature extraction and low-level feature modeling for ASR
Prosodic features and models
Robustness against noise or reverberation
Far field and microphone array speech recognition
Novel neural network architectures (e.g. sequence models, LSTM variants)
Neural network training methods (including new objective functions)
Discriminative acoustic training methods for ASR
Acoustic model adaptation (e.g. bandwidth, emotion, accent)
Speaker adaptation and normalisation
Pronunciation variants and modeling for speech recognition
Acoustic confidence measures
Cross-lingual and multilingual/accent aspects, and code-switching
Acoustic modeling for conversational speech (dialog, interaction)
Other topics in Speech Recognition: Signal Processing, Acoustic Modeling, Robustness, Adaptation

9. Speech Recognition: Architecture, Search, and Linguistic Components

Lexical modeling and access: units and morphological models
Automatic lexicon learning
Language model adaptation (domain, diachronic adaptation)
Neural networks for language modeling
Search methods, decoding algorithms, lattices, multipass strategies
New computational strategies, data-structures for ASR
Computational resource constrained speech recognition
Confidence measures
Cross-lingual and multilingual components for speech recognition
Other topics in Speech Recognition -Architecture, Search, and Linguistic Components

10. Speech Recognition: Technologies and Systems for New Applications

Multimodal systems
Applications in education and learning (incl. CALL, assessment of fluency)
Applications in medical practice (CIS, voice assessment, etc.)
Speech science in end-user applications
Rich transcription
Innovative products and services based on speech technologies
New paradigms (e.g. artic. models, silent speech interfaces, topic models)
Zero-resource speech recognition
Other topics in Speech Recognition -Technologies and Systems for New Applications

11. Spoken dialog systems and conversational analysis

Spoken dialog systems
Discourse and dialog structures
Multimodal interaction and interfaces
Conversation, communication and interaction
Analysis of verbal, co-verbal and nonverbal behavior
Language modeling for conversational speech (dialog, interaction)
Interactive systems for speech/language training, therapy, communication aids
Stochastic modeling for dialog
Question-answering from speech
Systems for spoken language understanding
Evaluation of speech and multimodal dialog systems
Other topics in Spoken dialog systems and conversational analysis

12. Spoken Language Processing: Translation, Information Retrieval, Summarization, Resources and Evaluation

Spoken machine translation
Speech-to-speech translation systems
Voice search
Spoken term detection
Indexing, mining and retrieval of speech and audio documents
Speech and multimodal resources
Evaluation of speech technology systems
Metadata descriptions of speech, audio and text resources
Metadata for semantic or content markup
Metadata for ling./discourse structure (disfluencies, boundaries, speech acts)
Methodologies and tools for language resource construction and annotation
Automatic segmentation and labeling of resources
Evaluation and quality insurance of language resources
Evaluation of spoken language technology
Spoken document summarization
Semantic analysis and classification
Entity extraction from speech
Topic spotting and classification
Other topics in Spoken Language Processing: Translation, Information Retrieval, Summarization, Resources and Evaluation