Areas and Topics
1. Speech Perception, Production and Acquisition
- Models of speech production
- Physiology and neurophysiology of speech production and perception
- Models of speech perception
- Acoustic and articulatory cues in speech perception
- Interaction speech production-speech perception
- Multimodal speech perception
- Cognition and brain studies on speech
- Code switching and multilingual studies
- L1 acquisition
- Bilingual and L2 acquisition and processing
- Speech and voice disorders
- Hearing disorders
- Combining speech and other biosignals
- Adverse listening conditions
- Other topics in Speech Perception, Production and Acquisition
2. Phonetics, Phonology, and Prosody
- Phonetics and phonology
- Language descriptions
- Acoustic phonetics
- Phonation and voice quality
- Articulatory and acoustic features of prosody
- Perception of prosody
- Laboratory phonology
- Sound changes
- Sociophonetics
- Phonetics of L1-L2 interaction
- Forensic phonetics
- Acoustic manifestations of social characteristics
- Other topics in Phonetics, Phonology, and Prosody
3. Analysis of Paralinguistics in Speech and Language
- Analysis of speaker states
- Analysis of speaker traits
- Automatic analysis of speaker states
- Automatic analysis of speaker traits
- Pathological speech and language
- Social signal processing
- Sentiment analysis and opinion mining
- Perception of paralinguistic phenomena
- Multimodal paralinguistics
- Phonetic and linguistic aspects of paralinguistics
- Other topics in Analysis of Paralinguistics in Speech and Language
4. Speaker and Language Identification
- Language identification and verification, language diarization
- Dialect and accent recognition
- Speaker verification and identification
- Features for speaker and language recognition
- Speaker diarization
- Higher-level knowledge in speaker and language recognition
- Evaluation of speaker and language identification systems
- Multimodal speaker recognition and diarization
- Other topics in Speaker and Language Identification
5. Analysis of Speech and Audio Signals
- Speech acoustics
- Speech analysis and representation
- Audio signal analysis and representation
- Speech and audio segmentation
- Speech and audio classification
- Voice activity detection
- Pitch and harmonic analysis
- Source separation and computational auditory scene analysis
- Speaker spatial localization
- Speech analysis in the presence of music
- Singing analysis
- Other topics in Analysis of Speech and Audio Signals
6. Speech Coding and Enhancement
- Speech coding and transmission
- Perceptual audio coding of speech signals
- Noise reduction for speech signals
- Speech enhancement: single-channel
- Speech enhancement: multi-channel
- Speech intelligibility
- Speech enhancement in hearing aids
- Dereverberation for speech signals
- Echo cancelation for speech signals
- Evaluation of speech transmission, coding and enhancement
- Bandwidth expansion
- Other topics in Speech Coding and Enhancement
7. Speech Synthesis and Spoken Language Generation
- Grapheme-to-phoneme conversion for synthesis
- Text processing for speech synthesis
- Signal processing methods for synthesis
- Speech synthesis paradigms and methods
- Towards end-to-end speech synthesis
- Articulatory speech synthesis
- Unit selection and concatenative speech synthesis
- Statistical parametric speech synthesis
- Prosody modeling and generation
- Expression, emotion and personality generation
- Synthesis of singing voices
- Voice modification, conversion and morphing
- Concept-to-speech conversion
- Cross-lingual and multilingual aspects in speech synthesis, code switching
- Multimodal synthesis for avatars and talking heads
- Tools and data for speech synthesis
- Evaluation of speech synthesis
- Other topics in Speech Synthesis
8. Speech Recognition: Signal Processing, Acoustic Modeling, Robustness, Adaptation
- Feature extraction and low-level feature modeling for ASR
- Prosodic features and models
- Robustness against noise or reverberation
- Far field and microphone array speech recognition
- Novel neural network architectures (e.g. sequence models, LSTM variants)
- Neural network training methods (including new objective functions)
- Discriminative acoustic training methods for ASR
- Acoustic model adaptation (e.g. bandwidth, emotion, accent)
- Speaker adaptation and normalisation
- Pronunciation variants and modeling for speech recognition
- Acoustic confidence measures
- Cross-lingual and multilingual/accent aspects, and code-switching
- Acoustic modeling for conversational speech (dialog, interaction)
- Other topics in Speech Recognition: Signal Processing, Acoustic Modeling, Robustness, Adaptation
9. Speech Recognition: Architecture, Search, and Linguistic Components
- Lexical modeling and access: units and morphological models
- Automatic lexicon learning
- Language model adaptation (domain, diachronic adaptation)
- Neural networks for language modeling
- Search methods, decoding algorithms, lattices, multipass strategies
- New computational strategies, data-structures for ASR
- Computational resource constrained speech recognition
- Confidence measures
- Cross-lingual and multilingual components for speech recognition
- Other topics in Speech Recognition -Architecture, Search, and Linguistic Components
10. Speech Recognition: Technologies and Systems for New Applications
- Multimodal systems
- Applications in education and learning (incl. CALL, assessment of fluency)
- Applications in medical practice (CIS, voice assessment, etc.)
- Speech science in end-user applications
- Rich transcription
- Innovative products and services based on speech technologies
- New paradigms (e.g. artic. models, silent speech interfaces, topic models)
- Zero-resource speech recognition
- Other topics in Speech Recognition -Technologies and Systems for New Applications
11. Spoken dialog systems and conversational analysis
- Spoken dialog systems
- Discourse and dialog structures
- Multimodal interaction and interfaces
- Conversation, communication and interaction
- Analysis of verbal, co-verbal and nonverbal behavior
- Language modeling for conversational speech (dialog, interaction)
- Interactive systems for speech/language training, therapy, communication aids
- Stochastic modeling for dialog
- Question-answering from speech
- Systems for spoken language understanding
- Evaluation of speech and multimodal dialog systems
- Other topics in Spoken dialog systems and conversational analysis
- Spoken machine translation
- Speech-to-speech translation systems
- Voice search
- Spoken term detection
- Indexing, mining and retrieval of speech and audio documents
- Speech and multimodal resources
- Evaluation of speech technology systems
- Metadata descriptions of speech, audio and text resources
- Metadata for semantic or content markup
- Metadata for ling./discourse structure (disfluencies, boundaries, speech acts)
- Methodologies and tools for language resource construction and annotation
- Automatic segmentation and labeling of resources
- Evaluation and quality insurance of language resources
- Evaluation of spoken language technology
- Spoken document summarization
- Semantic analysis and classification
- Entity extraction from speech
- Topic spotting and classification
- Other topics in Spoken Language Processing: Translation, Information Retrieval, Summarization, Resources and Evaluation