Machine Learning Research
Open Menu
Close Menu
Overview
Research
Events
Work with us
Explore advancements in
Machine Learning
JavaScript is disabled.
Please enable JavaScript for full page functionality.
All research
Sort by:
Newest
A – Z
Z – A
Newest
Oldest
Search research
Search Reset
Filters
Reset
Sort by:
Newest
A – Z
Z – A
Newest
Oldest
Search research
Search Reset
Reset filters
Content type
Paper
Highlight
Research areas
Accessibility
Computer Vision
Data Science and Annotation
Fairness
Health
Human-Computer Interaction
Knowledge Bases and Search
Methods and Algorithms
Privacy
Speech and Natural Language Processing
Tools, Platforms, Frameworks
More
(Research areas)
Venues
3DV
AAAI
AAAI Workshop
ACL
ACL Workshop
ACM FAccT
ACM Interaction Design and Children
ACM Multimedia
ACM SIGSPATIAL
ACM STOC
ACM TOCE
ACM Transactions on Computer-Human Interaction
AISTATS
ASRU
ASSETS
BayLearn
BMVC
CASE
CHI
CHIL
CIDR
CIKM
CODE
COLING
COLM
COLT
CoRL
CRAC
CVPR
CVPR Workshop
EACL
EACL Workshop
EAMT
ECCV
ECIR
EMBC
EMNLP
EMNLP Workshop
EuroMLSys
FOCS
FORC
How Far Are We from AGI?
ICASSP
ICCV
ICCV Workshop
ICLR
ICLR Workshop
ICMI
ICML
ICML Workshop
ICPR
ICRA
ICRAE
IDC
IEEE BITS the Information Theory Magazine
IEEE Conference on Artificial Intelligence for Medicine, Health, and Care
IEEE DSAA
IEEE MDM
IEEE SaTML
IEEE Signal Processing Letters
IEEE Spoken Language Technology Workshop (SLT)
IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC)
IEEE Visualization
IGARSS
IISE Transactions
IJCNN
IJCV
International Journal of Speech Technology
Interspeech
IROS
ISMIR
IUI
IWSDS
Journal of Data and Information Quality
Journal of Machine Learning Research (JMLR)
KDD
KDD Workshop
LREC-COLING
Machine Learning: Science and Technology
ML for Healthcare
MLSys
NAACL
NAACL Workshop
Nature
Nature Digital Medicine
Nature - Scientific Reports
NeurIPS
NeurIPS Workshop
Observational Studies
PAKDD
RecSys
Sane Workshop
SID
SIGDIAL
SIGGRAPH Asia
SIGIR
SIGMOD
SLT
SSW
SSW Workshop
TPDP
Transactions of the Association for Computational Linguistics
Transactions on Machine Learning Research (TMLR)
TREC
UIST
USENIX Security
VLDB
W4A
WACV
WeCNLP
When Creative AI Meets Conversational AI Workshop
WMT
WSDM
More
(Venues)
Published year
2025
2024
2023
2022
2021
2020
2019
2018
2017
Positional Description for Numerical Normalization
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Deepanshu Gupta, Javier Latorre
Can You Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Zak Aldeneh, Takuya Higuchi, Jee-weon Jung, Skyler Seto, Tatiana Likhomanenko, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe
Novel-View Acoustic Synthesis From 3D Reconstructed Rooms
content type
paper
|
research area
Computer Vision
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Byeongjoo Ahn, Karren Yang, Brian Hamilton, Jonathan Sheaffer, Anurag Ranjan, Miguel Sarabia del Castillo, Oncel Tuzel, Rick Chang
RepCNN: Micro-Sized, Mighty Models for Wakeword Detection
content type
paper
|
research area
Methods and Algorithms
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Arnav Kundu, Prateeth Nayak, Priyanka Padmanabhan, Devang Naik
Enhancing CTC-based Speech Recognition with Diverse Modeling Units
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Michael Han, Zhihong Lei, Mingbin Xu, Xingyu Na, Zhen Huang
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Satyam Kumar, Sai Srujana Buddi, Utkarsh Oggy Sarawgi, Vineet Garg, Shivesh Ranjan, Ognjen (Oggi) Rudovic, Ahmed Hussen Abdelaziz, Saurabh Adya
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Shruti Palaskar, Oggi Rudovic, Sameer Dharur, Florian Pesce, Gautam Krishna, Aswin Sivaraman, Jack Berkowitz, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed Tewfik
ESPnet-SPK: Full Pipeline Speaker Verification Toolkit with Multiple Reproducible Recipes, Self-Supervised Front-Ends, and Off-the-Shelf Models
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Jee-weon Jung, Wangyou Zhang, Jiatong Shi, Zak Aldeneh, Takuya Higuchi, Barry Theobald, Ahmed Hussen Abdelaziz, Shinji Watanabe
Transformer-based Model for ASR N-Best Rescoring and Rewriting
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Edwin Kang, Christophe Van Gysel, Man-Hung Siu
5IDER: Unified Query Rewriting for Steering, Intent Carryover, Disfluencies, Entity Carryover and Repair
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Jiarui Lu*, Bo-Hsiang Tseng*, Joel Ruben Antony Moniz*, Site Li, Xueyun Zhu, Hong Yu, Murat Akbacak
Showing page 1 of 4
1 of 4
Positional Description for Numerical Normalization
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Deepanshu Gupta, Javier Latorre
Can You Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features?
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Zak Aldeneh, Takuya Higuchi, Jee-weon Jung, Skyler Seto, Tatiana Likhomanenko, Stephen Shum, Ahmed Hussen Abdelaziz, Shinji Watanabe
Novel-View Acoustic Synthesis From 3D Reconstructed Rooms
content type
paper
|
research area
Computer Vision
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Byeongjoo Ahn, Karren Yang, Brian Hamilton, Jonathan Sheaffer, Anurag Ranjan, Miguel Sarabia del Castillo, Oncel Tuzel, Rick Chang
RepCNN: Micro-Sized, Mighty Models for Wakeword Detection
content type
paper
|
research area
Methods and Algorithms
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Arnav Kundu, Prateeth Nayak, Priyanka Padmanabhan, Devang Naik
Enhancing CTC-based Speech Recognition with Diverse Modeling Units
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Michael Han, Zhihong Lei, Mingbin Xu, Xingyu Na, Zhen Huang
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Satyam Kumar, Sai Srujana Buddi, Utkarsh Oggy Sarawgi, Vineet Garg, Shivesh Ranjan, Ognjen (Oggi) Rudovic, Ahmed Hussen Abdelaziz, Saurabh Adya
Multimodal Large Language Models with Fusion Low Rank Adaptation for Device Directed Speech Detection
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Shruti Palaskar, Oggi Rudovic, Sameer Dharur, Florian Pesce, Gautam Krishna, Aswin Sivaraman, Jack Berkowitz, Ahmed Hussen Abdelaziz, Saurabh Adya, Ahmed Tewfik
ESPnet-SPK: Full Pipeline Speaker Verification Toolkit with Multiple Reproducible Recipes, Self-Supervised Front-Ends, and Off-the-Shelf Models
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Jee-weon Jung, Wangyou Zhang, Jiatong Shi, Zak Aldeneh, Takuya Higuchi, Barry Theobald, Ahmed Hussen Abdelaziz, Shinji Watanabe
Transformer-based Model for ASR N-Best Rescoring and Rewriting
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2024
Authors
Edwin Kang, Christophe Van Gysel, Man-Hung Siu
5IDER: Unified Query Rewriting for Steering, Intent Carryover, Disfluencies, Entity Carryover and Repair
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Jiarui Lu*, Bo-Hsiang Tseng*, Joel Ruben Antony Moniz*, Site Li, Xueyun Zhu, Hong Yu, Murat Akbacak
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Miguel Sarabia, Elena Menyaylenko, Alessandro Toso, Zak Aldeneh, Shadi Pirhosseinloo, Luca Zappella, Barry Theobald, Nick Apostoloff, Jonathan Sheaffer
Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Maurits Bleeker, Pawel Swietojanski, Stefan Braun, Xiaodan Zhuang
Latent Phrase Matching for Dysarthric Speech
content type
paper
|
research area
Accessibility
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Colin Lea*, Dianna Yee*, Jaya Narain, Zifang Huang, Lauren Tooley, Jeffrey P. Bigham, Leah Findlater
Matching Latent Encoding for Audio-Text based Keyword Spotting
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Kumari Nishu, Minsik Cho, Devang Naik
Efficient Multimodal Neural Networks for Trigger-less Voice Assistants
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2023
Authors
Sai Srujana Buddi, Utkarsh Oggy Sarawgi, Tashweena Heeramun, Karan Sawnhey, Ed Yanosik, Saravana Rathinam, Saurabh Adya
Emphasis Control for Parallel Neural TTS
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2022
Authors
Shreyas Seshadri, Tuomo Raitio, Dan Castellani, Jiangchuan Li
Speech Emotion: Investigating Model Representations, Multi-Task Learning and Knowledge Distillation
content type
paper
|
research area
Methods and Algorithms
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2022
Authors
Vikramjit Mitra, Hsiang-Yun Sherry Chien, Vasudha Kowtha, Joseph Yitan Cheng, Erdrin Azemi
Improving Voice Trigger Detection with Metric Learning
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2022
Authors
Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed Tewfik
Space-Efficient Representation of Entity-centric Query Language Models
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2022
Authors
Christophe Van Gysel, Mirko Hannemann, Ernest Pusateri, Youssef Oualil, Ilya Oparin
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models
content type
paper
|
research area
Methods and Algorithms
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2022
Authors
Vineet Garg*, Ognjen (Oggi) Rudovic*, Pranay Dighe*, Ahmed H. Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed Tewfik
Vocal Effort Modeling in Neural TTS for Improving the Intelligibility of Synthetic Speech in Noise
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2022
Authors
Tuomo Raitio, Petko Petkov, Jiangchuan Li, Muhammed Shifas, Andrea Davis, Yannis Stylianou
User-Initiated Repetition-Based Recovery in Multi-Utterance Dialogue Systems
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2021
Authors
Hoang Long Nguyen, Vincent Renkens, Joris Pelemans, Srividya Pranavi Potharaju, Anil Kumar Nalamalapu, Murat Akbacak
DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2021
Authors
Deepak Muralidharan*, Joel Ruben Antony Moniz*, Weicheng Zhang, Stephen Pulman, Lin Li, Megan Barnes, Jingjing Pan, Jason Williams, Alex Acero
A Discriminative Entity Aware Language Model for Virtual Assistants
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2021
Authors
Mandana Saebi, Ernie Pusateri, Aaksha Meghawat, Christophe Van Gysel
Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2021
Authors
Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham
Class LM and Word Mapping for Contextual Biasing in End-to-End ASR
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2020
Authors
Rongqing Huang, Ossama Abdel-hamid, Xinwei Li, Gunnar Evermann
Complementary Language Model and Parallel Bi-LRNN for False Trigger Mitigation
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2020
Authors
Rishika Agarwal, Xiaochuan Niu, Pranay Dighe, Srikanth Vishnubhotla, Sameer Badaskar, Devang Naik
Controllable Neural Text-To-Speech Synthesis Using Intuitive Prosodic Features
content type
paper
|
research area
Human-Computer Interaction
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2020
Authors
Tuomo Raitio, Ramya Rasipuram, Dan Castellani
Hybrid Transformer and CTC Networks for Hardware Efficient Voice Triggering
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2020
Authors
Saurabh Adya, Vineet Garg, Siddharth Sigtia, Pramod Simha, Chandra Dhir
Improving On-Device Speaker Verification Using Federated Learning With Privacy
content type
paper
|
research area
Privacy
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2020
Authors
Filip Granqvist, Matt Seigel, Rogier van Dalen, Áine Cahill, Stephen Shum, Matthias Paulik
Stacked 1D Convolutional Networks for End-To-End Small Footprint Voice Trigger Detection
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2020
Authors
Takuya Higuchi, Mohammad Ghasemzadeh, Kisun You, Chandra Dhir
Reverse Transfer Learning: Can Word Embeddings Trained for Different NLP Tasks Improve Neural Language Models?
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
In collaboration with KU Leuven
Authors
Lyan Verwimp, Jerome R. Bellegarda
Connecting and Comparing Language Model Interpolation Techniques
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
Authors
Ernest Pusateri, Christophe Van Gysel, Rami Botros, Sameer Badaskar, Mirko Hannemann, Youssef Oualil, Ilya Oparin
Active Learning for Domain Classification in a Commercial Spoken Personal Assistant
content type
paper
|
research area
Data Science and Annotation
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
Authors
Xi C. Chen, Adithya Sagar, Justine T. Kao, Tony Y. Li, Christopher Klein, Stephen Pulman, Ashish Garg, Jason D. Williams
Coarse-to-fine Optimization for Speech Enhancement
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
Authors
Jian Yao, Ahmad Al-Dahle
Neural Network-Based Modeling of Phonetic Durations
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
Authors
Xizi Wei, Melvyn Hunt, Adrian Skilling
Mirroring to Build Trust in Digital Assistants
content type
paper
|
research area
Human-Computer Interaction
|
conference
Interspeech
Published year
2019
Authors
Katherine Metcalf, Barry-John Theobald, Garrett Weinberg, Robert Lee, Ing-Marie Jonsson, Russ Webb, Nicholas Apostoloff
Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice
content type
paper
|
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
Authors
Vikramjit Mitra, Sue Booker, Erik Marchi, David Scott Farrar, Ute Dorothea Peitz, Bridget Cheng, Ermine Teves, Anuj Mehta, Devang Naik
Bandwidth Embeddings for Mixed-Bandwidth Speech Recognition
content type
paper
|
research area
Privacy
,
research area
Speech and Natural Language Processing
|
conference
Interspeech
Published year
2019
Authors
Gautam Mantena, Ozlem Kalinli, Ossama Abdel-Hamid, Don McAllaster
Research - Apple Machine Learning Research