eventJune 1, 2021

ICASSP 2021

Apple sponsored the 46th International Conference on Acoustics, Speech, and Signal Processing (ICASSP). The conference focuses on signal processing and its applications and takes place virtually from June 6 to 11.

Accepted Papers

Conference Accepted Papers

Dynamic curriculum learning via data parameters for noise robust keyword spotting

Takuya Higuchi, Shreyas Saxena, Mehrez Souden, Tien Dung Tran, Masood Delfarah, Chandra Dhir

Error-driven Pruning of Language Models for Virtual Assistants

Sashank Gondala, Lyan Verwimp, Ernie Pusateri, Manos Tsagkias, Christophe Van Gysel

Generating Natural Questions from Images for Multimodal Assistants

Alkesh Patel, Akanksha Bindal, Hadas Kotek, Christopher Klein, Jason Williams

Knowledge Transfer for Efficient On-device False Trigger Mitigation

Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik

Multimodal Punctuation Prediction with Contextual Dropout

Andrew Silva, Barry Theobald, Nick Apostoloff

Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric

Ashish Shrivastava, Arnav Kundu, Chandra Dhir, Devang Naik, Oncel Tuzel

Progressive Voice Trigger Detection: Accuracy vs Latency

Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Vineet Garg, Erik Marchi

SapAugment: Learning A Sample Adaptive Policy for Data Augmentation

Ting-Yao Hu, Ashish Shrivastava, Jen-Hao Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalinli, Oncel Tuzel

Sep-28k: A Dataset for Stuttering Event Detection from Podcasts with People Who Stutter

Colin Lea, Vikram Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey Bigham

Special Session Accepted Paper

On the role of visual cues in audiovisual speech enhancement

Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz

Conference Talks and Workshops

Apple organized a special session, Recent Advances in Multichannel and Multimodal Machine Learning for Speech Applications on June 10 at 4:30 PDT. We had an accepted paper at this session, On the Role of Visual Cues in Audiovisual Speech Enhancement.

Apple sponsored the Women in Signal Processing virtual panel which was held on June 10 at 9:30 am PDT.

Let's innovate together. Build amazing machine-learned experiences with Apple. Discover opportunities for researchers, students, and developers by visiting our Work With Us page.

ICASSP 2021

Accepted Papers

Conference Accepted Papers

Dynamic curriculum learning via data parameters for noise robust keyword spotting

Error-driven Pruning of Language Models for Virtual Assistants

Generating Natural Questions from Images for Multimodal Assistants

Knowledge Transfer for Efficient On-device False Trigger Mitigation

Multimodal Punctuation Prediction with Contextual Dropout

Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric

Progressive Voice Trigger Detection: Accuracy vs Latency

SapAugment: Learning A Sample Adaptive Policy for Data Augmentation

Sep-28k: A Dataset for Stuttering Event Detection from Podcasts with People Who Stutter

Special Session Accepted Paper

On the role of visual cues in audiovisual speech enhancement

Conference Talks and Workshops

Related readings and updates.

ICASSP 2022

On the Role of Visual Cues in Audiovisual Speech Enhancement

Discover opportunities in Machine Learning.