Apple is sponsoring the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), which takes place in person from April 6 to 11 in Hyderabad, India. ICASSP is a conference in the field of signal processing and its applications. Below is the schedule of Apple-sponsored workshops and events at ICASSP 2025.

Schedule

Stop by the Apple booth from April 6 to 11 from 09:00 to 17:00 at Booth C3 in the Hyderabad International Convention Center. All times listed are in GMT +5:30.

Monday, April 7

Wednesday, April 9

Thursday, April 10

Accepted Papers

An Efficient and Streaming Audio Visual Active Speaker Detection System

Arnav Kundu, Yanzi Jin, Max Horton, Mohammad Sekhavat, Danny Tormoen, Devang Naik

Compact Neural TTS Voices for Accessibility

Kunal Jain, Eoin Murphy, Deepanshu Gupta, Jonathan Dyke, Saumya Hiren Shah, Vasileios Tsiaras, Petko Petkov, Alistair Conkie

Contextualization of ASR with LLM Using Phonetic Retrieval-Based Augmentation

Zhihong Lei, Xingyu Na, Mingbin Xu, Ernest Pusateri, Christophe Van Gysel, Yuanyuan Zhang, Shiyi Han (Character AI), Zhen Huang

Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition

Takaaki Hori, Martin Kocour (Brno University of Technology), Adnan Haider, Erik McDermott, Xiaodan Zhuang

Exploring Prediction Targets in Masked Pre-training for Speech Foundation Models

Li-Wei Chen (Carnegie Mellon University), Zak Aldeneh, Takuya Higuchi, Tatiana Likhomanenko, Richard Bai, Ahmed Hussen Abdelaziz, Barry Theobald

ImmerseDiffusion: Generative Spatial Audio Latent Diffusion Model

Moji Heydari (University of Rochester), Mehrez Souden, Josh Atkins, Bruno Conejo

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector Based Pseudo-Labels

Shinji Watanabe (Carnegie Mellon University), Jeeweon Jung (Carnegie Mellon University), Ahmed Hussen Abdelaziz, Takuya Higuchi, Zak Aldeneh, Li-Wei Chen (Carnegie Mellon University), Stephen Shum, Tatiana Likhomanenko, Barry-John Theobald

Retrieval-Augmented Correction of Named Entity Speech Recognition Errors

Ernest Pusateri, Anmol Walia, Anirudh Kashi, Bortik Bandyopadhyay, Nadia Hyder, Sayantan Mahinder, Raviteja Anantha, Daben Liu (Capital One), Sashank Gondala (Further AI)

SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions

Dominik Wagner (Friedrich-Alexander-Universitaet Erlangen-Nuernberg), Alex Churchill, Siddharth Sigtia, Erik Marchi

SLiCK: Exploiting Subsequences for Length-Constrained Keyword Spotting

Kumari Nishu, Minsik Cho, Devang Naik

Towards Automatic Assessment of Self-Supervised Speech Models Using Rank

Zak Aldeneh, Vimal Thilak, Takuya Higuchi, Tatiana Likhomanenko, Barry Theobald

Acknowledgements

Tatiana Likhomanenko is an Area Chair and Meta Reviewer for ICASSP 2025.

Arnav Kundu, Aswin Sivaraman, Kunal Jain, Kuan-Lin Chen, Kumari Nishu, Nimish Venkat Marigo, Parnia Bahar, Sameer Badaskar, Takaaki Hori, Tatiana Likhomanenko, Venki Nagesha, and Zak Aldeneh are reviewers for ICASSP 2025.

Venkataramanan Subramanian is presenting at the ICASSP 2025 Industry Forum.

Related readings and updates.

Neural Information Processing Systems (NeurIPS) 2024

Apple is presenting new research at the annual conference on Neural Information Processing Systems (NeurIPS), which takes place in person in Vancouver, Canada, from December 10 - 15. We are proud to again sponsor the multi-track interdisciplinary conference, which brings together the scientific and industrial research communities surrounding Machine Learning. Below is an overview of Apple’s participation at NeurIPS 2024.

See event details

International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024

Apple sponsored the International Conference on Acoustics, Speech and Signal Processing (ICASSP), which took place in person from April 14 to 19 in Seoul, South Korea. ICASSP is the IEEE Signal Processing Society's flagship conference on signal processing and its applications.

See event details