Speech Foundation Models Generalize to Time Series Tasks from Wearable Sensor Data

AuthorsJaya Narain, Zakaria Aldeneh, Shirley Ren

This paper was accepted at the Learning from Time Series for Health workshop at NeurIPS 2025.

Both speech and sensor time series data encode information in both the time- and frequency- domains, like spectral powers and waveform shapelets. We show that speech foundation models learn representations that generalize beyond the speech domain and achieve state-of-the-art performance on diverse time-series tasks from wearable sensors. Probes trained on features extracted from HuBERT and wav2vec 2.0 outperform those extracted from self-supervised models trained directly on modality-specific datasets for mood classification, arrhythmia detection, and activity classification tasks. We find that the convolutional feature encoders of speech models are particularly relevant for wearable sensor applications. The proposed approach enhances performance on data scarce time-series tasks using simple probing methods. This work takes a step toward developing generalized time-series models that unify speech and sensor modalities.

Related readings and updates.

Towards Time-Series Reasoning with LLMs

December 3, 2024research area Methods and Algorithms, research area Speech and Natural Language Processingconference NeurIPS

Multi-modal large language models (MLLMs) have enabled numerous advances in understanding and reasoning in domains like vision, but we have not yet seen this broad success for time-series. Although prior works on time-series MLLMs have shown promising performance in time-series forecasting, very few works show how an LLM could be used for time-series reasoning in natural language. We propose a novel multi-modal time-series LLM approach that…

Generalizable Autoregressive Modeling of Time Series Through Functional Narratives

October 15, 2024research area Methods and Algorithms, research area Tools, Platforms, Frameworks

Time series data are inherently functions of time, yet current transformers often learn time series by modeling them as mere concatenations of time periods, overlooking their functional properties. In this work, we propose a novel objective for transformers that learn time series by re-interpreting them as temporal functions. We build an alternative sequence of time series by constructing degradation operators of different intensity in the…

Speech Foundation Models Generalize to Time Series Tasks from Wearable Sensor Data

Related readings and updates.

Towards Time-Series Reasoning with LLMs

Generalizable Autoregressive Modeling of Time Series Through Functional Narratives

Discover opportunities in Machine Learning.