Model Compression in Practice: Lessons Learned from Practitioners Creating On-device Machine Learning Experiences
AuthorsFred Hohman, Mary Beth Kery, Donghao Ren, Dominik Moritz
Model Compression in Practice: Lessons Learned from Practitioners Creating On-device Machine Learning Experiences
AuthorsFred Hohman, Mary Beth Kery, Donghao Ren, Dominik Moritz
On-device machine learning (ML) promises to improve the privacy, responsiveness, and proliferation of new, intelligent user experiences by moving ML computation onto everyday personal devices. However, today’s large ML models must be drastically compressed to run efficiently on-device, a hurtle that requires deep, yet currently niche expertise. To engage the broader human-centered ML community in on-device ML experiences, we present the results from an interview study with 30 experts at Apple that specialize in producing efficient models. We compile tacit knowledge that experts have developed through practical experience with model compression across different hardware platforms. Our findings offer pragmatic considerations missing from prior work, covering the design process, trade-offs, and technical strategies that go into creating efficient models. Finally, we distill design recommendations for tooling to help ease the difficulty of this work and bring on-device ML into to more widespread practice.
Co-ML: Collaborative Machine Learning Model Building for Developing Dataset Design Practices
January 29, 2024research area Human-Computer Interaction, research area Tools, Platforms, Frameworksconference ACM TOCE
Machine learning (ML) models are fundamentally shaped by data, and building inclusive ML systems requires significant considerations around how to design representative datasets. Yet, few novice-oriented ML modeling tools are designed to foster hands-on learning of dataset design practices, including how to design for data diversity and inspect for data quality.
To this end, we outline a set of four data design practices (DDPs) for designing…
Data Platform for Machine Learning
June 1, 2019research area Data Science and Annotationconference SIGMOD
In this paper, we present a purpose-built data management system, MLdp, for all machine learning (ML) datasets. ML applications pose some unique requirements different from common conventional data processing applications, including but not limited to: data lineage and provenance tracking, rich data semantics and formats, integration with diverse ML frameworks and access patterns, trial-and-error driven data exploration and evolution, rapid…