Towards a Better Evaluation of 3D CVML Algorithms: Immersive Debugging of a Localization Model
AuthorsT. Lin†, J. Yuan, K. Miao, T. Katolikyan, I. Walker, M. Cavallo
Towards a Better Evaluation of 3D CVML Algorithms: Immersive Debugging of a Localization Model
AuthorsT. Lin†, J. Yuan, K. Miao, T. Katolikyan, I. Walker, M. Cavallo
As advancements in robotics, autonomous driving, and spatial computing continue to unfold, a growing number of Computer Vision and Machine Learning (CVML) algorithms are incorporating three-dimensional data into their frameworks. Debugging these 3D CVML models often requires going beyond traditional performance evaluation methods, necessitating a deeper understanding of an algorithm’s behavior within its spatio-temporal context. However, the lack of appropriate visualization tools presents a significant obstacle to effectively exploring 3D data and spatial features in relation to key performance indicators (KPIs). To address this challenge, we explore the application of Immersive Analytics (IA) methodologies to enhance the debugging process of 3D CVML models. Through in-depth interviews with eight CVML engineers, we identify common tasks and challenges faced during the development of spatial algorithms, and establish a set of design principles for creating tools tailored to spatial model evaluation. Building on these insights, we propose a novel immersive analytics system for debugging an indoor localization algorithm. The system is built using web technologies and integrates WebXR to enable fluid transitions across the reality-virtuality continuum. We conduct a qualitative study with six CVML engineers using our system on Apple Vision Pro, observing their analytical workflow as they debug an indoor localization sequence. We discuss the advantages of employing immersive analytics in the model evaluation workflow, emphasizing the role of seamlessly integrating 2D and 3D visualizations across varying levels of immersion to facilitate more effective model assessment. Finally, we reflect on the implementation trade-offs and discuss the generalizability of our findings for future efforts in immersive 3D CVML model debugging.
VibE: A Visual Analytics Workflow for Semantic Error Analysis of CVML Models at Subgroup Level
March 31, 2025research area Human-Computer Interaction, research area Tools, Platforms, Frameworksconference IUI
Effective error analysis is critical for the successful development and deployment of CVML models. One approach to understanding model errors is to summarize the common characteristics of error samples. This can be particularly challenging in tasks that utilize unstructured, complex data such as images, where patterns are not always obvious. Another method is to analyze error distributions across pre-defined categories, which requires analysts to…
GAUDI: A Neural Architect for Immersive 3D Scene Generation
September 27, 2022research area Computer Vision, research area Methods and Algorithmsconference NeurIPS
We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera. We tackle this challenging problem with a scalable yet powerful approach, where we first optimize a latent representation that disentangles radiance fields and camera poses. This latent representation is then used to learn a generative model that enables both unconditional and…