paperMay 2024

Poly-View Contrastive Learning

AuthorsAmitis Shidani, Devon Hjelm, Jason Ramapuram, Russ Webb, Eeshan Gunesh Dhekane, Dan Busbridge

Contrastive learning typically matches pairs of related views among a number of unrelated negative views.

Views can be generated (e.g. by augmentations) or be observed. We investigate matching when there are more than two related views which we call poly-view tasks, and derive new representation learning objectives using information maximization and sufficient statistics.

We show that with unlimited computation, one should maximize the number of related views, and with a fixed compute budget, it is beneficial to decrease the number of unique samples whilst increasing the number of views of those samples.

In particular, poly-view contrastive models trained for 128 epochs with batch size 256 outperform SimCLR trained for 1024 epochs at batch size 4096 on ImageNet1k, challenging the belief that contrastive models require large batch sizes and many training epochs.

Figure 1: An Overview of View Multiplicity in Contrastive Learning.

Figure 2: At fixed compute, Poly-View methods outperform other contrastive methods by exchanging unique samples for additional views.

Related readings and updates.

February 2, 2022research area Computer Vision, research area Methods and Algorithmsconference WACV

We study the problem of novel view synthesis from sparse source observations of a scene comprised of 3D objects. We propose a simple yet effective approach that is neither continuous nor implicit, challenging recent trends on view synthesis. Our approach explicitly encodes observations into a volumetric representation that enables amortized rendering. We demonstrate that although continuous radiance field representations have gained a lot of...

November 29, 2021research area Computer Visionconference 3DV

Multi-view triangulation is the gold standard for 3D reconstruction from 2D correspondences, given known calibration and sufficient views. However in practice expensive multi-view setups — involving tens sometimes hundreds of cameras — are required to obtain the high fidelity 3D reconstructions necessary for modern applications. In this work we present a novel approach that leverages recent advances in 2D-3D lifting using neural shape priors...

Poly-View Contrastive Learning

Related readings and updates.

Fast and Explicit Neural View Synthesis

High Fidelity 3D Reconstructions with Limited Physical Views

Discover opportunities in Machine Learning.