CVPR 2025 - Seattle, United States

C3DV: 3rd Workshop on Compositional 3D Vision

Second workshop on compositional 3D vision and VSIC and 3DCoMPaT++ dataset challenges, hosted by #CVPR2025.

000

Days

00

Hours

00

Minutes

00

Secondes

Program

Workshop Program

Join us for a day of exciting talks and discussions on the latest advances in computer vision and AI.

08:50

Opening Remarks08:50 AM - 09:00 AM

#

Opening Remarks

  • Organizers

09:00

Invited Talk 109:00 AM - 09:35 AM

Bio

Pr. Vedaldi is a Professor of Computer Vision and Machine Learning and a co-lead of the VGG group at the Engineering Science department of the University of Oxford. His research mainly covers using computer vision and machine learning methods to understand the content of images and videos automatically, with little to no manual supervision, in terms of semantics and 3D geometry. He is also the leading author of the VLFeat and MatConvNet computer vision and deep learning libraries.

09:35

Invited Talk 209:35 AM - 10:10 AM

Vladimir Kim

3D Vision and Geometry Processing

  • Vladimir (Vova) Kim
Bio

Vladimir (Vova) Kim is a Senior Research Scientist at Adobe Research, where his work focuses on computer vision, machine learning, and 3D geometry processing. His research spans a wide range of topics, including 3D reconstruction, shape analysis, and generative models for 3D content creation. Vova is well-known for his contributions to developing advanced algorithms that bridge the gap between visual understanding and 3D modeling, helping to shape the future of creative technologies. He holds a Ph.D. from Princeton University, and his work has been widely published in top-tier conferences such as CVPR and SIGGRAPH.

10:10

Coffee Break10:10 AM - 10:40 AM

10:40

Invited Talk 310:40 AM - 11:15 AM

Bio

Pr. Birdal is an Assistant Professor (Lecturer) in the Department of Computing of Imperial College London. Previously, he was a senior Postdoctoral Research Fellow at Stanford University within the Geometric Computing Group of Pr. Leonidas Guibas. His current foci of interest involve geometric machine learning and 3D computer vision. More theoretical work is aimed at investigating and interrogating limits in geometric computing and non-Euclidean inference as well as principles of deep learning.

11:15

Invited Talk 411:15 AM - 11:50 AM

Bio

Pr. Su is an Associate Professor in the Department of Computer Science and Engineering at the University of California, San Diego. His research focuses on artificial intelligence, 3D vision, and robotics, with a particular emphasis on deep learning for 3D understanding, 3D reconstruction, and robot learning. He has made significant contributions to the development of neural representations for 3D data, advancing fields such as 3D shape analysis and scene understanding.

11:50

Lunch Break11:50 AM - 13:20 PM

13:20

Invited Talk 513:20 PM - 13:55 PM

Bio

Pr. Chang is an Associate Professor in the School of Computing Science at Simon Fraser University. Prior to this, she was a visiting research scientist at Facebook AI Research and a research scientist at Eloquent Labs. Her research focuses on bridging the gap between language and 3D representations of shapes and scenes, grounding language for embodied agents, and synthesizing 3D environments from natural language.

13:20

Invited Talk 514:00 PM - 14:30 PM

Bio

Pr. Fang is an Associate Professor of Electrical and Computer Engineering at the NYU Abu Dhabi and NYU Tandon. He directs the NYU Multimedia and Visual Computing Lab. His research focuses on 3D Computer Vision and Machine Learning with applications to robotics and autonomous driving. He is currently working on the development of 3D deep learning technologies in large-scale visual computing, cross-domain and cross-modality models, and their various industrial applications.

Challenges

3DCoMPaT Challenge

3DCoMPaT dataset++

🔍 Challenge overview


The Grounded CoMPaT Recognition (GCR) is a compositional 3D vision task that aims to collectively recognize and ground compositions of materials on parts of 3D objects. This task is based on the 3DCoMPaT-200 dataset, a large-scale multimodal dataset composed of stylized 3D objects, associated 2D renderings, and textual descriptions.
We propose two variations of this task: GCR-Coarse and GCR-Fine, which are based on coarse-grained and fine-grained 3D segmentations of the 3DCoMPaT-200 models.
We highly encourage participants of the challenge to enter and submit to both tracks of the challenge.

📊 Dataset


The 3DCoMPaT-200 dataset for both challenge tracks is available through our download page.

📨 Submission


Submission will be made through the eval.ai platform.

📜 Rules


Here are the rules for the challenge:

  • Submission Limit: Each participant is allowed to submit their solution a maximum of three times per day.
  • Data Usage: Participants are not permitted to use any data other than the 3DCoMPaT-200 data for training their models.
  • Technical Report: Each participant must submit a technical report detailing their methods, which will be made public, in order to be eligible for any prizes or rewards.


🏆 Awards


Total prize pool: ***. Teams are encouraged to particpate to both challenge tracks.
Fine track:
  • 1st: ***
  • 2nd: ***
Coarse track:
  • 1st: ***
  • 2nd: ***

These prizes are designed to motivate participants to put their best effort into the challenge and to reward those who perform exceptionally well. The challenge organizers hope that these prizes will encourage a high level of participation and help to drive innovation in the field of 3D computer vision. It should be noted that eligibility for these prizes is contingent on participants adhering to the rules of the challenge. Therefore, participants must submit their solutions in accordance with the rules and provide a technical report detailing their methods to be considered for any prizes or rewards.

💬 Q&A


If you encounter any technical issue related to the challenge, or if you're missing critical information, please open a ticket on our GitHub repository.




Challenges

3DCoMPaT-200 Language-Based Part Grounding Challenge

This challenge aims to explore how modern machine learning techniques can be applied to improve the ability to refer to specific parts of a shape based on the given text prompt. Building on the principles of accessing various levels of difficulty, participants will be provided with different numbers of parts per shape with grounding prompts and will be tasked with designing a model able to segment the mentioned parts in the shape's point cloud. Please stay tuned for more information.

Speakers

Invited Speakers

Hao Su

Associate Professor UC San Diego

Tolga Birdal

Assistant Professor Imperial College London

Vladimir Kim

Senior Research Scientist Adobe Research

Georgia Gkioxari

Assistant Professor University of Southern California

Andrea Vedaldi

Professor University of Oxford

Angel Chang

Associate Professor Simon Fraser University

Yi Fang

Associate Professor NYU Abu Dhabi & NYU Tandon

Organizers

Workshop Organizers

Habib Slim

Ph.D. Student KAUST

Abdulwahab Felemban

Ph.D. Student KAUST

Mahmoud Ahmed

Research Student KAUST

Wolfgang Heidrich

Professor KAUST

Peter Vajda

Researcher and Engineering Manager Meta AI

Natalia Neverova

Research Lead Meta AI

Mohamed Elhoseiny

Assistant Professor KAUST

Challenge Organizers

Aditya Ganeshan

Ph.D. Student Brown University

Kenny Jones

Ph.D. Student Brown University

Habib Slim

Ph.D. Student KAUST

Mahmoud Ahmed

Research Student KAUST

Xiang Li

Postdoctoral Researcher KAUST

Daniel Ritchie

Assistant Professor Brown University

Peter Wonka

Professor KAUST

Mohamed Elhoseiny

Assistant Professor KAUST

For any question or support, please reach @Habib.S.