正文
除了国际竞赛,MMLab 在 CVPR 2025 也
主办了六项前沿 Workshop、Tutorial 活动
,全面覆盖自动驾驶、多模态、世界模型、协同感知、数据赋能等热点议题。
-
Embodied Intelligence for Autonomous Systems on the Horizon
-
Workshop on Autonomous Driving
-
Distillation of Foundation Models for Autonomous Driving
-
Multi-Agent Embodied Intelligent Systems Meet Generative-AI Era: Opportunities, Challenges and Futures
-
Robotics 101: An Odyssey from A Vision Perspective
-
The 1st Workshop on Benchmarking World Models
在生成式智能与多模态感知飞速发展的当下,这一系列研究成果展示了在跨模态理解、场景生成、人机交互和机器人智能等领域的一些进步。比如,文本驱动的视频合成、图像安全性评估、高精度的三维高斯建模和机器人操作策略学习这些技术,都在提升模型的通用性、效率以及在现实世界中的适应能力。不管你关心的是更安全可信的生成系统、更聪明的机器人大脑,还是更高质量的视觉生成模型,这些项目都代表了技术创新的前沿,欢迎关注!
-
TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
[Oral]
-
Parallelized Autoregressive Visual Generation
[Highlight]
-
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins
[Highlight]
-
HMAR: Efficient Hierarchical Masked AutoRegressive Image Generation
-
MBQ: Modality-Balanced Quantization for Large Vision-Language Models
-
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation