Newsletter Subscribe
Enter your email address below and subscribe to our newsletter
Enter your email address below and subscribe to our newsletter

Nvidia unveiled Cosmos 3 on Sunday at GTC Taipei, releasing what it calls the world’s first fully open omnimodel for physical AI — a single system that combines vision reasoning, world generation, and action prediction to help robots and autonomous vehicles perceive and act in the real world.
Built on a mixture-of-transformers architecture, Cosmos 3 pairs a reasoning transformer with an expert generation transformer, allowing the model to understand object interactions, motion, and spatial-temporal relationships before generating video and action trajectories. The system can natively process and generate text, images, video, ambient sound, and actions — eliminating the need for developers to juggle separate models for different capabilities.globenewswire
“The big bang of physical AI is just around the corner thanks to breakthroughs in multimodal reasoning language, vision and world models,” said Jensen Huang, Nvidia’s founder and CEO, during his keynote. “The Cosmos 3 family of open, frontier omnimodels gives developers a generational leap in ability to build robots, autonomous vehicles and vision AI that perceive, reason, plan and act in the physical world.”globenewswire
The release includes two model sizes: Cosmos 3 Nano, an 8-billion-parameter version designed to run on workstation-grade hardware like the RTX PRO 6000 GPU, and Cosmos 3 Super, a 32-billion-parameter model built for large-scale synthetic data generation on Hopper and Blackwell GPUs. A third variant, Cosmos 3 Edge, is coming soon for real-time inference at the edge.huggingface
Nvidia is open-sourcing the models, post-training scripts, and synthetic data generation datasets, making them available on Hugging Face and GitHub. Developers can also deploy the models as Nvidia NIM microservices or access them through cloud partners including Microsoft Azure, CoreWeave, and Nebius.huggingface
Alongside the launch, Nvidia announced the Cosmos Coalition, a collaboration with Agile Robots, Black Forest Labs, Generalist, LTX, Runway, and Skild AI to advance open world models. Physical AI developers already building on the platform include Samsung, LG Electronics, Doosan Robotics, and Li Auto.globenewswire
Among open models, Cosmos 3 ranks first across multiple physical AI benchmarks, including Physics-IQ and PAI-Bench for world generation accuracy, RoboLab and RoboArena for action policy, and the VANTAGE-Bench and TAR leaderboards for vision understanding. The model is designed to reduce physical AI training cycles from months to days by providing a pretrained foundation that requires less data and lower training costs.huggingface