AI
Molmo2: Open Video-Language AI with Pixel-Level Grounding
Most top video AIs are locked up. Molmo2 opens the door: open weights and open datasets, built to understand videos and ground that understanding by pointing to and tracking objects in the pixels. * Data you can build on: 7 new video datasets and 2 multi-image sets, including rich video captions,