DINO v2
PulseAugur coverage of DINO v2 — every cluster mentioning DINO v2 across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
New frameworks and benchmarks advance audio-visual generation
Researchers have introduced OmniCustom, a framework for customizing both video identity and audio timbre simultaneously from reference images and audio. This DiT-based model uses separate LoRA modules for identity and t…
-
New optical flow method skips test-time scaling using foundation models
Researchers have developed a new method for estimating dense optical flow that bypasses the need for computationally intensive test-time scaling. This approach leverages pretrained foundation models, specifically DINO-v…
-
SAM 3: The Eyes for AI — Nikhila & Pengchuan (Meta Superintelligence), ft. Joseph Nelson (Roboflow)
Meta AI has released SAM 3, a significant advancement in their Segment Anything project, capable of concept segmentation, detection, and tracking in images and video using natural language prompts. This new model achiev…