CLIP
PulseAugur coverage of CLIP — every cluster mentioning CLIP across labs, papers, and developer communities, ranked by signal.
-
New framework enhances farmland change detection using large-small model collaboration
Researchers have developed a new framework for farmland semantic change detection, addressing limitations in existing benchmarks and models. The proposed method, called Fine-grained Difference-aware Mamba (FD-Mamba) int…
-
New 4D wire framework enables unified 3D geometric abstraction
Researchers have developed a novel framework for 3D geometric abstraction by utilizing a single, continuous 4D wire. This approach, parameterized as a B-spline with spatial coordinates and variable width, represents com…
-
ClipSum framework uses CLIP for better instructional video summaries
Researchers have developed ClipSum, a new framework for summarizing instructional videos by leveraging CLIP's vision-language features. This approach uses semantically aligned visual features from CLIP, trained on a vas…
-
LLVMs applied to SAR imagery for military target recognition
Researchers have developed a new benchmark and training methodology for applying large language-vision models (LLVMs) to automatic target recognition (ATR) using synthetic aperture radar (SAR) imagery. The study leverag…
-
DRAPE framework generates instance-specific prompts for multimodal LLMs
Researchers have developed DRAPE, a novel framework for Multimodal Continual Instruction Tuning (MCIT) that generates instance-specific soft prompts for multimodal large language models. Unlike existing methods that rel…