ENTITY Large Vision Language Models

Large Vision Language Models

PulseAugur coverage of Large Vision Language Models — every cluster mentioning Large Vision Language Models across labs, papers, and developer communities, ranked by signal.

Total · 30d

1 over 90d

Releases · 30d

0 over 90d

Papers · 30d

1 over 90d

TIER MIX · 90D

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 3 TOTAL

TOOL · CL_28308 · May 11 · 17:35

New framework estimates LVLM confidence by contrasting image-based predictions

Researchers have developed a new framework called BICR (Blind-Image Contrastive Ranking) to assess the confidence of Large Vision-Language Models (LVLMs). This method helps distinguish between predictions genuinely info…
RESEARCH · CL_20315 · May 6 · 07:56

Composer framework advances aesthetic image generation via composition transfer

Researchers have developed Composer, a new framework designed to improve the aesthetic quality of generated images by explicitly modeling composition. This approach separates composition from semantics, allowing for com…
RESEARCH · CL_15882 · May 5 · 04:00

New VIDA dataset tackles ambiguity in multimodal machine translation

Researchers have introduced VIDA, a new dataset designed to tackle ambiguity in multimodal machine translation. The dataset contains 2,500 instances where visual context is crucial for resolving ambiguous expressions. E…

New framework estimates LVLM confidence by contrasting image-based predictions

Composer framework advances aesthetic image generation via composition transfer

New VIDA dataset tackles ambiguity in multimodal machine translation