HAC adapts CLIP to hyperbolic space for zero-shot VQA tasks

By PulseAugur Editorial · [1 sources] · 2026-04-28 04:00

Researchers have introduced HAC, a novel framework that adapts pre-trained CLIP models to hyperbolic geometry for improved zero-shot Visual Question Answering (VQA). This parameter-efficient approach allows existing CLIP models to transition to hyperbolic space through minimal fine-tuning, avoiding the need for training from scratch. HAC demonstrated superior performance across various VQA benchmarks, including reasoning-intensive tasks, by achieving up to a 1.9-point improvement over standard CLIP models. AI

IMPACT Offers a more efficient method for adapting large vision-language models to new tasks, potentially improving zero-shot capabilities.

RANK_REASON Academic paper introducing a new method for adapting existing models to hyperbolic geometry for VQA tasks.

Read on arXiv cs.CV →

arXiv

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CV TIER_1 English(EN) · Francesco Dibitonto, Cigdem Beyan, Vittorio Murino · 2026-04-28 04:00

HAC: Parameter-Efficient Hyperbolic Adaptation of CLIP for Zero-Shot VQA

arXiv:2604.23665v1 Announce Type: new Abstract: Recent advances in representation learning have shown that hyperbolic geometry can offer a more expressive alternative to the Euclidean embeddings used in CLIP models, capturing hierarchical structures and leading to better-organize…

COVERAGE [1]

HAC: Parameter-Efficient Hyperbolic Adaptation of CLIP for Zero-Shot VQA

RELATED ENTITIES

RELATED TOPICS