ENTITY Vision-Language-Action (VLA) models

Vision-Language-Action (VLA) models

PulseAugur coverage of Vision-Language-Action (VLA) models — every cluster mentioning Vision-Language-Action (VLA) models across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

16 over 90d

Releases · 30d

0 over 90d

Papers · 30d

16 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

11 day(s) with sentiment data

RECENT · PAGE 1/1 · 16 TOTAL

RESEARCH · CL_79104 · Jun 7 · 09:23

GEAR-VLA framework enhances robotic manipulation generalization

Researchers have developed GEAR-VLA, a new framework designed to improve the generalizability of Vision-Language-Action (VLA) models in robotic manipulation tasks. This approach addresses limitations in current VLA mode…
RESEARCH · CL_76937 · Jun 5 · 04:42

ActionMap improves robot policy learning with voxel heatmap

Researchers have developed ActionMap, a novel voxel heatmap action head designed to improve robot policy learning in vision-language-action (VLA) models. This new head replaces the traditional action decoder, predicting…
TOOL · CL_78002 · Jun 4 · 00:00

Hugging Face paper: Robots need better data interfaces, not just bigger models

A new position paper from Hugging Face argues that advancing robot intelligence requires more than just scaling existing Vision-Language-Action (VLA) models. The paper highlights the need for specialized interfaces to p…
RESEARCH · CL_70319 · Jun 3 · 10:38

VISTA framework improves robot training with validated data

Researchers have developed VISTA, a framework designed to improve the training of Vision-Language-Action (VLA) models using real-world robot data. The framework addresses challenges such as distorted camera views and ph…
TOOL · CL_68309 · Jun 3 · 04:00

New S2 framework boosts VLA model generalization with evidence budgets

Researchers have developed a new framework called S2 (See Less, Specify More) to enhance the generalization capabilities of vision-language-action (VLA) models. S2 refines the executor's training by preserving high-leve…
TOOL · CL_68285 · Jun 3 · 04:00

New TRAP attack hijacks VLA models via adversarial patches

Researchers have developed a novel attack method called TRAP that exploits the Chain-of-Thought (CoT) reasoning in Vision-Language-Action (VLA) models. This attack uses adversarial patches, such as a tablecloth, to mani…
RESEARCH · CL_62767 · May 29 · 09:18

New research probes VLM susceptibility to visual persuasion and influence

Researchers are developing new frameworks to evaluate the susceptibility of Vision-Language Models (VLMs) to multimodal persuasion and visual influences. One study introduces MMPersuade to test agent-to-agent persuasion…
RESEARCH · CL_62173 · May 29 · 00:00

New framework detects robot execution failures using trajectory data

Researchers have developed a new framework called Hide-and-Seek to improve the reliability of robots using Vision-Language-Action (VLA) models. This method detects execution failures by identifying specific actions that…
TOOL · CL_51592 · May 26 · 04:00

New X-Foresight model enhances VLA systems with predictive world modeling

Researchers have developed X-Foresight, a new predictive world model integrated into Vision-Language-Action (VLA) models. This model aims to equip VLA systems with physical world knowledge by predicting future video seq…
TOOL · CL_51142 · May 26 · 04:00

VLA-Pruner enhances embodied AI efficiency by optimizing visual token pruning

Researchers have developed VLA-Pruner, a new method to make Vision-Language-Action (VLA) models more efficient for embodied AI tasks. Existing visual token pruning techniques, designed for Vision-Language Models, degrad…
TOOL · CL_44730 · May 22 · 04:00

New RAW-Dream paradigm enables zero-shot VLA model adaptation

Researchers have introduced RAW-Dream, a new paradigm for adapting Vision-Language-Action (VLA) models without task-specific data. This approach leverages a pre-trained, task-agnostic world model for predicting future t…
RESEARCH · CL_42457 · May 20 · 17:34

Driving AI models show reasoning fragility under sensor perturbations

A new research paper titled "Lost in Fog" investigates the reasoning fragility of Vision-Language-Action (VLA) models in autonomous driving. The study subjected the Alpamayo R1 model to various sensor perturbations, inc…
TOOL · CL_32722 · May 14 · 17:51

HandITL method improves robotic hand manipulation via seamless intervention

Researchers have developed a new method called Hand-in-the-Loop (HandITL) to improve the performance of Vision-Language-Action (VLA) models in complex robotic manipulation tasks. This technique addresses the issue of "g…
TOOL · CL_29381 · May 12 · 16:16

RAW-Dream enables zero-shot VLA adaptation via task-agnostic world models

Researchers have introduced RAW-Dream, a novel approach to adapt Vision-Language-Action (VLA) models for new tasks using reinforcement learning within task-agnostic world models. This method disentangles world model lea…
TOOL · CL_29433 · May 12 · 08:27

DreamAvoid framework prevents VLA model failures in robotics

Researchers have developed DreamAvoid, a novel framework designed to prevent failures in Vision-Language-Action (VLA) models during critical manipulation tasks. The system uses a "dreaming" process at test time to antic…
TOOL · CL_27521 · May 11 · 07:11

Robotic VLAs learn from past successes with new adaptation method

Researchers have developed a new framework called Retrieve-then-Steer to improve the reliability of Vision-Language-Action (VLA) models in robotic manipulation tasks. This method allows a partially competent, frozen VLA…

GEAR-VLA framework enhances robotic manipulation generalization

ActionMap improves robot policy learning with voxel heatmap

Hugging Face paper: Robots need better data interfaces, not just bigger models

VISTA framework improves robot training with validated data

New S2 framework boosts VLA model generalization with evidence budgets

New TRAP attack hijacks VLA models via adversarial patches

New research probes VLM susceptibility to visual persuasion and influence

New framework detects robot execution failures using trajectory data

New X-Foresight model enhances VLA systems with predictive world modeling

VLA-Pruner enhances embodied AI efficiency by optimizing visual token pruning

New RAW-Dream paradigm enables zero-shot VLA model adaptation

Driving AI models show reasoning fragility under sensor perturbations

HandITL method improves robotic hand manipulation via seamless intervention

RAW-Dream enables zero-shot VLA adaptation via task-agnostic world models

DreamAvoid framework prevents VLA model failures in robotics

Robotic VLAs learn from past successes with new adaptation method