PulseAugur
EN
LIVE 21:27:05
ENTITY Vision-Language-Action (VLA) models

Vision-Language-Action (VLA) models

PulseAugur coverage of Vision-Language-Action (VLA) models — every cluster mentioning Vision-Language-Action (VLA) models across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
16
16 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
16
16 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

11 day(s) with sentiment data

RECENT · PAGE 1/1 · 16 TOTAL
  1. RESEARCH · CL_79104 ·

    GEAR-VLA framework enhances robotic manipulation generalization

    Researchers have developed GEAR-VLA, a new framework designed to improve the generalizability of Vision-Language-Action (VLA) models in robotic manipulation tasks. This approach addresses limitations in current VLA mode…

  2. RESEARCH · CL_76937 ·

    ActionMap improves robot policy learning with voxel heatmap

    Researchers have developed ActionMap, a novel voxel heatmap action head designed to improve robot policy learning in vision-language-action (VLA) models. This new head replaces the traditional action decoder, predicting…

  3. TOOL · CL_78002 ·

    Hugging Face paper: Robots need better data interfaces, not just bigger models

    A new position paper from Hugging Face argues that advancing robot intelligence requires more than just scaling existing Vision-Language-Action (VLA) models. The paper highlights the need for specialized interfaces to p…

  4. RESEARCH · CL_70319 ·

    VISTA framework improves robot training with validated data

    Researchers have developed VISTA, a framework designed to improve the training of Vision-Language-Action (VLA) models using real-world robot data. The framework addresses challenges such as distorted camera views and ph…

  5. TOOL · CL_68309 ·

    New S2 framework boosts VLA model generalization with evidence budgets

    Researchers have developed a new framework called S2 (See Less, Specify More) to enhance the generalization capabilities of vision-language-action (VLA) models. S2 refines the executor's training by preserving high-leve…

  6. TOOL · CL_68285 ·

    New TRAP attack hijacks VLA models via adversarial patches

    Researchers have developed a novel attack method called TRAP that exploits the Chain-of-Thought (CoT) reasoning in Vision-Language-Action (VLA) models. This attack uses adversarial patches, such as a tablecloth, to mani…

  7. RESEARCH · CL_62767 ·

    New research probes VLM susceptibility to visual persuasion and influence

    Researchers are developing new frameworks to evaluate the susceptibility of Vision-Language Models (VLMs) to multimodal persuasion and visual influences. One study introduces MMPersuade to test agent-to-agent persuasion…

  8. RESEARCH · CL_62173 ·

    New framework detects robot execution failures using trajectory data

    Researchers have developed a new framework called Hide-and-Seek to improve the reliability of robots using Vision-Language-Action (VLA) models. This method detects execution failures by identifying specific actions that…

  9. TOOL · CL_51592 ·

    New X-Foresight model enhances VLA systems with predictive world modeling

    Researchers have developed X-Foresight, a new predictive world model integrated into Vision-Language-Action (VLA) models. This model aims to equip VLA systems with physical world knowledge by predicting future video seq…

  10. TOOL · CL_51142 ·

    VLA-Pruner enhances embodied AI efficiency by optimizing visual token pruning

    Researchers have developed VLA-Pruner, a new method to make Vision-Language-Action (VLA) models more efficient for embodied AI tasks. Existing visual token pruning techniques, designed for Vision-Language Models, degrad…

  11. TOOL · CL_44730 ·

    New RAW-Dream paradigm enables zero-shot VLA model adaptation

    Researchers have introduced RAW-Dream, a new paradigm for adapting Vision-Language-Action (VLA) models without task-specific data. This approach leverages a pre-trained, task-agnostic world model for predicting future t…

  12. RESEARCH · CL_42457 ·

    Driving AI models show reasoning fragility under sensor perturbations

    A new research paper titled "Lost in Fog" investigates the reasoning fragility of Vision-Language-Action (VLA) models in autonomous driving. The study subjected the Alpamayo R1 model to various sensor perturbations, inc…

  13. TOOL · CL_32722 ·

    HandITL method improves robotic hand manipulation via seamless intervention

    Researchers have developed a new method called Hand-in-the-Loop (HandITL) to improve the performance of Vision-Language-Action (VLA) models in complex robotic manipulation tasks. This technique addresses the issue of "g…

  14. TOOL · CL_29381 ·

    RAW-Dream enables zero-shot VLA adaptation via task-agnostic world models

    Researchers have introduced RAW-Dream, a novel approach to adapt Vision-Language-Action (VLA) models for new tasks using reinforcement learning within task-agnostic world models. This method disentangles world model lea…

  15. TOOL · CL_29433 ·

    DreamAvoid framework prevents VLA model failures in robotics

    Researchers have developed DreamAvoid, a novel framework designed to prevent failures in Vision-Language-Action (VLA) models during critical manipulation tasks. The system uses a "dreaming" process at test time to antic…

  16. TOOL · CL_27521 ·

    Robotic VLAs learn from past successes with new adaptation method

    Researchers have developed a new framework called Retrieve-then-Steer to improve the reliability of Vision-Language-Action (VLA) models in robotic manipulation tasks. This method allows a partially competent, frozen VLA…