ENTITY vision transformer

vision transformer

PulseAugur coverage of vision transformer — every cluster mentioning vision transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

55 over 90d

Releases · 30d

0 over 90d

Papers · 30d

55 over 90d

TIER MIX · 90D

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

16 day(s) with sentiment data

RECENT · PAGE 2/3 · 55 TOTAL

TOOL · CL_40922 · May 19 · 12:12

New anomaly detection uses vision transformers for autonomous driving

Researchers have developed a new anomaly detection method for autonomous driving that uses pre-trained vision transformer embeddings. This approach models normality from a single reference image, avoiding the need for e…
TOOL · CL_38387 · May 19 · 04:00

CutMix training protocol induces spatial locality in Vision Transformers

Researchers have found that specific training techniques can encourage spatial locality in Vision Transformers. By using a 'Modern' protocol involving data augmentation like CutMix and ColorJitter, along with label smoo…
TOOL · CL_38820 · May 18 · 15:22

LESSViT architecture improves hyperspectral model generalization across sensors

Researchers have developed LESSViT, a novel architecture for hyperspectral imagery that addresses the challenge of generalizing models across different sensors. This Low-rank Efficient Spatial-Spectral ViT uses a struct…
TOOL · CL_37948 · May 18 · 10:20

TokenMask improves vision transformer segmentation efficiency

Researchers have developed TokenMask, a novel approach for vision transformer segmentation that bypasses the need for explicit image-space reconstruction. This method computes mask logits directly from query-token affin…
TOOL · CL_38007 · May 18 · 02:02

New GLIA framework enhances Vision Transformer use in image quality assessment

Researchers have developed a new framework called the Global-Local Interaction Adapter (GLIA) to improve Blind Image Quality Assessment (BIQA). This method leverages pre-trained Vision Transformers by using a dual-strea…
TOOL · CL_31312 · May 13 · 17:20

VoxCor method enables training-free volumetric features for medical imaging

Researchers have developed VoxCor, a novel method for creating reusable volumetric feature representations from pre-trained 2D Vision Transformer models. This training-free approach combines triplanar inference with a w…
TOOL · CL_29284 · May 12 · 12:08

What-Where Transformer separates object appearance from location

Researchers have introduced the What-Where Transformer (WWT), a novel visual backbone designed to better separate object appearance from spatial location. This new architecture uses a slot-based design where tokens repr…
TOOL · CL_27971 · May 11 · 17:51

Diffusion augmentation boosts Bangla character recognition accuracy

Researchers have developed a confidence-guided diffusion augmentation method to improve the recognition of handwritten Bangla compound characters. This approach uses diffusion models to generate high-quality synthetic c…
TOOL · CL_27505 · May 11 · 08:35

Foundation model learns from Dutch satellite data for global benchmarks

Researchers have developed a new foundation model for high-resolution remote sensing data, specifically trained on satellite images of the Netherlands. This model combines Convolutional Neural Networks and Vision Transf…
TOOL · CL_22428 · May 8 · 04:00

LC4-DViT uses generative AI and transformers for accurate land-cover mapping

Researchers have developed LC4-DViT, a novel framework for land-cover classification using a deformable Vision Transformer. This approach combines generative data creation with a deformation-aware backbone to improve ac…
TOOL · CL_22391 · May 8 · 04:00

New framework fuses facial and physiological signals for better emotion recognition

Researchers have developed a new framework for video-based emotion recognition that combines facial expressions with physiological signals from remote photoplethysmography (rPPG). Their method uses prompt tuning to inte…
TOOL · CL_21919 · May 8 · 04:00

Researchers develop robust foundation model for conservation laws using recurrent Vision Transformers

Researchers have developed a new architecture that enhances Flux Neural Operators (Flux NO) by incorporating context through Recurrent Vision Transformers. This hypernetwork model extracts solution dynamics over time, e…
RESEARCH · CL_20294 · May 6 · 14:12

DART vision-language model offers comprehensive rope condition monitoring

Researchers have developed DART, a vision-language foundation model designed for comprehensive rope condition monitoring. This model integrates a Vision Transformer with Llama-3.2-3B-Instruct to handle the entire inspec…
TOOL · CL_18721 · May 6 · 04:00

Hebbian Fast Weights enhance Vision Transformers for few-shot character recognition

Researchers have developed a new approach to few-shot character recognition by integrating Hebbian Fast-Weight (HFW) modules into Vision Transformer architectures. This method aims to mimic biological neural systems' ab…
RESEARCH · CL_18667 · May 5 · 17:21

RD-ViT cuts data needs for segmentation, outperforming standard ViT with fewer parameters

Researchers have developed RD-ViT, a novel Recurrent-Depth Vision Transformer designed for semantic segmentation tasks. This architecture significantly reduces data dependence by using a single, shared transformer block…
RESEARCH · CL_18682 · May 5 · 13:05

OneTrackerV2 unifies multimodal visual tracking with Dual Mixture-of-Experts

Researchers have developed a new event-based visual object tracking framework that addresses limitations of existing methods by explicitly modeling event density variations across multiple temporal scales. This approach…
TOOL · CL_16148 · May 5 · 04:00

Researchers develop AI framework for fluid-structure interaction prediction

Researchers have developed a new machine learning framework for predicting fluid-structure interactions (FSI) over long periods on deforming meshes. The system integrates a graph neural operator with a vision Transforme…
TOOL · CL_16142 · May 5 · 04:00

New framework enhances 3D ocean temperature reconstruction using AI

Researchers have developed an adaptive framework using spatiotemporal clustering to reconstruct 3D ocean subsurface temperature from surface observations. This method integrates with deep learning models like DP-CNN, At…
TOOL · CL_15745 · May 5 · 04:00

Researchers adapt Vision Transformers for fMRI analysis using flat maps

Researchers have developed a new family of models called CortexMAE, which adapt Vision Transformers for analyzing functional MRI data by projecting 3D volumes into 2D flat maps. This approach, tested on over 2,000 hours…
RESEARCH · CL_15610 · May 5 · 04:00

AI models advance plant disease detection with new datasets and efficient distillation

Researchers have developed new methods for plant leaf disease classification to aid in early detection and treatment. One approach involves training a new base model using the DenseNet201 architecture on a custom datase…

New anomaly detection uses vision transformers for autonomous driving

CutMix training protocol induces spatial locality in Vision Transformers

LESSViT architecture improves hyperspectral model generalization across sensors

TokenMask improves vision transformer segmentation efficiency

New GLIA framework enhances Vision Transformer use in image quality assessment

VoxCor method enables training-free volumetric features for medical imaging

What-Where Transformer separates object appearance from location

Diffusion augmentation boosts Bangla character recognition accuracy

Foundation model learns from Dutch satellite data for global benchmarks

LC4-DViT uses generative AI and transformers for accurate land-cover mapping

New framework fuses facial and physiological signals for better emotion recognition

Researchers develop robust foundation model for conservation laws using recurrent Vision Transformers

DART vision-language model offers comprehensive rope condition monitoring

Hebbian Fast Weights enhance Vision Transformers for few-shot character recognition

RD-ViT cuts data needs for segmentation, outperforming standard ViT with fewer parameters

OneTrackerV2 unifies multimodal visual tracking with Dual Mixture-of-Experts

Researchers develop AI framework for fluid-structure interaction prediction

New framework enhances 3D ocean temperature reconstruction using AI

Researchers adapt Vision Transformers for fMRI analysis using flat maps

AI models advance plant disease detection with new datasets and efficient distillation