Swin Transformer
PulseAugur coverage of Swin Transformer — every cluster mentioning Swin Transformer across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
New AMN network improves nuclei segmentation in histopathology images
Researchers have developed AMN, an Adaptive Multi-Scale Fusion Network designed for precise nuclei segmentation in histopathology images. This dual-encoder framework uniquely combines a Swin Transformer and a ResNet-50 …
-
New Transformer Model Enhances Metal Defect Detection
Researchers have developed a new framework called Contrastive Augmented Transformer (CAT) to improve the detection of metal surface defects in industrial manufacturing. This framework utilizes a hierarchical Swin Transf…
-
SMIT method leads in transferability for medical image segmentation
Researchers have benchmarked nine self-supervised learning (SSL) methods for their transferability in medical image segmentation tasks. The study found that the Self-Distilled Masked Image Transformer (SMIT) method, whi…
-
AI model accurately detects rectal tumor regrowth from endoscopy images
Researchers have developed a novel Siamese Swin Transformer with Dual Cross-Attention (SSDCA) designed to detect local regrowth of rectal tumors from endoscopic images. This model analyzes sequential images from patient…
-
New MorphoFormer AI model improves building height and footprint estimation
Researchers have developed MorphoFormer, a novel framework for jointly estimating building height and footprint using remote sensing data. This approach explicitly encodes the relationship between these two parameters, …
-
TwistNet-2D learns second-order channel interactions for texture recognition
Researchers have developed TwistNet-2D, a novel module designed to enhance texture recognition by capturing second-order channel interactions. This module computes local pairwise channel products with directional spatia…
-
InfiltrNet combines CNN and Transformer for brain tumor infiltration risk prediction
Researchers have developed InfiltrNet, a novel dual-branch architecture designed to predict brain tumor infiltration risk. This system combines a CNN encoder with a Swin Transformer encoder, utilizing cross-attention fu…
-
Vision Transformers leverage DCT for improved attention and efficiency
Researchers have developed a novel approach using the Discrete Cosine Transform (DCT) to enhance Vision Transformers. This method includes a DCT-based initialization strategy for self-attention, which improves classific…
-
New MSR framework improves CT-MRI cervical spine registration with hybrid modeling
Researchers have developed a new framework called MSR for rigid-deformable hybrid modeling in CT-MRI registration of the cervical spine. This approach combines rigid alignment of individual vertebrae with deformable mod…
-
100,000 Yuan Investment: Latest Interview with Princeton's Zhuang Liu: Architecture Isn't That Important, Data is King
Princeton Assistant Professor Liu Zhuang argues that AI architecture is less critical than previously thought, with data scale and diversity being the primary drivers of progress. In a recent interview, he highlighted t…
-
New AI models enhance hyperspectral image analysis for classification and super-resolution
Researchers have developed several new deep learning models for hyperspectral image analysis. The Dual-stage Spectrum-Constrained Clustering-based Classifier (DSCC) framework aims to improve classification accuracy by d…
-
A Graph-Augmented knowledge Distillation based Dual-Stream Vision Transformer with Region-Aware Attention for Gastrointestinal Disease Classification with Explainable AI
Researchers have developed a novel dual-stream deep learning framework for classifying gastrointestinal diseases from medical imagery. This system utilizes a teacher-student knowledge distillation approach, combining a …
-
New method improves parameter-efficient multi-task learning for AI models
Researchers have developed a new parameter-efficient method for multi-task learning in computer vision. Their approach, called progressive task-specific adaptation, uses adapter modules that are shared in earlier layers…
-
New methods QFlash and ELSA boost Vision Transformer attention efficiency
Researchers have developed two new methods to improve the efficiency of attention mechanisms in vision transformers. QFlash focuses on enabling integer-only operations for FlashAttention, achieving significant speedups …