ENTITY diffusion model

diffusion model

PulseAugur coverage of diffusion model — every cluster mentioning diffusion model across labs, papers, and developer communities, ranked by signal.

Total · 30d

51 over 90d

Releases · 30d

0 over 90d

Papers · 30d

50 over 90d

TIER MIX · 90D

research 21
tool 29
commentary 1

SENTIMENT · 30D

3 day(s) with sentiment data

RECENT · PAGE 1/2 · 23 TOTAL

TOOL · CL_27495 · May 11 · 09:23

Cold diffusion tackles percussive audio dereverberation

Researchers have developed a novel cold diffusion framework to address the challenge of dereverberating percussive audio signals, such as drums, which have been largely overlooked in favor of speech processing. This new…
TOOL · CL_27730 · May 10 · 00:32

New theory resolves instability in MeanFlow generative models

Researchers have developed a theoretical framework to address instability issues in MeanFlow training, a one-step generative modeling technique. They identified that the conditional velocity field is misused in the loss…
TOOL · CL_25766 · May 8 · 15:09

New BRIDGE method improves local image editing by controlling mask influence

Researchers have developed a new method called BRIDGE for local image editing, which aims to modify specific regions of an image while keeping the background intact. This approach tackles the issue of "mask-shape bias,"…
TOOL · CL_20798 · May 7 · 04:00

X-Cache accelerates world model inference for autonomous driving simulations

Researchers have developed X-Cache, a novel method to accelerate the inference of autoregressive world models used in autonomous driving simulations. This technique caches residual computations across generation chunks …
TOOL · CL_19909 · May 6 · 19:34

AI researchers explore learning the integral of diffusion models

A new paper explores the mathematical concept of integrating diffusion models, which are foundational to many generative AI systems. The research delves into the theoretical underpinnings of these models, potentially le…
TOOL · CL_16005 · May 5 · 04:00

StyleShield framework evades AI content detectors with controllable style transfer

Researchers have developed StyleShield, a novel framework that manipulates text style in the continuous token embedding space to evade AI-generated content detectors. This method utilizes a DiT backbone with cross-atten…
RESEARCH · CL_18705 · May 5 · 00:48

Ortho-Hydra paper introduces new method to improve LoRA fine-tuning for diffusion transformers

Researchers have introduced Ortho-Hydra, a novel re-parameterization technique designed to improve LoRA fine-tuning for diffusion transformers (DiT) on multi-style data. This method addresses the issue of 'style bleed' …
RESEARCH · CL_15510 · May 4 · 14:26

Mamoda2.5 model integrates multimodal AI with efficient DiT-MoE for top video editing

Researchers have introduced Mamoda2.5, a unified AR-Diffusion framework designed for multimodal understanding and generation. This model utilizes a Diffusion Transformer backbone enhanced with a Mixture-of-Experts (MoE)…
RESEARCH · CL_14333 · May 4 · 04:00

New AI methods enhance time series forecasting accuracy and interpretability

Researchers have introduced several new methods for time-series forecasting, aiming to improve accuracy and generalization. MeLISA, a latent-free autoregressive model, enhances rollout efficiency and long-horizon statis…
RESEARCH · CL_14344 · May 4 · 04:00

Video Generation with Predictive Latents

Researchers have developed several new methods to improve the efficiency and quality of visual generative models. DC-DiT introduces dynamic chunking to Diffusion Transformers, adaptively compressing visual data for fast…
RESEARCH · CL_11834 · May 1 · 04:00

YOSE framework speeds up video object removal with token selection

Researchers have developed YOSE, a new framework designed to significantly speed up video object removal using Diffusion Transformer (DiT) models. YOSE achieves this efficiency by adaptively selecting only the essential…
RESEARCH · CL_11361 · Apr 30 · 14:53

Researchers release TripVVT dataset and framework for in-the-wild video virtual try-on

Researchers have introduced TripVVT, a new framework for in-the-wild video virtual try-on, addressing limitations caused by scarce data and improper mask usage. The system utilizes a Diffusion Transformer and a stable h…
RESEARCH · CL_10493 · Apr 30 · 08:26

Sora's architect Bill Peebles departs OpenAI citing commercialization and copyright issues

Bill Peebles, a key figure behind OpenAI's Sora and the inventor of the DiT architecture, has departed the company. His exit is attributed to OpenAI's aggressive push towards an IPO, which has reportedly shifted the com…
RESEARCH · CL_10165 · Apr 30 · 04:00

Omni2Sound model unifies video, text to audio generation with new dataset

Researchers have developed Omni2Sound, a unified diffusion model capable of generating audio from video, text, or a combination of both. The model addresses challenges in data scarcity and cross-task competition by intr…
RESEARCH · CL_10149 · Apr 30 · 04:00

New Keyframe-Driven Method Enhances Video Virtual Try-On Realism

Researchers have introduced KeyTailor, a new framework designed to improve video virtual try-on (VVT) by addressing challenges in capturing garment dynamics and maintaining background consistency. The method utilizes a …
RESEARCH · CL_09744 · Apr 29 · 14:01

X-WAM model unifies robotic action and 4D world synthesis with asynchronous denoising

Researchers have developed X-WAM, a novel Unified 4D World Model designed to integrate real-time robotic action execution with high-fidelity 4D world synthesis. This framework addresses limitations in previous models by…
RESEARCH · CL_08581 · Apr 29 · 04:00

UniSER foundation model unifies soft effects removal in images

Researchers have developed UniSER, a novel foundation model designed to address a variety of soft visual degradations in digital images, such as lens flare, haze, shadows, and reflections. Unlike previous specialized mo…
RESEARCH · CL_09783 · Apr 29 · 02:58

MetaSR framework uses Diffusion Transformer for adaptive metadata in generative super-resolution

Researchers have developed MetaSR, a novel framework for generative super-resolution that adaptively selects and injects relevant metadata to enhance image and video quality. This Diffusion Transformer-based approach is…
RESEARCH · CL_06435 · Apr 28 · 04:00

AI research advances 3D asset generation and anomaly detection for autonomous driving

Researchers have developed a novel approach called GenAssets for generating high-quality 3D assets from in-the-wild LiDAR and camera data, crucial for autonomous driving simulations. This method utilizes a "reconstruct-…
RESEARCH · CL_06609 · Apr 28 · 04:00

Audio-Omni framework unifies audio generation, editing, and understanding

Researchers have introduced Audio-Omni, a novel framework designed to unify audio understanding, generation, and editing across diverse domains like speech, music, and general sounds. This system integrates a frozen Mul…

Cold diffusion tackles percussive audio dereverberation

New theory resolves instability in MeanFlow generative models

New BRIDGE method improves local image editing by controlling mask influence

X-Cache accelerates world model inference for autonomous driving simulations

AI researchers explore learning the integral of diffusion models

StyleShield framework evades AI content detectors with controllable style transfer

Ortho-Hydra paper introduces new method to improve LoRA fine-tuning for diffusion transformers

Mamoda2.5 model integrates multimodal AI with efficient DiT-MoE for top video editing

New AI methods enhance time series forecasting accuracy and interpretability

Video Generation with Predictive Latents

YOSE framework speeds up video object removal with token selection

Researchers release TripVVT dataset and framework for in-the-wild video virtual try-on

Sora's architect Bill Peebles departs OpenAI citing commercialization and copyright issues

Omni2Sound model unifies video, text to audio generation with new dataset

New Keyframe-Driven Method Enhances Video Virtual Try-On Realism

X-WAM model unifies robotic action and 4D world synthesis with asynchronous denoising

UniSER foundation model unifies soft effects removal in images

MetaSR framework uses Diffusion Transformer for adaptive metadata in generative super-resolution

AI research advances 3D asset generation and anomaly detection for autonomous driving

Audio-Omni framework unifies audio generation, editing, and understanding