PulseAugur
EN
LIVE 20:10:19
ENTITY mixture of experts

mixture of experts

PulseAugur coverage of mixture of experts — every cluster mentioning mixture of experts across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
100
100 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
76
76 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
TIMELINE
  1. 2026-05-11 research_milestone A new paper proposes an enhanced Mixture-of-Experts framework for faster time series forecasting model training. source
SENTIMENT · 30D

19 day(s) with sentiment data

RECENT · PAGE 3/5 · 100 TOTAL
  1. RESEARCH · CL_41759 ·

    New tool DODOCO reveals flaws in MoE model dispatch benchmarks

    A new research paper introduces DODOCO, a tool designed to diagnose overhead in dispatch operations for Mixture-of-Experts (MoE) models. The study found that common assumptions about workload representation in benchmark…

  2. TOOL · CL_41905 ·

    New HDMoE framework enhances cancer survival prediction with multimodal data

    Researchers have developed a new framework called HDMoE to improve multimodal cancer survival prediction. This hierarchical decoupling-fusion mixture-of-experts approach aims to better integrate data from sources like w…

  3. RESEARCH · CL_41793 ·

    Dynamic TMoE framework improves time series forecasting with adaptive experts

    Researchers have developed Dynamic TMoE, a novel framework designed to improve non-stationary time series forecasting. This approach addresses the limitations of existing Mixture-of-Experts (MoE) models by dynamically a…

  4. RESEARCH · CL_41804 ·

    Vision MoE models show stable animate-inanimate expert specialization

    Researchers have developed new methods to analyze the internal workings of Mixture-of-Experts (MoE) models in computer vision. Their work moves beyond simply examining how data is routed to specific "experts" within the…

  5. TOOL · CL_41191 ·

    New MoE framework enhances brain decoding with network-aware experts

    Researchers have developed FPED, a novel Mixture-of-Experts (MoE) framework designed for interpretable brain decoding using fMRI data. This approach explicitly models different functional brain networks as specialized e…

  6. FRONTIER RELEASE · CL_33854 ·

    DeepSeek V4 debuts with MegaMoE optimizations for efficient MoE

    DeepSeek has released its V4 model, featuring significant optimizations through a new system called MegaMoE. This system utilizes a 1400-line fused CUDA kernel to enhance performance by fine-grained pipelining of commun…

  7. RESEARCH · CL_36345 ·

    New $\phi$-balancing framework improves MoE model training

    Researchers have introduced a new framework called $\phi$-balancing to improve the training of Mixture-of-Experts (MoE) models. This method aims to achieve better expert utilization by directly targeting population-leve…

  8. RESEARCH · CL_32718 ·

    MetaMoE unifies private MoE models using public proxy data

    Researchers have introduced MetaMoE, a novel framework designed to unify independently trained Mixture-of-Experts (MoE) models without requiring access to private client data. The system utilizes public proxy data to ap…

  9. COMMENTARY · CL_29758 ·

    MoE architectures are workarounds for LLM training instability, not ideal solutions

    Mixture-of-Experts (MoE) architectures are often presented as an efficient solution for scaling large language models, but this analysis argues they are primarily a workaround for training instability in dense transform…

  10. RESEARCH · CL_28307 ·

    New research optimizes Sparse Mixture-of-Experts for efficient LLM scaling

    Researchers are exploring new methods to optimize Sparse Mixture-of-Experts (SMoE) models, which are crucial for scaling large language models efficiently. One paper reveals a geometric coupling between routers and expe…

  11. TOOL · CL_27710 ·

    New MoE framework speeds up time series forecasting training

    Researchers have developed a new Mixture-of-Experts (MoE) framework designed to accelerate the training of time series forecasting models. This method integrates expert-specific loss information directly into the traini…

  12. RESEARCH · CL_25314 ·

    EMO AI Model Achieves High Performance with Minimal Experts

    Researchers from the Allen Institute for AI and UC Berkeley have developed a new Mixture-of-Experts (MoE) model architecture named EMO. This model achieves nearly full performance while utilizing only 12.5% of its avail…

  13. SIGNIFICANT · CL_23645 ·

    DeepSeek releases open-source coding model matching GPT-4o

    DeepSeek has released V3-0324, an open-source coding model that matches or surpasses leading models like GPT-4o and Claude 3.5 Sonnet in coding performance. This Mixture-of-Experts model, with 671 billion total paramete…

  14. RESEARCH · CL_25612 ·

    New research explores speculative decoding for faster LLM inference

    Multiple research papers published on arXiv explore advancements in speculative decoding for Large Language Models (LLMs). These studies focus on improving inference speed and efficiency by using a smaller "draft" model…

  15. TOOL · CL_25610 ·

    MoE models misroute tokens on complex reasoning tasks, study finds

    Researchers have identified a significant issue in Mixture-of-Experts (MoE) language models where the routing mechanism, which directs tokens to specific experts, often selects suboptimal paths. While the standard route…

  16. TOOL · CL_22046 ·

    New MoE inference design uses pooled HBM to cut communication latency on Ascend

    Researchers have developed a new communication design for Mixture-of-Experts (MoE) inference on Ascend systems, aiming to reduce bottlenecks in token exchange. This approach eliminates intermediate relay and reordering …

  17. TOOL · CL_21909 ·

    Graph Normalization offers differentiable approximation for NP-hard MWIS problem

    Researchers have developed Graph Normalization (GN), a novel dynamical system that approximates the NP-hard Maximum Weight Independent Set (MWIS) problem. GN offers a principled and differentiable approach, converging t…

  18. TOOL · CL_21907 ·

    New research explores finite expert banks for communication-efficient MoE architectures

    Researchers have developed a new framework for analyzing sparse Mixture-of-Experts (MoE) architectures, focusing on communication efficiency. They propose treating the MoE gate as a stochastic channel and quantifying ro…

  19. RESEARCH · CL_22189 ·

    EMO model enables modularity in large language models with selective expert use

    Researchers have developed EMO, a novel Mixture-of-Experts (MoE) model designed for emergent modularity. Unlike traditional monolithic large language models, EMO activates only specific subsets of its parameters for dif…

  20. RESEARCH · CL_21995 ·

    New SAMoE-C method improves CSI-based HAR with scene-adaptive experts

    Researchers have developed a new method called Scene-Adaptive Mixture of Experts with Clustered Specialists (SAMoE-C) to improve human activity recognition using channel state information (CSI). This approach addresses …