PulseAugur
LIVE 08:20:32
ENTITY ggml-org

ggml-org

PulseAugur coverage of ggml-org — every cluster mentioning ggml-org across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
0
0 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_03576 ·

    llama.cpp CUDA pull request optimizes MMQ stream-k overhead for MoE models

    A pull request to the llama.cpp project aims to reduce overhead in CUDA's MMQ stream-k operations. This optimization targets Mixture of Experts (MoE) models, potentially leading to faster prompt processing speeds. The c…