PulseAugur
EN
LIVE 21:14:05
ENTITY Cutlass

Cutlass

PulseAugur coverage of Cutlass — every cluster mentioning Cutlass across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
4
4 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 4 TOTAL
  1. TOOL · CL_75452 ·

    CUDA/C++ inference engine built for NVIDIA's DVLT 3D model

    A new inference engine called dvlt.cu has been developed from scratch using CUDA/C++ for NVIDIA's DVLT 3D transformer model. This standalone 5MB binary has minimal dependencies, relying only on cuBLASLt and the header-o…

  2. TOOL · CL_51969 ·

    TileLang simplifies GPU kernel writing with Python interface

    A new programming language called TileLang aims to simplify GPU kernel development by offering a middle ground between high-level frameworks like Triton and low-level control like CUTLASS. TileLang allows developers to …

  3. RESEARCH · CL_13517 ·

    CuTeDSL emerges as new GPU kernel path for LLM inference, challenging CUTLASS

    The landscape of GPU kernel engineering for LLM inference is shifting, with CuTeDSL emerging as a potential successor to C++ CuTe/CUTLASS. This evolution is highlighted by industry trends in technologies like FlashAtten…

  4. RESEARCH · CL_11176 ·

    Moonshot AI open-sources FlashKDA, boosting Kimi Delta Attention 2.5x on H200 GPUs

    Moonshot AI has released FlashKDA, an open-source implementation of Kimi Delta Attention. This new kernel achieves up to 2.5 times faster inference speeds on NVIDIA H200 GPUs. It is built using CUTLASS and optimized for…