research · [2 sources] · 2026-05-10 10:01 · Deutsch(DE) RT @antirez: DS4 läuft auf DGX Spark (GB10 / CUDA), derzeit in einer privaten Branch. 12 Tokens pro Sekunde, die Speicherdurchsatz ist in diesem System auf 270

research

DS4 model runs on NVIDIA DGX Spark hardware at 12 tokens/sec

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 2 sources

The DS4 model is reportedly running on NVIDIA's DGX Spark hardware, utilizing GB10 and CUDA. Initial performance metrics indicate a speed of 12 tokens per second, with observed memory throughput limited to 270 GB/s. This setup is currently confined to a private branch, suggesting it is in an experimental or developmental phase. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT This indicates potential advancements in AI hardware utilization and performance benchmarks for large models.

RANK_REASON The cluster describes a model running on specific hardware, with performance metrics, which constitutes a research milestone or technical report.

Read on Mastodon — mastodon.social →

COVERAGE [2]

Mastodon — mastodon.social TIER_1 Deutsch(DE) · [email protected] · 2026-05-11 16:01

RT @antirez: DS4 runs on DGX Spark (GB10 / CUDA), for now in a private branch. 12 tokens/second, memory bandwidth is limited in this system

RT @antirez: DS4 läuft auf DGX Spark (GB10 / CUDA), vorerst in einem privaten Branch. 12 Tokens/Sekunde, die Speicherbandbreite ist in diesem System begrenzt auf 270 GB/Sekunde. Der Prefill-Prozess ist jedoch deutlich effizienter als beim M3 Max mit ~200 t/s. Ich werde es veröffe…
Mastodon — mastodon.social TIER_1 Deutsch(DE) · [email protected] · 2026-05-10 10:01

RT @antirez: DS4 runs on DGX Spark (GB10 / CUDA), currently in a private branch. 12 Tokens per second, memory throughput is 270 on this system

RT @antirez: DS4 läuft auf DGX Spark (GB10 / CUDA), derzeit in einer privaten Branch. 12 Tokens pro Sekunde, die Speicherdurchsatz ist in diesem System auf 270 GB/s begrenzt. Aber der Prefill ist deutlich besser auf den M3 Max (~200 t/s) abgestimmt. Ich werde es veröffentlichen, …

COVERAGE [2]

RT @antirez: DS4 runs on DGX Spark (GB10 / CUDA), for now in a private branch. 12 tokens/second, memory bandwidth is limited in this system

RT @antirez: DS4 runs on DGX Spark (GB10 / CUDA), currently in a private branch. 12 Tokens per second, memory throughput is 270 on this system

RELATED ENTITIES

RELATED TOPICS