ENTITY llama-server

llama-server

PulseAugur coverage of llama-server — every cluster mentioning llama-server across labs, papers, and developer communities, ranked by signal.

Total · 30d

2

2 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

1

1 over 90d

TIER MIX · 90D

RECENT · PAGE 1/1 · 2 TOTAL

RESEARCH · CL_03569 · Apr 25 · 20:52

Quantized Qwen3.6-27B model achieves 100k context on 16GB VRAM

A user on Reddit's r/LocalLLaMA has detailed a method for running the Qwen3.6-27B model on a system with 16GB of VRAM, achieving a context length of 100,000 tokens. The process involves creating a custom GGUF quantizati…
RESEARCH · CL_01070 · Apr 22 · 16:45

Qwen3.6-27B model offers flagship coding performance in a smaller package

Qwen has released Qwen3.6-27B, an open-weight model that reportedly matches flagship-level coding performance. This new model significantly outperforms its predecessor, Qwen3.5-397B-A17B, while being substantially small…