DeepSeek slashes API prices by 90% after V4 model release

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

DeepSeek has significantly reduced its API prices by up to 90% following the release of its V4 model. The company attributes these price cuts, which establish a new industry low, to its sparse attention architecture. This new architecture reportedly lowers per-token compute needs and supports context windows of up to 1 million tokens. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Accelerates the trend of falling AI inference costs, potentially enabling wider adoption of large context window models.

RANK_REASON Model release from a significant AI lab with a notable price reduction and technical innovation.

Read on Mastodon — sigmoid.social →

COVERAGE [1]

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-04-27 18:34

DeepSeek has cut API prices by up to 90% following its V4 model release, bringing cached input costs to RMB 0.025 per million tokens. The company says the cuts

DeepSeek has cut API prices by up to 90% following its V4 model release, bringing cached input costs to RMB 0.025 per million tokens. The company says the cuts set a new industry low and are driven by its sparse attention architecture, which reduces per-token compute requirements…

COVERAGE [1]

DeepSeek has cut API prices by up to 90% following its V4 model release, bringing cached input costs to RMB 0.025 per million tokens. The company says the cuts

RELATED ENTITIES

RELATED TOPICS