ENTITY AI accelerator

AI accelerator

PulseAugur coverage of AI accelerator — every cluster mentioning AI accelerator across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

35 over 90d

Releases · 30d

0 over 90d

Papers · 30d

10 over 90d

TIER MIX · 90D

significant 6
research 9
tool 20

TOPICS

RELATIONSHIPS

SENTIMENT · 30D

9 day(s) with sentiment data

RECENT · PAGE 2/2 · 35 TOTAL

SIGNIFICANT · CL_07216 · Apr 28 · 07:47

Google Cloud's AI compute market share rises with surging TPU demand

Google Cloud's market share is projected to increase significantly by 2026, driven by a massive surge in demand for Tensor Processing Units (TPUs). The company is expected to control a quarter of the global AI computing…
RESEARCH · CL_08328 · Apr 28 · 07:42

AHASD architecture boosts LLM speculative decoding on mobile devices

Researchers have developed AHASD, a novel asynchronous heterogeneous architecture designed to optimize large language model (LLM) inference on mobile devices. This architecture employs task-level decoupling for parallel…
SIGNIFICANT · CL_07091 · Apr 28 · 05:22

Google unveils TPU V8 with two chips for training and inference at massive scale

Google has unveiled its eighth-generation Tensor Processing Units (TPUs), marking a significant shift by introducing two distinct chip designs for the first time. These new TPUs are engineered for specific, crucial task…
RESEARCH · CL_06821 · Apr 28 · 04:00

Tessera offers secure, near-line-rate weight streaming for edge AI accelerators

Researchers have developed Tessera, a new architecture designed to securely stream model weights to edge accelerators in Unified Memory Architecture (UMA) systems. This approach addresses the challenge of protecting pro…
RESEARCH · CL_14463 · Apr 27 · 04:00

New research explores LLM security, efficiency, and training optimization

Researchers are developing novel methods to enhance the efficiency and security of Large Language Models (LLMs). One approach, "Widening the Gap," exploits outlier injection to compromise LLM quantization, demonstrating…
RESEARCH · CL_06213 · Apr 27 · 03:48

New techniques ZipCCL and FlashOverlap accelerate LLM training by optimizing communication

Researchers have developed ZipCCL, a lossless compression library designed to accelerate the distributed training of large language models by addressing communication bottlenecks. The library utilizes novel techniques l…
SIGNIFICANT · CL_03252 · Apr 24 · 22:20

Google Cloud Next unveils new TPUs, Gemini Enterprise Agent Platform

Google Cloud has announced new AI innovations, including their eighth-generation Tensor Processing Units (TPUs) designed for both inference and reasoning. The company also unveiled the Gemini Enterprise Agent Platform, …
SIGNIFICANT · CL_02794 · Apr 24 · 12:00

AMD gains server CPU revenue share; Arm AGI CPUs secure $2B; Meta expands AWS Graviton use

AMD has achieved a significant milestone, capturing 46.2% of the x86 server CPU revenue share in Q1 2026, while Intel maintains a 70.4% share of the overall consumer PC market. In related news, Arm's new AGI CPUs have g…
RESEARCH · CL_03108 · Apr 22 · 22:00

Photonic processors offer energy-efficient alternative for deep learning computations

The future of deep learning may involve photonic processors that use light instead of electrons to perform calculations. This approach aims to reduce the significant energy demands of current neural networks, which rely…
SIGNIFICANT · CL_03621 · Apr 22 · 12:15

Google launches new TPUs for AI training and inference

Google has unveiled its eighth-generation Tensor Processing Units (TPUs), featuring two specialized chips: TPU 8t for training and TPU 8i for inference. These new chips are designed to enhance the capabilities of AI mod…
TOOL · CL_17328 · Apr 22 · 12:00

CoreWeave enhances multi-cloud AI stack with Google Cloud interconnect and unified orchestration

CoreWeave has announced a suite of services aimed at simplifying multi-cloud AI infrastructure, including a direct interconnect with Google Cloud to reduce deployment times. The company also introduced SUNK Anywhere, a …
RESEARCH · CL_17329 · Apr 22 · 00:35

Microsoft and Google adapt data center planning for volatile AI demand

Microsoft and Google are adapting their data center planning strategies to accommodate the unpredictable nature of AI workloads. Both companies are moving away from fixed roadmaps towards continuous rebalancing, utilizi…
SIGNIFICANT · CL_02658 · Apr 21 · 15:22

Anthropic commits $200B to Google Cloud for AI compute

Anthropic has committed to spending approximately $200 billion over the next five years on Google Cloud services and next-generation TPUs, with the compute capacity expected to come online starting in 2027. This deal si…
RESEARCH · CL_01721 · Nov 17 · 15:09

Google DeepMind launches WeatherNext 2, an AI model for faster, more accurate weather forecasts

Google DeepMind has unveiled WeatherNext 2, an advanced AI model for weather forecasting that generates predictions 8x faster and with hourly resolution. This new model, built on a Functional Generative Network (FGN) ap…
RESEARCH · CL_01569 · May 26 · 00:00

Graphcore and Hugging Face partner to offer new IPU-ready AI models

Graphcore has partnered with Hugging Face to optimize its Intelligence Processing Unit (IPU) hardware for transformer models. This collaboration aims to improve the efficiency and performance of training and deploying l…

Google Cloud's AI compute market share rises with surging TPU demand

AHASD architecture boosts LLM speculative decoding on mobile devices

Google unveils TPU V8 with two chips for training and inference at massive scale

Tessera offers secure, near-line-rate weight streaming for edge AI accelerators

New research explores LLM security, efficiency, and training optimization

New techniques ZipCCL and FlashOverlap accelerate LLM training by optimizing communication

Google Cloud Next unveils new TPUs, Gemini Enterprise Agent Platform

AMD gains server CPU revenue share; Arm AGI CPUs secure $2B; Meta expands AWS Graviton use

Photonic processors offer energy-efficient alternative for deep learning computations

Google launches new TPUs for AI training and inference

CoreWeave enhances multi-cloud AI stack with Google Cloud interconnect and unified orchestration

Microsoft and Google adapt data center planning for volatile AI demand

Anthropic commits $200B to Google Cloud for AI compute

Google DeepMind launches WeatherNext 2, an AI model for faster, more accurate weather forecasts

Graphcore and Hugging Face partner to offer new IPU-ready AI models