AI accelerator
PulseAugur coverage of AI accelerator — every cluster mentioning AI accelerator across labs, papers, and developer communities, ranked by signal.
- instance of Tensor Processing Unit 90%
- used by Tensor Processing Unit 70%
- used by Arm Holdings 70%
- used by central processing unit 70%
- instance of graphics processing unit 60%
- affiliated with Tensor Processing Unit 50%
- used by graphics processing unit 50%
- competes with central processing unit 50%
- competes with Arm Holdings 50%
9 day(s) with sentiment data
-
Google Cloud's AI compute market share rises with surging TPU demand
Google Cloud's market share is projected to increase significantly by 2026, driven by a massive surge in demand for Tensor Processing Units (TPUs). The company is expected to control a quarter of the global AI computing…
-
AHASD architecture boosts LLM speculative decoding on mobile devices
Researchers have developed AHASD, a novel asynchronous heterogeneous architecture designed to optimize large language model (LLM) inference on mobile devices. This architecture employs task-level decoupling for parallel…
-
Google unveils TPU V8 with two chips for training and inference at massive scale
Google has unveiled its eighth-generation Tensor Processing Units (TPUs), marking a significant shift by introducing two distinct chip designs for the first time. These new TPUs are engineered for specific, crucial task…
-
Tessera offers secure, near-line-rate weight streaming for edge AI accelerators
Researchers have developed Tessera, a new architecture designed to securely stream model weights to edge accelerators in Unified Memory Architecture (UMA) systems. This approach addresses the challenge of protecting pro…
-
New research explores LLM security, efficiency, and training optimization
Researchers are developing novel methods to enhance the efficiency and security of Large Language Models (LLMs). One approach, "Widening the Gap," exploits outlier injection to compromise LLM quantization, demonstrating…
-
New techniques ZipCCL and FlashOverlap accelerate LLM training by optimizing communication
Researchers have developed ZipCCL, a lossless compression library designed to accelerate the distributed training of large language models by addressing communication bottlenecks. The library utilizes novel techniques l…
-
Google Cloud Next unveils new TPUs, Gemini Enterprise Agent Platform
Google Cloud has announced new AI innovations, including their eighth-generation Tensor Processing Units (TPUs) designed for both inference and reasoning. The company also unveiled the Gemini Enterprise Agent Platform, …
-
AMD gains server CPU revenue share; Arm AGI CPUs secure $2B; Meta expands AWS Graviton use
AMD has achieved a significant milestone, capturing 46.2% of the x86 server CPU revenue share in Q1 2026, while Intel maintains a 70.4% share of the overall consumer PC market. In related news, Arm's new AGI CPUs have g…
-
Photonic processors offer energy-efficient alternative for deep learning computations
The future of deep learning may involve photonic processors that use light instead of electrons to perform calculations. This approach aims to reduce the significant energy demands of current neural networks, which rely…
-
Google launches new TPUs for AI training and inference
Google has unveiled its eighth-generation Tensor Processing Units (TPUs), featuring two specialized chips: TPU 8t for training and TPU 8i for inference. These new chips are designed to enhance the capabilities of AI mod…
-
CoreWeave enhances multi-cloud AI stack with Google Cloud interconnect and unified orchestration
CoreWeave has announced a suite of services aimed at simplifying multi-cloud AI infrastructure, including a direct interconnect with Google Cloud to reduce deployment times. The company also introduced SUNK Anywhere, a …
-
Microsoft and Google adapt data center planning for volatile AI demand
Microsoft and Google are adapting their data center planning strategies to accommodate the unpredictable nature of AI workloads. Both companies are moving away from fixed roadmaps towards continuous rebalancing, utilizi…
-
Anthropic commits $200B to Google Cloud for AI compute
Anthropic has committed to spending approximately $200 billion over the next five years on Google Cloud services and next-generation TPUs, with the compute capacity expected to come online starting in 2027. This deal si…
-
Google DeepMind launches WeatherNext 2, an AI model for faster, more accurate weather forecasts
Google DeepMind has unveiled WeatherNext 2, an advanced AI model for weather forecasting that generates predictions 8x faster and with hourly resolution. This new model, built on a Functional Generative Network (FGN) ap…
-
Graphcore and Hugging Face partner to offer new IPU-ready AI models
Graphcore has partnered with Hugging Face to optimize its Intelligence Processing Unit (IPU) hardware for transformer models. This collaboration aims to improve the efficiency and performance of training and deploying l…