PulseAugur / Pulse
LIVE 08:33:01

Pulse

last 48h
[50/57] 89 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

  1. Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models

    Nous Research has developed Token Superposition Training (TST), a new method designed to significantly accelerate the pre-training of large language models. This technique can reduce pre-training time by up to 2.5 times for models ranging from 270 million to 10 billion parameters, without altering the model's architecture or how it performs inference. TST achieves this by modifying the training loop in two phases: an initial 'superposition' phase where token embeddings are averaged and processed in larger bags, followed by a 'recovery' phase that reverts to standard training. Experiments showed TST achieving lower final training loss with substantially less compute time compared to traditional methods. AI

    Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models

    IMPACT Accelerates LLM pre-training, potentially reducing compute costs and time for developing new large language models.

  2. Cisco Systems cuts 4,000 jobs despite hitting record $15.84B revenue. The company is spending $1B to pivot entirely toward AI infrastructure hardware as hypersc

    Cisco Systems is laying off 4,000 employees while simultaneously investing $1 billion to shift its focus towards AI infrastructure hardware. This strategic pivot is driven by the surging demand from hyperscale cloud providers. AI

    IMPACT Major tech companies are reallocating resources and workforce towards AI infrastructure, signaling a significant industry-wide shift.

  3. First, AI companies exploit their workers. Now, they want to exploit their resources. Microsoft's massive Kenya AI data center would require switching off 'half

    Microsoft's proposed $1 billion AI data center in Kenya is facing significant hurdles due to immense power demands. The project's electricity needs are so substantial that Kenyan officials estimate it could require shutting off power to half the country. Disagreements over power capacity and insufficient infrastructure have stalled the development. AI

    First, AI companies exploit their workers. Now, they want to exploit their resources. Microsoft's massive Kenya AI data center would require switching off 'half

    IMPACT Highlights the immense energy demands of AI infrastructure and potential conflicts with local resources and policy.

  4. Nevada electric company says it's going to cut off electricity to 50,000 people to use it for datacenters instead, tells multiple towns to take a hike https://

    A Nevada electric company plans to divert power from 50,000 residents to supply data centers. This decision affects multiple towns, which have been told to find alternative power sources. The move highlights the growing demand for energy to support data center infrastructure, particularly for AI. AI

    IMPACT Highlights the immense energy demands of AI infrastructure and potential conflicts with public utility needs.

  5. 🔬 Fervo Energy IPO Soars 33% as AI Data Centers Drive Demand Fervo Energy's IPO sees a 33% surge, fueled by AI data center demand, pushing its valuation past $1

    Fervo Energy's initial public offering experienced a significant 33% increase on its first day of trading. This surge in valuation, which propelled the company's worth beyond $10 billion, is largely attributed to the escalating demand for AI data centers. The company's focus on geothermal energy solutions is seen as a key factor in meeting this growing need for sustainable power in the AI infrastructure sector. AI

    🔬 Fervo Energy IPO Soars 33% as AI Data Centers Drive Demand Fervo Energy's IPO sees a 33% surge, fueled by AI data center demand, pushing its valuation past $1

    IMPACT The surge in Fervo Energy's valuation highlights the critical need for sustainable power solutions to support the exponential growth of AI data centers.

  6. Cerebras — Faster Tokens Please

    Cerebras Systems has announced a new wafer-scale engine designed to accelerate AI model training and inference. The company claims this new hardware significantly reduces the time required for processing tokens, a key metric in large language model performance. This advancement aims to address the growing computational demands of complex AI workloads. AI

    Cerebras — Faster Tokens Please

    IMPACT This new hardware could significantly speed up AI training and inference, potentially lowering costs and enabling more complex models.

  7. https://www. theregister.com/on-prem/2026/0 5/13/utah-mega-datacenter-could-dump-23-atomic-bombs-worth-of-energy-per-day/5239670 # ai # datacenter # energy

    A massive data center planned for Utah is projected to consume an enormous amount of energy, potentially releasing the equivalent of 23 atomic bombs' worth of heat daily. This facility is intended to support artificial intelligence operations, raising concerns about its environmental impact and energy demands. The project highlights the growing tension between the rapid expansion of AI infrastructure and the need for sustainable energy solutions. AI

    IMPACT Highlights the immense energy demands of AI infrastructure and the resulting environmental concerns.

  8. Wow. A # datacenter twice the size of # manhattan and they are going to use # lng to power it. # ai # utah # stratos # oleary # emissions https://www. theguardi

    A massive data center, projected to be twice the size of Manhattan, is planned for Utah. This facility will be powered by liquefied natural gas (LNG), a decision that has drawn criticism regarding its environmental impact and emissions. The project is moving forward despite concerns about its energy source and scale. AI

    IMPACT This massive data center's energy source and scale could influence future AI infrastructure development and environmental policy.

  9. 🤨 "Microsoft's massive Kenya # AI data center would require switching off 'half the country' to meet power requirements." government says. # tomshardware # Micr

    A proposed Microsoft AI data center in Kenya faces significant power challenges, with government officials stating it would necessitate shutting down power to half the country. The scale of the energy demand for the AI facility has raised concerns about its impact on the national grid and existing power supply. AI

    IMPACT Highlights the immense energy demands of AI infrastructure and potential conflicts with national power resources.

  10. The Deployment Company, Back to the 70s, Apple and Intel

    OpenAI has launched a new entity, the OpenAI Deployment Company, backed by over $4 billion in initial investment. This new venture aims to help organizations integrate and deploy AI systems by embedding specialized engineers. The move follows a trend of tech companies, including Google and Anthropic, establishing dedicated teams and partnerships to facilitate enterprise AI adoption. AI

    The Deployment Company, Back to the 70s, Apple and Intel

    IMPACT Accelerates enterprise AI adoption by providing dedicated deployment resources and expertise, potentially setting a new standard for AI integration services.

  11. RT by @ ZEK_Praha : Europe is building its # AI infrastructure. Czechia is now part of it. Today, Czech AI Factory ( # CZAI ) was officially launched in Ostrava

    The Czech AI Factory (CZAI) has officially launched in Ostrava, marking Czechia's entry into Europe's AI infrastructure development. This initiative is part of the broader EuroHPC AI Factories project, aiming to bolster AI services and supercomputing capabilities across the continent. The launch signifies a strategic move to integrate Czechia into the growing European AI ecosystem. AI

    IMPACT Strengthens European AI capabilities by providing dedicated infrastructure and services.

  12. In the face of an unprecedented race for AI resources, major tech companies like Microsoft and Google are funding the construction of factories at silicon suppliers.

    Major tech companies like Microsoft and Google are investing heavily in chip manufacturers to secure essential AI resources. This strategic move aims to guarantee access to critical components amidst an intense competition for AI hardware. The financial backing for these factories is expected to impact consumers through increased costs or limited availability. AI

    IMPACT Secures critical AI hardware supply chains, potentially influencing future AI development costs and accessibility.

  13. Rivian unveils groundbreaking AI autonomy strategy, developing custom silicon chips to revolutionize electric vehicle technology and drive innovation forward #

    Robo.ai has secured $180 million in financing from ATW Partners to advance its AI innovation in areas like smart logistics and eVTOL technologies. In parallel, Rivian has announced its own AI autonomy strategy, which includes the development of custom silicon chips to enhance its electric vehicle technology. AI

  14. US government site removes AI test details from MS, Google, xAI — TradingView News https://www.yayafa.com/2800233/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligenc

    A new, lightweight AI model named Needle has been developed by distilling Gemini's tool-calling capabilities into a 26 million parameter model. This smaller model is designed to run on smartphones, making it easier for developers to build AI agents for mobile devices. The project aims to bring advanced AI functionalities to edge devices. AI

    US government site removes AI test details from MS, Google, xAI — TradingView News https://www.yayafa.com/2800233/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligenc

    IMPACT Enables more powerful AI agents to run directly on mobile devices, reducing reliance on cloud processing.

  15. Persian Gulf countries are building advanced AI infrastructure, but this digital power has a hidden cost: the region's gigantic demand for water and energy, where

    Persian Gulf nations are rapidly developing advanced AI infrastructure, but this digital expansion comes with a significant environmental cost. The region faces a critical shortage of water and energy, resources essential for powering these AI systems. A report from QIASS highlights the potential for infrastructural gridlock if water policies are not integrated into technological planning. AI

    IMPACT Accelerated AI development in the Persian Gulf risks exacerbating regional water and energy scarcity, potentially leading to infrastructural bottlenecks.

  16. Samsung faces a two-week strike from May 21 that could disrupt AI chip production. Samsung and SK Hynix are two of only three companies worldwide making high-ba

    Samsung's AI chip production faces potential disruption due to a planned two-week strike by its workers, set to begin on May 21. This strike could impact the supply of high-bandwidth memory (HBM), a critical component for AI computing, as Samsung and SK Hynix are among the few global manufacturers. The company recently reported a significant increase in chip income and achieved a valuation of over one trillion USD. AI

    IMPACT A strike at Samsung could disrupt the supply of high-bandwidth memory, a key component for AI, potentially slowing down AI development and deployment.

  17. Imminent Samsung Strike Could Be an Earthquake for AI https://gizmodo.com/imminent-samsung-strike-could-be-an-earthquake-for-ai-2000757819 # AI # Tech # Labor

    A potential strike by South Korean workers at Samsung Electronics could significantly disrupt the global supply of AI chips. The union is demanding better wages and working conditions, and a strike could halt production of crucial components. This disruption could impact the development and deployment of AI technologies worldwide. AI

    IMPACT A strike at Samsung could halt AI chip production, delaying AI development and deployment globally.

  18. Meta’s $27 billion AI data center is transforming rural Louisiana https://www. byteseu.com/2014324/ # AI # AIDATACENTERLOUISIANA # ArtificialIntelligence # CONS

    Meta is constructing a massive $27 billion AI data center in rural Louisiana, a project that is significantly impacting the local economy and job market. The facility is expected to bring numerous construction and operational jobs to Richland Parish. This development highlights the growing infrastructure demands for advanced AI technologies and Meta's substantial investment in this area. AI

    Meta’s $27 billion AI data center is transforming rural Louisiana https://www. byteseu.com/2014324/ # AI # AIDATACENTERLOUISIANA # ArtificialIntelligence # CONS

    IMPACT Accelerates AI development by providing necessary compute infrastructure, potentially lowering costs for AI training and deployment.

  19. US Top News and Analysis | This metal just set a new record, boosted by AI data center demand. Citi says it’s time to 'chase the move higher' AI generated summa

    Copper prices have reached new all-time highs, driven significantly by the escalating demand from artificial intelligence data centers. Analysts at Citi predict further price increases, projecting copper could reach approximately $15,000 per metric ton. This surge is attributed to both AI-related demand and strategic inventory stockpiling, indicating a strong macro backdrop supporting higher commodity prices. AI

    IMPACT Accelerates demand for critical raw materials, potentially increasing costs for AI infrastructure build-out.

  20. It’s time to build affordable homes, a national network of public grocery stores, electric buses and an east-west clean energy grid. Not massive corporate AI da

    A call is being made for a pause on the construction of new AI data centers in Canada until strong federal regulations are established. The argument is that these massive corporate data centers are being built without democratic debate and raise concerns about data sovereignty, especially with Canadian data being transferred to U.S. tech giants like Google Cloud. The author emphasizes the need for oversight and robust regulatory frameworks before proceeding with AI development. AI

    IMPACT Urges a pause on AI infrastructure development, highlighting risks to data sovereignty and the need for regulatory oversight.

  21. THE MORE U BUY, THE MORE U SAVE: By ganging up multiple B200 8-GPU machines together over RoCEv2 CX-7 ethernet with Tomahawk switches with an inference optimiza

    Nvidia's B200 GPUs are being deployed in large clusters, utilizing RoCEv2 Ethernet and Tomahawk switches for efficient inference. This setup allows for significant cost savings as more machines are added, indicating a trend towards scaled-out AI infrastructure. AI

    THE MORE U BUY, THE MORE U SAVE: By ganging up multiple B200 8-GPU machines together over RoCEv2 CX-7 ethernet with Tomahawk switches with an inference optimiza

    IMPACT Highlights cost-saving strategies for large-scale AI inference deployments using advanced hardware.

  22. Investors are going # nuclear to keep # UK 's # AI # datacenters fed Tracxn, market intelligence that monitors startups, says institutional capital is "quietly

    Investors are channeling significant capital into private nuclear energy ventures to power the UK's growing AI data centers. Over $370 million has been invested, with a notable surge of $170 million in 2024 alone. This pivot towards nuclear innovation is driven by the need for sustainable and sovereign energy solutions to meet the immense power demands of AI infrastructure. AI

    IMPACT Accelerates the development of sustainable, high-capacity power infrastructure essential for scaling AI data centers.

  23. New Jersey residents say they can't even wash their clothes due to data centers https://www. thecooldown.com/green-business /ai-data-center-vineland-new-jersey-

    Residents in Vineland, New Jersey, are experiencing significant disruptions, including an inability to do laundry, due to the proliferation of data centers in their area. The increased demand for water by these facilities is straining local resources, leading to water pressure issues and impacting daily life for the community. This situation highlights the growing environmental and resource-management challenges posed by the expansion of data center infrastructure. AI

    IMPACT Data center expansion for AI is straining local water resources, impacting communities and raising environmental concerns.

  24. Florida Phoenix: Florida has a new law regulating AI data centers. “AI data centers will be required to pay for their own utilities and not shift the costs to c

    Florida has enacted a new law, SB 484, that specifically targets AI data centers. The legislation mandates that these facilities must cover their own utility costs, preventing them from passing these expenses onto customers. Governor Ron DeSantis expressed gratitude to the legislature for passing the bill, noting it was a step forward, though less comprehensive than his initial proposals. AI

    IMPACT Requires AI data centers to bear their own utility costs, potentially impacting operational expenses and deployment strategies in Florida.

  25. 📰 There’s an internet choke point in the Middle East — is the solution in the North Pole? The vast majority of the world's data - emails, financial transactions

    A new internet cable proposal, Polar Connect, aims to bypass critical choke points in the Middle East by running fiber optic lines through the Arctic. This route would connect Asia and Europe via North America, offering a more resilient and potentially faster data transfer path. The project seeks to mitigate risks associated with the current undersea cable infrastructure, which is vulnerable to disruptions. AI

    IMPACT Proposes a new resilient data infrastructure that could support future AI development and deployment.

  26. Ghent-based Holmes has launched with 1.1 million EUR in pre-seed funding to build an autonomous quality assurance platform for AI development. The platform lear

    Holmes, a startup based in Ghent, has secured 1.1 million EUR in pre-seed funding to develop an autonomous quality assurance platform specifically for AI development. This platform is designed to learn product workflows and automatically generate tests for essential user journeys, aiming to streamline the AI testing process. AI

    IMPACT This funding will accelerate the development of specialized AI testing tools, potentially improving the reliability and efficiency of AI product development cycles.

  27. Italian construction technology startup Pillar has raised 12 million EUR in seed funding to build an AI-powered operating system for the construction industry.

    Pillar, an Italian construction technology startup, has secured 12 million EUR in seed funding. The company plans to use this capital to develop an AI-powered operating system specifically designed for the construction industry. This platform aims to automate key processes such as quote generation, margin tracking, and workforce management. AI

    IMPACT This funding could accelerate AI adoption in the construction sector, streamlining operations and improving efficiency.

  28. Short prompt ≠ cheap prompt: how optimization breaks prefix cache in LLM agents. 32 tools in the prompt - cheaper than 7. Yes, yes - if you are building agents, this is not

    A technical article explores how optimizing prompts for LLM agents can inadvertently break the prefix cache, leading to higher costs than expected. The author explains that while fewer tokens in a prompt might seem cheaper, the underlying mechanism of prefix caching in agent cycles can cause inefficiencies. This issue arises because local optimizations can disrupt the cache's effectiveness across the entire agent's workflow. AI

    IMPACT Explains a potential inefficiency in LLM agent design that could impact cost and performance.

  29. OpenAI Establishes New Company "OpenAI Deployment Company" to Help Companies Adopt AI https://web.brid.gy/r/https://gigazine.net/news/20260512-openai-deployment-company/

    OpenAI has established a new subsidiary, the OpenAI Deployment Company, to assist organizations in building and operating AI systems. This new entity aims to accelerate AI adoption by operating with increased speed and a customer-centric focus, distinct from OpenAI's core research and product development. The company is launching with an initial investment of $4 billion from 19 partners, including Bain Capital and SoftBank, and will integrate approximately 150 Forward Deployed Engineers from AI consulting firm Tomoro, which OpenAI is acquiring. AI

    OpenAI Establishes New Company "OpenAI Deployment Company" to Help Companies Adopt AI https://web.brid.gy/r/https://gigazine.net/news/20260512-openai-deployment-company/

    IMPACT Accelerates enterprise AI adoption by providing dedicated expertise and infrastructure support for complex deployments.

  30. Exio G and IIJ Collaborate to Provide Edge Data Centers for AI

    Exeo Group and Internet Initiative Japan (IIJ) have partnered to offer edge data centers specifically designed for AI workloads. This collaboration aims to provide the necessary infrastructure to support the growing demand for AI processing power. The service will leverage Exeo Group's expertise in data center construction and IIJ's network and operational capabilities. AI

    IMPACT Provides specialized infrastructure to meet the increasing demand for AI processing power.

  31. Building Blocks for Foundation Model Training and Inference on AWS

    Hugging Face and AWS have collaborated to detail the infrastructure required for training and running large foundation models. The blog post outlines a layered architecture, emphasizing the interplay between AWS's compute, networking, and storage services with open-source software frameworks. It highlights the importance of efficient resource management and observability for large-scale AI operations. AI

    IMPACT Provides a technical blueprint for optimizing AI infrastructure, crucial for scaling model development and deployment.

  32. FoxConn Satellites? Huh? Oh, wait this strategy does have some logic behind it! Foxconn launches 2nd gen PEARL-1A and PEARL-1B satellites - core component of it

    Foxconn has launched its second generation of PEARL satellites, the PEARL-1A and PEARL-1B, as part of its "3+3" expansion strategy. This initiative focuses on electric vehicles, digital health, and robotics, leveraging AI, semiconductors, and next-generation communications. By controlling both satellite hardware and communication infrastructure, Foxconn intends to provide integrated services for the automotive and industrial sectors, including real-time data for its EV platform. AI

    FoxConn Satellites? Huh? Oh, wait this strategy does have some logic behind it! Foxconn launches 2nd gen PEARL-1A and PEARL-1B satellites - core component of it

    IMPACT Enables integrated services for AI-driven automotive and industrial applications through dedicated satellite communication infrastructure.

  33. Already passed and twice as big as Bryce Canyon Kevin O’Leary’s proposed data centre just got approved in Utah. It is estimated to consume 9GW of power per year

    A proposed data center championed by Kevin O'Leary has received approval in Utah, despite concerns about its immense power consumption. The facility is projected to require 9GW of electricity annually, a figure that dwarfs the current power usage of the entire state of Utah. This approval comes amidst broader discussions about the energy demands of large-scale data infrastructure. AI

    IMPACT This large-scale data center approval highlights the growing energy demands of digital infrastructure, which is critical for AI development and deployment.

  34. Shanghai Xiyu Jizhi, the entity behind AI startup MiniMax, has increased its registered capital from 1B to 4B RMB, a 300% surge signalling major AI infrastructu

    Shanghai Xiyu Jizhi, the parent company of AI startup MiniMax, has significantly boosted its registered capital. The capital injection represents a 300% increase, raising it from 1 billion to 4 billion RMB. This substantial financial move indicates a strong commitment to scaling up AI infrastructure and operations. AI

    IMPACT Signals substantial investment in scaling AI infrastructure, potentially accelerating development and deployment in the region.

  35. Super Micro Computers (SMCI), a key AI infrastructure provider, surprises the market with dynamic profitability growth, achieving revenues of US$10.2 billion

    Super Micro Computer (SMCI), a key AI infrastructure provider, has reported a significant increase in profitability with revenues reaching $10.2 billion. Despite strong sales, the company consumed $6.6 billion in cash last quarter due to aggressive component procurement and production line expansion. This rapid growth and cash burn highlight the intense demand and investment required in the AI hardware sector. AI

    IMPACT Confirms high demand and capital expenditure required for AI infrastructure, potentially impacting supply chains and other hardware providers.

  36. OpenAI is launching a majority-controlled enterprise unit with $4B+ backing and acquiring Tomoro, which brings ~150 deployment specialists. The bet: embedding e

    OpenAI is establishing a new enterprise unit, backed by over $4 billion, to focus on integrating AI into production systems. This unit will also absorb Tomoro, a company with approximately 150 deployment specialists. The move aims to bridge the gap between AI demonstrations and actual customer implementation by embedding engineers within client organizations. AI

    IMPACT Accelerates enterprise AI adoption by addressing the production gap with dedicated specialists and significant investment.

  37. https:// winbuzzer.com/2026/05/11/tesla -ai5-ignites-samsung-lg-race-for-abf-substrat-xcxwbn/ Tesla's AI hardware plans are turning ABF substrates into a live c

    Tesla's AI hardware development is creating significant demand for ABF substrates, a key component in advanced semiconductor manufacturing. This demand has sparked a competitive race between LG Innotek and Samsung Electro-Mechanics to supply these critical materials. The focus on AI hardware by Tesla, particularly for its autonomous driving and robotics initiatives, is driving innovation and investment in the semiconductor supply chain. AI

    https:// winbuzzer.com/2026/05/11/tesla -ai5-ignites-samsung-lg-race-for-abf-substrat-xcxwbn/ Tesla's AI hardware plans are turning ABF substrates into a live c

    IMPACT Accelerates demand for specialized semiconductor components, potentially driving innovation and supply chain competition for AI hardware.

  38. OpenADR and Matter are collaborating to let your smart home talk to the grid

    The Matter smart home connectivity standard is partnering with the OpenADR protocol to enable seamless communication between smart home appliances and the energy grid. This collaboration aims to simplify demand response programs, allowing devices like EV chargers and HVAC systems to automatically adjust energy consumption based on grid needs. The integration could eliminate the need for separate demand response hardware, embedding this functionality directly into appliances for more efficient energy management and potential cost savings for consumers. AI

    OpenADR and Matter are collaborating to let your smart home talk to the grid

    IMPACT Enables more efficient energy management in homes, potentially reducing strain on power grids and lowering consumer costs.

  39. OpenAI (@OpenAI) has launched 'OpenAI Deployment Company' to help companies build and deploy AI. OpenAI will hold a majority stake and control, supporting the production deployment of frontier AI with 19 investment firms, consulting firms, and system integrators.

    OpenAI has launched a new entity, the 'OpenAI Deployment Company,' aimed at assisting businesses in building and deploying AI solutions. This new venture will be majority-owned and controlled by OpenAI, and it will collaborate with 19 initial partners, including investment firms, consulting companies, and system integrators. The company's focus is on facilitating the production-level deployment of frontier AI technologies. AI

    IMPACT This initiative could streamline enterprise adoption of advanced AI by providing dedicated support for deployment and integration.

  40. Developers are showing increased interest in unincorp co land for putting up # DataCenters ,🚨reducing regulatory friction, & allowing const to proceed at a much

    Developers are increasingly targeting unincorporated rural land for AI data center construction to bypass city council approvals and reduce regulatory friction. This strategy allows projects to proceed more quickly, avoiding delays caused by public scrutiny and land-use reviews common in urban areas. Meta, for example, is investing $800 million in a new AI data center in Idaho, a state often associated with more favorable regulatory environments for such developments. AI

    IMPACT Accelerates AI infrastructure build-out by streamlining regulatory hurdles for data center construction.

  41. Claude has teamed up with Elon and no one expected it

    Anthropic has secured a significant compute deal with SpaceXAI, a newly merged entity combining SpaceX and xAI, to address Claude's token usage limits. This partnership is notable given Elon Musk's prior vocal criticism of Anthropic. The agreement grants Anthropic access to compute capacity at Musk's Colossus 1 data center, with future discussions about placing data centers in space. AI

    Claude has teamed up with Elon and no one expected it

    IMPACT Secures essential compute for Anthropic's models, potentially easing usage limits and enabling future space-based data centers.

  42. The Inference Shift

    Cerebras Systems is significantly increasing its IPO price and share count due to high demand driven by the AI industry's need for compute power. While GPUs, particularly from Nvidia, have dominated AI workloads like training, the future of AI compute is expected to be more heterogeneous. This shift acknowledges that specialized hardware beyond GPUs will be crucial for both training and inference, especially as AI agents require substantial computational resources. AI

    IMPACT Signals a shift towards heterogeneous AI compute architectures beyond GPUs, crucial for agent-based AI.

  43. Sakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMs

    Researchers from Sakana AI and NVIDIA have developed TwELL, a novel method that significantly speeds up large language model (LLM) operations. By targeting the feedforward layers, which are computationally intensive, TwELL induces high sparsity and translates this into practical performance gains on GPUs. This approach achieves up to a 21.9% speedup in training and a 20.5% speedup in inference without compromising model accuracy. AI

    Sakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMs

    IMPACT Accelerates LLM training and inference, potentially lowering costs and increasing accessibility for AI development.

  44. AI Work Is Splitting in Two

    Anthropic announced new Managed Agents features at its Code with Claude developer conference, aiming to allow users to achieve goals by simply providing an outcome and budget. The company is focusing on building the infrastructure to support agents running continuously and at scale. This development, alongside OpenAI's reported GPT-5.5 launch, suggests a bifurcation in AI development between real-time collaborative tools and long-running, delegated agents. AI

    AI Work Is Splitting in Two

    IMPACT Signals a shift towards more autonomous AI agents capable of handling complex, long-running tasks.

  45. NVIDIA AI Just Released cuda-oxide: An Experimental Rust-to-CUDA Compiler Backend that Compiles SIMT GPU Kernels Directly to PTX

    NVIDIA AI researchers have introduced cuda-oxide, an experimental compiler that enables developers to write GPU kernels in Rust and compile them directly to PTX, NVIDIA's intermediate representation for GPUs. This new tool aims to bring the CUDA programming model directly into safe Rust, bypassing the need for C++ or other intermediate languages. The project utilizes a custom rustc codegen backend and a Rust-native MLIR-like framework called Pliron, allowing host and device code to coexist in a single source file. AI

    IMPACT Enables developers to write GPU kernels in Rust, potentially improving safety and performance for AI workloads.

  46. NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

    NVIDIA researchers have introduced Star Elastic, a novel post-training method that embeds multiple reasoning models of varying parameter sizes within a single checkpoint. This approach allows for the extraction of smaller, nested submodels from a larger parent model without requiring additional fine-tuning. Star Elastic utilizes a trainable router and knowledge distillation to optimize the selection of model components, enabling efficient resource utilization and tailored model performance for different reasoning tasks. AI

    NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing

    IMPACT Enables efficient deployment of multiple model sizes from a single checkpoint, potentially reducing inference costs and complexity.

  47. Maryland citizens hit with $2B power grid upgrade for out-of-state AI https://www. tomshardware.com/tech-industry /artificial-intelligence/maryland-citizens-sla

    Maryland is challenging a $2 billion charge for power grid upgrades, arguing that the costs should not be borne by state citizens. The state's Office of People’s Counsel contends that these upgrades are primarily to benefit out-of-state AI data centers, which are disproportionately consuming power. Maryland is appealing to federal energy regulators to reallocate these costs, citing a broken cost allocation system that unfairly burdens its ratepayers. AI

    IMPACT Highlights the growing tension between AI's energy demands and the equitable distribution of infrastructure costs, potentially influencing future data center siting and energy policy.

  48. A Coding Implementation to Build Agent-Native Memory Infrastructure with Memori for Persistent Multi-User and Multi-Session LLM Applications

    This tutorial demonstrates how to implement Memori, an agent-native memory infrastructure, to enable persistent, context-aware LLM applications across multiple users and sessions. It guides users through setting up Memori in a Google Colab environment, integrating it with OpenAI clients for automatic memory interception, and storing/retrieving user data across different identities and sessions. The implementation showcases how Memori allows AI agents to retain context across interactions, moving beyond stateless conversations. AI

    IMPACT Enables AI agents to retain context across interactions, improving user experience and reducing repetitive input for complex tasks.

  49. Fast Byte Latent Transformer

    Researchers have developed the Fast Byte Latent Transformer (BLT) to address the slow generation speeds of byte-level language models. The new BLT Diffusion (BLT-D) method uses a block-wise diffusion objective during training, allowing for parallel byte generation during inference and reducing memory bandwidth usage by over 50%. Additional techniques like BLT Self-speculation (BLT-S) and BLT Diffusion+Verification (BLT-DV) offer further trade-offs between speed and generation quality, making byte-level LMs more practical. AI

    IMPACT Accelerates byte-level language models, potentially enabling more efficient processing of text without tokenization.