Pulse

last 48h

[11/11] 97 sources

What AI is actually talking about — clusters surfacing on Bluesky, Reddit, HN, Mastodon and Lobsters, re-ranked to elevate originality and crush noise.

TOOL · X — MiniMax AI English(EN) · 3h · [2 sources] · X

MiniMax is live on @RespanAI Gateway

MiniMax AI has announced its models are now available on the Respan AI Gateway. This integration aims to provide developers with easier access to MiniMax's suite of AI models for various applications including text, speech, image, video, and music. AI

IMPACT Increases accessibility of MiniMax AI models for developers building multimodal AI applications.
TOOL · X — Together (inference / OSS) English(EN) · 5h · X

The best AI infrastructure shouldn't be reserved for the biggest companies. Together AI is partnering with @pax8 to bring powerful, cost-efficient AI and leadi

Together AI has partnered with Pax8 to make advanced AI infrastructure and open-source models accessible to small and medium-sized businesses. This collaboration aims to democratize access to powerful AI tools, ensuring they are not exclusively available to large corporations. The partnership will focus on delivering cost-efficient AI solutions to a broader market. AI

IMPACT Expands access to AI tools for SMBs, potentially increasing adoption and innovation in smaller businesses.
RESEARCH · X — Together (inference / OSS) English(EN) · 1d · X

RT @vipulved: PSA: Just added a few thousand chips, including B200s and B300s to our Dedicated Model Inference (https://t.co/sD3mEZtSAa).…

Together AI has significantly expanded its cloud computing resources, adding thousands of new chips including NVIDIA's B200 and B300 accelerators. This move is aimed at bolstering their dedicated model inference services, providing enhanced capabilities for AI model deployment and operation. AI

IMPACT Increases available compute for AI model inference, potentially lowering costs and improving performance for users.
SIGNIFICANT · NVIDIA Blog English(EN) · 1w · [56 sources] · MASTOBLOGX

NVIDIA Levels Up Local AI Agents Across RTX PCs and DGX Spark

NVIDIA is expanding its AI infrastructure and agentic AI capabilities through strategic partnerships and new product releases. The company is collaborating with the UK government and various partners to build sovereign AI deployments, including the powerful Isambard-AI supercomputer. In South Korea, NVIDIA is working with LG Group to develop AI factories for robotics and autonomous driving, while also partnering with Doosan Group on similar initiatives. Additionally, NVIDIA is enhancing local AI agent deployment on Windows PCs with new hardware like RTX Spark and DGX Station, and integrating its NemoClaw framework across its Jetson platform for edge AI applications. AI

IMPACT NVIDIA's expanded AI infrastructure and agentic AI capabilities will accelerate development and deployment across various industries and edge devices.
SIGNIFICANT · Mastodon — mastodon.social Italiano(IT) · 1w · [49 sources] · MASTOX

📈 The tech crash doesn't stop NVIDIA: AI is still in its infancy. Between volatility and chips, the real game is played in the long run. # NVIDIA # AI 🔗 https://www. tom

Nvidia's CEO Jensen Huang has highlighted a new trillion-dollar growth opportunity in AI chips, sparking discussions about the company's future valuation and market position. Several reports predict that specific AI semiconductor stocks may outperform Nvidia in the coming years. Meanwhile, companies like LG Group are significantly increasing their adoption of Nvidia GPUs, with LG planning to use 10,000 units, and ASUS is integrating Nvidia's AI Factory Platform to accelerate revenue generation. AI

IMPACT Nvidia's strategic focus on AI chips and increasing adoption by major corporations like LG and ASUS signal continued growth and competition in the AI hardware sector.
RESEARCH · X — SemiAnalysis English(EN) · 1mo · [3 sources] · X

@manicely6005 The public documentation can be found here too (3/3)

NVIDIA has open-sourced parts of its cuDNN library, a significant move after 12 years of it being closed-source. This release includes over 20 Mixture-of-Experts (MoE) kernels and NSA sparse attention kernels. The codebase for these kernels is largely written in Python CuTe-DSL, with public documentation now available. AI

IMPACT Open-sourcing of cuDNN kernels could accelerate research and development in AI infrastructure and model optimization.
SIGNIFICANT · Exponential View (Azeem Azhar) English(EN) · 1mo · [220 sources] · MASTOBLOGX

🔮 The AI boom is becoming an entrepreneurship boom #577

Nvidia's reliance on Asian supply chains for components has increased to 90% of its production costs, impacting newer products like the Jetson Thor platform and automotive SoCs. This dependency strains wafer capacity and memory supply, even as the company commits to U.S. manufacturing. Meanwhile, the broader AI market faces scrutiny, with concerns about a potential bubble and the financial health of older startups, while some AI stocks are outperforming Nvidia and others are experiencing dips. AI

IMPACT Nvidia's supply chain shifts and broader market concerns about AI valuations could impact hardware availability and investment strategies.
RESEARCH · X — Qwen (Alibaba) English(EN) · 1mo · [3 sources] · X

Forward and backward benchmark results across common configurations. https://t.co/IHMCZRw9AW

Alibaba's Qwen team has released FlashQLA, a new set of high-performance linear attention kernels developed using TileLang. These kernels are designed to improve the efficiency of attention mechanisms in large language models. The team also shared benchmark results for their Qwen models, showcasing performance across various configurations. AI

IMPACT Introduces optimized kernels that could improve LLM inference speed and efficiency.
RESEARCH · X — Google DeepMind English(EN) · 1mo · [6 sources] · X

This is Decoupled DiLoCo: our new resilient and flexible way to train advanced AI models across multiple data centres. 🧵 https://t.co/YRmPrqIbYE

Google DeepMind has introduced Decoupled DiLoCo, a novel approach to training advanced AI models that enhances resilience and flexibility across data centers. This system can train models like Google's 12B Gemma model across geographically dispersed regions using low-bandwidth networks and can even mix different generations of hardware, such as TPU6e and TPUv5p. Decoupled DiLoCo is designed to be self-healing, isolating and continuing training through artificial hardware failures and reintegrating units when they come back online, addressing the synchronization issues that typically stall AI training. AI

IMPACT Enables more robust and flexible large-scale AI model training, potentially reducing costs and increasing accessibility.
SIGNIFICANT · OpenAI News English(EN) · 40mo · [1395 sources] · HNLOBSTERSMASTOBLOGREDDITX

Computer-Using Agent

OpenAI and Google DeepMind are advancing AI agents for software development and security. OpenAI's Codex is being leveraged to write entire codebases with minimal human intervention, as demonstrated by Harness Engineering's internal beta product. Google DeepMind has introduced CodeMender, an AI agent designed to automatically identify and fix software vulnerabilities, and AlphaEvolve, which uses Gemini models to discover and optimize algorithms for applications like data center efficiency and chip design. Meta is also investing heavily in its own AI infrastructure with the development of its MTIA chip family, aiming to power AI experiences for billions of users. AI

IMPACT These advancements signal a rapid evolution in AI agent capabilities and infrastructure, potentially accelerating software development, improving code security, and optimizing complex computational tasks.
SIGNIFICANT · Wired — AI English(EN) · 88mo · [455 sources] · HNMASTOBLOGX

Can OpenAI’s ‘Master of Disaster’ Fix AI’s Reputation Crisis?

OpenAI has announced a significant partnership with SAP to launch 'OpenAI for Germany,' aiming to bring advanced AI capabilities to the German public sector while prioritizing data sovereignty and security on Microsoft Azure. The company also proposed policy recommendations to the U.S. White House for the national AI Action Plan, focusing on innovation freedom, export controls, copyright, infrastructure, and government adoption. Additionally, OpenAI is collaborating with U.S. National Laboratories to leverage its reasoning models for scientific breakthroughs and national security initiatives. AI

IMPACT OpenAI's strategic partnerships and policy proposals signal a push for broader AI adoption in public sectors and national infrastructure, influencing future AI development and regulation.