MLC enables running large models on browsers, iPhones, and AMD cards

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 8 sources

The Machine Learning Compilation (MLC) group, led by Tianqi Chen at CMU, is developing frameworks like MLC Chat and Web LLM to enable running large language models on consumer hardware, including iPhones and web browsers. This initiative aims to mitigate the current GPU shortage by allowing models to run locally on devices with AMD cards or even just CPUs. Projects like Hugging Face's text-to-webapp generator and Gradio are also contributing to easier deployment and accessibility of ML models for developers and end-users. AI

Summary written by gemini-2.5-flash-lite from 8 sources. How we write summaries →

RANK_REASON This cluster discusses research and development in ML compilation frameworks and tools for broader accessibility, rather than a specific frontier model release or major industry shift.

Read on Practical AI →

MLC enables running large models on browsers, iPhones, and AMD cards

COVERAGE [8]

Hugging Face Blog TIER_1 · 2023-07-03 00:00

Making a web app generator with open ML models
Eugene Yan TIER_1 · 2024-11-03 00:00

39 Lessons on Building ML Systems, Scaling, Execution, and More

ML systems, production & scaling, execution & collaboration, building for users, conference etiquette.
Eugene Yan TIER_1 · 2021-12-02 00:00

The Data Scientist Show - Building end-to-end ML systems

Daliana and I had a 2hr chat on all things data science and machine learning.
Eugene Yan TIER_1 · 2021-02-07 00:00

DataTalksClub - Building an ML System; Behind the Scenes

Design and architecture, tech stack, methodology, results, and lessons learned.
Latent Space Podcast TIER_1 · Tianqi Chen · 2023-08-10 16:42

LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML

We have just announced our first set of speakers at <a href="https://www.ai.engineer/summit" target="_blank">AI Engineer Summit</a>! Sign up for the livestream or email <a href="http://[email protected]" target="_blank">[email protected]</e…
Practical AI TIER_1 · Practical AI LLC · 2022-04-05 15:30

Quick, beautiful web UIs for ML apps

Abubakar Abid joins Daniel and Chris for a tour of <a href="https://www.gradio.app">Gradio</a> and tells them about the project joining Hugging Face. What’s Gradio? The fastest way to demo your machine learning model with a friendly web interface, allowing non-technical users …
Practical AI TIER_1 · Practical AI LLC · 2021-06-01 22:00

The fastest way to build ML-powered apps

Tuhin Srivastava tells Daniel and Chris why <a href="https://www.baseten.co">BaseTen</a> is the application development toolkit for data scientists. BaseTen’s goal is to make it simple to serve machine learning models, write custom business logic around them, and expo…
Practical AI TIER_1 · Practical AI LLC · 2021-01-19 15:30

Accelerating ML innovation at MLCommons

MLCommons launched in December 2020 as an open engineering consortium that seeks to accelerate machine learning innovation and broaden access to this critical technology for the public good. David Kanter, the executive director of MLCommons, joins us to discuss the launch and …

COVERAGE [8]

RELATED ENTITIES

RELATED TOPICS