Show HN: Cactus – Ollama for Smartphones
Cactus has released an open-source AI engine designed for mobile devices and wearables, prioritizing low latency and reduced RAM usage. The engine supports multimodal capabilities, including speech, vision, and language models, with an option to fall back to cloud-based models. It features NPU acceleration for energy efficiency and offers OpenAI-compatible APIs for integration into various applications. AI
IMPACT Enables on-device AI processing, potentially reducing reliance on cloud services and improving user privacy for mobile applications.