Google Unveils MediaPipe API for On-Device LLM Execution AI NEWS

Home
AInews
Google Unveils MediaPipe API for On-Device LLM Execution

Google Unveils MediaPipe API for On-Device LLM Execution

2024-03-11

Recently, tech giant Google has made a significant breakthrough in the field of artificial intelligence with the launch of its new MediaPipe LLM Inference API, enabling large language models (LLMs) to run comprehensively on the device side. This technology marks a new chapter in the application of artificial intelligence on devices and will profoundly change the existing computing landscape.

The launch of the MediaPipe LLM Inference API is another manifestation of Google's continuous innovation in the field of machine learning. Previously, machine learning applications on devices were often limited by computational power and storage space, only able to run smaller-scale models. However, large language models, due to their massive parameters and computational requirements, typically could only run on cloud servers. The introduction of the MediaPipe LLM Inference API breaks this limitation, allowing large language models to run on devices such as laptops and smartphones without relying on cloud resources.

MediaPipe supports four types of large language models, including Gemma, Phi 2, Falcon, and Stable LM. These models can run on web, Android, and iOS platforms, providing great convenience for developers. Google stated that it will continue to expand the supported models and platforms in the future to meet the application needs in more scenarios.

The implementation of the MediaPipe LLM Inference API relies on a series of optimization techniques. Through new operations (ops), quantization, caching, and weight sharing, Google has successfully reduced the memory and computational requirements of large language models running on devices. The application of these optimization techniques not only enables large language models to run on devices but also ensures their performance and accuracy.

In addition, the launch of the MediaPipe LLM Inference API provides researchers and developers with more possibilities. They can prototype and test popular open-source large language models on devices, further promoting the development and application of artificial intelligence technology.

In summary, the launch of Google's MediaPipe LLM Inference API is a significant breakthrough in the field of artificial intelligence. It will open new doors for machine learning applications on devices and drive the popularization and application of artificial intelligence technology.

Sapia

Sapia - AI hiring agent for fair recruitment processes

Magic Motion

Magic Motion - AI transforms text into engaging 3D animations

Recall

Recall - AI summarizer for streamlined knowledge management

Rocket.new

Rocket.new - AI analyzes and summarizes call conversations

Qodo AI Platform

Qodo AI Platform - AI tool for ensuring code quality and integrity

Zev AI

Zev AI - AI coding assistant for seamless integration

Kepl-AI Scanner

Kepl-AI Scanner - AI scanner for quick object recognition

RECENT AI TOOLS

Final Round AI

Sapia

Magic Motion

Recall

Rocket.new

RECENT AI NEWS

Decagon, a Chatbot Startup, Raises $131 Million in Funding to Create Personalized AI Agents for Every Consumer

Google Contributes Agent2Agent Protocol to Linux Foundation

Google introduces AI-powered proxy mode in Android Studio

Google Launches On-Device Gemini AI Model

Google Introduces AI Mode in India

Salesforce Launches Agentforce 3 to Enhance AI Agent Visibility and Connectivity

Massive Leak Reveals Design of Google Pixel 10 Pro XL

Leaked Information Indicates Grok Could Soon Gain the Ability to Edit Spreadsheets

RECENT AI TOOLS