Last updated July 22, 2024
In AI News

Apple’s GenAI Takes a ‘Lazy’ Leap with LazyLLM

They’ve introduced LazyLLM, which selectively computes the key-value (KV) for important tokens during the pre-filling and decoding stages.

Share

Published on July 22, 2024

by Tarunya S

Apple recently introduced LazyLLM, a a novel technique designed to enhance the efficiency of LLM inference. Detailed in a recent research paper, it aims to accelerate the generation of responses in transformer-based language models without compromising accuracy.

This paper is proposed as a technique for efficient LLM inference, particularly in long context scenarios. LazyLLM selectively computes the KV for tokens important for the next token prediction and ‘lazily’ defers the computation of the remaining tokens to later steps, when they become relevant.

Developed by Qichen Fu, Thomas Merth, Sachin Mehta, and Mahyar Najibi of Apple, alongside Mohammad Rastegari, who now works at Meta, the ‘LazyLLM’ offers flexibility by enabling the model to reconsider tokens that were previously pruned, making the process more adaptive and efficient.

By reducing the heavy computing work of the pre-filling stage, this paves the way for more responsive and agile AI systems, possibly completely changing applications that rely on large language models.

Apple Generative AI Innovations

Apple recently released a new open-source LLM, DCLM-Baseline 7B, featuring 7 billion parameters. This model, which includes weights, training code, and dataset, is trained on 2.5 trillion tokens from open datasets. It primarily uses English data and has a 2048-token context window.

https://twitter.com/_philschmid/status/1814274909775995087

The new model is licensed under Apple Sample Code License, accessible on Hugging Face and Transformers. Trained with PyTorch and OpenLM, it matches closed-dataset models like Mistral in performance.

This comes after Apple at WWDC 2024 introduced Apple Intelligence to enhance Siri’s capabilities with generative AI.

📣 Want to advertise in AIM? Book here

Tarunya S

As a passionate enthusiast of caffeine and journalism, I transform tech into words. I enjoy mountain hikes as much as binge-watching new Netflix series.

Apple, LLM

Related Posts

Vayu Robotics

These Ex-Apple Employees from India are Building the Foundation Model for Robotics

Midjourney Hardware

Why the Hell is Midjourney Entering Hardware Business?

Apple Intelligence to Get a Robotic Arm Soon

Apple Intelligence to Get a Robotic Arm Soon

Sakana AI

Sakana AI Releases AI Scientist which Writes Scientific Papers for $15

Why Apple will Build the Best Chatbot

Apple Unveils MMAU: A New Benchmark for Evaluating Language Model Agents Across Diverse Domains

Apple Chooses Google TPUs Over NVIDIA GPUs to Build Apple Intelligence

Oracle Heatwave GenAI

Oracle Cools Down GPU Dependency with HeatWave GenAI

Apple Open Sources DCLM-Baseline 7B, Outperforms Meta’s Llama 2

Association of Data Scientists

Tailored Generative AI Training for Your Team

Upcoming Large format Conference

Cypher 2024
India's Biggest AI Summit

Sep 25-27, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

The 10 Best Videos Created by MiniMax

Tarunya S

The brand-new text-to-video model stands out due to its realistic AI visual content.

PhysicsWallah’s ‘Alakh AI’ is Making Education Accessible to Millions in India

Siddharth Jindal

Why AI Can’t Get Software Testing Right

Sagar Sharma

Top Editorial Picks

PwC Collaborates with Shorthills AI to Integrate GenAI Search and Summarisation

Aditi Suresh

Google Cloud Partners with ParallelDots to Enhance Retail Shelf Monitoring with AI

Aditi Suresh

Tech Veteran Jaspreet Bindra launches ‘AI&Beyond’ to Democratise AI Literacy

Aditi Suresh

HARMAN Introduces ForecastGPT, a GenAI Platform for Enterprises

Aditi Suresh

“When Will Mira Get Married?” OpenAI CTO’s Mother Asked ChatGPT

Siddharth Jindal

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

GenAI
Corner

View All

Will AI Coding Tools Mark the End of IDEs?

Channel-Specific and Product-Centric GenAI Implementation in Enterprises Leads to Data Silos and Inefficiencies

Meet Melty, Open Source Alternative to Cursor

Meet Melty, Open Source Alternative to Cursor

Vaishali Kasture

Microsoft Appoints Former AWS Head Vaishali Kasture as GM for India and South Asia

Anthropic Claude

Anthropic Claude Artifacts to Kill App Store Soon

Revrag Unveils Its First AI Agent Emma

How-Joule-is-Helping-SAP-Find-Moat-in-Spend-Management

How Joule is Helping SAP Find Moat in Spend Management

Wake Me Up When Companies Start Hiring Clueless Modern ‘Developers'

Wake Me Up When Companies Start Hiring Clueless Modern ‘Developers’

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

Launching 8th Edition of Cypher: Now in USA too

Discover how Cypher 2024 expands to the USA, bridging AI innovation gaps and tackling the challenges of enterprise AI adoption

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024