Last updated August 30, 2024
In AI News

Meta FAIR Releases Transfusion for Multimodal AI Training

Transfusion is a state-of-the art approach at advancing text and image modalities

Share

Published on August 29, 2024

by Aditi Suresh

Listen to this story

In a collaborated effort with Waymo & University of Southern California, Meta FAIR released its research on the importance of multi-modal generative models. Transfusion aims to unite and simplify the gap between discrete sequence modeling and continuous media generation.

The Transfusion Model

The model is trained equally on text and image. Per Meta, Transfusion is more advanced than quantising images and training a language model over discrete image tokens. The model’s performance can be enhanced through “modality-specific” encoding and decoding layers. The model predicts the next word in a sequence. Trained on improving predictions, it reduces the difference between guessing and actual words. It is imperative to note that with 7 billion parameters and 2 trillion multi modal tokens, Transfusion is at par with other larger models that create image and text – and outperforms models like DALL-E 2 and SDXL. It works better than Chameleon as it takes lesser computing power and generates better results.

One limitation is perhaps that diffusion models do not perform at par with traditional language models. A lot of research is yet to be done in this area to improve overall performance.

Transformer’s Uniqueness & the Future of Innovation in AI Research

What differentiates Transformer from the rest is its unified architecture that runs end to end to generate text and images. Existing models like Flamingo, LLaVA, GILL, and DreamLLM combine separate architectures for different types of data, which are trained separately.

The goal of this Transfusion is to synergise two modalities in a single joint model – with each of them fulfilling their objective. The incentives are that these are versatile, resource efficient, and cost effective for handling different types of data without any additional costs.

📣 Want to advertise in AIM? Book here

Aditi Suresh

Aditi is a political science graduate, and is interested in technology, social media, and culture.

Meta AI

Related Posts

Why Mark Zuckerberg Is Selfish With Open Source

The ‘Linux Moment’ in AI Has Finally Arrived, Says Meta Chief Mark Zuckerberg

Meta Llama 3.1 is Officially Out, Dethrones GPT-4o

Meta Launches AI Assistant in India Across WhatsApp, Facebook, Instagram

9 Must-Know Open Source Models From Meta in 2023

Meta Announces Four New AI Models and Additional Research Artifacts

Transformers Can Now Work Pixel by Pixel, Says Meta AI’s New Study

meta husky language agent

Meta AI Unveils Husky, a Unified, Open-Source Language Agent

Comprehensive RAG Benchmark Aims to Advance Retrieval-Augmented Question Answering

Association of Data Scientists

Tailored Generative AI Training for Your Team

Upcoming Large format Conference

Cypher 2024
India's Biggest AI Summit

Sep 25-27, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

The 10 Best Videos Created by MiniMax

Tarunya S

The brand-new text-to-video model stands out due to its realistic AI visual content.

PhysicsWallah’s ‘Alakh AI’ is Making Education Accessible to Millions in India

Siddharth Jindal

Why AI Can’t Get Software Testing Right

Sagar Sharma

Top Editorial Picks

Google Cloud Partners with ParallelDots to Enhance Retail Shelf Monitoring with AI

Aditi Suresh

Tech Veteran Jaspreet Bindra launches ‘AI&Beyond’ to Democratise AI Literacy

Aditi Suresh

HARMAN Introduces ForecastGPT, a GenAI Platform for Enterprises

Aditi Suresh

“When Will Mira Get Married?” OpenAI CTO’s Mother Asked ChatGPT

Siddharth Jindal

Meet Melty, Open Source Alternative to Cursor

Mohit Pandey

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.

GenAI
Corner

View All

Will AI Coding Tools Mark the End of IDEs?

Channel-Specific and Product-Centric GenAI Implementation in Enterprises Leads to Data Silos and Inefficiencies

Vayu Robotics

These Ex-Apple Employees from India are Building the Foundation Model for Robotics

Vaishali Kasture

Microsoft Appoints Former AWS Head Vaishali Kasture as GM for India and South Asia

Anthropic Claude

Anthropic Claude Artifacts to Kill App Store Soon

Revrag Unveils Its First AI Agent Emma

How-Joule-is-Helping-SAP-Find-Moat-in-Spend-Management

How Joule is Helping SAP Find Moat in Spend Management

Wake Me Up When Companies Start Hiring Clueless Modern ‘Developers'

Wake Me Up When Companies Start Hiring Clueless Modern ‘Developers’

World's Biggest Media & Analyst firm specializing in AI

Advertise with us

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

Branded Content

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

Corporate Upskilling

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

Hackathons

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Talent Assessment

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

Research & Advisory

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Conferences & Events

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

Launching 8th Edition of Cypher: Now in USA too

Discover how Cypher 2024 expands to the USA, bridging AI innovation gaps and tackling the challenges of enterprise AI adoption

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024