How Does Apple MGIE Work?

The Apple MGIE works its magic on images through a two-way approach:

1. Instruction Derivation

It uses MLLMs to solve user prompts (e.g., “make the sky more blue”) and convert them into clear, concise directives for editing (e.g., “increase sky saturation by 20%”). This ensures both accuracy and efficiency.

2. Visual Imagination

Apple’s MGIE employs MLLMs to build a “latent representation” of the desired edit, essentially capturing its essence. This representation serves as a guide for pixel-level manipulation, leading to precise and natural-looking changes.

What is Apple MGIE? How to Use MGIE’s AI Image Editing

Apple has unveiled a groundbreaking AI model, MGIE (Multimodal Large Language Model-Guided Image Editing), that is set to revolutionize the field of image editing. Developed in collaboration with researchers at the University of California, Santa Barbara, MGIE leverages multimodal large language models (MLLMs) to interpret text instructions and translate them into pixel-level image edits.

Similar Reads

What is Apple MGIE?

MGIE, which stands for Multimodal Large Language Model-Guided Image Editing, is a revolutionary AI model developed by Apple. It’s designed to interpret text instructions and translate them into pixel-level image edits....

How Does Apple MGIE Work?

The Apple MGIE works its magic on images through a two-way approach:...

MGIE vs Adobe Photoshop: Similarities and Differences

The MGIE AI and Photoshop are both image editing tools with distinct approaches and target audiences. While Photoshop is the industry standard, catering to professionals and experienced users with its extensive toolbox and manual controls, MGIE takes a revolutionary approach by leveraging natural language processing, making it highly accessible even for beginners unfamiliar with traditional editing software....

The Future of Image Editing with Apple MGIE

With the release of MGIE, Apple has opened up a new frontier for image editing, eliminating the need for complex software and technical expertise. Users can now easily edit their photos by simply typing out what they want to change about the picture. For example, if a user wants to make an image of a pepperoni pizza look healthier, they can type “make it more healthy” and the model will add vegetable toppings....