1

Stability AI tries to stay ahead of the pack with a new image-generating AI mode...

 7 months ago
source link: https://www.theverge.com/2024/2/14/24073253/stablity-ai-image-generation-stable-cascade-diffusion-model
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

Stability AI tries to stay ahead of the pack with a new image-generating AI model

/

Stable Cascade uses less compute power to train and is better at following prompts.

By Emilia David, a reporter who covers AI. Prior to joining The Verge, she covered the intersection between technology, finance, and the economy.

Feb 14, 2024, 11:13 PM UTC

Share this story

collage of art generated through Stable Cascade
Collage of Stabie Cascade artStability AI

Stability AI’s newest model for image generation is Stable Cascade promises to be faster and more powerful than its industry-leading predecessor, Stable Diffusion, which is the basis of many other text-to-image generation AI tools.

Stable Cascade can generate photos and give variations of the exact image it created, or try to increase an existing picture’s resolution. Other text-to-image editing features include inpainting and outpainting, where the model will fill edit only a specific part of the image, as well as canny edge, where users can make a new photo just by using the edges of an existing picture.

Stable Cascade images generated from the prompt “Cinematic photo of an anthropomorphic penguin sitting in a cafe reading a book and having a coffee.”Image: Stability AI

The new model is available on GitHub for researchers but not commercial use, and brings more options even as companies like Google and even Apple release their own image generation models.

Unlike Stability’s flagship Stable Diffusion models, Stable Cascade isn’t one large language model — it’s three different models that rely on the Würstchen architecture, The first stage, stage C, compresses text prompts into latents (or smaller pieces of code) that are then passed to stages A and B to decode the request.

Comparison of inference time Stable Cascade v other modelsStability AI

Breaking the requests into smaller bits compresses the request to require less memory (and fewer hours of training on those hard-to-find GPUs) and run faster. while performing better “in both prompt alignment and aesthetic quality.” It took about 10 seconds to create an image compared to 22 seconds for the SDXL model used currently.

Stability AI helped popularize the stable diffusion method and has also been the subject of several lawsuits alleging Stable Diffusion trained on copyrighted data without permission from rights holders — a UK lawsuit by Getty Images against Stability AI is scheduled to go to trial in December. It began offering commercial licenses through a subscription in December, which the company said was necessary to help fund its research.

Featured Videos From The Verge

Taylor Swift vs. Ronald Reagan: The Ticketmaster story

Ticketmaster botched the sale of Taylor Swift’s The Eras Tour and so many others. It’s gotten so bad - and has angered so many Taylor Swift fans - that in 2023 Congress held a hearing on antitrust law. Since the 1980s a series of policy changes have helped the firm grow to dominate every single aspect of the live events business. And Ronald Reagan is to blame.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK