Stability AI tries to stay ahead of the pack with a new image-generating AI model

Stable Cascade uses less compute power to train and is better at following prompts.

By Emilia David, a reporter who covers AI. Prior to joining The Verge, she covered the intersection between technology, finance, and the economy.

Feb 14, 2024, 11:13 PM UTC

Share this story

collage of art generated through Stable Cascade

Collage of Stabie Cascade artStability AI

Stability AI’s newest model for image generation is Stable Cascade promises to be faster and more powerful than its industry-leading predecessor, Stable Diffusion, which is the basis of many other text-to-image generation AI tools.

Stable Cascade can generate photos and give variations of the exact image it created, or try to increase an existing picture’s resolution. Other text-to-image editing features include inpainting and outpainting, where the model will fill edit only a specific part of the image, as well as canny edge, where users can make a new photo just by using the edges of an existing picture.

Stable Cascade images generated from the prompt “Cinematic photo of an anthropomorphic penguin sitting in a cafe reading a book and having a coffee.”Image: Stability AI

The new model is available on GitHub for researchers but not commercial use, and brings more options even as companies like Google and even Apple release their own image generation models.

Unlike Stability’s flagship Stable Diffusion models, Stable Cascade isn’t one large language model — it’s three different models that rely on the Würstchen architecture, The first stage, stage C, compresses text prompts into latents (or smaller pieces of code) that are then passed to stages A and B to decode the request.

Comparison of inference time Stable Cascade v other modelsStability AI

Breaking the requests into smaller bits compresses the request to require less memory (and fewer hours of training on those hard-to-find GPUs) and run faster. while performing better “in both prompt alignment and aesthetic quality.” It took about 10 seconds to create an image compared to 22 seconds for the SDXL model used currently.

Stability AI helped popularize the stable diffusion method and has also been the subject of several lawsuits alleging Stable Diffusion trained on copyrighted data without permission from rights holders — a UK lawsuit by Getty Images against Stability AI is scheduled to go to trial in December. It began offering commercial licenses through a subscription in December, which the company said was necessary to help fund its research.

Taylor Swift vs. Ronald Reagan: The Ticketmaster story

Ticketmaster botched the sale of Taylor Swift’s The Eras Tour and so many others. It’s gotten so bad - and has angered so many Taylor Swift fans - that in 2023 Congress held a hearing on antitrust law. Since the 1980s a series of policy changes have helped the firm grow to dominate every single aspect of the live events business. And Ronald Reagan is to blame.

Stability AI tries to stay ahead of the pack with a new image-generating AI mode...

Stability AI tries to stay ahead of the pack with a new image-generating AI model

Stable Cascade uses less compute power to train and is better at following prompts.

Share this story

Taylor Swift vs. Ronald Reagan: The Ticketmaster story

Recommend

Mark Zuckerberg disses Apple's Vision Pro, unsurprisingly thinks the Meta Quest...

ChatGpt之父奥特曼：AI将像手机那样改变世界

X let terrorist groups pay for verification, report says

Upscaling a technology for large-scale carbon storage

World demand for liquefied natural gas jumps 50% by 2040: Shell

💔 Apple and PWAs: It's not me, it's EU..

"Do a Barrel Roll!" – How My Favourite Video Game Catchphrase Took Ove...

A More Complex Ability System, but a Smoother Gamedev Workflow – How?

NGINX’s Continued Commitment to Securing Users in Action

How each Google Ads bid strategy influences campaign success

About Joyk