Text-to-Image Summary – Part 8

This is Part 8. There is also Part 1, Part 2, Part 3, Part 4, Part 5, Part 6 and Part 7.

This post continues listing the Text-to-Image scripts included with Visions of Chaos and some example outputs from each script.


Name: Deforum Stable Diffusion v0.4
Author: Original script by Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
Original script: https://colab.research.google.com/github/deforum/stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb
Time for 512×512 on a 3090: 34 seconds
Maximum resolution on a 24 GB 3090: 1280×640
Maximum resolution on an 8GB 2080: 640×576
Description: Incredible. Latest and greatest. Beats all previous Text-to-Image systems. If you only use one, use this one. Deforum builds upon Stable Diffusion with animation support. v0.4 is the latest version.

'a canal' Deforum Stable Diffusion v0.4
a canal

'a forest path' Deforum Stable Diffusion v0.4
a forest path

'a loft' Deforum Stable Diffusion v0.4
a loft

'a matte painting of a river hyperdetailed and CryEngine' Deforum Stable Diffusion v0.4
a matte painting of a river hyperdetailed and CryEngine

'a painting of the tropics' Deforum Stable Diffusion v0.4
a painting of the tropics

'a pastel of a nightmare 4K HD realism and trending on Flickr' Deforum Stable Diffusion v0.4
a pastel of a nightmare 4K HD realism and trending on Flickr

'a photorealistic painting of Cookie Monster rendered in unreal engine and CGSociety' Deforum Stable Diffusion v0.4
a photorealistic painting of Cookie Monster rendered in unreal engine and CGSociety

'a tropical beach by Karl Hagedorn and Michalis Oikonomou' Deforum Stable Diffusion v0.4
a tropical beach by Karl Hagedorn and Michalis Oikonomou

'an etching of King Kong' Deforum Stable Diffusion v0.4
an etching of King Kong

'concept art of Gandalf CGSociety and 4K HD realism' Deforum Stable Diffusion v0.4
concept art of Gandalf CGSociety and 4K HD realism

lovecraftian cthulhu tentacle horrors by giger and beksinski, highly textured, 8K 4K HD

roses in the rain, rosebuds, rain drops, 8K 4K HD


Name: Deforum Stable Diffusion v0.5
Author: Original script by Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
Original script: https://colab.research.google.com/github/deforum/stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb
Time for 512×512 on a 3090: 34 seconds
Maximum resolution on a 24 GB 3090: 1280×640
Maximum resolution on an 8GB 2080: 640×576
Description: Incredible. Latest and greatest. Beats all previous Text-to-Image systems. If you only use one, use this one. Deforum builds upon Stable Diffusion with animation support. v0.5 is the latest version.

'a castle' Deforum Stable Diffusion v0.5
a castle

'a cute monster' Deforum Stable Diffusion v0.5
a cute monster

'a fine art painting of humans rendered in unreal engine and trending on pixiv' Deforum Stable Diffusion v0.5
a fine art painting of humans rendered in unreal engine and trending on pixiv

'a pop art painting of Frankenstein by Kim Hwan-gi and Zha Shibiao' Deforum Stable Diffusion v0.5
a pop art painting of Frankenstein by Kim Hwan-gi and Zha Shibiao

'a sorceress by Adolf Fényes and Rodolfo Morales for sale on Facebook Marketplace and trending on ArtStation' Deforum Stable Diffusion v0.5
a sorceress by Adolf Fényes and Rodolfo Morales for sale on Facebook Marketplace and trending on ArtStation

'a watercolor painting of a farm by József Breznay and John Zephaniah Bell' Deforum Stable Diffusion v0.5
a watercolor painting of a farm by József Breznay and John Zephaniah Bell

'an eagle made of feathers and silver' Deforum Stable Diffusion v0.5
an eagle made of feathers and silver

'an ugly face' Deforum Stable Diffusion v0.5
an ugly face

'puppies' Deforum Stable Diffusion v0.5
puppies

'street art of Jason Vorhees' Deforum Stable Diffusion v0.5
street art of Jason Vorhees

colorful surrealism by dali, giger, beksinski and haeckel

nebula galaxy planets hubble


Name: Deforum Stable Diffusion v0.6
Author: Original script by Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
Original script: https://colab.research.google.com/github/deforum-art/deforum-stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb
Time for 512×512 on a 3090: 34 seconds
Maximum resolution on a 24 GB 3090: 1280×640
Maximum resolution on an 8GB 2080: 640×576
Description: Incredible. Latest and greatest. Beats all previous Text-to-Image systems. If you only use one, use this one. Deforum builds upon Stable Diffusion with animation support. v0.6 is the latest version.

'a bedroom' Deforum Stable Diffusion v0.6
a bedroom

'a bronze sculpture of Robert DeNiro rendered in unreal engine and trending on Flickr' Deforum Stable Diffusion v0.6
a bronze sculpture of Robert DeNiro rendered in unreal engine and trending on Flickr

'a chinese painting of a peacock by Agnes Lawrence Pelton and Bob Thompson' Deforum Stable Diffusion v0.6
a chinese painting of a peacock by Agnes Lawrence Pelton and Bob Thompson

'a cute girl 4K HD realism and 8K 3D' Deforum Stable Diffusion v0.6
a cute girl 4K HD realism and 8K 3D

'a fine art painting of a palace made of mist' Deforum Stable Diffusion v0.6
a fine art painting of a palace made of mist

'a green tree frog' Deforum Stable Diffusion v0.6
a green tree frog

'a lion' Deforum Stable Diffusion v0.6
a lion

'a storybook illustration of the Australian outback' Deforum Stable Diffusion v0.6
a storybook illustration of the Australian outback

'ballpoint pen art of Frankenstein' Deforum Stable Diffusion v0.6
ballpoint pen art of Frankenstein

'Brad Pitt by Rhea Carmi and Robert Bechtle' Deforum Stable Diffusion v0.6
Brad Pitt by Rhea Carmi and Robert Bechtle

beauty, 4K, 8K, HD, hyper detailed, high detail, surrealism

an oil painting by Picasso and van Gogh, 4K, 8K, HD, hyper detailed, high detail, surrealism


Name: Stable Diffusion v2
Author: Original script by Robin Rombach et al
Original script: https://github.com/Stability-AI/stablediffusion
Time for 768×768 on a 3090: 42 seconds
Maximum resolution on a 24 GB 3090: 1664×704
Maximum resolution on an 8GB 2080: Unable to run on an 8GB GPU.
Description: Uses a newly trained version of the Stable Diffusion model that renders native at 768×768. The following examples show 768×768 sized output.

'a cave' Stable Diffusion v2
a cave

'a detailed painting of fear IMAX and Flickr' Stable Diffusion v2
a detailed painting of fear IMAX and Flickr

'a digital rendering of a human made of chrome and gold' Stable Diffusion v2
a digital rendering of a human made of chrome and gold

'a mansion' Stable Diffusion v2
a mansion

'a portrait of a sad clown' Stable Diffusion v2
a portrait of a sad clown

'a spooky forest' Stable Diffusion v2
a spooky forest

'a storybook illustration of a lush rainforest for sale on Facebook Marketplace and #film' Stable Diffusion v2
a storybook illustration of a lush rainforest for sale on Facebook Marketplace and #film

'an etching of a babbling brook' Stable Diffusion v2
an etching of a babbling brook

'an oil painting of a castle in the mountains' Stable Diffusion v2
an oil painting of a castle in the mountains

'Yoda' Stable Diffusion v2
Yoda


Name: Stable Diffusion v2.1
Author: Original script by Robin Rombach et al
Original script: https://github.com/Stability-AI/stablediffusion
Time for 768×768 on a 3090: 42 seconds
Maximum resolution on a 24 GB 3090: 1664×704
Maximum resolution on an 8GB 2080: Unable to run on an 8GB GPU.
Description: Updated Stable Diffusion model. The following examples show 768×768 sized output.

'a detailed drawing of Frankenstein' Stable Diffusion v2.1
a detailed drawing of Frankenstein

'a forest clearing' Stable Diffusion v2.1
a forest clearing

'a frog hyperrealistic and photorealistic' Stable Diffusion v2.1
a frog hyperrealistic and photorealistic

'a mountain cabin' Stable Diffusion v2.1
a mountain cabin

'a sad clown' Stable Diffusion v2.1
a sad clown

'a surrealist sculpture of eyeballs' Stable Diffusion v2.1
a surrealist sculpture of eyeballs

'a swamp hyperdetailed and rendered in unreal engine' Stable Diffusion v2.1
a swamp hyperdetailed and rendered in unreal engine

'a townhouse photorealistic and lens flare' Stable Diffusion v2.1
a townhouse photorealistic and lens flare

'an ink drawing of Al Pacino' Stable Diffusion v2.1
an ink drawing of Al Pacino

'an ugly creature' Stable Diffusion v2.1
an ugly creature


Name: Deforum Stable Diffusion v0.7
Author: Original script by Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
Original script: https://colab.research.google.com/github/deforum-art/deforum-stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb
Time for 768×768 on a 3090: 2 minutes 50 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 640×576
Description: Now supports Stable Diffusion v2.1 model for 768×768 resolution.

'a cave' Deforum Stable Diffusion v0.7
a cave

'a forest clearing' Deforum Stable Diffusion v0.7
a forest clearing

'a lion' Deforum Stable Diffusion v0.7
a lion

'a pastel of Big Bird by John Blair and Christoph Ludwig Agricola CryEngine and 4K HD realism' Deforum Stable Diffusion v0.7
a pastel of Big Bird by John Blair and Christoph Ludwig Agricola CryEngine and 4K HD realism

'a tributary' Deforum Stable Diffusion v0.7
a tributary

'a watercolor painting of a western town trending on ArtStation and Tri-X 400 TX' Deforum Stable Diffusion v0.7
a watercolor painting of a western town trending on ArtStation and Tri-X 400 TX

'a werewolf' Deforum Stable Diffusion v0.7
a werewolf

'an abstract painting of Gandalf' Deforum Stable Diffusion v0.7
an abstract painting of Gandalf

'an alien forest IMAX and vivid colors' Deforum Stable Diffusion v0.7
an alien forest IMAX and vivid colors

'an engraving of a cute girl' Deforum Stable Diffusion v0.7
an engraving of a cute girl

a hyperrealistic matte painting of melting color, 4K, 8K, HD, high detail, hyper detailed

a hyperrealistic matte painting of a lush rainforest, 4K, 8K, HD, high detail, hyper detailed

a hyperrealistic matte painting of a magical glowing mushroom forest at night, 4K, 8K, HD, high detail, hyper detailed


Any Others I Missed?

Do you know of any other colabs and/or github Text-to-Image systems I have missed? Let me know and I will see if I can convert them to work with Visions of Chaos for a future release. If you know of any public Discords with other colabs being shared let me know too.

Jason.