This is Part 7. There is also Part 1, Part 2, Part 3, Part 4, Part 5, Part 6 and Part 8.
This post continues listing the Text-to-Image scripts included with Visions of Chaos and some example outputs from each script.
Name: Multi-Perceptor VQGAN+CLIP v4
Author: Remi Durant
Original script: https://colab.research.google.com/drive/1peZ98vBihDD9A1v7JdH5VvHDUuW5tcRK
Time for 512×512 on a 3090: 2 minutes 36 seconds
Maximum resolution on a 24 GB 3090: 1120×480
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: Version 4 of Remi’s Multi-Perceptor VQGAN+CLIP script.
a bronze sculpture of a garden
a church by Tadeusz Kantor
a color pencil sketch of a monkey hyperdetailed
a comic book panel of a lush rainforest
a matte painting of a witch by William Geissler
a peninsula by Ei-Q CGSociety
a surrealist sculpture of hell
an eyeball made of flowers
cyberpunk art of a canyon
lineart of dense woodland
Name: V-Majesty Diffusion v1.2
Authors: Original script by Dango233 and multimodalart
Original script: https://colab.research.google.com/github/multimodalart/MajestyDiffusion/blob/main/v.ipynb
Time for 512×512 on a 3090: 3 minutes 08 seconds
Maximum resolution on a 24 GB 3090: 1664×704.
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: A new diffusion based script.
a black and white photo of war
a brownstone by Oliver Sin super detailed
a cute creature
a doctor
a drawing of a babbling brook photorealistic
a hill
a ninja by Michael Ford psychedelic
a photo of a beautiful young girl in a summer garden at dusk
a storybook illustration of a cozy den
New York City
Name: Latent Majesty Diffusion v1.3
Authors: Original script by Dango233 and multimodalart
Original script: https://colab.research.google.com/github/multimodalart/MajestyDiffusion/blob/main/latent.ipynb
Time for 512×512 on a 3090: 2 minutes 24 seconds
Maximum resolution on a 24 GB 3090: 512×512 (when using GFPGAN upscaling)
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: Starts with a smaller resolution image (usually 256×256 pixels), upscales it with GFPGAN, and then does a few more diffusion passes. GFPGAN can really help get better coherency in faces.
a hyperrealistic painting of a cute creature
a hyperrealistic painting of an evil clown
a picture of a tree
a surrealist painting of kittens
an engraving of an angry woman made of voxels
an oil painting of an attractive woman by Eileen Aldridge
an ultrafine detailed painting of Bruce Willis 4K HD realism
Robert DeNiro ZBrush
Tweety Pie
Yoda
Name: Huemin JAX Diffusion v2.7
Author: Huemin
Original script: https://colab.research.google.com/github/huemin-art/jax-guided-diffusion/blob/v2.7/Huemin_Jax_Diffusion_2_7.ipynb
Time for 512×512 on a 3090: 3 minutes 55 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: Starts with a smaller resolution image (usually 256×256 pixels), upscales it with GFPGAN, and then does a few more diffusion passes. GFPGAN can really help get better coherency in faces.
a babbling brook
a mountain path Flickr
a river CGSociety
a spooky forest by John F. Peto
a storybook illustration of a mansion by Donald Roller Wilson
a surrealist sculpture of an alien forest
a watercolor painting of fear made of bones ZBrush
a wetland by Alexander Robertson super detailed
New York City
vector art of a bouquet of flowers ZBrush
Name: Disco Diffusion v5.2
Authors: Original script by @somnai, @gandamu and @zippy731
Original script: https://colab.research.google.com/github/zippy731/disco-diffusion-turbo/blob/skunk/Disco_Diffusion.ipynb
Time for 512×512 on a 3090: 2 minutes 2 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 768×768
Description: Latest version of Disco Diffusion.
a bay by Louis Valtat Tri-X 400 TX
a hyperrealistic painting of a mansion
a hyperrealistic painting of a nightmare
a pond
a tributary by Jacob Marrel
an alien landscape
an island by Andrew Robertson IMAX
Frankenstein made of liquid metal CryEngine
medusa by Gai Qi
the Amazon Rainforest
Name: DALL-E Mini
Author: Original script by Boris Dayma
Original script: https://colab.research.google.com/github/borisdayma/dalle-mini/blob/main/tools/inference/inference_pipeline.ipynb
Time for 512×512 on a 3090: Locked to 256×256 – 1 minute 13 seconds
Maximum resolution on a 24 GB 3090: 256×256
Maximum resolution on an 8GB 2080: 256×256
Description: Capable of rendering multiple images in one pass. Very nice results. Limited to 256×256 at this time. These examples show a 4×4 grid of 16 images for each prompt.
a fine art painting of a fire breathing dragon
a hyperrealistic painting of an ugly monster
a surrealist painting of a king
an ultrafine detailed painting of fear
Name: Latent Majesty Diffusion v1.6
Authors: Original script by Dango233 and multimodalart
Original script: https://colab.research.google.com/github/multimodalart/MajestyDiffusion/blob/main/latent.ipynb
Time for 512×512 on a 3090: 2 minutes 07 seconds
Maximum resolution on a 24 GB 3090: 512×512 (when using GFPGAN upscaling)
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: The latest amazing update to Latent Diffusion. Awesome colors, textures, lighting, details, coherency. Highly recommended.
a colorful parrot
a detailed matte painting of puppies
a gallery
a mountain cabin by Tom Palin 4K HD realism
a renaissance painting of a spooky forest
a school of tropical fish by Jane Carpanini
an ugly creature
the Amazon Rainforest photorealistic
The Grinch
Yoda trending on Flickr
Name: Disco Diffusion v5.4
Authors: Original script by @somnai, @gandamu, @zippy731 and @devdef
Original script: https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb
Time for 512×512 on a 3090: 2 minutes 20 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 768×768
Description: Latest version of Disco Diffusion.
a 3D render of the Grand Canyon by Dóra Keresztes
a hyperrealistic painting of a zombie made of cheese and feathers CryEngine and rendered in Cinema4D
a macro photograph of an ugly creature
a matte painting of a monument
a picture of a vast city
a skyscraper
a thunder storm
Jason Vorhees by Chen Chi
reflective spheres hyperrealistic
the country by Robert Thomas and Chen Jiru rendered in unreal engine and 4K photo
Name: Pixel Art Diffusion v3
Authors: Original script by @somnai, @gandamu, @zippy731 and @KaliYuga_ai
Original script: https://colab.research.google.com/github/KaliYuga-ai/Pixel-Art-Diffusion/blob/main/Pixel_Art_Diffusion_v3_0_(With_Disco_Symmetry).ipynb
Time for 512×512 on a 3090: 3 minutes 14 seconds
Maximum resolution on a 24 GB 3090: 1920×1088
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: Uses a fine tuned model to generate pixel art like imagery.
a bedroom hyperrealistic and hyperdetailed #pixelart
a cephalopod IMAX and lens flare #pixelart
a cinematic painting of a cottage #pixelart
a cross stitch of a townhouse made of voxels and timber #pixelart
a pastel of a cute girl #pixelart
a surrealist sculpture of Charmander #pixelart
an alien city by Anders Zorn and Laura Muntz Lyall photorealistic and CGSociety #pixelart
concept art of a wetland ZBrush and CryEngine #pixelart
Frankenstein made of string and vines trending on pixiv and hyperrealistic #pixelart
pixel art of a cloudy sunset #pixelart
Name: Disco Diffusion v5.6
Authors: Original script by @somnai and @gandamu
Original script: https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb
Time for 512×512 on a 3090: 3 minutes 34 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 768×768
Description: Latest version of Disco Diffusion.
a collage painting of a lush rainforest by Doc Hammer and Alexander Ivanov hyperrealistic and CryEngine
a cubist painting of a lion and a sunset CryEngine and trending on pixiv
a fine art painting of a zombie
a gulf by I Ketut Soki and Alfons von Czibulka
a monastery trending on Flickr and #film
a morning landscape
a prairie CGSociety and CryEngine
a werewolf
ballpoint pen art of a monument
cyberpunk art of heaven filmic and CryEngine
Name: CLIP Guided k-diffusion
Author: Original script by Katherine Crowson
Original script: https://colab.research.google.com/drive/1w0HQqxOKCk37orHATPxV8qb0wb4v-qa0
Time for 512×512 on a 3090: 6 minutes 56 seconds
Maximum resolution on a 24 GB 3090: Fixed to 512×512 resolution.
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: A new script by Katherine. Seems to generate more abstract results and these example images needed a long run of random prompts to select from.
a jigsaw puzzle of paranoia by Petr Brandl and Sasha Putrya
a landscape vivid colors
a pastel of Cookie Monster by Ren Bonian and Ángel Botello for sale on Facebook Marketplace and CryEngine
a reef
a renaissance painting of Al Pacino
a statue of a submarine made of metal and crystals by James Sessions American painter and Elfriede Lohse-Wächtler
an airbrush painting of a nightmare creature vivid colors and rendered in Cinema4D
an oil painting of a cephalopod made of paper and mist
an ugly person and an area 4K HD realism and trending on pixiv
conceptual art of an ugly monster
Name: CLIP Prior + VQGAN (MSE method)
Author: Original script by Katherine Crowson
Original script: https://colab.research.google.com/drive/1yOpCY9eXvzELHppvh-o0DevhxVYOGr5i
Time for 512×512 on a 3090: 3 minutes 31 seconds
Maximum resolution on a 24 GB 3090: 832×512
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: A new script by Katherine. Can give some interesting details but coherence may suffer at larger resolutions.
a collage painting of a tiger vivid colors and photorealistic
a cove 4K photo and CryEngine
a cute creature
a glacier
a space nebula
a townhouse
a valley
an oil painting of a peacock by Wu Hong and Eve Ryder
Cthulhu
digital art of a wetland made of cheese and timber by Jacob Duck and Jacob Gerritsz Cuyp
Name: Latent Diffusion LAION_400M v2
Author: Original script by pesser
Original script: https://github.com/pesser/stable-diffusion
Time for 16 256×256 images on a 3090: 49 seconds
Maximum resolution on a 24 GB 3090: 512×512
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM
Description: Renders multiple images quickly. Coherency is best at 256×256 so these example images are 2×2 tiled results. Each took 35 seconds on a 3090.
a babbling brook
a colorful parrot
a fine art painting of a castle
a matte painting of a rose
a pencil sketch of a cave 4K photo and hyperrealistic
a photorealistic painting of Cthulhu for sale on Facebook Marketplace and Flickr
a surrealist painting of a cloudy sunset
a surrealist painting of a monkey
an illustration of of a tiger by Stanley Twardowicz and Antoni Pitxot
an impressionist painting of a cottage
Name: Stable Diffusion
Author: Original script by pesser
Original script: https://github.com/CompVis/stable-diffusion
Time for 512×512 on a 3090: 34 seconds
Maximum resolution on a 24 GB 3090: 1280×640
Maximum resolution on an 8GB 2080: 640×576
Description: Incredible. Latest and greatest. Beats all previous Text-to-Image systems. If you only use one, use this one.
a black and white photo of puppies
a cathedral rendered in unreal engine and super detailed
a city made of mist trending on ArtStation and trending on Flickr
a detailed matte painting of a lush rainforest made of crystals and feathers
a king
a polaroid photo of a clown vivid colors and 8K 3D
an airbrush painting of the Terminator CryEngine and for sale on Facebook Marketplace
an ambient occlusion render of a wetland by William Forsyth and Victorine Foot trending on pixiv and CryEngine
poster art of a farm by Frederic Leighton and Yang Borun rendered in unreal engine and 8K 3D
the Australian outback
Name: Deforum Stable Diffusion v0.3
Author: Original script by Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
Original script: https://colab.research.google.com/github/deforum/stable-diffusion/blob/main/Deforum_Stable_Diffusion.ipynb
Time for 512×512 on a 3090: 34 seconds
Maximum resolution on a 24 GB 3090: 1280×640
Maximum resolution on an 8GB 2080: 640×576
Description: Incredible. Latest and greatest. Beats all previous Text-to-Image systems. If you only use one, use this one. Deforum builds upon Stable Diffusion with animation support.
a babbling brook
a forest path
a photo of a lake
a ranch by Nikolai Alekseyevich Kasatkin and Harriet Zeitlin
a watercolor painting of a rectory hyperdetailed and trending on ArtStation
Al Pacino by Lam Qua and George Frederick Harris
an angry person by Kazys Varnelis and Dóra Keresztes
an astronaut
puppies
war
giger xenomorphs, airbrush, HD, 4K, 8K, hyperrealistic, highly detailed, highly textured
Any Others I Missed?
Do you know of any other colabs and/or github Text-to-Image systems I have missed? Let me know and I will see if I can convert them to work with Visions of Chaos for a future release. If you know of any public Discords with other colabs being shared let me know too.
Jason.