This is Part 6. There is also Part 1, Part 2, Part 3, Part 4, Part 5, Part 7 and Part 8.
This post continues listing the Text-to-Image scripts included with Visions of Chaos and some example outputs from each script.
Name: Augmented CLIP Guided Diffusion
Author: Peter Baylies
Original script: https://github.com/pbaylies/Augmented_CLIP
Time for 512×512 on a 3090: 1 minutes 16 seconds
Maximum resolution on a 24 GB 3090: 1664×704
Maximum resolution on an 8GB 2080: 256×256 57 seconds
Description: Another CLIP Guided Diffusion script. Fast. Gives unique textured results.
a detailed painting of people by Nicolette Macnamara
a diagram of a nightmare creature made of gold
a nightmare creature
a painting of a cabin next to a stream in a secluded forest
a storybook illustration of Jabba The Hutt by Carle Hessay
a werewolf by A R Middleton Todd
an oil painting of Big Bird
Gandalf trending on pixiv
Lovecraftian horror
poster art of the Las Vegas strip by George Passantino
Name: Princess Generator
Author: Dango233
Original script: https://colab.research.google.com/drive/1QgH9TvQMXR3PpEGBcHnghtEcwFDXLaYE
Time for 512×512 on a 3090: 2 minutes 38 seconds
Maximum resolution on a 24 GB 3090: 1664×704
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM.
Description: The latest update to “CLIP Guided Diffusion v6” from Dango233. Can give some superb results. Worth exploring and experimenting with further.
a cloudy sunset
a fireplace by Jacob More
a happy alien by James Jarvaise
a mountain path by Stephen Pace
a raytraced image of a western town
a teddy bear
Charmander made of wood by Hua Yan
dense woodland by Marie Angel
paranoia by Floris van Dyck
portrait of Princess Victoria trending on artstation
Name: Disco Diffusion v4.1
Author: @Somnai
Original script: https://colab.research.google.com/drive/1sHfRn5Y0YKYKi1k-ifUSBFRNJ8_1sa39
Time for 512×512 on a 3090: 1 minute 57 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 1152×512. 4 minutes 39 seconds.
Description: The latest update to Disco Diffusion. Really nice detailed outputs. Low VRAM requirments allow huge sized images. I didn’t realise I had 3 zombie themed results in this random batch.
a bronze sculpture of a zombie
a fantasy land
a pencil sketch of Cthulhu by Rudolf Koller
a pop art painting of zombies
a portrait of a young boy by Hendrick Cornelisz. van Vliet
a tree by Philips Wouwerman
a western town
a zombie
Han Solo psychedelic
vector art of the Amazon Rainforest
Name: Hypertron v2
Author: Philipuss
Original script: https://colab.research.google.com/drive/10fa8X6EsfZfda1dfhJ_BtfPZ7Te1WGoX
Time for 512×512 on a 3090: 1 minute 57 seconds
Maximum resolution on a 24 GB 3090: 1120×480.
Maximum resolution on an 8GB 2080: 256×256 2 minutes 18 seconds
Description: Version 2 of Hypertron. More models, more flavors. Works OK. Can give the “image in a sea of purple/grey” that previous MSE based scripts suffered from. Can give good results if you let it run a large random batch overnight.
a bronze sculpture of a spooky forest by Herb Aach
a diamond made of flowers
a gouache of an android by Wu Bin
a photo of a kitchen
a photorealistic painting of a cemetery
a sketch of a haunted house
a tattoo of Squirtle made of clay
an art deco painting of a human by Nicolas Lancret 8K 3D
goldfish by Elfriede Lohse-Wächtler
Lovecraftian horror by Aileen Eagleton
Name: CC12M Diffusion
Author: Katherine Crowson
Original script: https://colab.research.google.com/drive/1TBo4saFn1BCSfgXsmREFrUl3zSQFg6CC
Time for 512×512 on a 3090: 1 minute 48 seconds
Maximum resolution on a 24 GB 3090: 1664×704.
Maximum resolution on an 8GB 2080: 832×512 2 minutes 59 seconds
Description: Can support higher resolutions, but the coherance really falls apart with anything over 256×256. It handles multiple images at once, so these examples are 4 256×256 results.
a beachside resort
a bouquet of flowers
a castle
a cemetery
a cephalopod by Walter Stuempfig super detailed
a color pencil sketch of a bedroom super detailed
a kitchen
a mountainscape
a nightclub
a vast city
Name: Disco Diffusion v5
Authors: @Somnai and @Gandamu
Original script: https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb
Time for 512×512 on a 3090: 2 minutes 02 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 1152×512. 4 minutes 43 seconds.
Description: The latest update to Disco Diffusion.
a cloudy sunset
a crying person made of wrought iron by František Jakub Prokyš psychedelic
a flemish baroque of a school of tropical fish
a low poly render of puppies
a morning landscape
a mosaic of a worried man by Paul Lohse
a thunder storm by Cornelis Claesz van Wieringen
a tropical beach
computer rendering of an evil alien 4K HD realism
the human condition Flickr
Name: Disco Diffusion v5 Turbo Smooth
Authors: Chris Allen
Original script: https://colab.research.google.com/github/zippy731/disco-diffusion-turbo/blob/main/Disco_Diffusion_v5_Turbo_%5Bw_3D_animation%5D.ipynb
Time for 512×512 on a 3090: 1 minutes 14 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 832×384. 2 minutes 21 seconds.
Description: An updated version of Disco Diffusion v5 that gives fast and smooth movie outputs.
a black and white photo of a lush rainforest trending on Flickr
a detailed matte painting of a factory
a hacker by Mykola Burachek
a sea monster CGSociety
a surrealist painting of a happy person
a tardigrade by Cosmo Alexander
an anime drawing of an evening landscape by Daphne Fedarb photorealistic
an art deco painting of a happy person by John Uzzell Edwards
chalk art of a bouquet of flowers
the human condition
Name: Augmented CLIP Guided Diffusion v2
Author: Peter Baylies
Original script: https://github.com/pbaylies/Augmented_CLIP
Time for 512×512 on a 3090: 2 minutes 48 seconds
Maximum resolution on a 24 GB 3090: 1664×704
Maximum resolution on an 8GB 2080: 512×512 4 minutes 56 seconds
Description: Updaterd version of the Augmented CLIP Guided Diffusion script.
a bungalow 4K HD realism
a forest fire
a lush rainforest CryEngine
a painting of a kitchen by Betye Saar
a portrait of a princess trending on artstation
a spooky forest
a tattoo of a zombie
a werewolf by David Cooke Gibson
an oil painting of a lake
an ugly man
Name: v-diffusion
Author: Katherine Crowson
Original script: https://github.com/crowsonkb/v-diffusion-pytorch
Time for 512×512 on a 3090: 3 minutes 57 seconds
Maximum resolution on a 24 GB 3090: 896×512 or 640×640.
Maximum resolution on an 8GB 2080: 128×128 1 minute 19 seconds
Description: Updated version of Velocity-Diffusion. Tends to make incoherant collage images over 256×256.
a black and white photo of a portrait of a young girl
a cityscape by Lujo Bezeredi
a cloudy sunset
a hologram of a sad face by Josef Šíma
a lounge room by Riad Beyrouti IMAX
a mountain path
a portrait of a young boy made of metal
a portrait of a young girl
a space nebula
an acrylic painting of a mountain range
Name: GLID-3
Author: Jack Qiao
Original script: https://github.com/Jack000/glid-3
Time for 512×512 on a 3090: 35 seconds
Maximum resolution on a 24 GB 3090: 768×768.
Maximum resolution on an 8GB 2080: 512×512 50 seconds
Description: Great textures and lighting. Poor image coherency.
a cemetery
a drawing of a cloudy sunset
a drawing of a human lens flare
a lake
a large waterfall made of silver
a marina
a minimalist painting of a teddy bear by Johann Ludwig Bleuler
a renaissance painting of paranoia made of vines
an abbey by Cornelis Pietersz
an art deco painting of a rose
Name: Disco Diffusion v5.1
Authors: @Somnai, @Gandamu and Chris Allen
Original script: https://colab.research.google.com/github/alembics/disco-diffusion/blob/main/Disco_Diffusion.ipynb
Time for 512×512 on a 3090: 2 minutes 05 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 1152×512. 4 minutes 40 seconds.
Description: Latest version of Disco Diffusion incorporating the “Turbo” features of v5 that gives fast and smooth movie outputs.
a flemish baroque of a sunset
a marsh
a mid-nineteenth century engraving of New York City
a minimalist painting of a cephalopod
a photo of Dracula
a watercolor painting of a knight
an ugly person by Samuel Colman trending on ArtStation
chalk art of Gandalf
lineart of a zombie
the Amazon Rainforest 4K HD realism
Name: Latent Diffusion LAION_400M
Authors: @multimodalart
Original script: https://colab.research.google.com/github/multimodalart/latent-diffusion-notebook/blob/main/Latent_Diffusion_LAION_400M_model_text_to_image.ipynb
Time for 512×512 on a 3090: 57 seconds
Maximum resolution on a 24 GB 3090: 1152×512 or 768×768
Maximum resolution on an 8GB 2080: 256×256. 1 minute 12 seconds.
Description: A new script based on the newly trained LAION_400M moidel. Impressive results at 256×256. Loses coherency at larger sizes. These examples are 4 256×256 images of each prompt.
a black and white photo of a nightmare creature
a futuristic city
a hyperrealistic painting of a queen made of flowers
a painting of a happy clown
a skeleton
a stained glass window 4K HD realism
a watercolor painting of a lounge room
an eagle
an ultrafine detailed painting of Harry Potter
vector art of a zombie by Oskar Kokoschka
Name: JAX CLIP Guided Diffusion v2.7
Author: nshepperd
Original script: https://colab.research.google.com/drive/1nmtcbQsE8sTjfLJ1u3Y4d6vi9ZTAvQph
Time for 512×512 on a 3090: 2 minutes 37 seconds
Maximum resolution on a 24 GB 3090: 2496×1088
Maximum resolution on an 8GB 2080: 512×512. 3 minutes 59 seconds.
Description: ANother diffusion based script. Can give very nice high detail results.
a Dalek made of feathers
a haunted house
a picture of a chateau by Odhise Paskali
a refinery
a studio by Allan Ramsay trending on ArtStation
a sunset
a thunder storm
a watercolor painting of a fire breathing dragon
a witch made of mist
the tropics by Thomas de Keyser
Name: GLID-3-XL
Author: Jack Qiao
Original script: https://github.com/Jack000/glid-3-xl
Time for 512×512 on a 3090: 1 minute 04 seconds
Maximum resolution on a 24 GB 3090: 512×512.
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM.
Description: Improved/updated version of GLID-3. Uses CLIP for better accuracy. Great textures and lighting. Poor image coherency when over 256×256.
a demon
a detailed matte painting of a bouquet of flowers
a kitchen
a photorealistic painting of a movie monster hyperrealistic
a picture of The Incredible Hulk by Kazimir Malevich
a pop art painting of an angry woman
a spooky forest
an abbey
New York City by Marie Courtois
poster art of Gandalf vivid colors
Name: ruDALL-E Aspect Ratio
Author: Alex Shonenkov
Original script: https://github.com/shonenkov-AI/rudalle-aspect-ratio
Time for 512×512 on a 3090: N/A
Maximum resolution on a 24 GB 3090: N/A
Maximum resolution on an 8GB 2080: Unable to run on 8GB VRAM.
Description: Version of ruDALL-E that generates wide and/or tall aspect ratio images. The shorter side is limited to 256 pixels. Results can be very nice. Will generate multiple images at once, so these sample images have 4 results per prompt.
a black and white photo of a werewolf
a cartoon of a swamp
a large waterfall made of metal
a lounge room
a matte painting of a townhouse
a palace made of mist
a photo of an ugly woman
a tropical beach
an evil clown
dense woodland
Any Others I Missed?
Do you know of any other colabs and/or github Text-to-Image systems I have missed? Let me know and I will see if I can convert them to work with Visions of Chaos for a future release. If you know of any public Discords with other colabs being shared let me know too.
Jason.
Love the work you’ve been doing compiling these! I was trying to play with the midi composer features and was getting an instant failure on each one… the console displays
ModuleNotFoundError: No module named ‘music21’
Any idea what might be the cause?
Missing the music21 library. You need to have followed all these steps.
https://softology.pro/tutorials/tensorflow/tensorflow.htm
There is a pip install for music21.
Ah, looks like I’d missed that they changed as of version 92. Thanks for the heads up!
Not sure if you’re aware, but your main VoC website seems to be down right now and has been for several days now
Domain has changed from softology.com.au to softology.pro. All links in these blogs and Visions of Chaos are updated.
Is there a way to get things like Disco Diffusion, and other notebooks like it on Visions of Chaos? Someone posted in a FB group about it, but I can’t find where or how to get it on your software.
You need to enable machine learning mode. Select that mode menu and you will be taken to the instructions page.
I am so blind. 🙂 thank you!
Feel free to join my Discord if you have other issues or just want to ask questions of other users or post some art. Link under the Help menu in Visions of Chaos.