Further Explorations Into Text-to-Image Machine Learning

NOTE: Make sure you also see this post that has a summary of all the Text-to-Image scripts supported by Visions of Chaos with example images.

After my initial experiments with Big Sleep Text-to-Image generation I looked around for some more examples to play with. I was really impressed with Big Sleep and you can see some examples of Big Sleep output in that original post. I still think Big Sleep is the best Text-to-Image code I have used so far and better than what is in this post.

Deep Daze

Deep Daze is by Phil Wang and the source code is available here.

Deep Daze tends to generate collage-like images. As the first example image shows the resulting images have a washed out or faded look. I put the rest of the example Deep Daze images through a quick Auto White Balance pass in GIMP.

“H R Giger”

DeepDaze - H R Giger

“Rainforest”

DeepDaze - Rainforest

“night club”

DeepDaze - Night Club

“seascape painting”

DeepDaze - Seascape Painting

“flowing water”

DeepDaze - Flowing Water

VQGAN-CLIP z+quantize

VQGAN-CLIP using a z+quantize method is from Katherine Crowson. Source code is available here.

This method also has the option to use an image to seed the initial model rather than just random noise, but the following examples were all seeded with noise. The resulting images tend to be divided up into rectangular regions, but the resulting imagery is interesting.

“H R Giger”

VQGAN-CLIP z+quantize - H R Giger

“rainforest”

VQGAN-CLIP z+quantize - Rainforest

“night club”

VQGAN-CLIP z+quantize - Night Club

“seascape painting”

VQGAN-CLIP z+quantize - Seascape Painting

“flowing water”

VQGAN-CLIP z+quantize - Flowing Water

VQGAN-CLIP codebook

VQGAN-CLIP using a codebook method is also from Katherine Crowson. Source code is available here.

VQGAN-CLIP codebook seem to give very similar images for different seeds, so I have only shown two examples for each phrase.

“H R Giger”

VQGAN-CLIP codebook - H R Giger

“rainforest”

VQGAN-CLIP codebook - Rainforest

“night club”

VQGAN-CLIP codebook - Night Club

“seascape painting”

VQGAN-CLIP codebook - Seascape Painting

“flowing water”

VQGAN-CLIP codebook - Flowing Water

Other Text-to-Image Models?

If you know of any other available Text-to-Image systems (that are freely available and shareable) let me know.

Availability

You can follow the above links and download the Python code yourself if you are so inclined.

I do include a basic GUI front-end for these Text-to-Image generators in Visions of Chaos. As long as you have the prerequisites installed (which you would need to install to run these outside Visions of Chaos) then you can experiment with these models yourself without needing to use the command line.

Jason.

Softology's Blog

Fractals, Cellular Automata, Chaos Theory, Machine Learning, Science, Space, etc

Further Explorations Into Text-to-Image Machine Learning

Leave a comment Cancel reply

Share this:

Related

Leave a comment Cancel reply