AI扩展 - Tools and Resources for AI Art - 《Python1024》

Image Super-resolution
Image to Text
StyleGAN
Other
Book Reviews
Cool Apps

原链接：https://pharmapsychotic.com/tools.html

以下留存备份

Looking to get started with AI art? A good place to start is one of the popular apps like Wombo or NightCafe. You can get a quick sense of how you can use words and phrases to guide image generation. Read up on prompt engineering to improve your results. Then you can graduate up to using Google Colab notebooks linked below like Disco Diffusion. If you have a good nVidia GPU of your own then you can also use Visions of Chaos to run the most popular notebooks locally.

Image Super-resolution

Gigapixel AI by Topaz Labs (costs $99) <- voted #1
Real-ESRGAN- (github) <- voted #2
Real-ESRGAN Sber - a nice fine tuned ESRGAN model
SuperRes Diffusion - Batch upscaling and super resolution with latent-diffusion
SwinIR - Hugging Face space
Upscale Model Database - big set of pretrained models for upscaling different types of content

Image to Text

StyleGAN
[2022/04/25] StyleGAN-Humans + CLIP modified by Diego Porres to use StyleGAN3
StyleGAN2-ADA - train your own StyleGAN2 model from an image set you create
StyleCLIP - Text-drive manipulation of StyleGAN imagery
Structured Dreaming - Styledreams With helpers
Structured Dreaming (CLIP+StyleGAN) by @ArYoMo (tweet)
StyleGAN 2 pretrained models - can use these with Structured Dreaming
StyleGAN 2 awesome pretrained models - BIG collection of models
StyleGAN 3 training - train a StyleGAN and do interpolation video by @dvsch
StyleGAN 3 music video generation - (tweet)
StyleGAN 3 + CLIP by Annas
StyleGAN3 + CLIP by @nshepperd1 and @RiversHaveWings
StyleGANXL + CLIP by Eugenio Herrera and Rodrigo Mello
Lucid Sonic Dreams - animate path through StyleGAN latent space with music (github)

Text
Goose.ai Playground - can use their playground to generate text with GPT-Neo
GPT Neo Colab notebook - use GPT-neo 1.3B and 2.7B from Google colab
GPT Neo HuggingFace - run GPT-neo 2.7B on HuggingFace
Neuralism Generative Art Prompt Generator - generate prompts to use for text to image
OpenAI GPT3 Playground - generate text with GPT-3 (requires free account)
Prompt Parrot by @KyrickYoung - train GPT2 on a list of your prompts
MadLib Prompt Generator - makes interesting prompts for you, by @remi_durant
Neuralism Prompt Generator - generative art prompt generator

Video
Video Enhance AI by Topaz Labs - commercial upscaling and frame interpolation <- excellent
AnimationKit AI - synthetic video upscaling and interpolation tool <- great
audio-reactive-video - by @devdef, skip video frames based on volume
PyTTI-Tools: FILM - @DigThatData ‘s version of FILM for video frames
FILM: Frame Interpolation for Large Motion- (replicate link) smooth interpolation/morphing
Sequence Frame Interpolation - batch version of FILM
FILM colab - by @KyrickYoung has pause, loops, reverse
Flowframes - free tool with patreon, uses RIFE and other models
ESRGAN 4 Video- increase resolution of video with ESRGAN
RIFE - smooth interpolation of video to increase frame rate
Super Slomo - another way to increase frame rate of video
3D Photo in-painting - give cool 3D effect to 2D image
3D Ken Burns Effect from single image - animated video from 2D image
Animating Pictures with Eulerian Motion Fields- code not out yet, looks like it’ll be awesome
Audio to keyframe string generator for AI animation notebooks
Keyframe string generator for AI animation notebooks
Wiggle animation key frame generator by @zippy731

Music

You can generate music with AI using OpenAI’s Jukebox. You can prompt Jukebox with an artist and music genre or with a short audio clip in WAV format. It generates new music for you in phases of increasing quality (level_2, level_1, level_0) and takes about 8 hours on Colab.

boomy - let’s you create and publish music with Ai but they hold the copyright
D3Net-MSS - colab for splitting music into separate clips for drums, vocals, etc
Easy One Click Jukebox - this is my favorite currently
Jukebox Community Build - download this ipynb and put in Colab Notebooks folder on Google Drive to use
lalal.ai - commercial music to stems service
mubert - nft friendly music remixed by Ai
Official OpenAI Jukebox - the official notebook from OpenAI
Spleeter colab - split music into stems
Zags Jukebox v3.7 - (youtube tutorial)

Other
JAX CLIP Guided Diffusion 2.7 Guide - Google doc from huemin
text2mesh - Kaggle notebook for text to 3D mesh
ClipMatrix - text controlled 3D mesh deformation and stylization
Zippy’s Disco Diffusion Cheatsheet - Google Doc guide to Disco and all the parameters
EZ Charts - Google Doc Visual Reference Guides for CLIP-Guided Diffusion (see what all the parameters do!)
A Guide to Writing Prompts for Text to Image - Google Doc guide and advice
Hitchhiker’s Guide To The Latent Space - a guide that’s been put together with lots of colab notebooks too
Resources for GAN Artists - another big Google Doc with notebooks and resources for AI art
Way of the TTI Artist - pytti guide
Guide to install Disco Diffusion 5 on Windows with WSL- haven’t tried this yet challenge is pytorch3d
Great explanation of VQGAN+CLIP - https://ljvmiranda921.github.io/notebook/2021/08/08/clip-vqgan/
Nice overview of lots of different optimization algorithms SGD, Adam, RMSProp etc and their differences (also covered in this lecture)
Stanford’s Convolutional Neural Networks class on YouTube - https://www.youtube.com/playlist?list=PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv
PIFuHD Colab - Human photo to 3D mesh of the human
Watermark images - little notebook to add text watermark to images
Zero-Shot Text-Guided Object Generation with Dream Fields - text to 3D render

Book Reviews

Generative Deep Learning: Teaching Machines to Paint, Write, Compose, and Play by David Foster [2019]

Really enjoyed this and it’s a great book! It’s from 2019 so doesn’t cover the very latest like VQGAN, CLIP, guided diffusion though.

Deep Learning with Python by Francois Chollet [2021]

Very good to get up and running with TensorFlow building all the basic/standard model types

Deep Learning for Coders with fastai and PyTorch by Jeremy Howard [2020]

I’m still at the beginning of this. I wish it used PyTorch directly and not their fastai library built on top. I was hoping to learn hands on with PyTorch. I’ll continue with it and update my review when I get further along.

The Artist in the Machine: The world of AI-powered creativity by Arthur I. Miller [2020]

Not very technical but engaging and inspiring view of many Ai art projects so far.

Dive into Deep Learning (online, free, interactive)

This looks fantastic and I wish I found it earlier! I love how you can toggle between TensorFlow and PyTorch for all the code examples.

ml4a.net - online textbook, classes, and learning resources

Cool Apps
No Code AI Art tools
Artbreeder - StyleGAN model with “genes” (directions in latent space) for editing
CogView- text to image, Chinese model like DALL-E (interview)
conjure.art - new text to image site currently in beta
DeepDreamGenerator - deep style, thin style, deep dream
midjourney - still in private beta, they have link to apply on the @midjourney twitter
NightCafe - style transfer, VQGAN, diffusion image generation
Nebula41 - under development, VQGAN videos from text
Ostagram - style transfer
pollinations.ai - run lots of popular notebooks
ProsePainter - interactive tool to “paint with words”
runwayml - video editing powered by AI
snowpixel - text to image and variations
StarryAI - text to image with easy selection of styles
uberduck.ai - text to speech with lots of different voices
Visions of Chaos - run the popular AI notebooks locally on Windows (see the Machine Learning setup steps)
Wombo - Super fast and free but may have restrictions for use
wzrd.ai - give it a music file and produce animation from big set of pretrained GANs

Online Galleries to Showcase Art

OnCyber art galleries - https://oncyber.io - Cool 3D art gallery to showcase your art with links to NFT market
Spatial - https://spatial.io

Has this guide been helpful to you? 🙂 You can follow me on twitter @pharmapsychotic and I’d love if you would

Tools and Resources for AI Art

Image Super-resolution

Image to Text

StyleGAN

Text

Video

Music

Other

Book Reviews

Cool Apps