原链接:https://pharmapsychotic.com/tools.html
以下留存备份
Looking to get started with AI art? A good place to start is one of the popular apps like Wombo or NightCafe. You can get a quick sense of how you can use words and phrases to guide image generation. Read up on prompt engineering to improve your results. Then you can graduate up to using Google Colab notebooks linked below like Disco Diffusion. If you have a good nVidia GPU of your own then you can also use Visions of Chaos to run the most popular notebooks locally.
Image Super-resolution
- Gigapixel AI by Topaz Labs (costs $99) <- voted #1
- Real-ESRGAN- (github) <- voted #2
- Real-ESRGAN Sber - a nice fine tuned ESRGAN model
- SuperRes Diffusion - Batch upscaling and super resolution with latent-diffusion
- SwinIR - Hugging Face space
Upscale Model Database - big set of pretrained models for upscaling different types of content
Image to Text
StyleGAN
[2022/04/25] StyleGAN-Humans + CLIP modified by Diego Porres to use StyleGAN3
- StyleGAN2-ADA - train your own StyleGAN2 model from an image set you create
- StyleCLIP - Text-drive manipulation of StyleGAN imagery
- Structured Dreaming - Styledreams With helpers
- Structured Dreaming (CLIP+StyleGAN) by @ArYoMo (tweet)
- StyleGAN 2 pretrained models - can use these with Structured Dreaming
- StyleGAN 2 awesome pretrained models - BIG collection of models
- StyleGAN 3 training - train a StyleGAN and do interpolation video by @dvsch
- StyleGAN 3 music video generation - (tweet)
- StyleGAN 3 + CLIP by Annas
- StyleGAN3 + CLIP by @nshepperd1 and @RiversHaveWings
- StyleGANXL + CLIP by Eugenio Herrera and Rodrigo Mello
Lucid Sonic Dreams - animate path through StyleGAN latent space with music (github)
Text
Goose.ai Playground - can use their playground to generate text with GPT-Neo
- GPT Neo Colab notebook - use GPT-neo 1.3B and 2.7B from Google colab
- GPT Neo HuggingFace - run GPT-neo 2.7B on HuggingFace
- Neuralism Generative Art Prompt Generator - generate prompts to use for text to image
- OpenAI GPT3 Playground - generate text with GPT-3 (requires free account)
- Prompt Parrot by @KyrickYoung - train GPT2 on a list of your prompts
- MadLib Prompt Generator - makes interesting prompts for you, by @remi_durant
Neuralism Prompt Generator - generative art prompt generator
Video
Video Enhance AI by Topaz Labs - commercial upscaling and frame interpolation <- excellent
- AnimationKit AI - synthetic video upscaling and interpolation tool <- great
- audio-reactive-video - by @devdef, skip video frames based on volume
- PyTTI-Tools: FILM - @DigThatData ‘s version of FILM for video frames
- FILM: Frame Interpolation for Large Motion- (replicate link) smooth interpolation/morphing
- Sequence Frame Interpolation - batch version of FILM
- FILM colab - by @KyrickYoung has pause, loops, reverse
- Flowframes - free tool with patreon, uses RIFE and other models
- ESRGAN 4 Video- increase resolution of video with ESRGAN
- RIFE - smooth interpolation of video to increase frame rate
- Super Slomo - another way to increase frame rate of video
- 3D Photo in-painting - give cool 3D effect to 2D image
- 3D Ken Burns Effect from single image - animated video from 2D image
- Animating Pictures with Eulerian Motion Fields- code not out yet, looks like it’ll be awesome
- Audio to keyframe string generator for AI animation notebooks
- Keyframe string generator for AI animation notebooks
- Wiggle animation key frame generator by @zippy731
Music
You can generate music with AI using OpenAI’s Jukebox. You can prompt Jukebox with an artist and music genre or with a short audio clip in WAV format. It generates new music for you in phases of increasing quality (level_2, level_1, level_0) and takes about 8 hours on Colab.
- boomy - let’s you create and publish music with Ai but they hold the copyright
- D3Net-MSS - colab for splitting music into separate clips for drums, vocals, etc
- Easy One Click Jukebox - this is my favorite currently
- Jukebox Community Build - download this ipynb and put in Colab Notebooks folder on Google Drive to use
- lalal.ai - commercial music to stems service
- mubert - nft friendly music remixed by Ai
- Official OpenAI Jukebox - the official notebook from OpenAI
- Spleeter colab - split music into stems
Zags Jukebox v3.7 - (youtube tutorial)
Other
JAX CLIP Guided Diffusion 2.7 Guide - Google doc from huemin
- text2mesh - Kaggle notebook for text to 3D mesh
- ClipMatrix - text controlled 3D mesh deformation and stylization
- Zippy’s Disco Diffusion Cheatsheet - Google Doc guide to Disco and all the parameters
- EZ Charts - Google Doc Visual Reference Guides for CLIP-Guided Diffusion (see what all the parameters do!)
- A Guide to Writing Prompts for Text to Image - Google Doc guide and advice
- Hitchhiker’s Guide To The Latent Space - a guide that’s been put together with lots of colab notebooks too
- Resources for GAN Artists - another big Google Doc with notebooks and resources for AI art
- Way of the TTI Artist - pytti guide
- Guide to install Disco Diffusion 5 on Windows with WSL- haven’t tried this yet challenge is pytorch3d
- Great explanation of VQGAN+CLIP - https://ljvmiranda921.github.io/notebook/2021/08/08/clip-vqgan/
- Nice overview of lots of different optimization algorithms SGD, Adam, RMSProp etc and their differences (also covered in this lecture)
- Stanford’s Convolutional Neural Networks class on YouTube - https://www.youtube.com/playlist?list=PL3FW7Lu3i5JvHM8ljYj-zLfQRF3EO8sYv
- PIFuHD Colab - Human photo to 3D mesh of the human
- Watermark images - little notebook to add text watermark to images
- Zero-Shot Text-Guided Object Generation with Dream Fields - text to 3D render
Book Reviews
- Generative Deep Learning: Teaching Machines to Paint, Write, Compose, and Play by David Foster [2019]
Really enjoyed this and it’s a great book! It’s from 2019 so doesn’t cover the very latest like VQGAN, CLIP, guided diffusion though.
- Deep Learning with Python by Francois Chollet [2021]
Very good to get up and running with TensorFlow building all the basic/standard model types
- Deep Learning for Coders with fastai and PyTorch by Jeremy Howard [2020]
I’m still at the beginning of this. I wish it used PyTorch directly and not their fastai library built on top. I was hoping to learn hands on with PyTorch. I’ll continue with it and update my review when I get further along.
- The Artist in the Machine: The world of AI-powered creativity by Arthur I. Miller [2020]
Not very technical but engaging and inspiring view of many Ai art projects so far.
- Dive into Deep Learning (online, free, interactive)
This looks fantastic and I wish I found it earlier! I love how you can toggle between TensorFlow and PyTorch for all the code examples.
ml4a.net - online textbook, classes, and learning resources
Cool Apps
No Code AI Art tools
Artbreeder - StyleGAN model with “genes” (directions in latent space) for editing
- CogView- text to image, Chinese model like DALL-E (interview)
- conjure.art - new text to image site currently in beta
- DeepDreamGenerator - deep style, thin style, deep dream
- midjourney - still in private beta, they have link to apply on the @midjourney twitter
- NightCafe - style transfer, VQGAN, diffusion image generation
- Nebula41 - under development, VQGAN videos from text
- Ostagram - style transfer
- pollinations.ai - run lots of popular notebooks
- ProsePainter - interactive tool to “paint with words”
- runwayml - video editing powered by AI
- snowpixel - text to image and variations
- StarryAI - text to image with easy selection of styles
- uberduck.ai - text to speech with lots of different voices
- Visions of Chaos - run the popular AI notebooks locally on Windows (see the Machine Learning setup steps)
- Wombo - Super fast and free but may have restrictions for use
- wzrd.ai - give it a music file and produce animation from big set of pretrained GANs
Online Galleries to Showcase Art
- OnCyber art galleries - https://oncyber.io - Cool 3D art gallery to showcase your art with links to NFT market
- Spatial - https://spatial.io
Has this guide been helpful to you? 🙂 You can follow me on twitter @pharmapsychotic and I’d love if you would