Ep 8: Image-to-Image
7 min
text to image vs image to image in text to image, the ai starts from noise and generates from scratch in image to image, the ai starts from a photo or sketch you provide and transforms it based on your prompt we'll take the text to image workflow from ep 4 text to image from scratch docid\ p7cv7o3yziw6dvngg5vev and modify it the change is small swap one node group the swap delete the empty latent image node add a load image node drop in the image you want to transform add a vae encode node between load image and the ksampler you can't connect load image (blue) directly to the ksampler (pink) vae encode converts the visible image into latent space so the ai can work on it connect your existing load vae to both the vae encode and vae decode nodes resize your input diffusion models work at specific resolutions (usually around 1024x1024) feeding a 4k image directly will fail or produce artifacts add a resize image node between load image and vae encode set it to 1024 pixels make sure it keeps the aspect ratio so nothing gets stretched add a resize image node between load image and vae encode set it to 1024 pixels make sure it keeps the aspect ratio so nothing gets stretched denoise is everything here in image to image, the ksampler's denoise setting controls how much changes value result 0 2 0 4 subtle structure and colors stay mostly the same 0 5 0 7 noticeable changes style shifts but composition holds 0 8 1 0 dramatic the original is barely a suggestion default is 1 0, which ignores your input entirely lower it the ai doesn't "see" objects in your image it sees pixels and colors a low denoise preserves those color patterns the prompt controls what the ai thinks it's looking at for actual structural control (poses, edges, composition), you need ep 10 controlnet basics docid 85 ftbnp59jo18bu8 lhr faq why does my output look nothing like my input? denoise is too high at 1 0 the ai ignores your image entirely try 0 3 0 5 what resolution should my input be? match your model's training resolution for most modern models, that's around 1024x1024 add a resize image node to handle this automatically
