SDXL用のCounterfeitとネガティブエンベッディングスでたー!【stable diffusion】
TLDRAlice from AI's in Wonderland introduces a model supporting both Counterfeit and SDXL, highlighting the impact of three Negative Embeddings tailored for SDXL. She compares the effects of Standard, Realistic, and Anime-like Embeddings on image quality and shares her experience using Comfy UI for generating cleaner images. Alice also discusses the results of adding a negative prompt for deformity prevention, showcasing the potential of Negative Embeddings in enhancing image generation.
Takeaways
- 🚀 Introduction of a model supporting both Counterfeit and SDXL, named CounterfeitXL.
- 📈 The models are large, approximately 7GB, potentially filling up storage space.
- 🌟 Exclusive to SDXL, three Negative Embeddings (A-Standard, B-Realistic, C-Anime-like) are discussed.
- 🖼️ CIVITA showcases prompts and images for reference, with a note that they may not match the original Counterfeit's quality.
- 🎨 Plans to experiment with Comfy UI for quicker and cleaner image generation.
- 🎥 A video on Comfy UI is planned, albeit delayed due to breaking news.
- 👧 The demonstration begins with drawing a girl in a school uniform using Counterfeit XLα without LoRA.
- 📊 The image settings include 1024x1024 size, 35 total steps, and a CFG scale of 7 with clip skip 2.
- 🔍 Evaluation of the sampler with DPM++2MSD crow for image generation.
- 📈 Testing the Upscale Model with different styles, starting with Anime, using Real ESRGAN 4x and Anime6B.
- 🌈 Exploration of the effects of Negative Embeddings on image quality and detail, with variations observed.
Q & A
What models are discussed in the video?
-The models discussed in the video are CounterfeitXL and SDXL.
How large are the CounterfeitXL and SDXL models?
-The CounterfeitXL and SDXL models are about 7GB each.
What are Negative Embeddings and how many are there for SDXL?
-Negative Embeddings are a technique used to refine the output of the model, and there are three of them exclusively for SDXL.
What are the three types of Negative Embeddings mentioned in the script?
-The three types of Negative Embeddings are A for Standard, B for Realistic, and C for Anime-like.
What is the purpose of the CIVITA's side prompts and images?
-The CIVITA's side prompts and images are provided as references to compare the generated images with the original Counterfeit.
What is the image size and total steps used in the demonstration?
-The image size used is 1024 by 1024, and the total steps are 35, with up to 28 steps being the Base model.
What sampler and upscale model is Alice planning to use?
-Alice plans to use the sampler with DPM++2MSD crow for image generation and an Upscale Model for 1.5x upscale with denoising strength 0.3.
What is the effect of using Negative Embeddings from category A?
-Using Negative Embeddings from category A makes the face more solid and the cherry blossoms more distinct.
What issue is observed with the Realistic Negative Embeddings (category B)?
-With the Realistic Negative Embeddings (category B), the hand becomes a little distorted, and it doesn't lean towards a real image.
What is the outcome of using Anime-style Negative Embeddings (category C)?
-Using Anime-style Negative Embeddings (category C) doesn't result in significant changes, but there is some variation.
How does adding a negative prompt from the template affect the image?
-Adding a negative prompt from the template results in a cleaner hand but changes the composition, making direct comparison difficult.
Outlines
🖌️ Introduction to Counterfeit XL and SDXL Models
Alice from AI's in Wonderland introduces the CounterfeitXL and SDXL models, noting their large size of about 7GB. She mentions the challenge of limited storage but commits to continuing their use. Alice also discusses the three Negative Embeddings exclusive to SDXL, which are Standard (A), Realistic (B), and Anime-like (C), expressing her intent to explore their effects. She encourages viewers to check out prompts and images on CIVITA's side, acknowledges the potential quality differences, and shares her plans to experiment with Comfy UI. Alice explains the model settings, including the Counterfeit XLα model without LoRA, a 1024x1024 image size, 35 total steps, and specific parameters for the sampler and Upscale Model. She plans to post a video on Comfy UI the following week, despite a delay due to breaking news, and provides a walkthrough of the Comfy UI screen settings.
Mindmap
Keywords
💡Counterfeit and SDXL
💡Negative Embeddings
💡Comfy UI
💡DPM++2MSD crow
💡Upscale Model
💡Refiner
💡CFG scale
💡No LoRA
💡Prompt
💡Real ESRGAN 4x
💡Anime6B
Highlights
Introduction of a model that supports both Counterfeit and SDXL.
The CounterfeitXL and SDXL models are approximately 7GB in size.
Three Negative Embeddings are exclusively for SDXL: Standard (A), Realistic (B), and Anime-like (C).
CIVITA features prompts and images for reference.
Comfy UI is considered for producing quicker and cleaner images.
The model used is Counterfeit XLα without LoRA.
The image size is set to 1024 by 1024 with a total of 35 steps.
Using the sampler with DPM++2MSD crow for image generation.
Upscale Model is set to generate images at 1.5x upscale with denoising strength 0.3.
A video on Comfy UI is planned for the following week.
Base model leaves some noise in the image.
Negative Embeddings from A make the face more solid and distinct.
Negative Embeddings from B make the hand distorted, but the face becomes cute when upscaled.
Anime-style (C) doesn't show significant changes with Negative Embeddings.
Negative Embeddings work well for cherry blossom petals.
The addition of a negative prompt from a template improves the image significantly.
The hand becomes clean, but the composition changes with the negative prompt.
Various settings, including realistic ones, were tested for image generation.