【Stable-Diffusion】🔰導入後はまずはこの3つを入れよう! #stablediffusion #prompt_all_in_one #vae #easynegative

ざすこ (道草_雑草子)
5 Nov 202319:25

TLDRThe video script is a comprehensive guide for beginners in AI image generation, focusing on the use of Stability Diffusion. It introduces three essential steps to enhance image quality post-installation: installing essential extensions, setting up auto-translation with Google Translate for ease of use, and incorporating VAE for image refinement. The tutorial also covers the use of negative prompts to avoid common image generation issues and provides tips for creating more accurate and polished images. The presenter encourages viewers to experiment with the tools and settings to improve their AI-generated images.


  • 📌 Introduction to AI image generation for beginners, aiming to explain concepts in an easy-to-understand manner.
  • 🔧 Post-installation recommendations for Stability Diffusion, including setting up an initial environment for ease of use.
  • 🛠️ Importance of installing essential extensions like 'Prompt from URL' for streamlined operations without needing to memorize complex English commands.
  • 🔄 Use of Google Translate's free API for automatic translation within the image generation process.
  • 🎨 Implementation of VAE (Variational Autoencoder) for enhancing image quality and making adjustments to generated images.
  • 🚫 Addressing common issues like difficulty in creating beautiful images and providing solutions through proper setup and extensions.
  • 📋 Explanation of how to use negative prompts effectively to avoid unwanted image features.
  • 🔗 Introduction to 'Hanging Face' and its utility in downloading VAE versions for improving image generation.
  • 🌐 Mention of additional tools and versions available for text-to-image generation, with promises of future videos detailing these options.
  • 🎓 Encouragement for beginners to experiment with the tools and learn from hands-on experience, as it aids in understanding the mechanics of prompt-based image generation.
  • 📈 Final thoughts on the benefits of using the discussed tools and extensions, emphasizing their convenience and potential for creating high-quality images.

Q & A

  • What is the main purpose of the video from AI Michikusa Channel?

    -The main purpose of the video is to provide an easy-to-understand explanation and guide for beginners on how to use AI image generation, specifically focusing on after installing Stable Diffusion, including what steps to take next and how to improve image quality.

  • What are the first three things to do after installing Stable Diffusion according to the video?

    -After installing Stable Diffusion, the video recommends: 1) Installing the most understandable extension right away, 2) Setting up automatic translation using Google's free service, and 3) Implementing a feature to easily handle complex prompt inputs.

  • What extension is suggested to be installed first in the Stable Diffusion setup?

    -The video suggests installing an extension called 'Prompt OB1' from the URL provided in the description, which is aimed at simplifying the process of creating prompts for image generation.

  • How does the video suggest improving the translation setup in Stable Diffusion?

    -The video recommends changing the translation API to Google's free translation service to facilitate understanding and inputting prompts, especially for those who are not fluent in English.

  • What is VAE, and why is it recommended to be installed according to the video?

    -VAE stands for Variational Autoencoder. It is recommended to be installed to enhance the quality of the generated images, making them more refined and reducing distortions.

  • What is the purpose of installing the 'Easy Negative' feature as mentioned in the video?

    -The 'Easy Negative' feature is installed to streamline the process of inputting negative prompts, which help in preventing unwanted elements or errors in the generated images, thereby improving the overall quality.

  • What does the video suggest doing if the generated images have issues with body parts like hands?

    -For issues with body parts in generated images, the video suggests utilizing negative prompts to specify undesired outcomes, like distorted hands, to prevent such errors in new images.

  • How does the video address the challenge of entering complex English prompts for non-English speakers?

    -The video addresses this challenge by showing how to use automatic translation and providing extensions that simplify prompt creation, allowing users to input in Japanese and have it automatically translated to English.

  • What benefit does the video highlight about using the 'Prompt OB1' extension with Stable Diffusion?

    -The 'Prompt OB1' extension benefits users by offering a simplified and intuitive interface for generating prompts, enabling users to create targeted images without needing to remember or type complex English text.

  • What is the overall message of the video regarding the use of Stable Diffusion for beginners?

    -The overall message of the video is to encourage beginners to start using Stable Diffusion by introducing user-friendly tools and extensions that simplify the image generation process, making it accessible and easy to produce high-quality images.



🎨 Introduction to AI Image Generation for Beginners

This paragraph introduces the video's purpose, which is to explain information about AI image generation in an easy-to-understand way for beginners. The speaker acknowledges that some viewers might have introduced Stability Diffusion into their projects but are unsure about what to do next. The video aims to help these viewers by recommending three essential steps to take after introducing Stability Diffusion, which will allow them to generate more beautiful images with simple operations without needing to memorize complex English text.


🛠️ Setting Up Your Environment with Essential Extensions

The speaker guides viewers on how to set up their environment by installing necessary extensions for Stability Diffusion. The process includes launching the Stability Matrix, installing an extension called 'Prompt In URL' from the extension repository, and setting up Google Translate as the translation API. The speaker emphasizes the convenience of these extensions and provides a step-by-step guide on how to install and configure them, ensuring that viewers can generate images with improved quality compared to their initial state.


🌟 Enhancing Image Quality with VAE and Negative Prompts

This paragraph discusses the introduction of VAE (Variational Autoencoder) and the use of negative prompts to enhance image quality. The speaker explains how to download and install VAE, as well as how to set up negative prompts using a downloaded package. The video demonstrates how these additions can help generate more refined images, with a focus on correcting common issues such as awkward hand positions. The speaker also mentions that more detailed instructions for hand corrections will be provided in a separate video.


🎉 Exploring Advanced Features and Customization Options

The speaker invites viewers to explore advanced features and customization options in Stability Diffusion. They demonstrate how to use group tags to change the scene and items, such as generating an image in a water park or changing the character's outfit. The video also shows how to input Japanese keywords for prompts, allowing for easier image generation without needing to type in English. The speaker concludes by encouraging viewers to experiment with the newly introduced features and promises to cover more details in future videos.




AI画像生成 refers to the process of creating images using artificial intelligence. In the context of the video, it involves using specific software and tools to generate visual content based on user inputs. The video aims to provide an accessible guide for beginners to understand and utilize AI image generation tools, such as Stability Diffusion, to create high-quality images without the need for complex technical knowledge.

💡ステーブル・ディフュージョン (Stability Diffusion)

Stability Diffusion is a term used in the video to refer to a specific AI-based image generation model. It is a type of deep learning model that uses a process called diffusion to create images from textual descriptions. The video emphasizes the importance of introducing Stability Diffusion to beginners as a foundational tool for generating images, highlighting its user-friendly nature and its potential to produce visually appealing content.

💡拡張機能 (Extensions)

Extensions, in the context of the video, refer to additional software components that enhance the functionality of the primary AI image generation tools. These extensions allow users to customize their environment and improve the image generation process by adding new features or capabilities. The video emphasizes the importance of selecting and installing the right extensions to achieve the desired image outcomes.

💡プロンプト (Prompts)

Prompts are textual inputs or descriptions that guide the AI image generation process. They are crucial in defining the characteristics, style, and elements of the images to be created. The video discusses the use of prompts to communicate the user's intentions to the AI, and how they can be structured and refined to produce high-quality images.

💡VAE (Variational Autoencoder)

VAE, or Variational Autoencoder, is a type of generative model used in machine learning and AI. In the video, VAE is introduced as a tool to improve the quality and coherence of AI-generated images. It works by learning a compressed representation of the input data and can be used to refine the generation process, ensuring that the images produced are more consistent and realistic.

💡ネガティブプロンプト (Negative Prompts)

Negative prompts are a technique used in AI image generation to specify what should not be included in the generated images. They serve as a form of constraint or filter to prevent certain elements from appearing, thereby improving the accuracy and relevance of the generated content. The video emphasizes the importance of using negative prompts to avoid common issues such as distorted limbs or unwanted objects in the images.

💡グループタグ (Group Tags)

Group tags are organizational labels used to categorize and filter the elements or features that can be included in the AI-generated images. They allow users to quickly select and combine different attributes or items to create a desired scene or character. The video highlights the use of group tags as a way to streamline the image generation process and make it easier for beginners to produce targeted visual content.

💡クイック操作 (Quick Operations)

Quick operations refer to the straightforward and efficient methods used in AI image generation to achieve the desired results with minimal effort. The video emphasizes the convenience of being able to generate images without needing to understand complex technical jargon or processes. It highlights the use of intuitive interfaces and features that allow users to quickly generate and refine images based on their preferences.

💡シード値 (Seed Value)

Seed value is a parameter used in AI image generation to ensure consistency and reproducibility of the output. By setting a specific seed value, users can generate images with a particular combination of features and elements. The video discusses the use of seed values as a way to fix the randomness in image generation, allowing users to recreate the same image or a similar one with consistent characteristics.

💡AI翻訳 (AI Translation)

AI translation refers to the process of automatically converting text from one language to another using artificial intelligence. In the context of the video, AI translation is used to facilitate the image generation process for users who may not be fluent in the language of the prompts. The video highlights the integration of Google Translate as a means to simplify the input of prompts and make the AI image generation tools more accessible to a wider audience.

💡画像生成 (Image Generation)

Image generation is the process of creating visual content using computational methods, such as AI models. In the video, image generation is the primary goal, with the focus on teaching beginners how to use AI tools to produce high-quality images based on their preferences and inputs. The video provides a step-by-step guide on how to set up the environment, input prompts, and refine the generation process to achieve the desired image outcomes.

💡初心者 (Beginners)

Beginners, in the context of the video, refers to users who are new to AI image generation and may lack prior experience or technical knowledge in this area. The video is tailored to provide an easy-to-understand guide for these users, helping them navigate the AI tools and features without being overwhelmed by complex terminology or processes.


AI道草, チャンネルのザコです今回もAI画像生成, 入門者向けの情報を分かりやすく解説して, いきたいと思います。

ステーブル, ディフュージョンを導入したけど次に何を, していいかわからないプロンクトって何, 注文なんか難しそうとりあえず画像成はし, てみたけどなんか変綺麗な絵ができないと, いう方も結構いらっしゃるじゃないかと, 思います。

設定項目が本当色々あり, すぎて正直分かりづらいですよねそんな, あなたのために今回はステーブル, ディフュージョンの導入後はまずこの3つ, をやるべしということで私のおすすめの, 初期環境の構築をご紹介していきたいと, 思います。

最初の拡張機能を入れてしまいますエションタブからこの, インスーfromURLっていうところ, からURforエクステンションズG, リポジトリてところのえLANを押して, いただいて概要欄にあるプロンプトオデ1, という拡張功能的URをここに貼り付けて, ください。

Google翻訳ですねで今回これにしてしまいますで下の方に, あるAPIキーが必要ですっていう下に, 色々リストはありますけどDLEを使った, 自動翻訳もできるんですがちょっとAPI, 機の設定が必要だったりするのでえ今回は, もう無料版のGoogle翻訳にして, しまいます。

この雲のAPIっていうアイコンから入ってもらってえこの上の方, がなで1番上の翻訳APIっていうところ, のえこれをGoogle無料の, Googleにしちゃいます。

プロンプトをもう1度入れたいのでえ直前に入れたやつは, こう左下ちボタンからですねえ入れ直す, ことができます。

vaeっていうものをえ導入して, いきたいと思いますえっとさっきとまにえ, 概要欄からハギングfaceというサイト, のえこのページにえ飛んでいただいて, ファイルズアバージョンズっていうところ, にえ入っていただきえ1番下のねのVA, FM, 84万円センサーズ版ってやつですね1番, 下これこれをえダウンロードしていき, ます。

クイックセッティング, リストてところにvaeを設定する項目を, 追加していきますでこれクリックして, もらってちょっとリスト多いんですけど下, の方に行くとこのSDvaeってあると, 思うんでこれをクリックして追加でついでににこのクリッstop@Last, レイヤーズてやつもえ後で必要になって, くるので今入れちゃいます。

ネガティブプロンプとが入ったことで, さっきよりも正確な絵が出るようになっ, てると思います。

手の修正についてはた別の, 動画で詳しく説明していきたいと思います。

実写系もちょっとやってみましょうマジックミクスで同じ, プロンプトで生成してみましょう。

シード値固定したままなのでこの, サイコロボタンのところですねシード地を, ランダムに戻すことができるんでこの, ボタンを押します。

vaeとEGネガティブを入れてみ, ますveeとEネガティブを入れてもう, 一度生成してみ, ますそうするとこんな感じでさっきよりも, ちょっと綺麗になったんじゃないかなと, 思います。

2, とイージネガティブが入ったことで初期, 状態から画像のクオリティが上がったのが, 分かるかと思います。

日本語でのキー, ワード入力がメインになってくるかなと, 思います。

いちいち英語で入力しなくても日本語入力, で簡単に画像生成ができちゃいますいい, ですね。

アニメ系のモデルの方でも, じゃこのカウンターフェイトV30って, やつでやってみ, ましょう。

プロンプトオールインワにはさらに細かい機能がたくさんあって, ちょっと今回紹介しきてないんですが主な, 使い方としてはこのグループタからの選択, というプロンプト入力と日本語でのキー, ワード入力がメインになってくるかなと, 思います。

この他にも種類が, 色々あるのでそれらについてはまた別の, 動画で説明していければなと思います, はいということで今回は以上です。