AUTOMATIC1111のおすすめ拡張機能９選んんんほおお！【Stable Diffusion】

テルルとロビン【てるろび】旧やすらぼ

30 Jun 202310:55

TLDRIn this informative video, the presenters introduce nine useful expansion functions for Automatic1111, a tool designed to enhance image browsing and editing. The Image Browser simplifies viewing and managing output images, while Tag Complete offers an auto-complete feature for prompt input. System Info provides real-time machine status, and Style Editor allows for easy editing of saved prompts. Aspect Ratio Selector and Aspect Ratio Helper assist with image sizing, and Config Preset saves common settings for quick recall. After Detailer adds detail to faces in illustrations, and InPaint Anything modifies images using object detection. These extensions aim to streamline the image generation process, making it more efficient and user-friendly.

Takeaways

🖼️ The Image Browser is an expansion function that simplifies the viewing and management of output images directly from the web UI.
🔍 Tag Complete is a prompt auto-complete feature that suggests Danbooru-tags and helps with input errors and related words, enhancing the ease of prompt entry.
💻 System Info provides real-time information about the machine, including VRAM usage and a list of recognized models and learning files.
📝 Style Editor allows for easy editing and deletion of saved styles, eliminating the need to manually edit a Style.csv file.
🎨 Aspect Ratio Selector is a tool that assists users in maintaining the aspect ratio of images during generation, with preset buttons and a simple calculator.
📐 Aspect Ratio Helper is a slide bar auxiliary function that maintains the aspect ratio while adjusting the resolution.
🔧 Config Preset enables users to save and quickly recall their frequently used prompt and sampling settings.
🖌️ After Detailer is an extension that automatically enhances the details of an illustration's face, improving the quality with minimal effort.
🎭 InPaint Anything uses a Facebook object detection model to modify illustrations and create masks, offering various processing options like InPaint and Control Net.
📈 The ranking function in Image Browser allows users to sort images based on a ranking they assign, streamlining the organization process after mass production.
📊 The metadata search and sorting capabilities in Image Browser enable users to filter and organize images based on specific criteria like prompt, model name, and sampling method.

Q & A

What is the first expansion function introduced in the video?
-The first expansion function introduced is the Image Browser, which allows users to easily browse and manage images in the Outputs Folder through a web UI.
How can users search for images based on metadata using the Image Browser?
-Users can search for images by entering a specific prompt in the Image Browser's search function, which will display only the images associated with that prompt. It also allows sorting by model name, sampling method, and the name of the expansion function used.
What is the unique function of the Image Browser that helps with sorting images?
-The unique function is the ranking function, which allows users to rank images they think are good. By opening the tab and sorting by rank, it will display only the images with that rank.
What is the purpose of the Tag Complete expansion function?
-Tag Complete is a prompt auto-complete function that suggests Danbooru-tags, which are used on overseas image posting sites like Danbooru. It also helps with typing errors and related word suggestions to facilitate prompt input.
How does the System Info expansion function benefit users?
-System Info displays the current state of the machine in real time, which is useful for monitoring VRAM consumption during image generation. It also lists recognized models and learning files, making it easier to find this information without manually searching through folders.
What is the main advantage of using the Style Editor expansion?
-The Style Editor allows users to easily edit the Style.csv file, which contains saved prompts, directly from the web UI. It simplifies the process of adding, editing, or deleting registered styles.
How does the Aspect Ratio Selector help users generate images of a specific size?
-The Aspect Ratio Selector assists users in maintaining the aspect ratio when generating images of a certain size. It provides preset buttons for common ratios and a simple calculator to help with aspect ratio adjustments.
What is the difference between the Aspect Ratio Selector and Aspect Ratio Helper?
-The Aspect Ratio Selector is a tool with preset buttons for quick ratio adjustments, while the Aspect Ratio Helper is an auxiliary function of the slide bar that allows users to maintain the aspect ratio while adjusting the resolution.
What does the Config Preset expansion function allow users to save?
-The Config Preset allows users to save their frequently used prompt settings, sampling settings, and high-resolution settings, making it easier to recall these settings quickly when needed.
How does the After Detailer extension enhance the details of an illustration?
-After Detailer automatically detects the face in an illustration and adds more details to it. It can be used with T2i and I2i and provides a significant visual enhancement with minimal operational effort.
What is the primary function of the InPaint Anything expansion?
-InPaint Anything allows users to modify illustrations and create masks for specific parts using the Segment Anything model. It offers different processing options, including InPaint, Cleaner, Control Net, and Mask Only, for various editing tasks.
What is the significance of the ranking function in the Image Browser for mass production?
-The ranking function is particularly useful after mass production with Generate Forever, as it allows users to sort images based on their quality or preference, making it easier to manage and select images from a large batch.

Outlines

00:00

🖼️ Image Browser Overview

The first expansion function introduced is the Image Browser, which simplifies the process of viewing output images. Instead of manually opening the Outputs Folder, users can view images directly through the web UI. This tool automatically reads and displays images, allowing for easy browsing, metadata inspection, and deletion. A unique feature is the ability to search metadata, enabling users to filter images based on specific prompts, model names, sampling methods, or the name of the expansion function used. Additionally, there's a ranking function that lets users sort images by their assigned rank, which is particularly useful after mass production with the Generate Forever feature. The Image Browser can be installed as an extension and is considered an essential tool for its convenience.

05:01

📝 Tag Complete - Prompt Auto-Complete

The second feature is Tag Complete, which assists users in inputting prompts by suggesting Danbooru-tags, commonly used on image posting sites like Danbooru. This auto-complete function not only proposes tags as users type but also corrects typos and offers related word suggestions. Each tag is accompanied by a hit count, indicating its popularity and the likelihood of obtaining desired results with that tag. However, some models may not respond well to these tags, so it's advised to use it as an auxiliary input tool. The default number of suggestions is five, but it's recommended to increase this number to about ten for a better experience.

10:03

💻 System Info - Real-Time Machine Status

System Info is the third function, aimed at users interested in the system's current state. It provides real-time information about the machine, which is beneficial for monitoring VRAM consumption during image generation. The tool also lists recognized models and learning files, eliminating the need to manually search for them in folders. While not an extension that performs actions, System Info is appreciated for consolidating scattered information into one view.

✍️ Style Editor - Template Management

The fourth tool is the Style Editor, which simplifies the management of saved prompt templates. While users can save prompts as templates using the floppy mark, managing an increasing number of templates can be cumbersome. The Style.csv file, where saved styles are stored, can be edited directly through the web UI with the Style Editor extension. This allows for easy viewing, editing, and deletion of registered styles in real time, streamlining the process of managing and organizing frequently used prompts.

🎨 Aspect Ratio Selector - Image Sizing Made Easy

The fifth feature is the Aspect Ratio Selector, designed to help users generate images of a specific size while maintaining the aspect ratio. It adds an icon to the interface with preset buttons for common ratios like 1:1 and 16:9, and a template button to reset to the default size of 512. It also includes a calculator to determine the necessary aspect ratio adjustments when changing resolution settings. This tool simplifies the process of resizing images without distorting their proportions.

📐 Aspect Ratio Helper - Slide Bar Assistant

The sixth tool, Aspect Ratio Helper, complements the slide bar functionality, providing a preset aspect ratio that maintains the ratio as the user adjusts the bar. It offers a different experience from the selector and adds a small command next to the resolution setting for easy ratio adjustments.

🔄 Config Preset - Saving Custom Settings

The seventh feature, Config Preset, allows users to save not only their prompt settings but also their sampling and high-resolution settings, which are not saved by default. It adds a pull-down menu under the output results of T2i and I2i, where users can save, edit, and delete presets, making it easy to recall custom settings with each startup.

🔍 After Detailer - Enhancing Illustration Details

The eighth feature is After Detailer, a well-known extension that automatically detects and enhances the facial details of illustrations. It works with T2i and I2i and can be used by simply enabling it in the settings. After Detailer can also accommodate prompts for creating variations in facial expressions, offering a simple yet powerful tool for refining illustrations.

🖌️ InPaint Anything - Object Segmentation and Editing

The ninth and final feature is InPaint Anything, which utilizes an old Facebook object detection model to allow users to modify illustrations and create masks using Segment Anything. It offers a choice of models based on quality and speed, and once a model is selected and the image is processed, users can segment the image, create masks for specific objects, and apply various processing methods, including InPaint, Cleaner, Control Net, and Mask Only. This feature provides a straightforward way to modify specific parts of an image, such as changing the color of an object or creating a material image from a segment.

🗑️ Mask Only - Segment Extraction for Collage Materials

In addition to the main features, the Mask Only function allows users to extract only the segment mask, which can be useful for collecting materials for collages or extracting parts of an image. The video concludes by encouraging viewers to make good use of these nine expansions for efficient and convenient operation, and reminds them to stay hydrated.

Mindmap

Keywords

💡Automatic1111

Automatic1111 is a software or tool that the video is discussing. It seems to be a platform with various expansion functions that enhance its capabilities. The video focuses on introducing these functions to make the use of Automatic1111 more intuitive and efficient.

💡Image Browser

The Image Browser is an expansion function of Automatic1111 that allows users to easily browse and manage images in the Outputs Folder. It simplifies the process of viewing images by displaying them on the web UI, making it unnecessary to manually open each file. This tool is highlighted for its simplicity and the ability to search and sort images based on metadata.

💡Tag Complete

Tag Complete is a prompt auto-complete feature that suggests Danbooru-tags, which are used on image posting sites. It assists users in inputting prompts by providing suggestions for typos and related words, thereby facilitating easier and more accurate tag entry. The number next to each tag indicates its popularity, which can be an indicator of the content availability associated with that tag.

💡System Info

System Info is an extension function that displays real-time information about the machine's current state. It is particularly useful for monitoring VRAM consumption during the image generation process. It also lists recognized models and learning files, providing a convenient overview without the need to manually check folders.

💡Style Editor

The Style Editor is an extension that enables users to edit the Style.csv file, which contains saved prompts, directly on the web UI. This simplifies the management of saved styles by allowing users to easily add, edit, or delete styles without the need to manually handle the underlying files.

💡Aspect Ratio Selector

The Aspect Ratio Selector is a tool that helps users generate images of a specific size while maintaining the desired aspect ratio. It simplifies the process of calculating and setting the correct aspect ratio by providing preset buttons and a calculator for more complex adjustments.

💡Aspect Ratio Helper

The Aspect Ratio Helper is a slide bar auxiliary function that assists in maintaining the aspect ratio while adjusting image dimensions. It provides a preset aspect ratio and allows users to make adjustments while keeping the ratio constant, offering an alternative method to the Aspect Ratio Selector.

💡Config Preset

Config Preset is an extension that allows users to save and quickly recall their frequently used prompt and sampling settings. It adds a pull-down menu for easy access to presets, streamlining the process of starting new image generation tasks with preferred settings.

💡After Detailer

After Detailer is an extension feature that automatically enhances the details of an illustration's face. It works with T2i and I2i and can significantly improve the quality of facial features with minimal effort from the user. It's particularly useful for adding finer details to an image post-generation.

💡InPaint Anything

InPaint Anything is a feature that leverages an object detection model to modify illustrations and create masks for specific parts of an image. It allows users to select, mask, and replace parts of an image with ease, offering various processing options such as InPaint, Cleaner, Control Net, and Mask Only.

💡Segment Anything

Segment Anything is a model used by the InPaint Anything feature for object detection and segmentation. It enables users to divide an image into segments and selectively process parts of the image, such as changing the color of an object or creating a mask for a specific segment.

Highlights

Introduces useful expansion functions for Automatic1111, focusing on intuitive and simple ones.

Image Browser allows easy browsing and management of output images through a web UI.

Image Browser enables searching and sorting of metadata, including prompts, model names, and sampling methods.

A unique ranking function in Image Browser helps sort images based on user-defined criteria.

Tag Complete is a prompt auto-complete feature for Danbooru-tags, aiding in input efficiency.

System Info provides real-time machine status, including VRAM consumption and recognized models.

Style Editor allows for easy editing and deletion of saved prompt templates on the web UI.

Aspect Ratio Selector simplifies the process of generating images with specific aspect ratios.

Aspect Ratio Helper is a slide bar auxiliary tool that maintains aspect ratio during adjustments.

Config Preset enables saving and quick access to frequently used prompt and sampling settings.

After Detailer is an extension that automatically enhances facial details in illustrations.

InPaint Anything uses an object detection model to modify illustrations and create masks.

InPaint Anything's Control Net mode allows for model-specific processing of selected segments.

Mask Only feature in InPaint Anything can be used to extract and collect segment masks for collage materials.

Automatic1111's recommended 9 expansions aim for ease of operation and high efficiency.

The presentation encourages viewers to make good use of the highlighted expansions for Automatic1111.

The video concludes with a reminder to stay hydrated and a thank you for watching.

Casual Browsing

おすすめのStable Diffusion web UI拡張機能3選！（After Detailer / TensorRT / TrainTrainとDataset Tag Editor）

2024-05-17 10:15:01

【おすすめ】WebUIを便利にカスタマイズ！Stable Diffusionで画像生成AIを使うなら導入したい拡張機能10選【ずんだもん解説】

2024-03-25 21:55:02

Stable Diffusion お姉さんの(没)な手を修正する Embedding ControlNet

2024-04-17 00:50:01

Stable Diffusionの新機能『IP Adapter』でトレースが可能に。コントロールの超おすすめ機能

2024-03-24 00:30:01

【神拡張機能】regional prompterを上手に使おう【stable diffusion】

2024-04-12 14:20:01