Creating and Composing on the Unified Canvas (Invoke - Getting Started Series #6)

19 Feb 202421:24

TLDRThe video script introduces the concept of using a unified canvas for AI-assisted image creation and editing. It explains the process of importing an image into the canvas, utilizing layers for base editing and mask layer for inpainting to refine details. The script also covers the use of bounding boxes for focused image generation, and the staging area for iterating on edits. Techniques like automatic and manual infills for extending images, and adjusting denoising strength for coherence are discussed. The video serves as a guide to harness the power of AI in enhancing and transforming images, emphasizing the importance of understanding the tool for effective results.


  • 🎨 The purpose of the unified canvas is to facilitate the creation and enhancement of images using AI-assisted technologies.
  • πŸ–ΌοΈ Users can start with an AI-generated image or enhance their own by using the unified canvas for further creative control.
  • 🌟 The canvas allows for iteration and improvement of AI-generated images that may be close but not perfect.
  • πŸŽ“ It's recommended to familiarize oneself with the 'image to image' concept before diving into canvas editing.
  • 🏷️ The canvas introduces layers - the base layer for direct image content changes and the mask layer for inpainting.
  • πŸ–ŒοΈ The brush tool on the base layer can be used to introduce new colors and structures, while the mask layer allows for selective image editing.
  • πŸ”„ Switching between layers is simplified with hotkeys, such as pressing 'Q' to toggle between the mask and base layer.
  • 🎯 The bounding box, or the dotted box, is crucial for defining the AI's focus area and ensuring the prompt matches the content within it.
  • πŸ› οΈ The canvas provides a staging area for multiple iterations, allowing users to compare, accept, or discard different versions.
  • πŸŒ„ The unified canvas supports extending images through automatic and manual infill methods, ensuring seamless integration of new content.
  • πŸ’‘ The process of AI image editing is exploratory and may involve trial and error, but it's part of mastering the creative tool.

Q & A

  • What is the primary purpose of the unified canvas?

    -The primary purpose of the unified canvas is to enable users to create and composite a perfect image using AI-assisted technologies, whether starting from an AI-generated image or augmenting an existing one.

  • How can you initiate the editing process on the unified canvas?

    -To initiate the editing process, you can either navigate to the unified canvas and drag the image onto it or use the three-dot menu on any image within the studio and select 'send to canvas' to bring it directly to the canvas tab.

  • What are the two layers available for direct editing on the canvas?

    -The two layers available for direct editing are the base layer, where changes are made directly to the image content, and the mask layer, which allows selection of portions of the image for modification through a process called inpainting.

  • What is the function of the brush tool on the base layer?

    -The brush tool on the base layer is used to add new colors and structure to the image, making direct modifications to the underlying image layer.

  • How does the mask layer facilitate image editing?

    -The mask layer enables users to select specific regions of the image for editing through inpainting. It allows users to add new content or refine details within the selected areas, guiding the generation process.

  • What is the significance of the bounding box in the AI model's interpretation of the image?

    -The bounding box is crucial as it defines the area the AI model focuses on for generation. It effectively tells the AI where to concentrate its attention, and the prompt should describe everything inside this box to ensure accurate and contextually relevant image generation.

  • How does the staging area work in the canvas?

    -The staging area presents a toolbar at the bottom, allowing users to create multiple iterations of the same content. Users can accept the current iteration to apply it to the base layer or discard it to continue with the original image. It also enables comparison of before and after generations and saving of iterations to the gallery.

  • What is the role of the 'scale before processing' feature?

    -The 'scale before processing' feature ensures that the image generated uses the maximum power available with the selected model, generating the image at the model's trained size (e.g., 1024x1024) and then compositing the details into the smaller region of the image being edited.

  • What are the four infill methods available for extending images on the canvas?

    -The four infill methods, which provide different mechanisms for pulling colors from the original image into the new area, are patch match (default), and three other methods not explicitly named in the script. These methods help in generating a seamless extension of the image by using colors from the original image.

  • How can you enhance details in characters or objects in the background of an image?

    -Details in characters or objects in the background can be enhanced using inpainting with mini models. This technique allows for the addition of fine-grained details like improved facial features and crisper elements, especially in smaller regions that are prone to artifacts.

  • What is the importance of maintaining a balance between denoising strength and generation when extending images?

    -Maintaining a balance between denoising strength and generation is important to ensure that new colors being outpainted are transformed appropriately and that structural irregularities are fixed without losing the desired level of detail or causing the image to look significantly different from the original.



🎨 Introduction to Unified Canvas and AI-Assisted Image Editing

This paragraph introduces the concept of the Unified Canvas, a tool designed to enhance and composite images using AI-assisted technologies. It emphasizes the utility of the canvas in refining AI-generated images or augmenting user-created images. The speaker guides the audience through the process of importing an image into the canvas and highlights the importance of understanding the 'image to image' concept before proceeding. The paragraph outlines the basics of working with the canvas, including the use of layers such as the base layer for direct image modifications and the mask layer for inpainting techniques to add or alter content within the image. The speaker also explains how to switch between layers using the 'Q' hotkey and touches on the potential of the canvas to improve the editing process.


πŸ–ŒοΈ Utilizing Masks and Inpainting for Detailed Image Editing

This paragraph delves deeper into the specifics of using masks and inpainting within the Unified Canvas. The speaker describes how the mask layer allows users to select portions of the image for targeted changes, using the concept of inpainting to modify details and add new content. The process of selecting regions with the 'B' hotkey and adjusting the mask display is explained. The paragraph also covers the use of the 'H' hotkey to toggle the mask's visibility and the ability to save and clear masks. The speaker provides a practical example of changing an item of clothing in the image from a corduroy jacket to a leather jacket, emphasizing the importance of the bounding box and prompt accuracy for effective AI interpretation and image generation.


🌟 Enhancing Image Details with Bounding Box and Scaling

The speaker discusses the use of the bounding box to control the focus of the AI model and the 'scale before processing' feature to maintain image quality when editing smaller regions of the image. The paragraph explains how the AI model uses the maximum power available to generate images at a specific size, and how 'scale before processing' ensures that the generated details are composited into the smaller selected region. The speaker then demonstrates how to add finer details to a model's face using the bounding box and how to adjust the prompt accordingly. The process of generating new looks for the model and saving the edited images to the gallery is also covered.


πŸ“ Techniques for Extending and Outpainting Images

This paragraph focuses on the techniques for extending and outpainting images using the Unified Canvas. The speaker explains the importance of having enough context from the original image to inform the generated content in the empty spaces. The concept of the 'rule of threes' is introduced to ensure a proper balance between empty and filled regions of the image for effective outpainting. Four infill methods are mentioned, with the default 'patch match' method being recommended for most use cases. The speaker also discusses the 'coherence pass' feature within the compositing dropdown to control for seams in the generated images and the use of denoising strength to achieve a balanced result. The paragraph concludes with a practical example of outpainting an image and the importance of clear suggestions to the AI model for successful results.


πŸ› οΈ Advanced Editing and Confidence Building with AI Tools

The final paragraph addresses the realities of working with AI-assisted image editing tools and the importance of building confidence through exploration and experimentation. The speaker encourages embracing the exploratory nature of the process and learning how the tool works to achieve desired results. The paragraph also touches on the potential for future advanced techniques to offer more control, such as IP adapter control net. The speaker concludes by reassuring that unexpected results are part of the creative process and that with practice, users will gain the skills to effectively use the system for image editing.



πŸ’‘Unified Canvas

The Unified Canvas is a platform that enables users to create and composite images with the assistance of AI technologies. It serves as a tool to enhance images generated by AI or to augment user-created images with additional creative control. In the context of the video, the canvas is introduced as a means to refine AI-generated images that are close to perfect but may require further iteration and improvement.

πŸ’‘AI-Assisted Technologies

AI-Assisted Technologies refer to the use of artificial intelligence to aid in various tasks, such as image creation and editing. In the video, these technologies are used to generate images that can be further modified and improved upon by the user. The AI assists in creating a base image, which can then be fine-tuned and customized through the Unified Canvas.


Inpainting is a technique used in image editing where missing or unwanted parts of an image are filled in or 'painted' with new content that matches the surrounding area. In the context of the video, inpainting is a key feature of the Unified Canvas, allowing users to select specific portions of an image for modification through the mask layer, adding new details or correcting imperfections.

πŸ’‘Mask Layer

The Mask Layer is a feature within the Unified Canvas that enables users to select and isolate specific areas of an image for editing. This layer is crucial for making targeted adjustments without affecting the rest of the image, and it works in conjunction with the base layer to achieve the desired edits.

πŸ’‘Base Layer

The Base Layer in the Unified Canvas is the fundamental layer where the original image content resides. Users can make direct modifications to this layer, such as adding new colors or structures, which are then processed by the AI denoising process to refine the image.


Denoising is a process in image editing that aims to reduce or eliminate visual noise or artifacts in an image. In the context of the video, the denoising process is part of the AI-assisted technology that smooths out imperfections and enhances the image quality after direct modifications are made on the base layer.

πŸ’‘Bounding Box

The Bounding Box is a selection tool used in image editing to define a specific area or region of an image for processing or manipulation. In the video, the bounding box is crucial for guiding the AI model to focus its attention and generate content within the specified area, ensuring that the AI understands the context and the desired output.

πŸ’‘Staging Area

The Staging Area in the Unified Canvas is a feature that allows users to create and manage multiple iterations of an image. It provides a toolbar for comparing different versions, accepting or discarding changes, and saving the preferred versions to the gallery. This area facilitates the process of refining and finalizing the edited images.


Outpainting is a technique in image editing where the AI generates new content to extend the boundaries of an existing image, filling in the empty or selected areas with content that matches the style and context of the original image. This process is used to expand images or add details to incomplete sections.

πŸ’‘Coherence Pass

The Coherence Pass is a feature within the Unified Canvas that helps to ensure the seamless blending of newly generated content with the existing image. It involves a two-step process where the image is generated and then composited, with the area where the two parts meet being blurred together to create a smooth transition.


The purpose of the unified canvas is to create and composite a perfect image using AI-assisted technologies.

The unified canvas allows for the combination of AI tooling and creative control to refine images generated or augmented by AI.

Users can navigate to the unified canvas and drag an image onto it or send an image from the studio to the canvas.

The base layer is where changes are made directly to the image content, which will be denoised in the process.

The mask layer is used for inpainting, allowing users to select portions of the image for modification.

Switching between the mask and base layer can be done using the Q hotkey for efficient editing.

Masks can be saved for future use or cleared entirely from the canvas.

The bounding box, indicated by a dotted box, guides the AI's focus and should match the prompt describing the image content.

The staging area allows for multiple iterations of the same content and the ability to save or discard each iteration.

Inpainting with mini models can enhance details in characters or objects, especially those further in the background.

The scale before processing mode ensures that images are generated at the maximum size the model can handle, regardless of the bounding box size.

The rule of threes is recommended for out painting, where at most one-third of the image should be empty to provide enough context for accurate generation.

There are four infill methods to extract colors from the original image for out painting, with patch match being the default and most effective for most cases.

Adjusting the denoising strength and blur methods can help control for inconsistencies in out painting generations.

Manual infills allow users to source colors and block in areas for out painting, providing more control over the generation process.

The AI model requires clear suggestions and understanding of spatial relationships, especially when adding complex elements like trees.

Saving the edited image to the gallery is straightforward and allows for future use of the refined content.