I Ran Stable Diffusion 3 Prompts in Midjourney | SD3 vs. Midjourney Prompt Battle

Lexie AI
31 Mar 202403:50

TLDRThe video script presents a comparative analysis of Stable Diffusion 3 and Mid Journey AI art generation models. It showcases five different prompts, including an Elven Ranger, a child riding a llama, an alien banana cop, an anime girl, and a stack of animals. The comparison highlights the strengths and weaknesses of each model, with Stable Diffusion 3 generally producing more accurate and detailed images, despite some minor issues. The video ends with a call to action for viewers to explore the potential of AI-generated art and subscribe to the channel for more content.

Takeaways

  • 🎨 The video discusses a comparison between Stable Diffusion 3 (SD3) and Mid Journey (MJ) AI art generation models.
  • 🚀 Stable Diffusion 3 is not yet widely available to the public, but early access results are impressive.
  • 🏹 The first prompt compared was for an 'Elven Ranger' with specific characteristics; SD3 missed the bow, while MJ had a minor issue with the arrow.
  • 🦙 The 'Llama Kid' prompt resulted in a cute image from SD3, but MJ struggled with the desert setting and the depiction of the child.
  • 👮‍♂️ The 'Alien Banana Cop' prompt saw SD3 create a more accurate and creative image compared to MJ's less convincing cop portrayal.
  • 👩‍🎤 The 'Let's Go Girl' anime-style girl prompt was better handled by SD3, with MJ's text rendering needing improvement.
  • 🤣 The final prompt, 'Stack of Animals', showcased SD3's difficulty with the dog and mule, while MJ produced a humorous and creative interpretation.
  • 🏆 SD3 won the majority of the matchups, demonstrating its strength in generating detailed and accurate images.
  • 📈 The video encourages viewers to subscribe to the channel, which is new and appreciates support.
  • 🎥 The video includes a call to action to like and subscribe, emphasizing viewer engagement with the content.
  • 🌐 The video script provides insights into the capabilities and potential of AI art generation models like Stable Diffusion 3 and Mid Journey.

Q & A

  • What is the main subject of the video?

    -The main subject of the video is a comparison between the results produced by Stable Diffusion 3 and Mid Journey, two AI image generation models, based on various prompts.

  • How does the video demonstrate the capabilities of Stable Diffusion 3?

    -The video demonstrates the capabilities of Stable Diffusion 3 by showcasing the images it generated from different prompts and comparing them with the images produced by Mid Journey.

  • What is the first prompt compared in the video?

    -The first prompt compared in the video is 'badass Elven Archer', which describes an Elven Ranger with braided platinum hair, a rune-etched bow, glowing eyes, and aiming at a roaring Dragon.

  • What is the issue with the Stable Diffusion 3 image for the 'badass Elven Archer' prompt?

    -The issue with the Stable Diffusion 3 image for the 'badass Elven Archer' prompt is that the bow is missing, which is a crucial detail for the character.

  • Which model performed better for the 'badass Elven Archer' prompt according to the video?

    -According to the video, Mid Journey version 6 performed better for the 'badass Elven Archer' prompt, despite the arrow going through the elf's thumb.

  • What is the second prompt featured in the comparison?

    -The second prompt featured in the comparison is 'llama kid', which asks for a digital art picture of a child riding a llama with a bell on its tail through a desert.

  • What is the main critique of the Mid Journey image for the 'llama kid' prompt?

    -The main critique of the Mid Journey image for the 'llama kid' prompt is that it's hard to tell if the character is a kid, and it doesn't really look like a desert, despite the llama being accurately depicted.

  • Which model won the 'alien banana cop' prompt comparison?

    -Stable Diffusion 3 won the 'alien banana cop' prompt comparison, as it generated an image that was more in line with the prompt, showing a Xenomorph police officer enjoying a banana during golden hour in Hawaii.

  • What was the final prompt compared in the video?

    -The final prompt compared in the video was 'stack of um animals', depicting a rooster standing on a cat, which is standing on a dog, which is standing on a mule, which is standing on a turtle.

  • What was the verdict for the 'stack of um animals' prompt comparison?

    -Mid Journey's interpretation of the 'stack of um animals' prompt was deemed more creative and amusing, particularly because it showed a chicken, dog, and turtle stacked on top of each other in a humorous way.

  • How does the video encourage viewer engagement?

    -The video encourages viewer engagement by asking viewers to like the video, subscribe to the channel, and share their opinions in the comments section, particularly on the quality of the AI-generated images and the performance of each model.

Outlines

00:00

🎨 Stable Diffusion 3 Art Comparison

This paragraph introduces a video comparing the outputs of Stable Diffusion 3, an AI art generation model, with another AI model, Mid Journey. The comparison is based on five different prompts provided by a user who has early access to Stable Diffusion 3. The video promises a variety of images, with the last one being particularly surprising. The first prompt involves an Elven Ranger with a specific description, and the paragraph discusses the results from both AI models, highlighting the details and discrepancies in their outputs.

Mindmap

Keywords

💡stable diffusion 3

Stable diffusion 3 is a term that refers to an advanced AI model used for generating images from text prompts. In the context of the video, it is one of the two AI systems being compared to create digital art based on given prompts. The results from stable diffusion 3 are used as a benchmark to evaluate the performance of the other AI system, Mid Journey.

💡prompts

In the context of AI and the video, prompts are textual descriptions or requests that guide the AI to generate specific images. They are the inputs provided to the AI system, which then produces a visual representation based on the prompt's content.

💡Elven Ranger

Elven Ranger is a fictional character concept described in the video as an elf with braided platinum hair, a rune-etched bow, glowing eyes, and aiming at a roaring dragon. This character serves as a prompt for the AI systems to generate an image, showcasing their ability to understand and visualize complex fantasy elements.

💡Mid Journey

Mid Journey refers to another AI system being compared against stable diffusion 3 in the video. It is used to generate images based on text prompts and is evaluated on its performance in creating accurate and detailed digital art.

💡llama kid

Llama kid is a concept prompt describing a child riding a llama with a bell on its tail, set in a desert environment. This prompt is used to test the AI systems' ability to generate images with specific settings and characters.

💡alien banana cop

Alien banana cop is a creative prompt that challenges the AI systems to visualize a Xenomorph police officer enjoying a banana during the golden hour in Hawaii. This concept tests the AI's ability to blend elements of science fiction with everyday objects and settings.

💡anime style girl

Anime style girl refers to a prompt for an image of a girl characterized by the distinctive visual style of Japanese animation, or anime. This includes features like exaggerated expressions, vibrant colors, and stylized hair and eyes.

💡rooster standing on a cat

This phrase describes a humorous and complex prompt where a rooster is standing on a cat, which in turn is standing on a dog, which is standing on a mule, which finally is standing on a turtle. It tests the AI's ability to create a layered and intricate scene with multiple interacting animals.

💡digital art

Digital art refers to any artwork created using digital technology or computer software. In the context of the video, it highlights the output of AI systems that generate images based on text prompts, showcasing the evolving capabilities of AI in the realm of art and design.

💡AI-generated images

AI-generated images are visual outputs created by artificial intelligence systems in response to specific prompts or inputs. These images demonstrate the AI's ability to understand and interpret textual descriptions to produce corresponding visual content.

💡Xenomorph

Xenomorph is a term often used to describe a fictional extraterrestrial creature from the 'Alien' film series, characterized by its sleek, menacing appearance. In the video, it is used in a creative prompt for the AI to generate an image of an alien police officer, blending science fiction elements with a unique scenario.

Highlights

The introduction of Stable Diffusion 3, a groundbreaking AI tool not yet widely available to the public.

The unique approach of using prompt requests from random people to showcase the capabilities of Stable Diffusion 3.

An impressive image of an Elven Ranger with braided platinum hair, a rune-etched bow, and glowing eyes, despite a minor detail missing.

The humorous detail in the Stable Diffusion 3 image where the elf's middle finger is used as an arrow.

The comparison between Stable Diffusion 3 and Mid Journey version 6, with Mid Journey having a minor issue with the arrow going through the elf's thumb.

The creative prompt of a child riding a llama through a desert, highlighting the strengths and weaknesses of both AI tools.

The accurate representation of a llama in the Stable Diffusion 3 image, despite the misplaced bell.

The imaginative prompt of an alien banana cop, showcasing the AI's ability to handle complex and unusual scenarios.

The unexpected and amusing result of the alien banana cop prompt, with the Xenomorph police officer enjoying a banana in Hawaii.

The anime-style girl prompt, demonstrating the AI's capability to create detailed and expressive characters.

The critique of Mid Journey's text generation, highlighting the room for improvement in its speech bubble creation.

The final and comical prompt of a stack of animals, featuring a rooster on a cat on a dog on a mule on a turtle.

The impressive and imaginative result from Mid Journey for the stack of animals prompt, despite some issues with the dog and mule.

The overall comparison and conclusion of the AI tools, with Stable Diffusion 3 taking the lead in this best of five faceoff.