AI Struggles to Capture the Essence of Dramatic Poses with Midjourney
- 9 minutes read - 1851 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, used to convey emotion, action, and character. They often involve exaggerated movements, dynamic angles, and a strong sense of visual impact. However, replicating these poses accurately and effectively requires a deep understanding of artistic principles, which is a challenge for current AI models. This blog post explores the limitations of AI in capturing the essence of dramatic poses through a case study, analyzing the results of an experiment where an AI model was tasked with generating images based on specific poses and scenes.
Created with: midjourney
A Moment of Solitude Amidst Majestic Peaks
A lone figure finds peace on a rocky cliff, dwarfed by the towering mountains and swirling clouds. The scene evokes a sense of serenity and awe, highlighting the vastness of nature and the power of the landscape.
Prompt
crossed-legs crossed-legs: determined, contemplative ; A lone adventurer, sitting on a cliff edge; wide shot; Adventure; a vast, breathtaking mountain range; cinematic
Characteristic
Shot : A lone figure sits on a cliff overlooking a vast mountain valley, with clouds and a distant village visible in the distance.
Aesthetic Score : 0.8
Mood : serene, contemplative, majestic
Quality
Entropy : 6.75
Noise : 106
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors.
Roman Warrior Triumphant: A Moment of Epic Victory
A Roman warrior stands amidst the chaos of a battlefield, his armor gleaming, his sword held high. The city in the background burns, but his stance conveys strength and determination, capturing the dramatic power of a heroic victory.
Prompt
crossed-legs crossed-legs: triumphant, confident ; A victorious warrior, standing tall on a battlefield; medium shot; Heroism; fallen enemies and a burning city in the background; cinematic
Characteristic
Shot : A lone Roman soldier stands amidst a battlefield, the aftermath of a fierce battle, with bodies lying scattered around him, fires raging, and a cityscape in the background.
Aesthetic Score : 0.7
Mood : dramatic, epic, powerful
Quality
Entropy : 6.63
Noise : 117
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly in the smoke and fire effects, and the rendering of the soldier’s armor is slightly unrealistic.
Gamer’s Paradise: Red and Orange Lights Illuminate a Relaxed Workspace
This image captures the essence of a gamer’s haven. The vibrant red and orange lighting creates a dramatic atmosphere, highlighting the relaxed figure with their feet up on the desk and the glowing computer screen in the background. The mood is playful and edgy, reflecting the energy of the gaming world.
Prompt
crossed-legs crossed-legs: intense, focused ; A gamer, intensely focused on a screen; close-up; Gaming; a dimly lit room with glowing monitors and gaming peripherals; cinematic
Characteristic
Shot : A person sitting at a desk with their feet up, in front of a computer screen showing a video game, the desk is illuminated by warm and cool lighting, there is a string of lights around the screen.
Aesthetic Score : 0.6
Mood : chill, relaxed, gaming
Quality
Entropy : 6.31
Noise : 98
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible errors
Contemplating the City’s Vastness
A group of men find solace on a rocky cliff, gazing out at a sprawling city and distant mountains. The panoramic view evokes a sense of tranquility, contemplation, and nostalgia, highlighting the awe-inspiring scale of urban life.
Prompt
crossed-legs crossed-legs: excited, awe-struck ; A group of tourists, admiring a breathtaking view; medium shot; Tourism; a panoramic vista of a bustling city skyline; cinematic
Characteristic
Shot : A group of young men sit on a rocky cliff overlooking a large city with mountains in the distance. The city is very dense and seems to stretch out for miles. The men are all looking out at the view. The image was taken on a cloudy day. The image has a vintage feel.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, nostalgic
Quality
Entropy : 6.61
Noise : 102
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.00
Image errors : The image is slightly grainy and has some artifacts from the film stock used to create it.
Fleeting Moments: A Melancholy Journey Through Blurred Landscapes
A wistful glimpse of a person’s legs hanging out of a train window, the fast-moving scenery creating a sense of fleeting time and introspective reflection. The contrast between the dark train interior and the bright, blurred countryside evokes a melancholic mood.
Prompt
crossed-legs crossed-legs: reflective, nostalgic ; A traveler, gazing out of a train window; close-up; Travel; a blur of passing landscapes and towns; cinematic
Characteristic
Shot : A person’s feet dangling out of a train window as the scenery speeds by, the sky is overcast with dark clouds
Aesthetic Score : 0.6
Mood : melancholy, contemplative, journey
Quality
Entropy : 5.99
Noise : 91
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some noise and slight blurring in the image, particularly in the background.
Campfire Magic Under a Starry Sky
A group of friends gather around a crackling campfire, their laughter echoing through the cool forest air. The warm glow of the flames contrasts with the vast darkness above, creating a sense of intimacy and wonder. This scene captures the joy, warmth, and nostalgia of a perfect summer night.
Prompt
crossed-legs crossed-legs: joyful, relaxed ; A group of friends, laughing and sharing stories around a campfire; medium shot; Groups; a serene forest setting with twinkling stars above; cinematic
Characteristic
Shot : A group of friends are gathered around a campfire in a forest under a starry night.
Aesthetic Score : 0.8
Mood : joyful, serene, warm
Quality
Entropy : 6.25
Noise : 81
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some slight imperfections in the brushstrokes, which could be perceived as a stylistic choice or a slight imperfection.
A Moment of Solitude Among the Stars
An astronaut gazes out at Earth from the vastness of space, a solitary figure contemplating the beauty and fragility of our planet against the backdrop of swirling clouds and twinkling stars.
Prompt
crossed-legs crossed-legs: awe-inspired, contemplative ; A lone astronaut, gazing at Earth from a spaceship window; close-up; Heroism; a vast, blue planet against the backdrop of space; cinematic
Characteristic
Shot : An astronaut is sitting in a spaceship window, looking at the Earth. The Earth is a vibrant blue and green, with clouds and oceans. The stars are visible in the background.
Aesthetic Score : 0.8
Mood : solitude, wonder, contemplation
Quality
Entropy : 5.66
Noise : 91
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, such as the edges of the Earth looking slightly pixelated.
Shadows and Secrets: A Campfire in the Cave’s Embrace
Five men huddle around a flickering campfire, their faces illuminated by the dancing flames in the cavernous darkness. The scene is steeped in suspense, mystery, and adventure, as the contrast between light and shadow hints at a perilous situation unfolding.
Prompt
crossed-legs crossed-legs: suspenseful, cautious ; A group of explorers, huddled together in a dark cave; medium shot; Adventure; flickering torches illuminating the rough stone walls; cinematic
Characteristic
Shot : A group of five men are gathered around a small campfire in a cave. The cave is dark and mysterious, with beams of light streaming in from above. The men are dressed in rugged clothing and appear to be on an adventure.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, tense
Quality
Entropy : 6.21
Noise : 118
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be AI-generated, with some artifacts and unnatural textures, specifically in the fire and the cave walls.
Victory Dance! Gamer Celebrates Triumph Amidst Confetti Shower
A young man, radiating joy and energy, throws his arms in the air, surrounded by a flurry of confetti. He’s clearly celebrating a hard-earned victory, captured in this moment of pure triumph.
Prompt
crossed-legs crossed-legs: exuberant, joyful ; A gamer, celebrating a victory with a triumphant fist pump; close-up; Gaming; a brightly lit room with a celebratory confetti explosion; cinematic
Characteristic
Shot : A young man wearing headphones is sitting in a gaming chair with confetti falling around him. He is raising his fist in the air and appears to be excited.
Aesthetic Score : 0.7
Mood : joyful, triumphant, celebratory
Quality
Entropy : 6.77
Noise : 87
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some of the confetti particles have noticeable blur and unrealistic shapes, and the subject’s hair looks slightly too smooth.
A Symphony of Flavors: Street Food in India
Experience the vibrant energy of a bustling Indian street food scene, where colorful dishes and lively crowds create a feast for the senses. This image captures the essence of Indian street food culture, with its delicious aromas, vibrant textures, and infectious energy.
Prompt
crossed-legs crossed-legs: lively, adventurous ; A group of travelers, sharing a meal at a bustling street market; medium shot; Travel; vibrant colors and aromas of exotic food stalls; cinematic
Characteristic
Shot : A group of people are eating at a street food stall. The food is colorful and looks delicious. There are many people in the background, but the focus is on the people eating. The photo is taken from a low angle, which gives a sense of immediacy.
Aesthetic Score : 0.7
Mood : lively, bustling, colorful
Quality
Entropy : 6.78
Noise : 107
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is some slight graininess in the image, but it is not a major issue.
Conclusion
The results show that the generative AI model performed okay in terms of camera position and shot analysis, but not so well in terms of aesthetic analysis. Here’s a breakdown:
- Camera Position: The model scored 0.25, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t quite capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.44, also below the “good” range. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create the expected shot composition.
- Aesthetic Analysis: The model scored 0.04, which is far from the “very good” range of -0.2 to 0.1. This means the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall, the model struggled to accurately interpret the prompt’s instructions regarding camera position, shot composition, and aesthetic style.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://midjourney.com