AI's Artistic Journey: Capturing Poses, But Missing the Essence with Midjourney
- 10 minutes read - 2003 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, conveying emotions and narratives through body language. From heroic stances to contemplative gazes, these poses are often used in art, photography, and film to create impactful imagery. This blog post explores the capabilities of an AI model in generating images based on specific poses and scenes, highlighting its strengths and weaknesses in capturing the essence of dramatic poses.
Created with: midjourney
A Knight’s Solitude Amidst the Storm
A lone knight stands defiant on a windswept cliff, overlooking a tumultuous sea and a distant medieval castle. The dramatic lighting, stormy backdrop, and the knight’s solitary figure create a sense of epic drama and impending danger.
Prompt
three-quarter-pose three-quarter-pose: determined, resolute, heroic ; A lone knight, standing tall on a windswept hilltop; three-quarter pose; Heroism; a vast, stormy landscape with a distant castle in the background; cinematic
Characteristic
Shot : A lone knight stands on a cliff overlooking a stormy sea, with a distant castle in the background.
Aesthetic Score : 0.7
Mood : dramatic, melancholic, epic
Quality
Entropy : 6.59
Noise : 116
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some of the textures, especially on the clouds and the knight’s armor, appear a bit grainy and unrealistic.
Silhouetted Against the Setting Sun: A Lone Traveler’s Journey Begins
A solitary figure stands on a cliff, bathed in the warm glow of the setting sun. The jungle stretches out below, leading the eye towards a majestic Mayan temple in the distance. This evocative scene whispers of mystery, adventure, and the promise of discovery.
Prompt
three-quarter-pose three-quarter-pose: adventurous, curious, hopeful ; An intrepid explorer, silhouetted against the setting sun, holding a map; three-quarter pose; Adventure; a dense jungle with ancient ruins in the distance; cinematic
Characteristic
Shot : A solitary figure stands on a cliff, looking at a map while gazing at a large temple in the distance, against a sunset backdrop. The scene is set in a dense jungle, likely a tropical environment.
Aesthetic Score : 0.6
Mood : mysterious, adventurous, serene
Quality
Entropy : 5.95
Noise : 81
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some painting-like artifacts and the texture of the jungle foliage is a bit repetitive and artificial.
Cyberpunk City Lights: A Focused Hacker at Work
A young man, clad in a white jacket and glasses, sits before a glowing computer screen, his fingers flying across the keyboard. The neon-drenched cityscape behind him creates a futuristic, cyberpunk atmosphere, hinting at a world of secrets and intrigue. The scene is both focused and dramatic, capturing the intensity of his work.
Prompt
three-quarter-pose three-quarter-pose: focused, intense, exhilarated ; A gamer, eyes glued to the screen, fingers flying across the keyboard; three-quarter pose; Gaming; a brightly lit gaming room with neon lights and a futuristic cityscape projected on the wall; cinematic
Characteristic
Shot : A young man is sitting at a desk in front of a computer. He is wearing glasses and a pink jacket. He is typing on the keyboard. The background is a blurry city at night, with many neon lights.
Aesthetic Score : 0.8
Mood : futuristic, cyberpunk, urban
Quality
Entropy : 6.58
Noise : 94
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image is slightly blurry. The neon lights are overexposed and there is some noise in the image.
Parisian Dreams: A Moment of Wanderlust
A woman, bundled in warmth, stands on a Parisian street, her gaze fixed on the iconic Eiffel Tower. The blurred background captures the bustling energy of the city, while her silhouette against the distant landmark evokes a sense of nostalgic charm and wanderlust.
Prompt
three-quarter-pose three-quarter-pose: amazed, joyful, curious ; A tourist, gazing in awe at the Eiffel Tower, camera in hand; three-quarter pose; Tourism; a bustling Parisian street with cafes and shops lining the sidewalk; cinematic
Characteristic
Shot : A woman is taking a photo of the Eiffel Tower in Paris. The scene is a busy street with many people walking by. The photo was taken on a sunny day with a clear blue sky.
Aesthetic Score : 0.7
Mood : romantic, nostalgic, vibrant
Quality
Entropy : 6.91
Noise : 97
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some minor artifacts and blur due to motion blur
A Moment of Awe: Hiker Finds Serenity on a Mountain Peak
A lone hiker stands triumphantly on a mountain summit, arms outstretched, taking in the breathtaking panorama of snow-capped peaks and a sprawling valley below. The vastness of the landscape evokes a sense of awe and insignificance, leaving the viewer with a feeling of serenity and inspiration.
Prompt
three-quarter-pose three-quarter-pose: free, exhilarated, adventurous ; A backpacker, standing on a mountain peak, arms outstretched, enjoying the view; three-quarter pose; Travel; a breathtaking panorama of snow-capped mountains and valleys; cinematic
Characteristic
Shot : A lone hiker stands on a mountaintop with outstretched arms, taking in the view of a vast snow-capped mountain range. The sky is cloudy but the light is bright, casting long shadows on the landscape.
Aesthetic Score : 0.8
Mood : inspiring, adventurous, serene
Quality
Entropy : 6.71
Noise : 94
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to have been sharpened excessively, resulting in some artifacts around edges.
Campfire Nights: A Starry Escape with Friends
A group of friends gather around a crackling campfire, bathed in the warm glow of the flames against the backdrop of a star-studded sky. The scene evokes a sense of nostalgia, peace, and warmth, capturing the essence of shared moments under the open sky.
Prompt
three-quarter-pose three-quarter-pose: happy, relaxed, connected ; A group of friends, laughing and sharing stories around a campfire; three-quarter pose; Groups; a serene forest clearing with stars twinkling in the night sky; cinematic
Characteristic
Shot : A group of six people are sitting around a campfire in a forest at night. There are many stars visible in the sky, and the campfire is casting a warm glow on the group. The scene is peaceful and serene.
Aesthetic Score : 0.8
Mood : serene, peaceful, warm
Quality
Entropy : 6.21
Noise : 120
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to be painted, and there are no noticeable artifacts or errors.
Hope Rises from the Ashes: Superhero Stands Tall in Devastated City
A powerful image captures the resilience of a superhero, standing defiantly on a pile of rubble in a city ravaged by disaster. The burning fires in the distance and the hero’s red cape create a dramatic and hopeful scene, suggesting a future where hope prevails over destruction.
Prompt
three-quarter-pose three-quarter-pose: powerful, victorious, confident ; A superhero, standing triumphantly over a defeated villain; three-quarter pose; Heroism; a cityscape with smoke and debris in the background; cinematic
Characteristic
Shot : A superhero stands triumphantly atop a pile of rubble in a destroyed city, cape billowing in the wind, with a cityscape behind them.
Aesthetic Score : 0.7
Mood : triumphant, heroic, hopeful
Quality
Entropy : 6.52
Noise : 114
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some of the textures, especially on the rubble, appear pixelated and unrealistic.
Lost in the Majesty: Hikers Conquer a Snowy Ridge
Two adventurers navigate a narrow snow-covered ridge, dwarfed by the majestic, snow-capped mountains in the distance. The cloudy sky adds a sense of mystery and adventure, while the dramatic play of light and shadow creates an awe-inspiring scene. This epic landscape evokes a feeling of serenity and wonder, capturing the essence of a breathtaking journey.
Prompt
three-quarter-pose three-quarter-pose: determined, focused, adventurous ; A group of adventurers, navigating a treacherous mountain path; three-quarter pose; Adventure; a rugged mountain range with snow-covered peaks and a deep valley below; cinematic
Characteristic
Shot : Two hikers with backpacks walk along a narrow rocky path. They are silhouetted against a majestic backdrop of snow-capped mountains and misty valleys.
Aesthetic Score : 0.8
Mood : serene, adventurous, awe-inspiring
Quality
Entropy : 6.79
Noise : 117
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly soft, particularly in the background, which could be due to lens limitations or post-processing. There is also a bit of noise in the shadows.
Hacking the System: Three Young Coders Face a High-Stakes Challenge
In a dimly lit room, three young people huddle around a computer screen, their faces illuminated by the glow of the monitor. The atmosphere is tense, the air thick with anticipation. A pizza box sits forgotten on the desk, a testament to the hours they’ve spent working. The camera angle, slightly elevated, draws the viewer into the scene, creating a sense of intimacy and claustrophobia. What are they working on? And what are the stakes?
Prompt
three-quarter-pose three-quarter-pose: focused, competitive, excited ; A group of gamers, huddled around a table, strategizing their next move; three-quarter pose; Gaming; a dimly lit room with flickering computer screens and a stack of pizza boxes; cinematic
Characteristic
Shot : Three young adults are gathered around a table in a dimly lit room. They are looking at a computer screen, likely playing a game. The room is decorated with gaming posters and other items. There is a pizza box on the table, suggesting they are taking a break from their gaming session.
Aesthetic Score : 0.6
Mood : intense, focused, competitive
Quality
Entropy : 5.73
Noise : 88
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slight amount of noise, which is common in low-light photography. There is also some slight chromatic aberration, particularly in the edges of the image.
A Family’s Joyful Day in a Colorful Town
A heartwarming scene of a family, radiating happiness and love, standing before a vibrant building with a church steeple in the background. The warm colors and their joyful expressions create a sense of warmth and togetherness.
Prompt
three-quarter-pose three-quarter-pose: happy, joyful, memorable ; A family, standing in front of a famous landmark, smiling for a photo; three-quarter pose; Tourism; a vibrant city square with colorful buildings and street performers; cinematic
Characteristic
Shot : A young family, a man, a woman and a baby, are standing in a European city square with colorful buildings and a church in the background.
Aesthetic Score : 0.7
Mood : happy, joyful, loving
Quality
Entropy : 6.88
Noise : 83
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the background.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.5, which falls within the “good” range. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.26, which is significantly lower than the “very good” range of -0.2 to 0.1. This suggests that the generated image didn’t quite match the expected aesthetic style described in the prompt.
Overall, the model shows promise in understanding scene composition and camera positioning, but needs improvement in capturing the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://midjourney.com