AI's Artistic Struggle: Capturing Dramatic Poses with Imagen-v2
- 9 minutes read - 1739 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, conveying emotion, action, and tension. They often involve dynamic movement, exaggerated angles, and a sense of urgency. This blog post delves into the challenges of generating images with dramatic poses using AI, exploring how well these models understand the nuances of composition, aesthetics, and the emotional impact of a pose.
Created with: imagen-v2
Superman Soars Through a City in Flames
A dramatic image captures Superman flying through a burning city, his cape billowing behind him. The dark sky and smoke-filled air create a sense of urgency and danger.
Prompt
poses falling: Epic, desperate, hopeful ; A lone figure in a tattered cape; wide shot; Heroism; A burning city skyline; cinematic
Characteristic
Shot : Superman, weathered and wounded, flies through a burning city, possibly in the aftermath of a battle. The city is engulfed in flames, buildings are crumbling, and debris fills the air.
Aesthetic Score : 0.7
Mood : dramatic, heroic, apocalyptic
Quality
Entropy : 6.58
Noise : 87
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to have been generated by AI, with some digital artifacts and unnatural textures, especially in the fire and smoke.
Lost in the Mist: A Man’s Perilous Climb
A lone figure hangs precariously from a ladder, suspended in a lush, misty forest. The scene evokes a sense of mystery, adventure, and suspense, leaving the viewer wondering what dangers lie ahead.
Prompt
poses falling: Suspenseful, thrilling, determined ; An explorer clinging to a rope ladder; close-up; Adventure; A vast, misty jungle canyon; cinematic
Characteristic
Shot : A man is swinging from a rope ladder in a dense jungle. The scene is misty and shrouded in fog.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, suspenseful
Quality
Entropy : 6.58
Noise : 115
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable artifacts or errors in the image.
Pixelated Hero Soars Through a Neon-Drenched City
A blocky character, reminiscent of Minecraft, defies gravity in a futuristic cityscape. Crumbling buildings and vibrant neon lights create a dynamic scene, hinting at a playful yet mysterious adventure.
Prompt
poses falling: Energetic, chaotic, playful ; A pixelated character plummeting through a digital landscape; medium shot; Gaming; A neon-lit cityscape with glowing buildings; cinematic
Characteristic
Shot : A blocky, robotic figure made of stone is flying in a shattered, neon-lit city. The city is made up of jagged blocks and glowing lines of light.
Aesthetic Score : 0.6
Mood : surreal, futuristic, eerie
Quality
Entropy : 6.71
Noise : 111
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some artifacts and blurriness.
Soaring Above the Peaks: A Hot Air Balloon Adventure
Experience the thrill of freedom as a hot air balloon glides over a majestic mountain range, its snow-capped peaks reaching for the sky. The low angle shot captures the vastness and scale of the landscape, creating a sense of awe and adventure.
Prompt
poses falling: Exhilarating, awe-inspiring, carefree ; A hot air balloon basket with tourists; long shot; Tourism; A breathtaking view of a mountain range with snow-capped peaks; cinematic
Characteristic
Shot : A hot air balloon flies over a mountain range, with the camera pointing upwards toward the balloon.
Aesthetic Score : 0.7
Mood : adventurous, serene, majestic
Quality
Entropy : 6.76
Noise : 99
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, especially towards the edges of the frame.
Precarious Perch: A Hikers’s Moment of Truth
A daring hiker clings to a cliff edge, arms outstretched, with a breathtaking valley spread below. The scene captures the thrill and danger of adventure, leaving the viewer wondering if he’ll make it back to safety.
Prompt
poses falling: Adrenaline-fueled, chaotic, humorous ; A backpacker tumbling down a rocky hillside; close-up; Travel; A lush green valley with a winding river; cinematic
Characteristic
Shot : A man in a red vest and black pants is hanging off the edge of a cliff with his arms outstretched, looking out over a valley with a winding river.
Aesthetic Score : 0.6
Mood : dramatic, adventurous, daring
Quality
Entropy : 6.80
Noise : 116
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no obvious artifacts or errors in the image.
Golden Hour Adventure on the Cliff’s Edge
Two women stand on a dramatic cliff edge, arms outstretched towards a figure in the foreground, as the golden hour bathes the ocean and sky in a warm glow. This captivating scene evokes a sense of adventure, optimism, and breathtaking beauty.
Prompt
poses falling: Heart-stopping, terrifying, bonding ; A group of friends holding hands, falling from a cliff; wide shot; Groups; A dramatic ocean coastline with crashing waves; cinematic
Characteristic
Shot : Three people are standing on a cliff overlooking the ocean. The two women are reaching out towards a man who is standing on the edge of the cliff. They are positioned as if they are trying to pull him back from the edge of the cliff. The sun is setting in the background creating a dramatic and beautiful sky.
Aesthetic Score : 0.7
Mood : dramatic, adventurous, suspenseful
Quality
Entropy : 6.50
Noise : 89
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor artifacts around the edges of the people’s bodies, likely from compression or editing.
Superman’s Gritty Stand: A Moment of Intense Battle
A powerful image captures Superman in a dynamic pose, his determined expression reflecting the intensity of the moment. The dark and gritty background suggests a fierce battle, leaving viewers on the edge of their seats.
Prompt
poses falling: Dramatic, heroic, determined ; A superhero in mid-air, falling towards a collapsing building; close-up; Heroism; A cityscape with smoke and debris; cinematic
Characteristic
Shot : A stylized superhero, possibly Superman, is flying through a destroyed city, with debris flying around him
Aesthetic Score : 0.6
Mood : action, gritty, epic
Quality
Entropy : 6.51
Noise : 79
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.90
Image errors : There is some blurring and noise in the image, particularly in the background. The detail of the debris is lacking, and the image is slightly soft. Some artifacts are visible on the suit.
A Dance with Danger: Climbers Conquer a Towering Rock Face
Two climbers, one ascending, one descending, navigate a massive rock face. The image captures the thrill and risk of their adventure, highlighting the climbers’ determination against the imposing backdrop.
Prompt
poses falling: Thrilling, adventurous, daring ; A group of adventurers rappelling down a sheer rock face; long shot; Adventure; A towering mountain peak with a breathtaking view; cinematic
Characteristic
Shot : Two climbers are rappelling down a steep rock face. The one in the foreground is almost at the bottom, while the other is further up and looks like they are about to rappel.
Aesthetic Score : 0.6
Mood : adventure, daring, focused
Quality
Entropy : 6.62
Noise : 104
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors.
Mystical Flight Through a Glowing Mushroom Field
A surreal and dreamy scene unfolds as a glowing blue figure soars through the air above a field of luminous mushrooms. The hazy background adds to the mystical atmosphere, creating a captivating and visually stunning image.
Prompt
poses falling: Magical, surreal, exciting ; A player character falling through a virtual world; medium shot; Gaming; A vibrant, fantastical landscape with glowing flora; cinematic
Characteristic
Shot : A mysterious, glowing figure is flying over a vibrant, bioluminescent forest, with large mushrooms in the foreground.
Aesthetic Score : 0.6
Mood : dreamy, fantastical, ethereal
Quality
Entropy : 6.84
Noise : 72
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image suffers from a slight amount of over-blurring, particularly in the background. The composition is slightly cluttered, making it difficult to focus on a particular area. There’s a slight impressionistic quality that some might find distracting.
Golden Sunrise Balloon Ride Over a Tranquil Valley
Experience the magic of a hot air balloon ride as the sun rises over a picturesque town nestled in a valley. The golden light creates a breathtaking scene, perfect for a romantic adventure or a moment of peaceful reflection.
Prompt
poses falling: Romantic, nostalgic, heartwarming ; A family in a hot air balloon, falling towards a picturesque village; long shot; Tourism; A charming village with cobblestone streets and colorful houses; cinematic
Characteristic
Shot : A hot air balloon ride over a town at sunrise. The image is taken from inside the balloon, with the town below and the sunrise in the distance. The image is framed by the edge of the balloon basket and the hands of a passenger.
Aesthetic Score : 0.7
Mood : peaceful, magical, adventurous
Quality
Entropy : 6.85
Noise : 90
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.45, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t fully capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.585, which falls within the “good” range. This indicates that the model was able to understand the scene and create a shot that was generally consistent with the prompt.
- Aesthetic Analysis: The model scored 0.09, which is significantly higher than the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall, the model demonstrated a good understanding of the scene and shot composition, but struggled to achieve the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://deepmind.google/technologies/imagen-2/