AI's Artistic Journey: Capturing Dramatic Poses, But Missing the Shot with Imagen-v3-fast
- 9 minutes read - 1882 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual prompts is rapidly evolving. One intriguing area of exploration is the creation of dramatic poses within specific scenes. This involves not only understanding the physical form of the pose but also capturing the intended emotion, setting, and camera perspective. This blog post delves into the results of an AI model tasked with generating images based on dramatic poses and scenes, highlighting its strengths and weaknesses in capturing the essence of these artistic elements.
Created with: imagen-v3-fast
City on Fire, Hope Falls
A lone figure stands silhouetted against a fiery sunset, a city skyline ablaze behind them. Above, another figure plummets towards the earth, adding a sense of impending doom to the already dramatic scene. This image captures the raw emotion of an apocalyptic moment, leaving the viewer to wonder what fate awaits the lone figure below.
Prompt
poses falling: Epic, desperate, hopeful ; A lone figure in a tattered cape; wide shot; Heroism; A burning city skyline; cinematic
Characteristic
Shot : A lone figure stands in the foreground, facing a city skyline with a fiery sunset in the background. Above them, another figure falls headfirst towards the earth.
Aesthetic Score : 0.7
Mood : dramatic, apocalyptic, suspenseful
Quality
Entropy : 6.40
Noise : 47
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, such as the jagged edges of the falling figure and the cityscape.
A Solitary Ascent: Hope in the Canyon’s Embrace
A lone figure scales a ladder in a deep, narrow canyon, bathed in soft light filtering through the opening above. Lush greenery clings to the walls, creating a sense of mystery and adventure. The narrowness of the canyon emphasizes the figure’s isolation and the challenge of the climb, while the light suggests a possible exit and a glimmer of hope.
Prompt
poses falling: Suspenseful, thrilling, determined ; A lone explorer clinging to a rope ladder; close-up; Adventure; A vast, misty jungle canyon; cinematic
Characteristic
Shot : A lone figure climbs a ladder in a deep, narrow canyon. Lush greenery covers the walls, and a soft light filters through the opening at the top.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, hopeful
Quality
Entropy : 6.68
Noise : 69
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.80
Image errors : The figure’s hand on the ladder and the way the ladder disappears into the top of the canyon appear somewhat unnatural. The foliage in the background seems slightly blurry and lacking detail.
Neon City Chase: A Pixelated Fall From Grace
A dynamic scene unfolds in a futuristic city bathed in neon light. One figure falls, while another races through the pixelated landscape. The dramatic composition and retro aesthetic create a thrilling sense of action and suspense.
Prompt
poses falling: Energetic, chaotic, playful ; A pixelated character plummeting through a digital landscape; medium shot; Gaming; A neon-lit cityscape with glowing buildings; cinematic
Characteristic
Shot : Two figures, one falling, one running, in a futuristic city with neon lights and pixelated graphics.
Aesthetic Score : 0.6
Mood : retro, futuristic, action
Quality
Entropy : 6.20
Noise : 88
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has noticeable pixelation and jagged edges, typical of low-resolution pixel art.
Peaceful Majesty Meets Thrilling Descent
A breathtaking view of a hot air balloon soaring over majestic mountains is punctuated by the exhilarating spectacle of four skydivers plummeting towards the valley below. This scene captures the perfect balance of tranquility and adventure, leaving you with a sense of awe and excitement.
Prompt
poses falling: Exhilarating, awe-inspiring, carefree ; A hot air balloon basket with tourists; long shot; Tourism; A breathtaking view of a mountain range with snow-capped peaks; cinematic
Characteristic
Shot : A hot air balloon is flying over a mountain range with four skydivers falling towards the ground. The view is from a high vantage point, looking down at the mountain valley.
Aesthetic Score : 0.8
Mood : peaceful, adventurous, majestic
Quality
Entropy : 6.93
Noise : 72
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The skydivers are slightly blurry, suggesting a fast shutter speed was used, which may have been necessary to capture the movement. There are no obvious image artifacts.
Precipice of Peril: A Dramatic Fall Towards the Valley
A heart-stopping image captures the moment a person plunges towards a distant river and valley, the sheer height of the cliff and the vastness of the landscape creating a palpable sense of danger and suspense. The dramatic composition emphasizes the perilous situation, leaving the viewer breathless with anticipation.
Prompt
poses falling: Adrenaline-fueled, chaotic, humorous ; A backpacker tumbling down a rocky hillside; close-up; Travel; A lush green valley with a winding river; cinematic
Characteristic
Shot : A person is falling off a cliff, towards a river and a valley in the distance. The scene is set in a mountainous region.
Aesthetic Score : 0.7
Mood : dramatic, suspenseful, dangerous
Quality
Entropy : 6.64
Noise : 84
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor errors in the rendering of the figure and the environment. There are some visible artifacts in the rocks and foliage, and the figure’s anatomy looks slightly off.
Adrenaline Rush: Plunging into the Unknown
A breathtaking scene of four individuals plummeting from a cliff, their descent framed by a vibrant blue sky and crashing ocean waves. This dramatic image captures the raw thrill and exhilaration of adventure, leaving you breathless with anticipation.
Prompt
poses falling: Heart-stopping, terrifying, bonding ; A group of friends holding hands, falling from a cliff; wide shot; Groups; A dramatic ocean coastline with crashing waves; cinematic
Characteristic
Shot : Four people are falling from a cliff over the ocean. The sky is a light blue and the ocean is a deep blue.
Aesthetic Score : 0.6
Mood : dramatic, adventurous, exhilarating
Quality
Entropy : 6.84
Noise : 90
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image looks slightly artificial, it could be a rendering, especially the sky and waves
Superhero Plunge: A Dramatic Fall into Chaos
Witness the intensity of a superhero’s fall, captured in a gritty, action-packed scene. The red cape billows dramatically as debris flies, creating a powerful sense of danger and urgency.
Prompt
poses falling: Dramatic, heroic, determined ; A superhero in mid-air, falling towards a collapsing building; close-up; Heroism; A cityscape with smoke and debris; cinematic
Characteristic
Shot : A superhero in a red cape falls backwards against a dark, gritty backdrop, with debris flying around them.
Aesthetic Score : 0.7
Mood : intense, dramatic, action-packed
Quality
Entropy : 6.28
Noise : 47
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some of the debris looks slightly pixelated, particularly in the upper right corner.
Silhouetted Against the Sky: Climbers Conquer a Majestic Mountain
A breathtaking scene unfolds as three climbers rappel down a steep cliff face, their figures silhouetted against the bright sky. The lush green forest covering the mountains and the soft blue sky with wispy clouds create a dramatic and awe-inspiring backdrop. This image captures the scale and danger of their adventure, leaving viewers in awe of their courage and the beauty of the natural world.
Prompt
poses falling: Thrilling, adventurous, daring ; A group of adventurers rappelling down a sheer rock face; long shot; Adventure; A towering mountain peak with a breathtaking view; cinematic
Characteristic
Shot : A scenic view of a mountain range with three climbers rappelling down a steep cliff face. The climbers are silhouetted against the bright sky. The mountains are covered in a lush green forest and the sky is a soft blue with wispy white clouds.
Aesthetic Score : 0.7
Mood : dramatic, adventurous, awe-inspiring
Quality
Entropy : 6.58
Noise : 75
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has a slight blur in the background, and the texture of the mountain is somewhat unnatural and pixelated.
A Dreamy Descent into the Unknown
A man plummets through a fantastical forest, guided by glowing flowers towards a rushing river in a deep canyon. The scene is both beautiful and unsettling, leaving the viewer to wonder about the man’s fate and the mysteries of this surreal world.
Prompt
poses falling: Magical, surreal, exciting ; A player character falling through a virtual world; medium shot; Gaming; A vibrant, fantastical landscape with glowing flora; cinematic
Characteristic
Shot : A man is falling through a fantastical forest with large, glowing flowers. He is falling towards a river that is rushing through a canyon
Aesthetic Score : 0.8
Mood : dreamy, surreal, mysterious
Quality
Entropy : 6.77
Noise : 81
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.90
Image errors : The man’s figure is somewhat distorted and his limbs are disproportionate. The background is blurry and lacks detail. The overall image has a slightly artificial look.
Whimsical Sunset Over a Cobblestone Street
A hot air balloon drifts serenely against a golden sunset sky, casting a magical glow over a charming European town. The cobblestone streets and quaint houses evoke a sense of nostalgia and wonder, inviting you to escape into a world of dreams.
Prompt
poses falling: Melancholy, wistful, serene ; A lone hot air balloon, silhouetted against a fiery sunset, descends slowly towards a quaint, cobblestone village.; cinematic
Characteristic
Shot : A picturesque cobblestone street in a European town, with charming houses and a hot air balloon floating in the golden sunset sky.
Aesthetic Score : 0.8
Mood : serene, whimsical, nostalgic
Quality
Entropy : 6.86
Noise : 79
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.70
Image errors : There are no visible errors in the image.
Conclusion
The results show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.45, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.55, which is considered average. This indicates that the model was able to understand the scene and shot type described in the prompt, but not exceptionally well.
- Aesthetic Analysis: The model scored 0.07, which is considered very good. This means that the generated image closely matched the expected aesthetic style described in the prompt.
Overall, the model seems to be better at understanding the aesthetic style than the camera position and shot composition. This suggests that the model might need further training to improve its ability to accurately interpret and implement camera positions and shot types.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://deepmind.google/technologies/imagen-3/