AI's Artistic Struggle: Capturing the Essence of Poses with Imagen-v3
- 9 minutes read - 1843 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, conveying emotions, actions, and narratives through the way a subject positions their body. From the iconic silhouette of a lone adventurer against a majestic mountain range to the triumphant pose of a soldier on a battlefield, these poses evoke a sense of drama and intrigue. However, teaching an AI model to understand and generate these poses with the desired aesthetic can be a challenging task. This blog post explores the results of an AI model tasked with generating images based on specific poses and scenes, highlighting its strengths and weaknesses in capturing the essence of dramatic poses.
Created with: imagen-v3
A Hiker’s Solitude Amidst Majestic Peaks
Experience the tranquility and awe of a lone hiker standing on a mountain ridge, overlooking a breathtaking panorama of snow-capped peaks, rolling hills, and lush greenery. The isolation and vastness of the landscape create a sense of wonder and inspire a sense of adventure.
Prompt
poses standing-tall: Determined, hopeful, awe-inspiring ; Lone adventurer; wide shot; Adventure; Majestic mountain range with a vast, clear sky; cinematic
Characteristic
Shot : A lone hiker stands on a mountain ridge overlooking a vast, snow-capped mountain range. The landscape is breathtaking, with rolling hills, lush greenery and a clear blue sky.
Aesthetic Score : 0.75
Mood : tranquil, adventurous, inspiring
Quality
Entropy : 6.78
Noise : 99
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly underexposed, which makes the colors appear a bit muted. The mountains are slightly blurry in the distance, which could be improved with a sharper focus.
Amidst the Ashes, a Soldier Stands
A powerful image captures the intensity of war, with a soldier in full combat gear standing resolute against a backdrop of smoke and fire. The scene evokes a sense of drama, intensity, and somber reflection, highlighting the strength and determination of those who face conflict.
Prompt
poses standing-tall: Brave, defiant, resolute ; Soldier standing on a battlefield; medium shot; Heroism; Smoke and debris from a recent explosion; cinematic
Characteristic
Shot : A soldier in full combat gear stands in a war-torn landscape with smoke and fire in the background.
Aesthetic Score : 0.6
Mood : dramatic, intense, somber
Quality
Entropy : 6.89
Noise : 106
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight noise in the background, but it’s barely noticeable.
Victory Dance! Friends Celebrate Triumph with Joyful Fist Pumps
Four friends erupt in celebration, their smiles and raised fists radiating pure joy after a hard-fought victory. The image captures the raw emotion and camaraderie of shared success, making it a truly heartwarming moment.
Prompt
poses standing-tall: Joyful, triumphant, celebratory ; Group of friends celebrating a victory in a video game; close-up; Gaming; Neon lights and glowing screens of a gaming setup; cinematic
Characteristic
Shot : Four friends are celebrating a victory in front of a computer screen. They are all smiling and have their fists raised in the air.
Aesthetic Score : 0.6
Mood : joyful, celebratory, energetic
Quality
Entropy : 6.72
Noise : 76
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major image errors detected. Some noise may be present but it’s not disruptive.
Solitude and Serenity: A Moment of Tranquility on the Hilltop
A lone figure stands silhouetted against the setting sun, gazing out at a breathtaking panorama of rolling hills, forests, and a distant coastline. The warm glow of the evening light bathes the scene in a peaceful tranquility, creating a sense of awe and contemplation. This image captures the beauty of solitude and the vastness of nature, leaving a lasting impression of serenity.
Prompt
poses standing-tall: Awe-struck, contemplative, peaceful ; Tourist standing on a cliff overlooking a breathtaking view; long shot; Tourism; Scenic landscape with rolling hills and a sparkling ocean; cinematic
Characteristic
Shot : A lone figure stands on a hilltop, looking out at a vast landscape of rolling hills, forest, and a distant coastline. The setting sun casts a warm glow over the scene.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, serene
Quality
Entropy : 6.89
Noise : 100
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Silhouettes of Love Against a Fiery Sunset
A couple embraces on a ship’s deck, their silhouettes framed against a breathtaking sunset over the ocean. The scene evokes a sense of romance, serenity, and tranquility, with a distant island adding to the picturesque backdrop.
Prompt
poses standing-tall: Romantic, adventurous, hopeful ; Couple standing on a ship’s deck; medium shot; Travel; Sunset over the ocean with a silhouette of a distant island; cinematic
Characteristic
Shot : A couple is standing on a ship’s deck, embracing and gazing at each other, with a beautiful sunset over the ocean and a distant island in the background.
Aesthetic Score : 0.7
Mood : romantic, serene, tranquil
Quality
Entropy : 6.93
Noise : 90
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant artifacts or errors
Energetic Dance Performance Under Dramatic Lighting
A group of young men captivate the audience with their synchronized dance moves, illuminated by dramatic lighting that emphasizes their energy and focus. The dark setting and similar attire create a cohesive and impactful visual.
Prompt
poses standing-tall: Energetic, passionate, expressive ; Group of dancers performing on a stage; wide shot; Groups; Bright stage lights and a cheering audience; cinematic
Characteristic
Shot : A group of young men are dancing on a stage in a dark setting. The lighting is dramatic and focuses on the dancers. The dancers are all wearing similar clothing. The overall feel of the image is energetic and dramatic.
Aesthetic Score : 0.7
Mood : dramatic, energetic, focused
Quality
Entropy : 6.29
Noise : 106
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is well-composed and has no visible errors.
Awe-Inspiring Solitude: Astronauts on the Moon
A breathtaking image captures the isolation and wonder of space exploration. Three astronauts stand on the lunar surface, gazing out at the Earth hanging in the vast expanse of the cosmos. The scene evokes a sense of futuristic adventure and the profound beauty of our universe.
Prompt
poses standing-tall: Awe-inspiring, futuristic, surreal ; Astronaut standing on the surface of the moon; long shot; Adventure; Cratered lunar landscape with Earth in the distance; cinematic
Characteristic
Shot : Three astronauts standing on the surface of the moon, with the earth in the background
Aesthetic Score : 0.7
Mood : space, futuristic, awe
Quality
Entropy : 6.80
Noise : 93
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to be AI-generated, with some noticeable imperfections in the astronaut’s features and the lunar surface.
Firefighters Brave Blaze in Dramatic Rescue
Two firefighters stand silhouetted against a raging inferno, their bravery evident as they face the intense flames. The scene is both dramatic and solemn, highlighting the danger and heroism of their work.
Prompt
poses standing-tall: Brave, determined, selfless ; Firefighter standing in front of a burning building; medium shot; Heroism; Flames and smoke billowing from the building; cinematic
Characteristic
Shot : Two firefighters stand in front of a building engulfed in flames. The fire is intense, and the firefighters are silhouetted against the blaze. The fire appears to be well-established.
Aesthetic Score : 0.5
Mood : intense, dangerous, solemn
Quality
Entropy : 6.73
Noise : 94
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has a slight amount of noise, particularly in the shadows. The fire is well-exposed, but some areas of the flames appear overexposed and lack detail. The fire has a slight artificial look.
Champions Crowned: Esports Duo Celebrate Victory on Stage
Two young esports athletes bask in the glow of victory, holding aloft their trophy on a sleek, modern stage. The crowd roars in the background, their cheers a testament to the intensity of the competition and the triumphant moment captured in this image.
Prompt
poses standing-tall: Triumphant, proud, accomplished ; Gamer holding a trophy after winning a tournament; close-up; Gaming; Crowd cheering and flashing cameras; cinematic
Characteristic
Shot : Two young men in esports jerseys are holding a trophy on a stage in front of a crowd. The stage has a dark, modern aesthetic, with sleek lines and a wooden surface. The crowd is blurred in the background, creating a sense of depth and distance.
Aesthetic Score : 0.7
Mood : triumphant, celebratory, competitive
Quality
Entropy : 5.80
Noise : 80
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.00
Image errors : No visible errors.
Family Reaches Mountaintop, Smiles Radiate Joy and Adventure
A family of four, beaming with happiness, stands triumphantly on a mountain peak, their winter gear a testament to their adventurous spirit. The majestic snowy mountain range behind them serves as a breathtaking backdrop, amplifying the sense of accomplishment and exhilaration in this joyful scene.
Prompt
poses standing-tall: Joyful, united, adventurous ; Family standing on a mountain peak; wide shot; Travel; Panoramic view of snow-capped mountains and a clear blue sky; cinematic
Characteristic
Shot : A family of four, two adults and two children, stand on a mountaintop with a snowy mountain range in the background. They are all wearing winter clothing and smiling.
Aesthetic Score : 0.6
Mood : joyful, adventurous, happy
Quality
Entropy : 6.67
Noise : 85
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable image errors.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.35
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t quite capture the intended camera positions as described in the prompt.
Shot Analysis:
- Score: 0.53
- Interpretation: This score falls within the “good” range of 0.5 to 0.75. It indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it to a decent degree.
Aesthetic Analysis:
- Score: 0.115
- Interpretation: This score is significantly above the “very good” range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of shot composition and scene description, but struggles to match the desired aesthetic. This suggests that the model might need further training to better understand and translate aesthetic preferences into visual outputs.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://deepmind.google/technologies/imagen-3/