AI Struggles with Poses: A Look at the Aesthetic Gap with Scenario
- 9 minutes read - 1744 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate realistic and compelling images is a rapidly evolving field. One key aspect of image generation is the ability to capture the essence of a pose, conveying emotion, intention, and the overall narrative of a scene. This blog post explores the results of an experiment where an AI model was tasked with generating images based on specific poses and scenes. While the model demonstrated proficiency in understanding the technical aspects of image composition, it struggled to capture the aesthetic nuances that make a pose truly impactful. This highlights the ongoing challenge of bridging the gap between technical accuracy and artistic expression in AI-generated imagery.
Created with: scenario
A Moment of Awe on the Mountaintop
A lone woman stands on a rocky peak, dwarfed by the majestic snow-capped mountains in the distance. The setting sun casts a warm glow, creating an epic and contemplative scene. This image captures the beauty and wonder of nature, and the human spirit’s desire to explore and connect with the vastness of the world.
Prompt
poses standing-tall: Determined, hopeful, awe-inspiring ; Lone adventurer; wide shot; Adventure; Majestic mountain range with a vast, clear sky; cinematic
Characteristic
Shot : A lone woman stands on a mountain peak, looking out at a snow-capped mountain range in the distance. The sky is a clear blue, and the sun is shining brightly.
Aesthetic Score : 0.7
Mood : epic, adventurous, contemplative
Quality
Entropy : 6.61
Noise : 88
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.60
Image errors : The mountains in the background look a little blurry, as if they were generated by AI.
A Solitary Figure in the Wake of War
A woman in military uniform stands amidst the ruins of a war-torn landscape, her back to the camera, gazing towards a distant plume of smoke and fire. The image captures a moment of quiet contemplation amidst the chaos, highlighting the stark contrast between the woman’s composure and the devastation surrounding her.
Prompt
poses standing-tall: Brave, defiant, resolute ; Soldier standing on a battlefield; medium shot; Heroism; Smoke and debris from a recent explosion; cinematic
Characteristic
Shot : A female soldier in military uniform stands in front of an explosion, looking towards the flames and smoke. The scene is set in a destroyed landscape.
Aesthetic Score : 0.7
Mood : dramatic, intense, suspenseful
Quality
Entropy : 6.76
Noise : 76
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable image errors.
Neon Nights: Three Friends Dance the Night Away
Capture the vibrant energy of youth as three friends dance under dazzling neon lights. The central figure’s radiant smile and the dynamic use of light and color create a joyful and energetic atmosphere, making this a truly captivating moment.
Prompt
poses standing-tall: Joyful, triumphant, celebratory ; Group of friends celebrating a victory in a video game; close-up; Gaming; Neon lights and glowing screens of a gaming setup; cinematic
Characteristic
Shot : Three young women are dancing in a club or party, lit by neon lights.
Aesthetic Score : 0.7
Mood : joyful, energetic, celebratory
Quality
Entropy : 6.81
Noise : 85
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Solitude and Sunset: A Moment of Tranquility on the Cliffside
A woman, bathed in the golden light of the setting sun, stands alone on a cliff overlooking a vast ocean. The scene evokes a sense of serenity and contemplation, with the woman’s solitary figure emphasizing the vastness of the natural world.
Prompt
poses standing-tall: Awe-struck, contemplative, peaceful ; Tourist standing on a cliff overlooking a breathtaking view; long shot; Tourism; Scenic landscape with rolling hills and a sparkling ocean; cinematic
Characteristic
Shot : A woman in a white outfit is standing on a cliff overlooking a vast blue ocean. The sky is a warm sunset color, with a few clouds. The woman is facing away from the viewer, gazing out at the horizon.
Aesthetic Score : 0.7
Mood : serene, contemplative, tranquil
Quality
Entropy : 6.58
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors or artifacts
Love on the Horizon: A Silhouette of Romance Against the Sunset
Experience the serene beauty of a romantic sunset at sea, as a couple shares a tender moment on the deck of a ship. The vast ocean and the setting sun create a hopeful and grand backdrop for their love story.
Prompt
poses standing-tall: Romantic, adventurous, hopeful ; Couple standing on a ship’s deck; medium shot; Travel; Sunset over the ocean with a silhouette of a distant island; cinematic
Characteristic
Shot : A couple embracing on a ship’s deck as the sun sets over the ocean, with a dramatic sky in the background
Aesthetic Score : 0.7
Mood : romantic, serene, dreamy
Quality
Entropy : 6.76
Noise : 88
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : no visible artifacts
Sparkling Confidence: A Stage Filled with Glamour
A captivating image of a woman in a dazzling costume, radiating confidence as she takes center stage. The dramatic lighting and her outstretched arms create a powerful visual, capturing the essence of entertainment and glamour.
Prompt
poses standing-tall: Energetic, passionate, expressive ; Group of dancers performing on a stage; wide shot; Groups; Bright stage lights and a cheering audience; cinematic
Characteristic
Shot : A woman in a sparkly gold costume is dancing on a stage in a theater, surrounded by other dancers.
Aesthetic Score : 0.7
Mood : confident, glamorous, energetic
Quality
Entropy : 6.69
Noise : 100
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, especially in the background.
A Lone Figure in the Cosmic Vastness
An astronaut stands on a desolate, rocky landscape, dwarfed by the immensity of space. Two moons hang in the sky, casting a dramatic light on the scene. This evocative image captures the solitude, wonder, and exploration that define humanity’s journey beyond Earth.
Prompt
poses standing-tall: Awe-inspiring, futuristic, surreal ; Astronaut standing on the surface of the moon; long shot; Adventure; Cratered lunar landscape with Earth in the distance; cinematic
Characteristic
Shot : An astronaut standing on a desolate, barren, alien planet, possibly Mars, with two moons in the background. There is a sense of isolation and loneliness in the scene.
Aesthetic Score : 0.6
Mood : solitary, epic, otherworldly
Quality
Entropy : 6.70
Noise : 101
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.90
Image errors : The astronaut’s suit has a slightly unnatural texture and the lighting on the scene seems flat and artificial.
Firefighter Silhouetted Against Blazing Inferno
A dramatic image captures a firefighter standing bravely in front of a burning building, silhouetted against the intense orange flames. The scene evokes a sense of danger, urgency, and heroism.
Prompt
poses standing-tall: Brave, determined, selfless ; Firefighter standing in front of a burning building; medium shot; Heroism; Flames and smoke billowing from the building; cinematic
Characteristic
Shot : A firefighter stands in front of a burning building. The flames are intense and the smoke is thick. The firefighter is wearing a protective suit and helmet.
Aesthetic Score : 0.7
Mood : dramatic, somber, intense
Quality
Entropy : 6.86
Noise : 91
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors.
Golden Smile: Woman Celebrates Triumphant Victory
A young woman beams with joy, holding a golden trophy aloft, surrounded by cheering crowds. The scene captures the essence of victory and accomplishment, radiating a celebratory and triumphant mood.
Prompt
poses standing-tall: Triumphant, proud, accomplished ; Gamer holding a trophy after winning a tournament; close-up; Gaming; Crowd cheering and flashing cameras; cinematic
Characteristic
Shot : A young woman is holding a golden trophy in a crowded room with people in the background. The woman is smiling and looking at the camera. There are lights in the background.
Aesthetic Score : 0.7
Mood : happy, celebratory, triumphant
Quality
Entropy : 6.76
Noise : 87
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed. There are some artifacts in the background, but they are not distracting.
Awe-Inspiring Solitude: Two Figures Conquer a Mountain Peak
A serene and peaceful scene unfolds as an adult and child stand atop a majestic mountain peak, dwarfed by the vast expanse of snow-covered peaks. The bright blue sky and scattered clouds enhance the sense of adventure and solitude, creating a dramatic contrast that evokes awe and wonder.
Prompt
poses standing-tall: Joyful, united, adventurous ; Family standing on a mountain peak; wide shot; Travel; Panoramic view of snow-capped mountains and a clear blue sky; cinematic
Characteristic
Shot : Two people standing on a mountain peak, looking out at a snowy mountain range.
Aesthetic Score : 0.8
Mood : serene, adventurous, inspiring
Quality
Entropy : 6.45
Noise : 87
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable artifacts or errors in the image.
Conclusion
The results show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect of the image. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered average. This means the generated image’s camera position was somewhat similar to what was requested in the prompt, but not exceptionally close.
- Shot Analysis: The model scored 0.44, also considered average. This indicates the generated image’s shot composition was somewhat similar to what was requested in the prompt, but not exceptionally close.
- Aesthetic Analysis: The model scored 0.04, which is considered poor. This means the generated image’s aesthetic was significantly different from what was expected based on the prompt.
Overall, the model seems to be better at understanding the technical aspects of the image (camera position and shot composition) than the artistic aspects (aesthetic).
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.scenario.com