AI Captures Poses, But Struggles with Style with Scenario
- 9 minutes read - 1829 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, conveying emotions and narratives through body language. From the lone adventurer perched on a cliff edge to the victorious warrior standing tall on a battlefield, these poses evoke specific feelings and create a sense of drama. This blog post explores the capabilities of an AI model in generating images based on such dramatic poses, analyzing its performance and highlighting areas for improvement.
Created with: scenario
A Moment of Serenity Amidst the Vastness
A young woman finds peace on a rocky ledge, overlooking a sprawling valley with a winding river. The scene evokes a sense of serenity, contemplation, and adventure, highlighting the grandeur of nature and the smallness of humanity in its presence.
Prompt
poses crossed-legs: determined, contemplative ; A lone adventurer, sitting on a cliff edge; wide shot; Adventure; a vast, breathtaking mountain range; cinematic
Characteristic
Shot : A young woman sits on a rocky outcropping overlooking a valley, with a winding river snaking through the green forested hills. She is wearing a beige shirt and brown boots, and has her hair pulled back in a low ponytail.
Aesthetic Score : 0.7
Mood : serene, contemplative, adventurous
Quality
Entropy : 6.69
Noise : 92
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors
Warrior Amidst the Ashes
A lone female warrior, clad in silver armor, stands defiant on a rocky outcrop. The burning city behind her paints a stark backdrop of destruction, highlighting her strength and resilience in the face of devastation. This dramatic scene evokes a sense of power and epic scale.
Prompt
poses crossed-legs: triumphant, confident ; A victorious warrior, standing tall on a battlefield; medium shot; Heroism; fallen enemies and a burning city in the background; cinematic
Characteristic
Shot : A woman in silver armor stands on a rocky hill, a city in ruins behind her. A large fire burns in the distance, and a plume of smoke rises from a tall chimney. The woman is looking towards the camera with a serious expression, and she is holding a sword in her right hand.
Aesthetic Score : 0.7
Mood : dramatic, epic, powerful
Quality
Entropy : 6.75
Noise : 84
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts, such as blur around the edges of the subject.
Level Up Your Focus: Gamer Girl Ready for Action
A young woman exudes confidence and focus, ready to conquer the virtual world. The cool lighting, her determined pose, and the vibrant colors of the game on her screen create a dynamic and captivating scene.
Prompt
poses crossed-legs: intense, focused ; A gamer, intensely focused on a screen; close-up; Gaming; a dimly lit room with glowing monitors and gaming peripherals; cinematic
Characteristic
Shot : A woman is sitting in a gaming chair in front of a computer. She is wearing a white hoodie and headphones. There are neon lights in the background.
Aesthetic Score : 0.6
Mood : cool, techy, gamer
Quality
Entropy : 6.82
Noise : 86
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts, such as the slight blurriness around the edges of the woman’s hair.
Finding Peace Amidst the City’s Hustle
A solitary figure finds serenity on a rooftop overlooking the iconic New York City skyline, bathed in the warm glow of a sunset. The juxtaposition of the woman’s small form against the vast cityscape creates a powerful sense of perspective and tranquility.
Prompt
poses crossed-legs: excited, awe-struck ; A group of tourists, admiring a breathtaking view; medium shot; Tourism; a panoramic vista of a bustling city skyline; cinematic
Characteristic
Shot : A woman is sitting on the edge of a building in a city, looking out at the skyline with the Empire State Building in the distance. The sunset is in the background.
Aesthetic Score : 0.7
Mood : dreamy, hopeful, adventurous
Quality
Entropy : 6.57
Noise : 87
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible errors in the image.
Lost in Thought: A Moment of Nostalgia on a Retro Train
A young woman, bathed in warm light, gazes out the window of a vintage train, her wistful expression hinting at a contemplative mood. The passing landscape and the train’s retro charm evoke a sense of nostalgia and quiet longing.
Prompt
poses crossed-legs: reflective, nostalgic ; A traveler, gazing out of a train window; close-up; Travel; a blur of passing landscapes and towns; cinematic
Characteristic
Shot : A woman sitting in a train, looking out the window at a scenic view of a grassy field and trees. The train is likely moving as the view is blurry
Aesthetic Score : 0.8
Mood : tranquil, contemplative, wistful
Quality
Entropy : 6.76
Noise : 87
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be AI-generated with some slight artifacts in the hair and some edges of the clothing
Campfire Dreams Under a Starry Sky
A group of friends gather around a crackling campfire, bathed in the warm glow of the flames and the soft light of a million stars. The scene evokes a sense of cozy nostalgia and whimsical adventure, perfect for a night under the open sky.
Prompt
poses crossed-legs: joyful, relaxed ; A group of friends, laughing and sharing stories around a campfire; medium shot; Groups; a serene forest setting with twinkling stars above; cinematic
Characteristic
Shot : A group of friends are gathered around a campfire in a forest at night. There is a teepee in the background. The scene is lit by the fire and the stars in the sky.
Aesthetic Score : 0.7
Mood : cozy, friendly, peaceful
Quality
Entropy : 6.39
Noise : 101
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : No visible errors
Lost in the Vastness: A Dreamy Glimpse of Space
A lone astronaut, clad in a pristine white suit, gazes out of a round spaceship window at the breathtaking expanse of outer space. The image evokes a sense of isolation and wonder, capturing a moment of quiet contemplation and hopeful anticipation.
Prompt
poses crossed-legs: awe-inspired, contemplative ; A lone astronaut, gazing at Earth from a spaceship window; close-up; Heroism; a vast, blue planet against the backdrop of space; cinematic
Characteristic
Shot : A woman in an astronaut suit is sitting by a large window looking out at space. She is gazing out at the Earth and the other planets. The scene is peaceful and serene.
Aesthetic Score : 0.7
Mood : peaceful, contemplative, hopeful
Quality
Entropy : 6.79
Noise : 92
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has some artifacts in the background, particularly around the planets. The astronaut’s hair appears slightly unnatural, with some texture issues.
Campfire Tales in the Cave’s Embrace
A group of five huddle around a flickering campfire, their faces illuminated by the warm glow. The cave walls cast long shadows, adding an air of mystery to this intimate gathering. A sense of adventure and shared stories hangs in the air.
Prompt
poses crossed-legs: suspenseful, cautious ; A group of explorers, huddled together in a dark cave; medium shot; Adventure; flickering torches illuminating the rough stone walls; cinematic
Characteristic
Shot : Five people are sitting around a campfire in a cave. There is a starry sky visible in the background.
Aesthetic Score : 0.75
Mood : cozy, mysterious, adventurous
Quality
Entropy : 6.50
Noise : 112
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to have some minor artifacting around the edges of the characters, especially the woman on the right. The shadows are slightly too dark in some areas.
Confetti Dreams: A Moment of Joy Captured
A woman basks in the celebratory spirit, surrounded by a cascade of confetti. Her joyful expression and the whimsical atmosphere create a captivating scene that evokes feelings of happiness and celebration.
Prompt
poses crossed-legs: exuberant, joyful ; A gamer, celebrating a victory with a triumphant fist pump; close-up; Gaming; a brightly lit room with a celebratory confetti explosion; cinematic
Characteristic
Shot : A young woman is sitting on the floor of a living room, surrounded by confetti, with a bright smile on her face. She appears to be enjoying herself.
Aesthetic Score : 0.7
Mood : joyful, celebratory, playful
Quality
Entropy : 6.59
Noise : 81
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Vibrant Market Life: A Feast for the Senses
Capture the energy and warmth of a bustling outdoor market, where people gather to enjoy delicious food and lively conversation. The scene is bathed in warm colors and inviting light, creating a sense of community and celebration.
Prompt
poses crossed-legs: lively, adventurous ; A group of travelers, sharing a meal at a bustling street market; medium shot; Travel; vibrant colors and aromas of exotic food stalls; cinematic
Characteristic
Shot : Four people are sitting at a street food market, enjoying a meal and talking amongst themselves. The scene is bustling with activity with people behind them and food stalls on all sides. The light is warm and inviting, creating a cozy atmosphere.
Aesthetic Score : 0.7
Mood : joyful, relaxed, friendly
Quality
Entropy : 6.71
Noise : 111
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has a slight blurry effect, especially in the background. The colors are a bit saturated and the details are somewhat oversharpened.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.5, which falls within the “good” range (0.5 to 0.75). This means the model was able to accurately capture the camera position described in the prompt.
- Shot Analysis: The model scored 0.54, also within the “good” range. This indicates the model understood the scene described in the prompt and created an image that reflects that understanding.
- Aesthetic Analysis: The model scored 0.03, which is significantly lower than the “very good” range (-0.2 to 0.1). This suggests that the generated image didn’t quite match the expected aesthetic style described in the prompt.
Overall, the model demonstrates a good understanding of camera position and scene composition, but needs improvement in capturing the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.scenario.com