AI's Artistic Struggle: Capturing the Essence of Poses with Scenario
- 10 minutes read - 1923 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate realistic and aesthetically pleasing images is a coveted goal. While significant progress has been made, capturing the nuances of human poses remains a challenge. This blog post examines the results of a generative AI model tasked with creating images based on specific poses and scenes, highlighting the model’s strengths and weaknesses in capturing the desired aesthetic.
Created with: scenario
Golden Hour Serenity: A Lone Figure Contemplates the Vastness
A solitary figure stands on a rocky cliff, bathed in the warm glow of the setting sun. The winding river below and the majestic mountain valley create a breathtaking scene of tranquility and epic beauty. The play of light and shadow adds a touch of mystery and grandeur, leaving you to ponder the thoughts of the lone figure.
Prompt
poses leaning-back: epic, contemplative ; A lone adventurer, silhouetted against a setting sun; wide shot; adventure; vast, rugged mountain range; cinematic
Characteristic
Shot : A lone figure stands on a rocky mountain peak, gazing out at a vast valley with a winding river snaking through it. The sun sets in the distance, casting a warm golden glow over the landscape.
Aesthetic Score : 0.8
Mood : serene, contemplative, epic
Quality
Entropy : 6.66
Noise : 93
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has a slight unnatural appearance. The textures and lighting are a bit too perfect, making it look slightly artificial.
Heroic Silhouette: A Superhero Stands Guard Over New York City
A powerful image captures a superhero woman in a red cape, standing tall on a rooftop overlooking the cityscape of New York City at sunset. The scene evokes a sense of heroism and hope, with the superhero’s silhouette standing as a symbol of strength against the backdrop of the city.
Prompt
poses leaning-back: triumphant, powerful ; A superhero, cape billowing in the wind, looking down at a city skyline; medium shot; heroism; bustling cityscape; cinematic
Characteristic
Shot : A female superhero stands on a rooftop overlooking a city skyline at sunset. Her cape billows in the wind, and she looks determined and ready to take on any challenge.
Aesthetic Score : 0.7
Mood : heroic, confident, hopeful
Quality
Entropy : 6.76
Noise : 97
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has a slight blurriness around the edges, and the city skyline looks a bit artificial. The superhero’s pose is also a bit stiff.
Sunset Laughter: Friends Embrace the Golden Hour
Four young women bask in the warm glow of a setting sun, their laughter echoing across the sandy beach. Their carefree joy and vibrant summer attire capture the essence of friendship and happy memories. The dynamic composition invites you to share in their moment of pure bliss.
Prompt
poses leaning-back: joyful, carefree ; A group of friends, laughing and relaxing on a beach, watching the sunset; wide shot; tourism; tropical beach with palm trees; cinematic
Characteristic
Shot : Four young women are laughing together on a beach at sunset. They are wearing summer clothes and hats.
Aesthetic Score : 0.7
Mood : happy, carefree, summery
Quality
Entropy : 6.64
Noise : 84
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Focused and Ready: Gamer in the Zone
A young woman, radiating focus and determination, sits in a gaming chair, headphones on, controller in hand. The dramatic lighting highlights her intensity and the excitement of the game on her computer screen.
Prompt
poses leaning-back: intense, focused ; A gamer, eyes glued to a screen, leaning back in a gaming chair, surrounded by controllers and snacks; medium shot; gaming; dimly lit room with neon lights; cinematic
Characteristic
Shot : A young woman is sitting in a gaming chair, wearing headphones and holding a controller. The background features two monitors displaying video games. The room is lit with neon lights.
Aesthetic Score : 0.6
Mood : cool, futuristic, edgy
Quality
Entropy : 6.80
Noise : 89
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Lost in Thought: A Moment of Tranquility on the Train
A young woman gazes out the window of a moving train, her expression contemplative and serene. The passing countryside scenery creates a dreamy atmosphere, highlighting the melancholic and introspective mood of the image.
Prompt
poses leaning-back: reflective, nostalgic ; A traveler, gazing out of a train window, watching the scenery pass by; medium shot; travel; rolling hills and fields; cinematic
Characteristic
Shot : A young woman is sitting by a window in a train. She is looking out the window at a field of grass and trees. She is wearing a knitted sweater and a straw hat.
Aesthetic Score : 0.7
Mood : pensive, calm, introspective
Quality
Entropy : 6.77
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some slight artifacts around the edges of the window, but they’re very minor. The lighting is good and there are no other visible errors.
Singer Electrifies Crowd with Energetic Performance
A female singer ignites the stage with a vibrant performance, captivating the audience with her dynamic moves and infectious energy. The spotlight illuminates her white crop top and denim shorts, while the crowd’s cheers and outstretched hands create a powerful sense of excitement.
Prompt
poses leaning-back: energetic, passionate ; A group of musicians, performing on stage, bathed in spotlights; wide shot; groups; concert stage with cheering audience; cinematic
Characteristic
Shot : A woman in a white crop top and denim shorts is performing in front of a large crowd of cheering people. The woman is singing and holding her microphone.
Aesthetic Score : 0.6
Mood : energetic, exuberant
Quality
Entropy : 6.65
Noise : 109
Prompt Clip Score : 0.17
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some artifacts and errors in the image, particularly in the faces of the crowd members. Some people have blurry faces and some are strangely colored. The details of the background are blurry.
Finding Peace in the Vastness: A Woman Contemplates the Ocean’s Majesty
A solitary figure finds solace on a windswept cliff, gazing out at the crashing waves and the boundless expanse of the ocean. The soft light and cloudy sky create a serene atmosphere, inviting contemplation and a sense of awe at the power of nature.
Prompt
poses leaning-back: solitary, contemplative ; A lone figure, sitting on a cliff edge, looking out at a vast ocean; medium shot; adventure; dramatic coastline with crashing waves; cinematic
Characteristic
Shot : A woman sits on a cliff overlooking the ocean, with a large wave breaking in the distance. The sky is overcast, and the overall feeling is calm and serene.
Aesthetic Score : 0.7
Mood : serene, contemplative, tranquil
Quality
Entropy : 6.68
Noise : 97
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight graininess, but overall is good quality.
A Moment of Awe: Astronaut Gazes at Earth from the Vastness of Space
This breathtaking image captures the profound isolation and wonder of space exploration. An astronaut, dwarfed by the immensity of the cosmos, floats amidst the stars, their gaze fixed on the blue marble of Earth. The scene evokes a sense of awe and contemplation, reminding us of the fragility of our planet and the boundless possibilities of human ambition.
Prompt
poses leaning-back: awe-inspiring, majestic ; A group of astronauts, floating weightlessly in space, looking out at Earth; wide shot; heroism; Earth from space with stars in the background; cinematic
Characteristic
Shot : A female astronaut floats in space with Earth in the background. She is looking towards the left side of the frame with a determined expression.
Aesthetic Score : 0.7
Mood : awe, wonder, hope
Quality
Entropy : 6.88
Noise : 99
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.70
Image errors : The astronaut’s helmet reflection is slightly distorted and appears unnaturally sharp, suggesting the image may be digitally manipulated.
Campfire Tranquility: A Moment of Peace in the Forest
A group of friends gather around a crackling campfire, bathed in the warm glow of the flames. The serene forest setting and the soft light create a cozy and nostalgic atmosphere, evoking a sense of peace and tranquility. The image captures the essence of a perfect evening spent in nature.
Prompt
poses leaning-back: warm, intimate ; A family, gathered around a campfire, sharing stories and laughter; medium shot; groups; forest clearing with a crackling fire; cinematic
Characteristic
Shot : A group of four people are sitting around a campfire in a forest. There is a tent in the background. The scene is lit by the fire and the setting sun.
Aesthetic Score : 0.7
Mood : cozy, warm, inviting
Quality
Entropy : 6.53
Noise : 99
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.90
Image errors : No visible image artifacts or errors.
Soaring High: A Pilot’s Serene Moment
A female pilot, radiating calm and confidence, navigates her helicopter over breathtaking mountain scenery. This image captures the empowering spirit of adventure and the tranquility found in the face of potential danger.
Prompt
poses leaning-back: exhilarating, adventurous ; A pilot, looking out of the cockpit window, flying over a breathtaking landscape; medium shot; travel; mountains and valleys covered in clouds; cinematic
Characteristic
Shot : A female pilot is flying a helicopter over a mountain range. The helicopter’s cockpit is visible in the foreground, and the pilot is wearing a white shirt and a pilot’s cap. The mountains are covered in snow, and the sky is clear and blue.
Aesthetic Score : 0.7
Mood : serene, focused, adventurous
Quality
Entropy : 6.81
Noise : 79
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.60
Image errors : The image appears to be slightly overexposed and the lighting looks a bit artificial. The pilot’s sunglasses cast an unnatural shadow on her face, and there is a slight blurring effect around the edges.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
- Camera Position: The model scored 0.45, which is slightly below the “good” range of 0.5 to 0.75. This suggests that the model’s ability to accurately interpret and reproduce camera positions in the prompt is decent, but could be improved.
- Shot Analysis: The model scored 0.51, which falls within the “good” range. This indicates that the model is generally able to understand the scene described in the prompt and create images that reflect the intended shot type.
- Aesthetic Analysis: The model scored 0.06, which is significantly lower than the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviates significantly from the expected aesthetic based on the prompt.
Overall, the model shows promise in understanding camera positions and shot types, but needs improvement in generating images that match the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.scenario.com