AI Captures the Essence of Poses, But Struggles with Aesthetics with Scenario
- 9 minutes read - 1834 wordsTable of Contents
The ability to generate images based on text prompts is a rapidly evolving field in AI. This blog post delves into the performance of a generative AI model in capturing poses and scenes, highlighting its strengths and weaknesses. Dramatic poses, often used in photography and film to convey emotion and action, are a challenging test for AI models. For example, a lone adventurer silhouetted against a setting sun requires the model to understand the concept of silhouette and the emotional weight of the scene. This analysis explores how well the model captures these elements.
Created with: scenario
Silhouetted Against the Setting Sun: A Moment of Solitude in the Mountains
A lone figure stands on a cliff, their silhouette stark against the fiery sunset. The vast mountainous valley below stretches out, creating a scene of epic beauty and contemplative solitude. The dramatic contrast between the figure and the sky emphasizes the vastness of the landscape and the figure’s isolation, leaving a sense of serenity and awe.
Prompt
poses over-the-shoulder: epic, hopeful ; A lone adventurer, silhouetted against a setting sun; wide shot; Adventure; a vast, rugged mountain range; cinematic
Characteristic
Shot : A lone figure stands on a rocky outcrop, gazing out over a vast mountain range at sunset. The sun is setting behind a snow-capped peak in the distance, casting a warm, golden glow over the landscape.
Aesthetic Score : 0.8
Mood : epic, serene, majestic
Quality
Entropy : 6.64
Noise : 88
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts in the image, particularly in the sky and on the figure’s clothing. There is slight blurriness in the image, especially in the mountain range.
Firefighter Faces the Blaze: A Moment of Courage and Intensity
A female firefighter, clad in full gear, stands resolute before a burning building, her gaze fixed on the raging flames. The scene captures the dramatic tension and solemn mood of the situation, highlighting the firefighter’s bravery and the urgency of the fire.
Prompt
poses over-the-shoulder: intense, dramatic ; A firefighter, helmet gleaming, facing a raging inferno; medium shot; Heroism; a burning building with smoke billowing; cinematic
Characteristic
Shot : A female firefighter in full gear is looking at a burning building. The building is in the background, and the firefighter is in the foreground.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 6.86
Noise : 88
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts, particularly in the smoke and flames. There is also some slight blurriness in the background.
Neon Glow, Focused Mind: A Gamer’s Intensity
A young woman, bathed in the vibrant glow of neon lights, is completely absorbed in her gaming session. The dimly lit room adds an air of mystery, while her focused expression reveals her determination to conquer the virtual world.
Prompt
poses over-the-shoulder: focused, intense ; A gamer, eyes glued to the screen, fingers flying across the keyboard; close-up; Gaming; a brightly lit gaming setup with flashing lights; cinematic
Characteristic
Shot : A young woman is gaming at a computer desk with a lit-up keyboard, wearing gaming headphones and a hoodie with blue lights.
Aesthetic Score : 0.6
Mood : focused, intense, determined
Quality
Entropy : 6.56
Noise : 82
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some slight blurriness in the background, especially on the monitor.
Parisian Romance: A Moment of Joy at the Eiffel Tower
A young woman, radiant in a white dress, captures the essence of Parisian romance as she stands before the iconic Eiffel Tower. Her happy smile and stylish attire are perfectly complemented by the grandeur of the landmark, creating a picture of timeless beauty and joy.
Prompt
poses over-the-shoulder: joyful, awe-inspired ; A tourist, camera in hand, gazing at the Eiffel Tower; medium shot; Tourism; a bustling Parisian street with the Eiffel Tower in the background; cinematic
Characteristic
Shot : A young woman with long brown hair is standing in front of the Eiffel Tower in Paris. She is wearing a white dress and a brown handbag. The background is a blurred view of the city.
Aesthetic Score : 0.7
Mood : happy, romantic, Parisian
Quality
Entropy : 6.90
Noise : 91
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts in the background, particularly in the area of the cars. However, these are not very noticeable and do not detract significantly from the overall image.
Sunset Serenity on the Beach
A young woman finds peace and tranquility as she gazes out at the ocean at sunset. The soft golden light bathes the scene in a serene glow, with a palm tree standing tall in the foreground.
Prompt
poses over-the-shoulder: peaceful, contemplative ; A backpacker, gazing out at a breathtaking sunset over the ocean; wide shot; Travel; a serene beach with palm trees and turquoise water; cinematic
Characteristic
Shot : A young woman in a straw hat and denim shorts stands on a beach with her back to the camera, looking out at a calm ocean. Palm trees line the shore and a sunset is visible in the distance.
Aesthetic Score : 0.75
Mood : tranquil, serene, peaceful
Quality
Entropy : 6.65
Noise : 101
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image is slightly blurry and the woman’s face is not well defined.
Campfire Magic Under a Starry Sky
A group of friends gather around a crackling campfire, bathed in the warm glow of the flames and the twinkling light of a million stars. The scene evokes a sense of cozy camaraderie, peace, and wonder, making it the perfect escape from the everyday.
Prompt
poses over-the-shoulder: warm, nostalgic ; A group of friends, laughing and sharing stories, around a campfire; medium shot; Groups; a campsite under a starry night sky; cinematic
Characteristic
Shot : A group of six young people are sitting around a campfire under a starry night sky. They are all dressed casually and seem to be enjoying each other’s company. There is a tent in the background, suggesting that they are camping.
Aesthetic Score : 0.8
Mood : cozy, warm, friendly
Quality
Entropy : 6.63
Noise : 94
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.90
Image errors : No visible artifacts or errors.
Unveiling the Secrets: A Scientist’s Focused Pursuit
A captivating image of a woman in a lab coat, her eyes intently focused through a microscope, captures the essence of scientific exploration. The framing emphasizes her concentration, highlighting the dedication and precision required in the pursuit of knowledge.
Prompt
poses over-the-shoulder: focused, determined ; A scientist, peering through a microscope, engrossed in her research; close-up; Heroism; a laboratory filled with scientific equipment; cinematic
Characteristic
Shot : A woman in a white lab coat is looking through a microscope in a laboratory setting.
Aesthetic Score : 0.7
Mood : focused, scientific, professional
Quality
Entropy : 6.84
Noise : 76
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors, but the image appears to have been slightly over-sharpened, resulting in some halos around the subject’s hair and the microscope.
A Pilot’s Dream: Gazing into the Vastness
A woman in a pilot’s uniform, helmet in place, stares out the cockpit window at the boundless sky. The scene evokes a sense of dreamy nostalgia and adventurous anticipation, capturing the wonder of flight.
Prompt
poses over-the-shoulder: exhilarating, adventurous ; A pilot, gripping the controls, soaring through the clouds; wide shot; Adventure; a cockpit with a view of the vast, blue sky; cinematic
Characteristic
Shot : A young woman in a pilot’s uniform and helmet is looking out the window of a vintage aircraft.
Aesthetic Score : 0.8
Mood : dreamy, adventurous, hopeful
Quality
Entropy : 6.73
Noise : 85
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some minor artifacts in the woman’s hair and the cockpit, but these are not very noticeable.
Mastering the Art of Culinary Precision
A seasoned chef, radiating calm and focus, meticulously plates a dish in a pristine professional kitchen. The warm lighting and stainless steel appliances create an atmosphere of expertise and culinary excellence.
Prompt
poses over-the-shoulder: passionate, artistic ; A chef, meticulously plating a dish, surrounded by the aromas of fresh ingredients; close-up; Tourism; a bustling kitchen in a gourmet restaurant; cinematic
Characteristic
Shot : A chef in a white uniform and a striped apron is plating a meal in a professional kitchen. The background is blurred, with the focus on the chef and the food.
Aesthetic Score : 0.7
Mood : professional, calm, focused
Quality
Entropy : 6.94
Noise : 82
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No notable errors.
Summit Conquered: Hikers Celebrate on Majestic Mountain Ridge
Four adventurers stand triumphant on a snow-covered mountain peak, arms raised in celebration. The breathtaking panorama of snow-capped peaks and a vast mountain range creates a sense of awe and accomplishment. This exhilarating scene captures the spirit of adventure and the joy of reaching the summit.
Prompt
poses over-the-shoulder: triumphant, inspiring ; A group of hikers, silhouetted against a mountain peak, reaching the summit; wide shot; Groups; a majestic mountain range with a breathtaking view; cinematic
Characteristic
Shot : Four hikers stand on a snow-covered mountain peak with their arms raised in victory, overlooking a vast and majestic mountain range. The sky is a clear blue with fluffy white clouds, and the sun is shining brightly.
Aesthetic Score : 0.8
Mood : triumphant, adventurous, inspiring
Quality
Entropy : 6.55
Noise : 95
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.5, which is considered good. This means the generated image’s camera position closely matched the prompt’s instructions.
- Shot Analysis: The model scored 0.58, also considered good. This indicates the generated image’s shot composition was fairly aligned with the prompt’s description.
- Aesthetic Analysis: The model scored 0.02, which is very good. This suggests the generated image’s aesthetic closely matched the expected aesthetic based on the prompt.
Overall, the model demonstrated a good understanding of camera position and shot composition, but its ability to capture the desired aesthetic was exceptional.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.scenario.com