AI's Facial Expressions: A Mixed Bag of Success with Scenario
- 9 minutes read - 1709 wordsTable of Contents
Facial expressions are a powerful tool for conveying emotions and storytelling. In the realm of generative AI, the ability to create realistic and expressive faces is a crucial benchmark. This blog post delves into the performance of a generative AI model in capturing facial expressions across a range of scenes, analyzing its strengths and weaknesses in understanding camera position, shot analysis, and aesthetic style. We’ll explore examples where the model excels and where it needs improvement, providing insights into the current state of AI’s ability to create compelling and emotionally resonant images.
Created with: scenario
Lost in the City Lights: A Moment of Melancholy
A young woman, shrouded in darkness, walks through a city bathed in the warm glow of streetlights. Her bright eyes, reflecting the urban landscape, hint at a story of nostalgia and longing. The scene evokes a sense of dreamy melancholy, leaving the viewer to ponder the woman’s thoughts and the secrets held within the city’s shadows.
Prompt
facial-expressions Excitement: Thrilled, anticipation ; A lone figure; eye-level; Single Person; bustling city street at night; cinematic
Characteristic
Shot : A woman stands on a wet city street at night, looking off to the side. The background is blurry and out of focus, with glowing lights and signs.
Aesthetic Score : 0.8
Mood : mysterious, melancholic, dreamy
Quality
Entropy : 6.67
Noise : 107
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are no significant errors in the image, however, some slight oversharpening is noticeable on the subject’s hair and skin.
Supergirl Soars into the Sunset, Hope Takes Flight
A powerful image captures Supergirl’s epic flight over a city at sunset, radiating hope and strength. The dramatic pose, vibrant colors, and sprawling cityscape create a breathtaking scene.
Prompt
facial-expressions Excitement: Triumphant, exhilarating ; A superhero in mid-air; low-angle; Hero; cityscape with a dramatic sunset; cinematic
Characteristic
Shot : A woman dressed as Supergirl flying over a cityscape at sunset.
Aesthetic Score : 0.7
Mood : powerful, hopeful, determined
Quality
Entropy : 6.77
Noise : 96
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be slightly blurry.
Sun-Kissed Laughter: Capturing the Joy of Summer
Three friends embrace the warmth of a sunny day, their laughter echoing through the park as they run with carefree abandon. The focus on the woman in the foreground captures the infectious energy of their shared moment, creating a vibrant and joyful scene.
Prompt
facial-expressions Excitement: Joyful, carefree ; A group of friends laughing and running; eye-level; Normal People; a sunny park with a vibrant green lawn; cinematic
Characteristic
Shot : A young woman with flowing hair is running towards the camera with a big smile on her face. She is in a park, with greenery in the background, and the sun is shining.
Aesthetic Score : 0.8
Mood : joyful, carefree, sunny
Quality
Entropy : 6.77
Noise : 93
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Lost in Thought, Fueled by Pixels
A young woman, immersed in the digital world, contemplates her next move. The dimly lit room and focused gaze create a sense of mystery and intrigue, hinting at a world beyond the screen.
Prompt
facial-expressions Excitement: Intense, focused ; A gamer’s hands furiously tapping on a keyboard; close-up; Gamer; a dimly lit room with glowing screens; cinematic
Characteristic
Shot : A young woman wearing a pink hoodie and headphones is sitting in front of a computer, the background is a dimly lit room with a keyboard and monitors. She looks like she is concentrating on something.
Aesthetic Score : 0.8
Mood : focused, mysterious, hopeful
Quality
Entropy : 6.68
Noise : 86
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some minor artifacts and blurriness are present around the edges of the image.
Sunset Serenity: A Moment of Tranquility on the Cliffside
A woman finds peace and beauty as she watches the sun dip below the horizon, casting warm hues across the ocean. The scene evokes a sense of calm and romanticism, with the sunset’s glow offering a feeling of hope and tranquility.
Prompt
facial-expressions Excitement: Awe-inspiring, liberating ; A woman standing on a cliff overlooking a vast ocean; eye-level; Single Person; dramatic clouds and a setting sun; cinematic
Characteristic
Shot : A woman stands on a cliff overlooking the ocean at sunset. The sun is setting behind her, casting a warm golden glow over the scene.
Aesthetic Score : 0.8
Mood : romantic, serene, wistful
Quality
Entropy : 6.86
Noise : 90
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, particularly around the woman’s hair and the ocean.
Into the Fire: A Warrior’s Grit in a Futuristic Battlefield
A woman clad in advanced combat armor charges through a chaotic battlefield, explosions and smoke swirling around her. Her determined expression and the intensity of the scene create a palpable sense of urgency and danger.
Prompt
facial-expressions Excitement: Brave, adrenaline-fueled ; A hero charging into battle; low-angle; Hero; a chaotic battlefield with explosions and smoke; cinematic
Characteristic
Shot : A young woman in a futuristic suit runs through a warzone with explosions and smoke behind her.
Aesthetic Score : 0.7
Mood : intense, action, dramatic
Quality
Entropy : 6.78
Noise : 92
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The smoke is a bit blurry and generic, the explosions could be more realistic and well-defined, and the woman’s hair is overly stylized.
Birthday Joy Captured in a Moment of Laughter
This heartwarming photo captures a family celebrating a birthday with cake and balloons. The woman’s infectious laughter and the joyful expressions of the man and boy create a sense of warmth and happiness, despite a slightly awkward composition. The natural light adds to the overall feeling of celebration.
Prompt
facial-expressions Excitement: Happy, celebratory ; A family celebrating a birthday; eye-level; Normal People; a brightly decorated living room with balloons and streamers; cinematic
Characteristic
Shot : A birthday party with a cake and balloons
Aesthetic Score : 0.7
Mood : joyful, celebratory, heartwarming
Quality
Entropy : 6.74
Noise : 96
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant image errors, although there is slight noise in the background. The colors look a bit oversaturated, which could be improved with a color correction.
Lost in Thought: A Dreamy Portrait of a Young Woman
This ethereal portrait captures a young woman lost in contemplation, her gaze directed upwards and slightly to the right. The soft lighting and blurred neon background create a sense of mystery and intrigue, leaving the viewer to wonder what thoughts are swirling in her mind.
Prompt
facial-expressions Excitement: Engrossed, focused ; A gamer’s face illuminated by the screen; close-up; Gamer; a dark room with neon lights reflecting on the screen; cinematic
Characteristic
Shot : A close-up shot of a young woman’s face. Her expression is thoughtful, with her eyes slightly upturned. The background is out of focus and features glowing purple and blue lights.
Aesthetic Score : 0.8
Mood : dreamy, melancholic, contemplative
Quality
Entropy : 6.82
Noise : 90
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The skin texture appears slightly unnatural and the hair is somewhat blurry in areas.
Joyride: A Woman’s Laughter Fills the Air
A woman, radiating joy, drives with a contagious laugh, her eyes fixed on something off-screen. The blur of the background adds to the sense of excitement and carefree energy, capturing a moment of pure happiness on the road.
Prompt
facial-expressions Excitement: Thrilling, exhilarating ; A man riding a rollercoaster; POV shot; Single Person; a fast-paced ride with twists and turns; cinematic
Characteristic
Shot : A young woman with long brown hair is riding in a roller coaster, her mouth is open in a joyful scream. She is wearing a white shirt. The shot is taken from inside the roller coaster.
Aesthetic Score : 0.8
Mood : joyful, exhilarating, carefree
Quality
Entropy : 6.80
Noise : 84
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, there are no other major image errors.
Finding Hope in the Storm
A young woman stands on a rooftop, her bright smile a beacon of hope against the backdrop of a dark, stormy sky. The dramatic contrast captures a moment of resilience and freedom.
Prompt
facial-expressions Excitement: Victorious, powerful ; A hero standing triumphantly on a rooftop; high-angle; Hero; a cityscape with a dramatic storm in the background; cinematic
Characteristic
Shot : A young woman stands on a rooftop, looking up at a stormy sky. The cityscape behind her is mostly out of focus.
Aesthetic Score : 0.7
Mood : happy, dramatic, hopeful
Quality
Entropy : 6.72
Noise : 84
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors.
Conclusion
The generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.59, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.08, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model shows promise in understanding scene descriptions and achieving a desired aesthetic, but needs improvement in accurately capturing camera positions.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.scenario.com