AI's Facial Expressions: A Mixed Bag of Success with Scenario
- 10 minutes read - 1940 wordsTable of Contents
Facial expressions are a powerful tool for conveying emotions and intentions in visual storytelling. Generative AI models are increasingly being used to create images with realistic facial expressions, but how well do they perform? This blog post delves into the nuances of AI-generated facial expressions, examining its strengths and weaknesses in capturing the subtle nuances of human emotion. We’ll explore examples of successful and less successful attempts, analyzing the factors that contribute to the model’s performance. From the bustling city streets to the quiet solitude of a foggy alleyway, we’ll see how AI navigates the complexities of human expression in diverse settings.
Created with: scenario
Lost in the City Lights: A Moment of Melancholy
A young woman stands alone on a vibrant city street, her gaze lost in the distance. The bustling crowd around her seems to fade away as she succumbs to a sense of longing and isolation. This evocative image captures the bittersweet beauty of urban life and the search for connection in a crowded world.
Prompt
facial-expressions Confusion: Disoriented, overwhelmed ; A lone figure; eye-level; Single Person; a bustling city street with neon signs and crowds; cinematic
Characteristic
Shot : A young woman stands in a brightly lit city street at night, looking away from the camera towards the glowing neon signs. There are other people in the background, but they are blurred and out of focus, making the woman the focal point.
Aesthetic Score : 0.7
Mood : nostalgic, urban, dreamy
Quality
Entropy : 6.76
Noise : 105
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.90
Image errors : The background is slightly blurry and the woman’s hair looks somewhat artificial. There are a few minor artifacts in the background, but they are not very noticeable.
Hope Rises from the Ashes: Superhero Stands Tall Amidst Devastation
A powerful female superhero, radiating determination and hope, stands on the edge of a ravaged city. Smoke and fire billow in the background, creating a dramatic and tense scene. This image captures the resilience of the human spirit in the face of adversity.
Prompt
facial-expressions Confusion: Doubt, uncertainty ; A superhero in a tattered costume; eye-level; Hero; a destroyed cityscape with smoke and debris; cinematic
Characteristic
Shot : A woman dressed as Supergirl stands against a backdrop of a city in ruins, with a fire and smoke in the background. The woman appears to be looking at the fire and smoke with a determined expression on her face.
Aesthetic Score : 0.6
Mood : determined, powerful, dramatic
Quality
Entropy : 6.90
Noise : 93
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.50
Image errors : The image has some blurriness and noise, and the lighting is not ideal.
A Moment of Contemplation in the Corporate World
A woman in a sharp business suit stands alone in an office, her expression serious and contemplative. The lighting casts a mysterious glow, adding to the sense of intrigue. The background figure adds a touch of context, suggesting a world of professional challenges and decisions.
Prompt
facial-expressions Confusion: Lost, unmoored ; A woman in a business suit; eye-level; Normal People; a sterile office with fluorescent lights and cubicles; cinematic
Characteristic
Shot : A woman in a business suit stands in an office setting. The background is blurred and shows cubicles and other office furniture. The woman’s face is in focus and she is looking directly at the camera.
Aesthetic Score : 0.6
Mood : serious, professional, confident
Quality
Entropy : 6.65
Noise : 98
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be overly stylized, with the subject’s face and hair appearing to be unrealistically smooth and flawless.
Lost in the Melody: A Moment of Calm and Contemplation
A young woman, bathed in soft, warm light, finds solace in her music. Her relaxed expression and the intimate feel of the image evoke a sense of peace and quiet contemplation. The beige sweater and headphones add to the cozy and introspective atmosphere.
Prompt
facial-expressions Confusion: Frustration, bewilderment ; A gamer with headphones on; close-up; Gamer; a dimly lit room with a computer screen displaying a complex game interface; cinematic
Characteristic
Shot : A woman wearing headphones, looking thoughtfully off-camera, with a neutral background.
Aesthetic Score : 0.8
Mood : calm, contemplative, introspective
Quality
Entropy : 6.63
Noise : 91
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major image errors.
Lost in the Fog: A Moment of Suspense
A woman in a trench coat stands shrouded in fog, her gaze fixed on something unseen. A man in a hat lurks further down the alley, adding to the air of mystery. This cinematic scene evokes a sense of suspense and intrigue, leaving you wondering what secrets lie hidden in the mist.
Prompt
facial-expressions Confusion: Suspicious, wary ; A man in a trench coat; eye-level; Single Person; a foggy alleyway with flickering streetlights; cinematic
Characteristic
Shot : A woman in a trench coat standing in a foggy alleyway. There is a man in a hat walking down the alley, out of focus, behind her. The scene is lit by streetlamps.
Aesthetic Score : 0.7
Mood : mysterious, alluring, melancholic
Quality
Entropy : 6.76
Noise : 103
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The lighting in the image is a bit unnatural, with the streetlamps being too bright and the fog being too dense. The woman’s hair is also a bit too perfect, which makes it look unnatural.
A Knight’s Tale: Mystery in the Mist
A young woman in medieval armor stands amidst a misty forest, her gaze fixed on something unseen. The ethereal lighting and her pensive pose create a sense of mystery and intrigue, leaving the viewer to wonder what secrets lie hidden within the fog.
Prompt
facial-expressions Confusion: Disillusioned, lost ; A knight in shining armor; eye-level; Hero; a dark forest with twisted trees and ominous shadows; cinematic
Characteristic
Shot : A woman in full plate armor stands in a misty forest. She looks determined, yet somewhat apprehensive. She leans against a tree, suggesting a moment of respite or contemplation. The light is soft and muted, highlighting the details of the armor and her face.
Aesthetic Score : 0.8
Mood : mysterious, powerful, contemplative
Quality
Entropy : 6.77
Noise : 101
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.80
Image errors : There is some blurring and softness in the background, which might be intended to create a misty effect, but could be perceived as technical error.
Lost in Thought: A Moment of Quiet Contemplation
A woman sits at a dining table, her gaze fixed on something beyond the frame. The plate of food before her is untouched, a testament to the depth of her thoughts. The quiet mood and thoughtful expression create a sense of introspection and quiet contemplation.
Prompt
facial-expressions Confusion: Awkward, uncomfortable ; A family at a dinner table; eye-level; Normal People; a brightly lit kitchen with mismatched plates and silverware; cinematic
Characteristic
Shot : A woman is sitting at a table, looking off to the side, with a plate of food in front of her. Another woman is sitting to the left, but is out of focus.
Aesthetic Score : 0.7
Mood : pensive, thoughtful, wistful
Quality
Entropy : 6.78
Noise : 91
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors.
The Moment Before Victory: A Gamer’s Focused Gaze
A young woman, bathed in warm light, sits on a couch, controller in hand, her gaze fixed upwards. The scene exudes a sense of relaxed focus and determination, hinting at a moment of intense anticipation before a crucial gaming challenge.
Prompt
facial-expressions Confusion: Overwhelmed, disoriented ; A gamer holding a controller; close-up; Gamer; a brightly lit room with a TV screen displaying a chaotic game scene; cinematic
Characteristic
Shot : A young woman is playing video games on a couch. She is looking up, perhaps at a screen or something in the distance. The room is lit with warm light, and the couch is yellow.
Aesthetic Score : 0.6
Mood : focused, contemplative, playful
Quality
Entropy : 6.82
Noise : 90
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.70
Image errors : Some minor imperfections are noticeable in the woman’s skin and hair. There’s a slight blurring around her edges.
Lost in Thought: A Moment of Contemplation on a Busy Street
A young woman, her gaze distant, walks through a bustling city scene. The soft focus of the background and her contemplative expression create a sense of mystery and intrigue, leaving the viewer wondering what thoughts occupy her mind.
Prompt
facial-expressions Confusion: Lost, alienated ; A woman walking down a crowded street; eye-level; Single Person; a bustling city street with people rushing past; cinematic
Characteristic
Shot : A young woman with long brown hair is walking through a city street. The background is blurry, but it appears to be a bustling city street.
Aesthetic Score : 0.8
Mood : dreamy, hopeful, mysterious
Quality
Entropy : 6.67
Noise : 100
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to be generated using AI. The background and blur effects are unrealistic and lacking in detail.
Moonlit Reflections: A Moment of Dreamy Contemplation
In this romantic and melancholic scene, a young woman stands on a rooftop, captivated by the cityscape’s twinkling lights and the full moon’s soft glow. The dramatic effect of the moon as a focal point, combined with her thoughtful pose, creates a dreamy atmosphere of longing and reflection.
Prompt
facial-expressions Confusion: Doubt, questioning ; A superhero standing on a rooftop; eye-level; Hero; a cityscape with twinkling lights and a full moon; cinematic
Characteristic
Shot : A young woman in a white crop top and denim shorts stands on a rooftop overlooking a city skyline at night. The moon is visible in the sky, and the city lights twinkle below.
Aesthetic Score : 0.7
Mood : romantic, dreamy, nostalgic
Quality
Entropy : 6.51
Noise : 81
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some minor artifacts are visible in the background, particularly in the cityscape and around the moon. The lighting appears a bit unnatural, and the skin tones are slightly overly saturated.
Conclusion
The results of the analysis show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the generated image didn’t accurately reflect the camera position described in the prompt.
- Shot Analysis: The model scored 0.61, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create an image that reflects it well.
- Aesthetic Analysis: The model scored 0.05, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model seems to be better at understanding the scene and achieving the desired aesthetic than accurately capturing the camera position.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.scenario.com