AI's Facial Expressions: A Hit or Miss? with Dall-e-3
- 10 minutes read - 1951 wordsTable of Contents
Facial expressions are a powerful tool in storytelling, conveying emotions and adding depth to characters. In the realm of AI, generating realistic and expressive faces is a challenging task. This blog post examines the results of an AI model’s attempt to generate facial expressions in various scenes, highlighting its strengths and weaknesses. We’ll explore how the model captures the scene and shot composition, but also delve into its struggles with achieving the desired aesthetic. Through this analysis, we gain insights into the current capabilities and limitations of AI in generating expressive faces.
Created with: dall-e-3
Lost in the City’s Buzz: A Moment of Introspection
A young woman finds solitude amidst the bustling city life, her focused expression and the blurred background highlighting a moment of introspection. The warm lighting and steam from her coffee create a cozy atmosphere in this trendy cafe scene.
Prompt
facial-expressions Embarrassment: Awkward and self-conscious ; A single woman; eye-level; Single Persons; A crowded cafe with loud chatter and laughter; cinematic
Characteristic
Shot : A woman is sitting at a table in a cafe with a cup of coffee. There are other people sitting at tables in the background, and buildings in the background.
Aesthetic Score : 0.6
Mood : casual, urban, contemplative
Quality
Entropy : 6.71
Noise : 103
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is somewhat blurry, especially in the background, and there are some visible artifacts from the AI generation process. Some of the people in the background appear pixelated or unnatural.
Hope Takes Flight: Superhero Inspires City Crowd
A beacon of hope in a bustling city, a superhero in a vibrant blue and yellow costume stands tall, inspiring awe and excitement in the crowd that looks up at him with admiration. The scene radiates optimism and heroism, capturing the essence of a brighter future.
Prompt
facial-expressions Embarrassment: Humiliated and exposed ; A superhero in a full costume; eye-level; Heroes; A bustling city street with people staring; cinematic
Characteristic
Shot : A superhero stands at the edge of a crowded street in a city with tall buildings, the crowd is looking up at the superhero, the scene is set in a busy urban environment, maybe a demonstration or a parade
Aesthetic Score : 0.6
Mood : hopeful, inspiring, optimistic
Quality
Entropy : 6.88
Noise : 105
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to have been generated by AI. The faces are not fully realistic, they lack details and have a repetitive look. Also, the city and buildings in the background look unrealistic. The car details are not accurate and lack definition
Tears in a Golden Cage: Loneliness Amidst Luxury
A man sits alone at a lavish dinner table, his tears a stark contrast to the opulent surroundings. While others around him seem indifferent or react with discomfort, his sadness is palpable, highlighting the isolating nature of his grief.
Prompt
facial-expressions Embarrassment: Mortified and ashamed ; A man in a business suit; eye-level; Normal People; A formal dinner party with elegant guests; cinematic
Characteristic
Shot : A man is crying at a formal dinner party. He is seated at the head of the table with other guests around him. He is dressed in a suit and tie, and he is holding his hands to his chest. There are glasses of wine on the table. The mood is somber and reflective.
Aesthetic Score : 0.7
Mood : sadness, despair, formality
Quality
Entropy : 6.00
Noise : 101
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some slight issues with the smoothing and edge detection in the background and the subject’s hair.
The Price of Defeat: Gamer’s Despair in a Dimly Lit Room
A young woman, slumped in her gaming chair, headphones on, stares blankly ahead. Empty pizza boxes and scattered gaming equipment litter the dimly lit room, reflecting the weight of her frustration. Monitors flicker with game footage, a silent testament to her recent defeat. The scene captures the raw emotion of a gamer facing the harsh reality of failure, leaving a sense of unease and isolation.
Prompt
facial-expressions Embarrassment: Cringing and defeated ; A gamer in a gaming chair; eye-level; Gamer; A dimly lit room with flashing screens and empty pizza boxes; cinematic
Characteristic
Shot : A young woman is sitting in a gaming chair, looking distressed. The room is dimly lit, with neon lights casting a blue and purple hue. The background is cluttered with pizza boxes, gaming monitors, and controllers.
Aesthetic Score : 0.4
Mood : gloomy, stressed, overwhelmed
Quality
Entropy : 6.77
Noise : 93
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.40
Image errors : The lighting is uneven, and the image has a slight digital artifacting effect. The image has a slight blur in the background.
A Bride’s Silent Sorrow: Capturing the Melancholy of a Wedding Reception
A poignant image captures the stark contrast between a bride’s somber mood and the joyous atmosphere of her wedding reception. The scene evokes feelings of melancholy, awkwardness, and loneliness, highlighting the complex emotions that can accompany such a momentous occasion.
Prompt
facial-expressions Embarrassment: Lonely and out of place ; A woman in a wedding dress; eye-level; Single Persons; A crowded wedding reception with happy couples; cinematic
Characteristic
Shot : A bride sits alone at a wedding reception, looking sad and out of place while everyone else is laughing and having fun.
Aesthetic Score : 0.7
Mood : lonely, somber, melancholic
Quality
Entropy : 6.89
Noise : 95
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the background, particularly around the faces of some guests.
Masked Figure Commands the Spotlight Amidst a Roaring Crowd
A close-up reveals a man shrouded in mystery, his blue mask a stark contrast against the blurred cheers of the crowd. The spotlight’s dramatic illumination intensifies the intrigue, leaving us to wonder what unfolds next in this captivating scene.
Prompt
facial-expressions Embarrassment: Embarrassed and self-conscious ; A superhero in a cape; eye-level; Heroes; A cheering crowd at a victory parade; cinematic
Characteristic
Shot : A superhero in a blue mask looks out at an adoring crowd. They are cheering and holding their phones up.
Aesthetic Score : 0.6
Mood : excited, celebratory, heroic
Quality
Entropy : 6.87
Noise : 104
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.90
Image errors : The crowd in the background is very blurry and the lighting is a bit too bright and flat.
Distress at the Dinner Table: A Moment of Anxiety Captured
A close-up shot reveals a young woman’s distress as she sits alone at a restaurant table, her face hidden in her hands. The dramatic lighting and intimate framing create a palpable sense of tension and suspense, leaving the viewer wondering what has caused her anguish.
Prompt
facial-expressions Embarrassment: Uncomfortable and out of place ; A woman in a casual outfit; eye-level; Normal People; A fancy restaurant with white tablecloths and expensive wine; cinematic
Characteristic
Shot : A woman is sitting at a restaurant table, looking scared. She is being filmed by a phone on a tripod, and the camera is capturing her reaction. The scene is set in a fancy restaurant with white tablecloths and wine glasses. The restaurant is dimly lit, creating a sense of mystery and intrigue.
Aesthetic Score : 0.5
Mood : suspenseful, uneasy, mysterious
Quality
Entropy : 6.92
Noise : 95
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some artifacts in the image, particularly around the edges of the woman’s hair and the background. The image appears to be slightly blurry, indicating that the camera was moving slightly during the capture.
The Pressure is On: Young Gamer Faces the Crowd
A young man, hooded and focused, sits before a computer in a roaring arena. The dramatic lighting and his intense expression capture the pressure and excitement of the moment. This scene evokes a sense of intense focus and anticipation, leaving the viewer wondering what challenge lies ahead.
Prompt
facial-expressions Embarrassment: Humiliated and defeated ; A gamer in a hoodie; eye-level; Gamer; A crowded esports tournament with loud cheers and flashing lights; cinematic
Characteristic
Shot : A hooded gamer is sitting at a computer in an arena filled with cheering fans. The stage is lit with spotlights.
Aesthetic Score : 0.6
Mood : intense, focused, exciting
Quality
Entropy : 6.60
Noise : 113
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.60
Image errors : Some of the figures in the background look a bit blurry and pixelated. There is a slight blurring effect on the screen. The lighting on the gamer’s face seems a bit unnatural and overexposed.
Fear in the Candlelight: A Man’s Tense Dinner
A man in a tuxedo sits at a dimly lit dinner table, his face etched with fear. The flickering candlelight casts long shadows, adding to the atmosphere of suspense and mystery. What secrets lie hidden beneath the surface of this tense encounter?
Prompt
facial-expressions Embarrassment: Awkward and uncomfortable ; A man in a tuxedo; eye-level; Single Persons; A romantic dinner for two with candles and flowers; cinematic
Characteristic
Shot : A man in a tuxedo sits at a dimly lit dinner table, looking startled, with candles and flowers in the foreground.
Aesthetic Score : 0.7
Mood : suspenseful, eerie, dramatic
Quality
Entropy : 6.59
Noise : 84
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors in the image.
Masked Figure Speaks at Tense Press Conference
A mysterious figure, shrouded in a black mask, faces a barrage of questions from reporters at a tense press conference. The low camera angle and ominous lighting heighten the suspense, leaving the audience wondering about the identity and intentions of the masked individual.
Prompt
facial-expressions Embarrassment: Mortified and ashamed ; A superhero in a mask; eye-level; Heroes; A news conference with reporters asking difficult questions; cinematic
Characteristic
Shot : A man wearing a black mask is being interviewed by reporters in a room. He sits at a table with microphones in front of him.
Aesthetic Score : 0.6
Mood : tense, mysterious, dramatic
Quality
Entropy : 6.88
Noise : 109
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.50
Image errors : No noticeable errors
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.15, which is below the “good” range of 0.5 to 0.75. This suggests the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.62, falling within the “good” range. This indicates the model successfully understood the scene described in the prompt and created an image that reflects it.
- Aesthetic Analysis: The model scored 0.12, which is significantly higher than the “very good” range of -0.2 to 0.1. This suggests the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall, the model demonstrated a good understanding of the scene and shot composition, but struggled to match the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://openai.com/index/dall-e-3/