AI Captures the Drama: Facial Expressions in Generated Images with Freepik
- 9 minutes read - 1894 wordsTable of Contents
Facial expressions are a powerful tool in storytelling, conveying a wide range of emotions and adding depth to characters. In the realm of AI-generated imagery, capturing these expressions realistically and effectively is a crucial challenge. This blog post examines a case study where a generative AI model was tasked with creating images based on prompts describing facial expressions and scenes. The results highlight the model’s strengths and weaknesses, offering valuable insights into the current capabilities and limitations of AI in this domain. We’ll explore how the model performed in understanding scene and camera position, and delve into its struggles with capturing the desired aesthetic, particularly in facial expressions. Through this analysis, we aim to shed light on the potential and challenges of using AI for creative tasks, specifically in the realm of visual storytelling.
Created with: freepik
Drowning in Laundry: The Overwhelmed Young Man
A young man sits amidst a sea of laundry baskets, his expression a picture of frustration and exhaustion. The image captures the overwhelming feeling of being buried in chores, a relatable struggle for many.
Prompt
facial-expressions Frustration: Overwhelmed and defeated ; A single person; eye-level; Single Persons; A cluttered apartment with overflowing laundry baskets and takeout containers.; cinematic
Characteristic
Shot : A young man is sitting on the floor, surrounded by laundry baskets full of clothes. He looks up with a surprised expression, as if something unexpected has happened.
Aesthetic Score : 0.3
Mood : surprised, confused, overwhelmed
Quality
Entropy : 6.72
Noise : 55
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, particularly in the background. There are some minor lighting inconsistencies, making the scene look a bit flat.
The Man of Steel Prepares for Battle in the Shadows
A lone figure in a Superman costume stands in a dark, narrow alleyway, his serious expression hinting at the danger that awaits. The image evokes a sense of tension and anticipation, leaving viewers wondering what challenge lies ahead for the iconic hero.
Prompt
facial-expressions Frustration: Powerless and angry ; A superhero; close-up; Heroes; A dark alley with flickering streetlights, the hero’s cape billowing in the wind.; cinematic
Characteristic
Shot : A man dressed as Superman stands in a dark alleyway, looking determined and ready for action.
Aesthetic Score : 0.7
Mood : serious, heroic, suspenseful
Quality
Entropy : 6.64
Noise : 46
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to be slightly blurred and the lighting is a bit flat. The Superman symbol appears a bit faded and blurry, but this could be a style choice.
Terror in the Train Car: One Man’s Scream Echoes the Fear
A chilling image captures the raw terror on the faces of passengers in a crowded train car. The focus falls on a man in the foreground, his mouth wide open in a silent scream, his fear amplified by the shallow depth of field that blurs the surrounding chaos. The scene evokes a palpable sense of tension and dread, leaving viewers questioning the source of the panic.
Prompt
facial-expressions Frustration: Impatient and stressed ; A businessman; eye-level; Normal People; A crowded train with people pushing and shoving, the businessman trapped in the middle.; cinematic
Characteristic
Shot : A crowded train car with people looking frightened, one man is in the foreground with his mouth open in a scream.
Aesthetic Score : 0.7
Mood : intense, suspenseful, chaotic
Quality
Entropy : 6.87
Noise : 55
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors.
On the Edge of Discovery: A Hacker’s Focus
A young man, bathed in the blue glow of his computer screen, sits intently at his desk. The dimly lit room and his focused expression create a palpable sense of tension, hinting at a moment of critical importance. Is he about to crack a code, uncover a secret, or unleash something powerful? The mystery unfolds in this captivating image.
Prompt
facial-expressions Frustration: Focused but frustrated ; A gamer; close-up; Gamer; A dimly lit room with a computer screen displaying a frustratingly difficult level, the gamer’s hands shaking on the keyboard.; cinematic
Characteristic
Shot : A young man, wearing headphones, sits at a desk in a dimly lit room. He is looking directly at the camera, his hands hovering over a backlit keyboard. Two computer monitors are visible in the background, one of which shows a blurry, pixelated image. The lighting is muted, casting a dark and mysterious atmosphere.
Aesthetic Score : 0.6
Mood : intense, focused, serious
Quality
Entropy : 6.57
Noise : 50
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears slightly overexposed in the highlights, especially on the subject’s forehead and hair. The background appears to be slightly out of focus.
Autumn Reflections: A Moment of Quiet Contemplation
A young woman finds solace in the changing season, lost in thought as she scrolls through her phone on a park bench. The vibrant autumn foliage provides a backdrop of tranquility, mirroring the pensive mood of the scene.
Prompt
facial-expressions Frustration: Lonely and isolated ; A young woman; eye-level; Single Persons; A deserted park bench, the woman staring blankly at the ground, her phone lying forgotten beside her.; cinematic
Characteristic
Shot : A young woman is sitting on a park bench, looking at her phone, in a park with fall foliage in the background.
Aesthetic Score : 0.6
Mood : pensive, reflective, autumnal
Quality
Entropy : 6.77
Noise : 64
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors.
Heroic Figure Amidst the Flames
A firefighter, clad in full gear, stands stoic in the face of a raging inferno. Smoke billows from the burning building, creating a dramatic backdrop for this moment of courage and determination. The firefighter’s gaze, directed away from the fire, adds a layer of mystery and intrigue to the scene.
Prompt
facial-expressions Frustration: Urgent and desperate ; A firefighter; close-up; Heroes; A burning building with smoke billowing out, the firefighter struggling to open a door.; cinematic
Characteristic
Shot : A fireman wearing a helmet and respirator is looking towards a burning building with smoke and flames in the background.
Aesthetic Score : 0.7
Mood : intense, dramatic, heroic
Quality
Entropy : 6.88
Noise : 51
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors are present.
The Quiet Focus of Study
A young man finds solace and concentration amidst the towering shelves of a library, bathed in soft light. The scene evokes a sense of calm and thoughtful focus, capturing the essence of dedicated study.
Prompt
facial-expressions Frustration: Overwhelmed and anxious ; A student; eye-level; Normal People; A crowded library with students hunched over books, the student staring at a blank page, their pen hovering over the paper.; cinematic
Characteristic
Shot : A young man wearing glasses sits at a table in a library, surrounded by books, with his focus on an open book.
Aesthetic Score : 0.6
Mood : serious, studious, focused
Quality
Entropy : 6.89
Noise : 58
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no major errors in the image.
Lost in the Game: A Gamer’s Intense Focus
A young man, headphones on and eyes glued to the screen, embodies the intensity of gaming. The dim lighting and close-up shot highlight his determined focus as he navigates the virtual world.
Prompt
facial-expressions Frustration: Focused and intense ; A gamer; close-up; Gamer; A brightly lit gaming tournament stage, the gamer staring at the screen, their controller gripped tightly in their hands.; cinematic
Characteristic
Shot : A young man is sitting in a dimly lit room, wearing headphones and holding a video game controller. He is looking directly at the camera with a focused expression.
Aesthetic Score : 0.6
Mood : intense, focused, serious
Quality
Entropy : 6.79
Noise : 54
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry and there is some noise in the shadows.
Drowning in Cash, Yet Drowned in Despair
A woman sits amidst a mountain of money, her face etched with anxiety and confusion. The scene evokes a sense of unease, suggesting that wealth hasn’t brought her happiness, but rather a deeper sense of despair.
Prompt
facial-expressions Frustration: Exhausted and defeated ; A single mother; eye-level; Single Persons; A messy kitchen with dishes piled high in the sink, the single mother staring at a pile of bills, her shoulders slumped.; cinematic
Characteristic
Shot : A woman in a blue shirt sits at a kitchen counter looking up in shock. She is surrounded by stacks of money and a bowl of crumpled bills.
Aesthetic Score : 0.6
Mood : suspenseful, dramatic, troubled
Quality
Entropy : 6.81
Noise : 54
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors are visible. However, the image could be slightly sharper and the color balance could be adjusted for a more natural look.
Doctor’s Serious Gaze Hints at Critical Situation
A doctor, clad in a white coat and stethoscope, stares intently at the camera with a serious expression. The sterile hospital room and surrounding medical equipment amplify the tension, suggesting a critical moment is about to unfold.
Prompt
facial-expressions Frustration: Concerned and helpless ; A doctor; close-up; Heroes; A hospital room with a patient hooked up to machines, the doctor looking at a medical chart with a furrowed brow.; cinematic
Characteristic
Shot : A doctor in a white coat with a stethoscope around his neck looks sternly at the camera in what appears to be a hospital setting. He has a serious expression on his face. Medical equipment is visible in the background.
Aesthetic Score : 0.6
Mood : serious, professional, concerned
Quality
Entropy : 6.85
Noise : 50
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.31, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.61, falling within the “good” range. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.17, which is outside the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated from the expected aesthetic described in the prompt.
Overall, the model shows promise in understanding scene and camera position, but needs improvement in capturing the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.freepik.com