AI's Mixed Bag: Capturing Emotion in Images with Freepik
- 9 minutes read - 1816 wordsTable of Contents
In the realm of AI-generated art, capturing the essence of human emotion remains a formidable challenge. While AI models can create stunning visuals, replicating the subtle nuances of facial expressions, the very language of our emotions, is a complex endeavor. This blog post explores a case study where an AI model attempts to generate images with specific facial expressions, revealing both successes and limitations. We’ll delve into the intricacies of dramatic style facial expressions, exploring how they are used in various contexts, from film and photography to everyday human interaction. By examining the AI’s performance, we gain valuable insights into the ongoing quest to bridge the gap between human creativity and the computational power of AI.
Created with: freepik
Lost in the Neon Glow: A Moment of Melancholy in the City
A young woman, bathed in the soft light of urban neon, stands alone, her gaze lost in the distance. Her melancholic expression and the vibrant cityscape create a captivating scene of wistful loneliness and intrigue.
Prompt
facial-expressions Disappointment: Melancholy, isolation ; A lone figure; eye-level; Single Person; a bustling city street at night, with neon signs and blurred lights; cinematic
Characteristic
Shot : A young woman with brown hair and blue eyes is standing on a city street at night. The street is lined with shops and restaurants, and the buildings are lit up with neon signs. The woman is wearing a green jacket, and she is looking off into the distance. There are other people walking around in the background.
Aesthetic Score : 0.8
Mood : melancholy, dreamy, urban
Quality
Entropy : 6.69
Noise : 54
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Heroic Silhouette Against the Setting Sun
A powerful superhero stands tall on a rooftop, silhouetted against a breathtaking sunset. The scene evokes a sense of epic heroism and hopeful anticipation, capturing the grandeur of the moment.
Prompt
facial-expressions Disappointment: Defeated, disillusioned ; A superhero standing on a rooftop; eye-level; Hero; a cityscape bathed in the orange glow of a setting sun, with the hero’s cape billowing in the wind; cinematic
Characteristic
Shot : A superhero in a long orange cape stands on a rooftop, looking out at a city skyline during sunset.
Aesthetic Score : 0.7
Mood : heroic, hopeful, epic
Quality
Entropy : 6.76
Noise : 48
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.80
Image errors : The city skyline appears to be generated, with some of the buildings looking unrealistic. The cape also appears slightly unnatural.
The Weight of a Messy Meal
A woman sits alone in a kitchen, her meal untouched and her expression heavy with melancholy. The scene evokes a sense of loneliness and disappointment, leaving the viewer to ponder the weight of her unspoken emotions.
Prompt
facial-expressions Disappointment: Hopelessness, resignation ; A woman sitting at a kitchen table; eye-level; Normal Person; a cluttered kitchen with dirty dishes and a half-eaten meal; cinematic
Characteristic
Shot : A woman is sitting at a table in a kitchen, looking directly at the camera with a sad expression. The table is covered in crumbs and there are dishes with food on it, suggesting a meal has just been eaten.
Aesthetic Score : 0.6
Mood : sad, contemplative, melancholic
Quality
Entropy : 6.89
Noise : 52
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : None.
The Hacker’s Gaze: A Portrait of Digital Intrigue
A young man sits before a computer, his serious expression and the skull icon on the screen hinting at a world of digital mystery. The dim lighting and dark mood create a sense of intrigue, leaving you wondering what secrets lie behind his gaze.
Prompt
facial-expressions Disappointment: Frustration, anger ; A gamer sitting in front of a computer screen; eye-level; Gamer; a dimly lit room with flashing lights and the glow of the monitor reflecting in their eyes; cinematic
Characteristic
Shot : A young man sits at a desk in front of a computer, his expression is serious and intense. The background is a dimly lit room, and the computer screen shows a skull-like icon in a futuristic interface.
Aesthetic Score : 0.6
Mood : intense, serious, futuristic
Quality
Entropy : 6.20
Noise : 46
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.50
Image errors : No obvious artifacts or errors
Lost in the Shadows: A Man’s Solitary Walk Under the City Lights
A captivating image of a lone figure walking through a dimly lit street, bathed in the glow of street lamps and shop signs. The low-key lighting and the man’s posture evoke a sense of mystery and melancholy, leaving the viewer to ponder his story.
Prompt
facial-expressions Disappointment: Loneliness, despair ; A man walking down a deserted street; eye-level; Single Person; a street lined with closed shops and flickering streetlights; cinematic
Characteristic
Shot : A man in a brown coat walks down a street lit by street lamps and string lights at night.
Aesthetic Score : 0.7
Mood : mysterious, moody, lonely
Quality
Entropy : 6.86
Noise : 64
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors.
A Knight’s Lament in a Post-Apocalyptic Wasteland
A solitary knight kneels over a fallen comrade amidst the smoke and debris of a recent battle. The scene is both dramatic and somber, capturing the intensity of a post-apocalyptic world.
Prompt
facial-expressions Disappointment: Disappointment, regret ; A hero standing over a fallen villain; eye-level; Hero; a battlefield littered with debris and smoke, with the villain’s defeated form at the hero’s feet; cinematic
Characteristic
Shot : A warrior in a dark cloak kneels over the body of a fallen comrade in a post-apocalyptic cityscape
Aesthetic Score : 0.7
Mood : dark, dramatic, somber
Quality
Entropy : 6.87
Noise : 64
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise and compression artifacts are visible.
Anticipation and Warmth: A Family Meal Under Golden Light
A cozy scene unfolds with a family gathered around a table laden with food. The warm lighting creates an intimate atmosphere, hinting at a moment of shared joy and anticipation. The image evokes a sense of togetherness and the excitement of something special about to happen.
Prompt
facial-expressions Disappointment: Tension, estrangement ; A family gathered around a dinner table; eye-level; Normal People; a table set with a simple meal, but with an uncomfortable silence hanging in the air; cinematic
Characteristic
Shot : A group of people are gathered around a dinner table, lit by warm lamplight. The table is laden with food, giving the impression of a celebratory dinner.
Aesthetic Score : 0.6
Mood : intimate, cozy, contemplative
Quality
Entropy : 6.85
Noise : 67
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable artifacts or errors in the image.
The Gamer’s Focus: A Portrait of Intensity
A young man, bathed in the glow of multiple monitors, one emblazoned with the word ‘GAMER’, is locked in a battle of skill and concentration. The dimly lit room amplifies the tension, highlighting his determined expression and the intensity of his focus.
Prompt
facial-expressions Disappointment: Defeat, frustration ; A gamer staring at a game over screen; eye-level; Gamer; a darkened room with the glow of the monitor reflecting in their eyes, showing a game over message; cinematic
Characteristic
Shot : A young man is sitting at his computer, wearing headphones and typing on a keyboard. There are several monitors in the background, one of which displays the word ‘GAMER’.
Aesthetic Score : 0.6
Mood : focused, intense, determined
Quality
Entropy : 6.38
Noise : 42
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to be slightly overexposed, and there is a slight amount of noise in the background. There are some artifacts in the background, which may be due to the lighting conditions.
Lost in the Rain: A Moment of Melancholy
A woman gazes out a window, her expression tinged with sadness, as rain falls softly outside. The soft lighting and blurred background create an intimate and introspective atmosphere, capturing a moment of quiet contemplation.
Prompt
facial-expressions Disappointment: Sadness, longing ; A woman standing at a window; eye-level; Single Person; a rainy day with the city streets blurred in the background; cinematic
Characteristic
Shot : A young woman is looking out of a window at a rainy city scene. The lighting is soft and the focus is on her face.
Aesthetic Score : 0.6
Mood : melancholy, contemplative, somber
Quality
Entropy : 6.85
Noise : 62
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image appears to be slightly grainy and the colors are a bit desaturated.
A Solitary Figure Contemplates the Majestic Landscape
A lone man stands on a mountain peak, bathed in sunlight, gazing out over a misty valley. The vastness of the scene evokes a sense of isolation and wonder, creating a serene and contemplative mood.
Prompt
facial-expressions Disappointment: Isolation, disillusionment ; A hero standing on a mountaintop; eye-level; Hero; a vast landscape stretching out before them, but with a sense of emptiness in the air; cinematic
Characteristic
Shot : A lone figure stands on a mountaintop, gazing out at a misty valley, with a dramatic backdrop of distant mountains.
Aesthetic Score : 0.7
Mood : serene, contemplative, vast
Quality
Entropy : 6.62
Noise : 59
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable artifacts or errors in the image.
Conclusion
The analysis of the generated image reveals mixed results:
- Camera Position: The model performed fairly well in capturing the intended camera position, scoring 0.15. This is slightly below the “good” range of 0.5 to 0.75, indicating some discrepancies between the prompt and the final image.
- Shot Analysis: The model demonstrated good understanding of the scene described in the prompt, achieving a score of 0.52. This falls within the “good” range, suggesting the model successfully translated the prompt’s description into a visually coherent scene.
- Aesthetic Analysis: The image’s aesthetic deviated significantly from the expected aesthetic, scoring -0.11. This score falls outside the “very good” range of -0.2 to 0.1, indicating a noticeable difference between the desired and actual aesthetic.
Overall, the model showed a good understanding of the scene and camera position, but struggled to achieve the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.freepik.com