AI's Mixed Bag: Capturing Emotion in Images with Dall-e-3
- 10 minutes read - 1944 wordsTable of Contents
In the realm of AI-generated imagery, capturing the nuances of human facial expressions remains a significant challenge. While advancements in technology have enabled impressive feats in image creation, replicating the subtle complexities of emotions through pixels is a task that requires further refinement. This blog post delves into a case study where an AI model was tasked with generating images featuring specific facial expressions, revealing both successes and limitations in its ability to convey the desired emotional range.
Created with: dall-e-3
Lost in the City Lights
A solitary figure in a suit stands amidst the vibrant chaos of a bustling city street at night. The man’s downcast gaze and the blurred background evoke a sense of melancholy and isolation, highlighting the loneliness that can be found even in the heart of urban life.
Prompt
facial-expressions Disappointment: Melancholy, isolation ; A lone figure; eye-level; Single Person; a bustling city street at night, with neon signs and blurred lights; cinematic
Characteristic
Shot : A man in a suit is walking down a rainy street in a city at night. The street is lined with neon signs. The man looks sad.
Aesthetic Score : 0.7
Mood : melancholy, urban, lonely
Quality
Entropy : 6.53
Noise : 96
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly in the neon signs. The man’s face is also a little bit blurry.
Even Heroes Cry: A Moment of Melancholy at Sunset
A powerful image captures the emotional weight of heroism. A superhero, cloaked in red and masked, stands against a breathtaking sunset cityscape, tears streaming down their face. The contrast between the beauty of the scene and the hero’s sadness creates a powerful sense of drama and melancholy.
Prompt
facial-expressions Disappointment: Defeated, disillusioned ; A superhero standing on a rooftop; eye-level; Hero; a cityscape bathed in the orange glow of a setting sun, with the hero’s cape billowing in the wind; cinematic
Characteristic
Shot : A superhero figure in a red cape stands against the silhouette of a city skyline at sunset. The figure is wearing a black mask and has a tear running down their cheek.
Aesthetic Score : 0.6
Mood : melancholy, dramatic, heroic
Quality
Entropy : 6.29
Noise : 79
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.90
Image errors : There is a slight blurriness around the edges of the figure and the cityscape, which may be due to the blending of different elements in the image. The lighting on the figure’s face appears somewhat unnatural, and the detail of the muscles on the figure is not well-rendered and appears slightly too strong.
A Moment of Quiet Despair
A woman, shrouded in sadness, sits amidst a table of unwashed dishes and a forgotten bottle. The low light and cluttered scene amplify her loneliness, creating a poignant image of melancholic reflection.
Prompt
facial-expressions Disappointment: Hopelessness, resignation ; A woman sitting at a kitchen table; eye-level; Normal Person; a cluttered kitchen with dirty dishes and a half-eaten meal; cinematic
Characteristic
Shot : A woman in a headscarf sits at a table with empty plates of food. She has her head in her hands and looks sad. The scene is dimly lit and the table is cluttered with dirty dishes, suggesting a sense of poverty or hardship.
Aesthetic Score : 0.6
Mood : sad, somber, desolate
Quality
Entropy : 6.47
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Lost in the Game: A Man’s Intense Focus Under Blue Light
A dimly lit room, bathed in blue and yellow hues, reveals a man engrossed in his computer screen. His intense focus and the mysterious atmosphere suggest a thrilling gaming session. The dramatic lighting adds to the scene’s captivating mood.
Prompt
facial-expressions Disappointment: Frustration, anger ; A gamer sitting in front of a computer screen; eye-level; Gamer; a dimly lit room with flashing lights and the glow of the monitor reflecting in their eyes; cinematic
Characteristic
Shot : A young man with a beard is staring intently at the camera, lit by blue spotlights. There is a dark background and what appears to be a gaming monitor in the background.
Aesthetic Score : 0.7
Mood : intense, focused, mysterious
Quality
Entropy : 6.27
Noise : 71
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : The lighting is a bit harsh and creates some uneven shadows on the man’s face. The background is very dark and blurry.
Lost in the City’s Shadow
A solitary figure walks through a deserted urban landscape, bathed in the melancholic glow of streetlights. The towering skyscrapers in the distance create a sense of overwhelming isolation, while the dramatic lighting and perspective amplify the feeling of loneliness and mystery.
Prompt
facial-expressions Disappointment: Loneliness, despair ; A man walking down a deserted street; eye-level; Single Person; a street lined with closed shops and flickering streetlights; cinematic
Characteristic
Shot : A lonely, dark alley in a city at night, lit by streetlamps, with a lone figure walking down the middle of the road, and two silhouettes in the distance.
Aesthetic Score : 0.7
Mood : solitude, urban, melancholic
Quality
Entropy : 6.68
Noise : 117
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some minor aliasing artifacts on the edges of the buildings.
Lone Survivor: A Battlefield of Despair
A solitary figure stands amidst the carnage of a battlefield, their presence a stark contrast to the surrounding devastation. Smoke and dust hang heavy in the air, amplifying the sense of loss and chaos. The image evokes a powerful sense of isolation and the weight of the aftermath of battle.
Prompt
facial-expressions Disappointment: Disappointment, regret ; A hero standing over a fallen villain; eye-level; Hero; a battlefield littered with debris and smoke, with the villain’s defeated form at the hero’s feet; cinematic
Characteristic
Shot : A lone figure in a military uniform stands on a battlefield, surrounded by the bodies of fallen soldiers. There are signs of smoke in the air, suggesting a recent battle.
Aesthetic Score : 0.7
Mood : dramatic, somber, gritty
Quality
Entropy : 6.97
Noise : 111
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image is slightly blurry and the details are not as sharp as they could be. There are some artifacts visible in the background, especially around the smoke and the bodies.
A Family’s Silent Sorrow: A Portrait of Unseen Pain
A dimly lit room, a somber family gathered around a dinner table, and a haunting portrait of a young boy on the wall. This scene whispers of unspoken tension and a melancholic atmosphere, leaving the viewer to ponder the family’s hidden story.
Prompt
facial-expressions Disappointment: Tension, estrangement ; A family gathered around a dinner table; eye-level; Normal People; a table set with a simple meal, but with an uncomfortable silence hanging in the air; cinematic
Characteristic
Shot : A family sits at a dinner table, looking somber and contemplative. A large painting of a young boy’s face, also with a melancholic expression, dominates the background.
Aesthetic Score : 0.7
Mood : melancholy, somber, pensive
Quality
Entropy : 6.92
Noise : 97
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight blurriness around the edges of the painting, which could be a result of the original image being cropped or resized.
The Game is Over, But the Story Continues
A woman in a hijab, headphones on, stares intently at a computer screen displaying the stark message ‘Game Over’. The low lighting and close-up shot create a sense of dramatic tension, leaving the viewer to wonder what led to this moment and what awaits her next.
Prompt
facial-expressions Disappointment: Defeat, frustration ; A gamer staring at a game over screen; eye-level; Gamer; a darkened room with the glow of the monitor reflecting in their eyes, showing a game over message; cinematic
Characteristic
Shot : A woman wearing a hijab and headphones is looking at a computer screen, with ‘GAME OVER’ displayed on it.
Aesthetic Score : 0.6
Mood : intense, dramatic, defeat
Quality
Entropy : 6.11
Noise : 79
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.40
Image errors : There is a slight blur in the background and some noise on the screen, but no major artifacts.
Lost in the City Lights
A woman in a hijab gazes out a rain-streaked window, her reflection mirroring the melancholy mood of the city lights. The scene evokes a sense of loneliness and introspection, capturing a moment of quiet contemplation.
Prompt
facial-expressions Disappointment: Sadness, longing ; A woman standing at a window; eye-level; Single Person; a rainy day with the city streets blurred in the background; cinematic
Characteristic
Shot : A woman in a hijab is looking out of a window at the city lights during a rainy night. The window is covered in rain droplets. The woman’s expression is sad and reflective.
Aesthetic Score : 0.7
Mood : sad, melancholic, contemplative
Quality
Entropy : 6.82
Noise : 114
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.80
Image errors : Slight blurriness on the woman’s face, especially on the eyes. The rain droplets on the window appear slightly artificial.
Lost in the Vastness: A Man’s Solitary Gaze
A lone figure stands on a windswept hill, his long coat billowing in the breeze. His intense gaze pierces through the hazy distance, leaving a sense of mystery and isolation. The vast, empty landscape amplifies his solitude, creating a dramatic and evocative scene.
Prompt
facial-expressions Disappointment: Isolation, disillusionment ; A hero standing on a mountaintop; eye-level; Hero; a vast landscape stretching out before them, but with a sense of emptiness in the air; cinematic
Characteristic
Shot : A man in a long coat stands in a field with hills and a cloudy sky in the background.
Aesthetic Score : 0.7
Mood : mysterious, lonely, contemplative
Quality
Entropy : 6.58
Noise : 97
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears slightly blurry and there is some noise in the background. The man’s nose looks a bit unnatural.
Conclusion
The analysis of the generated image reveals mixed results:
- Camera Position: The model performed fairly well in capturing the intended camera position, scoring 0.15. This is slightly below the “good” range of 0.5 to 0.75, indicating that the camera position in the generated image might not perfectly match the prompt’s description.
- Shot Analysis: The model demonstrated moderate success in understanding the scene described in the prompt, scoring 0.525. This falls within the “good” range, suggesting that the generated image captures some key elements of the scene but might not be entirely accurate.
- Aesthetic Analysis: The model struggled to achieve the desired aesthetic, scoring -0.09999999999999996. This is significantly below the “very good” range of -0.2 to 0.1, indicating a noticeable difference between the expected aesthetic and the actual aesthetic of the generated image.
Overall, the model shows some strengths in understanding the scene and camera position, but it falls short in achieving the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://openai.com/index/dall-e-3/