AI Captures the Drama: Facial Expressions in Generated Images with Imagen-v2
- 10 minutes read - 1950 wordsTable of Contents
Dramatic facial expressions are a powerful tool in storytelling, conveying a wide range of emotions and adding depth to characters. This experiment aimed to explore how well an AI model could capture these expressions in generated images. The results show promising progress, with the model demonstrating a strong understanding of the relationship between scene, camera position, and facial expressions. We’ll delve into specific examples, highlighting the model’s strengths and areas for improvement, and discuss the implications of this technology for the future of visual storytelling.
Created with: imagen-v2
Screaming in the Rain: A Moment of Raw Emotion
A woman with long hair, her face contorted in a scream, stands amidst a downpour. The background blurs, highlighting the intensity of her distress. The play of light and shadow adds to the dramatic tension, capturing a raw and powerful moment of emotion.
Prompt
facial-expressions Anger: Despair and rage ; A lone figure, standing in the middle of a deserted street; eye-level; Single Person; Rain pouring down, streetlights casting long shadows; cinematic
Characteristic
Shot : A woman with long, wet hair is screaming in the rain. The background is blurred and out of focus, with only a few lights visible.
Aesthetic Score : 0.6
Mood : intense, dramatic, desperate
Quality
Entropy : 6.60
Noise : 98
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some artifacts, particularly in the woman’s hair. The background is also slightly blurry.
Superman: A Hero Rises from the Ashes
In a post-apocalyptic world shrouded in smoke, Superman stands defiant, his iconic suit a beacon of hope. The dramatic lighting and his powerful pose hint at an impending battle, promising a story of heroism and resilience.
Prompt
facial-expressions Anger: Fury and determination ; A superhero, fists clenched, facing down a horde of villains; eye-level; Hero; A crumbling cityscape, smoke and debris filling the air; cinematic
Characteristic
Shot : A close-up of Superman, standing in a destroyed city with a cloudy sky behind him, he is looking angry with his fist clenched.
Aesthetic Score : 0.7
Mood : dramatic, intense, angry
Quality
Entropy : 6.64
Noise : 50
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image is slightly blurry, particularly the background. There are some artifacts in the muscle definition of the subject’s arms and shoulders.
On the Verge of Explosion: Man’s Frustration Reaches Boiling Point
A man sits at a cluttered desk, his face contorted in anger, his fist clenched tight. The scene speaks volumes of mounting frustration and a temper on the brink of eruption. The image captures the raw intensity of a moment teetering on the edge of chaos.
Prompt
facial-expressions Anger: Frustration and rage ; A man, slamming his fist on a table, surrounded by scattered papers; eye-level; Normal Person; A cluttered office, with a window showing a stormy sky; cinematic
Characteristic
Shot : A man is sitting at a desk with papers scattered around him. He looks angry and is clenching his fist.
Aesthetic Score : 0.3
Mood : angry, tense, dramatic
Quality
Entropy : 6.61
Noise : 74
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has a few artifacts, particularly around the man’s hand and face. The overall image is a little blurry, which is distracting.
Caught in the Act: A Moment of Frustration and Anger
A young man sits in a dimly lit room, his face contorted in a mixture of surprise and anger. His hands are raised in the air, as if caught in the midst of a heated moment. The scene is punctuated by several cans of soda, hinting at a long night of frustration. The image captures a raw emotion, leaving the viewer to wonder what triggered this outburst.
Prompt
facial-expressions Anger: Frustration and rage ; A gamer, throwing his headset on the floor, surrounded by empty energy drink cans; eye-level; Gamer; A dimly lit room, with a computer screen displaying a game in progress; cinematic
Characteristic
Shot : A man is sitting at a desk with a headset on. He is looking at the camera with a shocked expression. There are three cans of soda and the headset on the desk in front of him. The background is a blurry image of a room with a computer monitor and other furniture.
Aesthetic Score : 0.6
Mood : intense, shocked, frustrated
Quality
Entropy : 6.10
Noise : 78
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some minor artifacts around the edges of the man’s hair and around the cans of soda.
Screaming in the Dark: A Moment of Terror Captured
A woman’s face contorted in a silent scream, illuminated by a single, harsh light source. The darkness surrounding her amplifies the intensity of her fear, creating a palpable sense of suspense and dread.
Prompt
facial-expressions Anger: Despair and rage ; A woman, screaming into the void, her face contorted in anger; close-up; Single Person; A dark, empty room, with only a single flickering light; cinematic
Characteristic
Shot : A close-up shot of a woman with long brown hair screaming in a dark room, lit with a single light source, creating shadows on her face.
Aesthetic Score : 0.6
Mood : intense, fearful, dramatic
Quality
Entropy : 5.94
Noise : 93
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image appears to be slightly over-sharpened, leading to some artifacts around the edges of the woman’s face and hair.
Facing the Flames: A Man of Power in a Moment of Crisis
A figure cloaked in darkness, a man in a futuristic suit stands defiant against a backdrop of fire and smoke. The intensity of the moment is palpable, hinting at a struggle for survival or a battle against overwhelming odds. The blurred background adds to the sense of urgency, leaving the viewer to wonder what lies ahead for this enigmatic figure.
Prompt
facial-expressions Anger: Anger and determination ; A hero, standing on a rooftop, overlooking a city in flames; eye-level; Hero; A fiery inferno engulfing the city, with smoke billowing into the sky; cinematic
Characteristic
Shot : A man in a dark suit, possibly a superhero, stands in front of a massive inferno, looking intense and determined.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 6.78
Noise : 86
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.80
Image errors : The flames in the background are somewhat blurry and lacking in detail, possibly due to image compression or over-processing. The man’s costume could be more textured and detailed, particularly in the leather areas.
Dinner Gone Wrong: Couple’s Heated Argument Explodes
A tense scene unfolds as a couple’s dinner conversation turns into a heated argument. The woman’s anger is palpable, while the man tries to defend himself. Close-up shots capture the raw emotion and tension in this dramatic moment.
Prompt
facial-expressions Anger: Frustration and rage ; A couple, arguing in a crowded restaurant, their voices raised in anger; eye-level; Normal People; A bustling restaurant, with other diners looking on; cinematic
Characteristic
Shot : A couple is having a heated argument at a restaurant, the woman is yelling at the man, the other diners are out of focus in the background
Aesthetic Score : 0.6
Mood : tense, dramatic, angry
Quality
Entropy : 6.72
Noise : 109
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry and the lighting is a bit too dark, the white balance seems a bit off
Rage Unleashed: A Man’s Scream Echoes in the Darkness
A raw and powerful image captures a man consumed by anger, his face contorted in a scream. The blurry background adds to the intensity, suggesting a moment of chaos and urgency. This photograph evokes a sense of raw emotion and leaves a lasting impression.
Prompt
facial-expressions Anger: Frustration and rage ; A gamer, smashing his keyboard in a fit of rage; close-up; Gamer; A dimly lit room, with a computer screen displaying a game over screen; cinematic
Characteristic
Shot : A close-up portrait of a man with a fierce expression. His mouth is wide open, and his eyes are wild, suggesting intense emotion.
Aesthetic Score : 0.6
Mood : intense, dramatic, angry
Quality
Entropy : 6.02
Noise : 80
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight blurriness in the image, which may be due to camera shake or the subject’s movement. The contrast could be better.
Caught in the Storm: A Scream of Agony
A close-up shot of a man’s face, contorted in a silent scream, as rain streaks across the screen. The image is unsettling and dramatic, capturing a moment of intense emotional turmoil.
Prompt
facial-expressions Anger: Despair and rage ; A man, standing in the rain, his face obscured by the downpour; eye-level; Single Person; A dark, deserted street, with only the sound of rain and thunder; cinematic
Characteristic
Shot : A close-up of a man’s face, his eyes are closed and he is screaming. It appears he is in a shower or heavy rain.
Aesthetic Score : 0.4
Mood : intense, dramatic, agony
Quality
Entropy : 6.24
Noise : 110
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.50
Image errors : The image contains a lot of noise and grain, the rain effect appears to be repetitive and unrealistic.
One Warrior Stands Amidst the Ashes
A lone warrior, clad in leather and a tattered yellow coat, stands defiant amidst a battlefield ravaged by war. Flames lick at the horizon, casting an ominous glow on the scene. The contrast between his determined stance and the surrounding devastation creates a powerful and dramatic image.
Prompt
facial-expressions Anger: Anger and determination ; A hero, standing on a battlefield, surrounded by fallen enemies; eye-level; Hero; A battlefield littered with bodies, with smoke and dust filling the air; cinematic
Characteristic
Shot : A lone warrior stands amidst a battlefield littered with bodies, with fire and smoke in the background.
Aesthetic Score : 0.6
Mood : dark, intense, dramatic
Quality
Entropy : 6.67
Noise : 62
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to be slightly over-sharpened, and the background is somewhat blurry.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.62, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.24, which is considered very good. This means that the generated image’s aesthetic closely matched the expected aesthetic described in the prompt.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position. The aesthetic of the generated image is very close to the expected aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://deepmind.google/technologies/imagen-2/