AI's Struggle with Dramatic Facial Expressions with Stable-diffusion
- 10 minutes read - 1934 wordsTable of Contents
Dramatic facial expressions are a powerful tool in storytelling, capable of conveying a wide range of emotions and adding depth to characters. From the iconic scream of Edvard Munch’s ‘The Scream’ to the intense expressions of actors in film and theater, dramatic facial expressions have the power to captivate and move audiences. However, replicating these expressions in AI-generated images remains a challenge. This blog post explores the limitations of current AI models in capturing dramatic facial expressions and discusses potential solutions for improving their ability to create realistic and impactful imagery.
Created with: stability-ai-core
Lost in the Rain: A Man’s Solitary Journey
A hooded figure stands alone in a deserted city street, bathed in the soft glow of distant streetlights. The rain falls steadily, mirroring the man’s somber mood and amplifying his sense of isolation. This evocative scene captures a moment of profound loneliness and melancholy.
Prompt
facial-expressions Anger: Despair and rage ; A lone figure, standing in the middle of a deserted street; eye-level; Single Person; Rain pouring down, streetlights casting long shadows; cinematic
Characteristic
Shot : A man in a black raincoat stands alone in a rainy city street. The street is wet and glistening. The man’s face is partially obscured by his hood. The city is dimly lit with a few streetlights.
Aesthetic Score : 0.7
Mood : gloomy, mysterious, lonely
Quality
Entropy : 6.81
Noise : 84
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some noise is visible, particularly in the rain and the shadows.
Hero Stands Tall Amidst the Flames
A costumed superhero, radiating determination, stands amidst a burning cityscape. The fiery backdrop and the flaming skull evoke a sense of danger and urgency, while the hero’s pose suggests strength and hope in the face of chaos.
Prompt
facial-expressions Anger: Fury and determination ; A superhero, fists clenched, facing down a horde of villains; eye-level; Hero; A crumbling cityscape, smoke and debris filling the air; cinematic
Characteristic
Shot : A superhero in a patriotic costume stands in a destroyed city, surrounded by flames and rubble. There is a fiery skull in the sky and other figures in the background.
Aesthetic Score : 0.6
Mood : intense, dramatic, heroic
Quality
Entropy : 6.88
Noise : 81
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, especially in the background. The lighting is a little flat, making the image appear somewhat unrealistic.
The Weight of Expectations: A Man Crumbles Under Pressure
A businessman, trapped in a sea of paperwork, struggles to contain his mounting frustration. His clenched fist and tense expression speak volumes about the overwhelming stress he faces in this demanding office environment.
Prompt
facial-expressions Anger: Frustration and rage ; A man, slamming his fist on a table, surrounded by scattered papers; eye-level; Normal Person; A cluttered office, with a window showing a stormy sky; cinematic
Characteristic
Shot : A man in a suit sits at a desk in an office, his fist clenched on the desk and a look of frustration on his face. The desk is covered in papers and documents, and there are more papers stacked up on a desk in the background.
Aesthetic Score : 0.6
Mood : frustrated, overwhelmed, anger
Quality
Entropy : 6.78
Noise : 67
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight color cast and the image is a bit blurry.
The Moment He Won It All: Gamer’s Surprise Captured
A young gamer, fueled by energy drinks and adrenaline, sits amidst a battlefield of empty cans, his face a picture of pure surprise. This image captures the intensity and excitement of a crucial gaming moment.
Prompt
facial-expressions Anger: Frustration and rage ; A gamer, throwing his headset on the floor, surrounded by empty energy drink cans; eye-level; Gamer; A dimly lit room, with a computer screen displaying a game in progress; cinematic
Characteristic
Shot : A man is sitting on the floor in front of a gaming setup, wearing a headset and looking surprised. There are many energy drink cans scattered around him and a game controller lying on the floor.
Aesthetic Score : 0.5
Mood : intense, focused, surprised
Quality
Entropy : 6.55
Noise : 73
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image seems to have a slight color cast, with the overall lighting being a bit too warm.
Screaming in the Dark: A Moment of Raw Emotion
A woman’s face contorted in anger, illuminated by a single light source in the darkness. This close-up shot captures a moment of intense, dramatic fear, leaving the viewer feeling both captivated and unsettled.
Prompt
facial-expressions Anger: Despair and rage ; A woman, screaming into the void, her face contorted in anger; close-up; Single Person; A dark, empty room, with only a single flickering light; cinematic
Characteristic
Shot : A close-up shot of a woman’s face, she is screaming, her mouth is open wide, her eyes are wide with fear. She is wearing a casual shirt and her hair is messy, indicating she is in distress. The background is blurred and the lighting is dim and moody.
Aesthetic Score : 0.6
Mood : intense, dramatic, anxious
Quality
Entropy : 6.12
Noise : 57
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors, the image quality is good and the details are clear.
Man Stands Amidst Apocalyptic Inferno
A lone figure in a long coat surveys a city consumed by flames. Smoke billows around him, creating a dramatic and intense scene of chaos and destruction. The man’s stoic expression adds to the apocalyptic mood, leaving viewers to wonder what fate awaits him.
Prompt
facial-expressions Anger: Anger and determination ; A hero, standing on a rooftop, overlooking a city in flames; eye-level; Hero; A fiery inferno engulfing the city, with smoke billowing into the sky; cinematic
Characteristic
Shot : A man in a dark coat stands on a rooftop with fire raging all around him, in the background, a city is burning, with smoke and flames filling the air.
Aesthetic Score : 0.6
Mood : intense, dramatic, apocalyptic
Quality
Entropy : 6.82
Noise : 79
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.70
Image errors : There are some artifacts in the fire and smoke, making the image less realistic.
Yelling Match: Tensions Flare at Restaurant Table
A heated argument unfolds at a restaurant table, with one man’s outburst drawing the attention of his companions. The scene is charged with intensity and drama, leaving viewers wondering what sparked the conflict.
Prompt
facial-expressions Anger: Frustration and rage ; A couple, arguing in a crowded restaurant, their voices raised in anger; eye-level; Normal People; A bustling restaurant, with other diners looking on; cinematic
Characteristic
Shot : A group of people are sitting at a table in a restaurant. One man is yelling at the table, and the other people are looking at him with concern and surprise.
Aesthetic Score : 0.4
Mood : tense, dramatic, agitated
Quality
Entropy : 6.51
Noise : 68
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors are present, but the image is slightly overexposed, causing some details to be lost in the highlights.
Frustration at the Keyboard: A Man’s Moment of Despair
A dimly lit room, a man hunched over his computer, his face contorted in frustration. This photo captures the raw emotion of a moment of intense struggle, leaving the viewer to wonder what has driven him to this point.
Prompt
facial-expressions Anger: Frustration and rage ; A gamer, smashing his keyboard in a fit of rage; close-up; Gamer; A dimly lit room, with a computer screen displaying a game over screen; cinematic
Characteristic
Shot : A man is shown sitting in front of a computer, clearly frustrated, and shouting. His face is close to the camera. The scene is set in a dimly lit room, possibly a home office, with a computer screen in the background.
Aesthetic Score : 0.6
Mood : intense, frustrated, angry
Quality
Entropy : 6.23
Noise : 63
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : No notable artifacts or errors.
Lost in the Rain: A Man’s Solitary Journey
A figure shrouded in darkness, a black raincoat shielding him from the relentless rain. The city streets glisten under the low-key lighting, amplifying the sense of isolation and mystery surrounding this solitary figure. The dramatic effect of the scene evokes a sense of intrigue, leaving the viewer wondering about the man’s story and destination.
Prompt
facial-expressions Anger: Despair and rage ; A man, standing in the rain, his face obscured by the downpour; eye-level; Single Person; A dark, deserted street, with only the sound of rain and thunder; cinematic
Characteristic
Shot : A man walking through a rainy city street, holding a black umbrella over his head. The image is shot from a low angle, with the focus on the man’s face and the dark tones of the rain-soaked environment.
Aesthetic Score : 0.7
Mood : mysterious, moody, brooding
Quality
Entropy : 6.86
Noise : 82
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight noise in the image, some of the raindrops look pixelated
Sole Survivor: A Warrior’s Lonely Victory
A lone figure, clad in futuristic armor, strides through a battlefield ravaged by smoke and fire. The weight of victory hangs heavy as they walk past fallen comrades, a testament to the intensity and drama of the conflict.
Prompt
facial-expressions Anger: Anger and determination ; A hero, standing on a battlefield, surrounded by fallen enemies; eye-level; Hero; A battlefield littered with bodies, with smoke and dust filling the air; cinematic
Characteristic
Shot : A lone, powerful figure strides through a battlefield strewn with fallen soldiers. The backdrop is a city in the throes of a cataclysmic event, with billowing smoke and fire.
Aesthetic Score : 0.7
Mood : epic, dramatic, bleak
Quality
Entropy : 6.79
Noise : 76
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable errors in the image, and the details are well rendered.
Conclusion
The analysis shows that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.15, indicating a fairly weak ability to accurately represent the camera position described in the prompt. This suggests the generated image may not have the intended perspective or angle.
- Shot Analysis: The model scored 0.44, indicating a good understanding of the scene described in the prompt. This means the generated image likely captured the overall composition and elements of the scene as intended.
- Aesthetic Analysis: The model scored 0.21, indicating a moderate deviation from the expected aesthetic. This suggests the generated image may not have the desired style, color palette, or overall visual feel.
Overall: While the model demonstrated a good understanding of the scene and its composition, it struggled to accurately represent the camera position and achieve the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://stability.ai