AI Captures the Nuance of Human Emotion in Images with Midjourney
- 9 minutes read - 1873 wordsTable of Contents
Facial expressions are a powerful form of communication, conveying a wide range of emotions and intentions. In the realm of artificial intelligence, the ability to generate realistic facial expressions is a significant step towards creating more engaging and believable virtual experiences. This blog post delves into the capabilities of a new AI model that demonstrates impressive proficiency in capturing the nuances of human emotion in images. We will explore how the model analyzes scene context, camera position, and aesthetic to generate images that effectively convey the intended emotions.
Created with: midjourney
A Glimpse into Poverty: The Messy Reality of a Struggling Life
This image captures the somber reality of living in poverty. A messy bedroom, cluttered with belongings and trash, reveals the challenges of a struggling life. The man sleeping on the bed, bathed in the light from the window, evokes a sense of melancholy and mundane existence.
Prompt
Frustration Frustration, resignation: Overwhelmed and defeated ; A single person; eye-level; Single Persons; A cluttered apartment with overflowing laundry baskets and takeout containers.; cinematic
Characteristic
Shot : A messy bedroom with a man sleeping on the bed. The room is cluttered with clothes, blankets, and other items. There is a window in the background and a trash can in the foreground.
Aesthetic Score : 0.2
Mood : gloomy, cluttered, tired
Quality
Entropy : 6.70
Noise : 102
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry in some areas, particularly the subject’s face.
Superman: A Shadow in the Night
A brooding Superman stands in a dark alleyway, his cape billowing dramatically. The lighting and pose create a powerful and dramatic scene, hinting at the hero’s inner turmoil.
Prompt
Frustration Frustration, anger, determination: Powerless and angry ; A superhero; close-up; Heroes; A dark alley with flickering streetlights, the hero’s cape billowing in the wind.; cinematic
Characteristic
Shot : A man dressed as Superman standing in a dark alleyway, looking serious.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 5.50
Noise : 81
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.30
Image errors : No significant image errors are visible.
Lost in Thought: A Man’s Pensive Journey on the Subway
A man in a suit sits alone on a crowded subway train, his serious expression and the low-key lighting creating an atmosphere of mystery and intrigue. The urban setting adds to the sense of isolation and contemplation, leaving the viewer to wonder about his thoughts and destination.
Prompt
Frustration Frustration, annoyance, desperation: Impatient and stressed ; A businessman; eye-level; Normal People; A crowded train with people pushing and shoving, the businessman trapped in the middle.; cinematic
Characteristic
Shot : A man in a suit is standing on a crowded subway train, looking out the window, while other passengers are around him
Aesthetic Score : 0.7
Mood : serious, contemplative, urban
Quality
Entropy : 5.79
Noise : 77
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are some minor artifacts in the image, such as graininess in the shadows and some noise in the background.
Lost in the Digital Depths: A Portrait of Frustration
A young man sits hunched over his computer, his head in his hands, reflecting the melancholy mood of the dimly lit room. The harsh lighting and cluttered surroundings amplify the sense of loneliness and despair, hinting at a character grappling with inner turmoil.
Prompt
Frustration Frustration, concentration, determination: Focused but frustrated ; A gamer; close-up; Gamer; A dimly lit room with a computer screen displaying a frustratingly difficult level, the gamer’s hands shaking on the keyboard.; cinematic
Characteristic
Shot : A young man is sitting at a desk in a dimly lit room, looking dejected as he holds his head in his hands. There’s a computer monitor in front of him with a video game displayed on it. There’s also a keyboard and a plant in the background.
Aesthetic Score : 0.4
Mood : sad, lonely, frustrated
Quality
Entropy : 6.27
Noise : 84
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Lost in Thought: A Moment of Melancholy in Black and White
A solitary figure sits on a park bench, her face hidden by cascading hair. The black and white composition amplifies the sense of loneliness and introspection, capturing a poignant moment of contemplation.
Prompt
Frustration Frustration, sadness, despair: Lonely and isolated ; A young woman; eye-level; Single Persons; A deserted park bench, the woman staring blankly at the ground, her phone lying forgotten beside her.; cinematic
Characteristic
Shot : A young woman sits on a bench in a park, looking down with a thoughtful expression. The image is in black and white, creating a moody and atmospheric feel.
Aesthetic Score : 0.7
Mood : melancholy, introspective, contemplative
Quality
Entropy : 6.16
Noise : 105
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears slightly grainy, especially in the shadows. There is also some minor noise in the background.
Firefighter Bravely Faces the Blaze
A firefighter stands in a doorway, smoke and flames billowing behind them, capturing the intensity and danger of the situation. The image evokes a sense of courage and urgency, highlighting the bravery of those who fight fires.
Prompt
Frustration Frustration, fear, determination: Urgent and desperate ; A firefighter; close-up; Heroes; A burning building with smoke billowing out, the firefighter struggling to open a door.; cinematic
Characteristic
Shot : A firefighter in full gear is looking intently into the distance, possibly toward a burning building. The scene is filled with smoke and the building is barely visible.
Aesthetic Score : 0.7
Mood : intense, focused, dangerous
Quality
Entropy : 6.69
Noise : 108
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to be slightly grainy and the color saturation is a little low. There is also some noise in the background.
The Quiet Focus of Study
A young woman finds peace and concentration amidst the gentle hum of a library. Soft, warm light bathes the scene, highlighting her dedication to her work. The atmosphere is calm and studious, inviting viewers to share in the tranquility.
Prompt
Frustration Frustration, confusion, anxiety: Overwhelmed and anxious ; A student; eye-level; Normal People; A crowded library with students hunched over books, the student staring at a blank page, their pen hovering over the paper.; cinematic
Characteristic
Shot : A young woman is studying in a library, leaning over a book and writing with a pen. Other people are studying in the background, blurred.
Aesthetic Score : 0.75
Mood : focused, studious, contemplative
Quality
Entropy : 6.84
Noise : 111
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is some slight grain in the image, and the background is somewhat blurry.
Immersed in the Game: A Moment of Intense Focus
A young man, bathed in vibrant red and blue light, sits captivated by his game. His focused gaze and tight grip on the controller reveal the intensity of his immersion, creating a sense of suspense and excitement.
Prompt
Frustration Frustration, concentration, determination: Focused and intense ; A gamer; close-up; Gamer; A brightly lit gaming tournament stage, the gamer staring at the screen, their controller gripped tightly in their hands.; cinematic
Characteristic
Shot : A young man is sitting in a chair and playing video games. He is looking directly at the camera, his eyes are wide, and he is holding a controller in his hands. He is lit by red and blue neon lights.
Aesthetic Score : 0.6
Mood : intense, focused, dramatic
Quality
Entropy : 5.39
Noise : 61
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
A Moment of Quiet Melancholy
A young woman, dressed in floral, leans against a cluttered kitchen counter, her head resting in her hand. The soft, natural light filtering through the window casts a gentle glow on the scene, highlighting the weariness in her posture and the sadness in her eyes. The stacks of plates and newspapers add a sense of burden and responsibility, amplifying the mood of quiet melancholy.
Prompt
Frustration Frustration, sadness, resignation: Exhausted and defeated ; A single mother; eye-level; Single Persons; A messy kitchen with dishes piled high in the sink, the single mother staring at a pile of bills, her shoulders slumped.; cinematic
Characteristic
Shot : A woman is sitting at a kitchen counter, leaning her head on her hand. She looks tired and troubled. A large pile of newspapers and dishes sit beside her. The lighting is soft and moody, creating a sense of intimacy and introspection.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, introspective
Quality
Entropy : 6.43
Noise : 94
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors
A Doctor’s Intense Gaze: What’s He Seeing?
A doctor stands in a hospital room, his face etched with worry as he stares intently at something off-camera. Medical equipment fills the background, adding to the tense atmosphere. The lighting and the doctor’s expression create a palpable sense of suspense, leaving the viewer wondering what he’s witnessing.
Prompt
Frustration Frustration, worry, determination: Concerned and helpless ; A doctor; close-up; Heroes; A hospital room with a patient hooked up to machines, the doctor looking at a medical chart with a furrowed brow.; cinematic
Characteristic
Shot : A doctor is looking directly at the camera in a hospital room, lit by the blueish glow of medical equipment.
Aesthetic Score : 0.7
Mood : serious, intense, concerned
Quality
Entropy : 5.83
Noise : 61
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight amount of noise and grain, particularly in the darker areas. There is also some color banding in the background.
Conclusion
The analysis shows that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.58, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.17, which is considered very good. This means that the generated image’s aesthetic closely matched the expected aesthetic described in the prompt.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position. The aesthetic quality of the generated image is very good.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://midjourney.com