AI's Facial Expressions: A Mixed Bag of Success with Midjourney
- 10 minutes read - 1980 wordsTable of Contents
Facial expressions are a powerful tool for conveying emotions and intentions. In the realm of AI-generated imagery, capturing these expressions realistically is a crucial step towards creating truly immersive and engaging experiences. This blog post delves into the performance of a generative AI model in capturing facial expressions, analyzing its strengths and weaknesses. We’ll explore how the model handles different scenes, camera angles, and aesthetic styles, providing insights into its capabilities and limitations. By understanding the nuances of AI-generated facial expressions, we can better appreciate the potential and challenges of this rapidly evolving technology.
Created with: midjourney
Lost in Thought: A Moment of Melancholy in a Dimly Lit Cafe
A young woman sits alone at a table in a dimly lit cafe, her gaze lost in the distance. The soft lighting and her pensive pose evoke a sense of isolation and contemplation, capturing a moment of wistful melancholy.
Prompt
Embarrassment Flushed cheeks, downcast eyes, a slight grimace: Awkward and self-conscious ; A single woman; eye-level; Single Persons; A crowded cafe with loud chatter and laughter; cinematic
Characteristic
Shot : A young woman sitting alone at a table in a cafe, looking thoughtfully off to the side.
Aesthetic Score : 0.7
Mood : pensive, introspective, lonely
Quality
Entropy : 6.88
Noise : 102
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears slightly blurry with some loss of detail, particularly on the subject’s face and hair.
Superman’s Shocked Gaze: A Moment of Intensity
A stark black and white image captures Superman in a moment of dramatic intensity. His shocked expression, the stark contrast of light and shadow, and the shallow depth of field all contribute to a visually striking scene that evokes heroism and a sense of impending danger.
Prompt
Embarrassment Wide eyes, open mouth, a look of disbelief: Humiliated and exposed ; A superhero in a full costume; eye-level; Heroes; A bustling city street with people staring; cinematic
Characteristic
Shot : A close-up shot of Superman in a city, looking up in surprise.
Aesthetic Score : 0.6
Mood : dramatic, tense, suspenseful
Quality
Entropy : 6.57
Noise : 101
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy, but it is not a major issue.
Secrets and Shadows: A Moment of Melancholy
A man in a suit, his face hidden in his hand, sits at a dimly lit table surrounded by wine glasses. The woman behind him and the figures in the background add to the air of mystery and intrigue. This image captures a moment of dramatic tension, hinting at unspoken emotions and a story waiting to unfold.
Prompt
Embarrassment Red face, sweating, avoiding eye contact: Mortified and ashamed ; A man in a business suit; eye-level; Normal People; A formal dinner party with elegant guests; cinematic
Characteristic
Shot : A man is sitting at a table in a dimly lit room, looking down, possibly crying. There are other people at the table but they are mostly obscured by shadows. A painting hangs on the wall behind the man.
Aesthetic Score : 0.7
Mood : melancholy, somber, introspective
Quality
Entropy : 5.70
Noise : 83
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be somewhat blurry, and the detail on the painting is not very sharp. The focus seems off.
Lost in the Glow: A Moment of Introspection in a Digital Maze
A young man sits alone in a dimly lit room, bathed in the glow of multiple computer screens. The clutter around him suggests a life consumed by technology, while his contemplative expression hints at a deeper sense of loneliness and introspection. The scene evokes a feeling of being overwhelmed and trapped, highlighting the potential downsides of our increasingly digital world.
Prompt
Embarrassment Slumped shoulders, a defeated sigh, a look of despair: Cringing and defeated ; A gamer in a gaming chair; eye-level; Gamer; A dimly lit room with flashing screens and empty pizza boxes; cinematic
Characteristic
Shot : A young man sits in a gamer chair in a dimly lit room with a cluttered background. He is looking down, possibly at a keyboard on his lap. There is a pizza box on the floor, suggesting a late-night gaming session.
Aesthetic Score : 0.5
Mood : dark, lonely, focused
Quality
Entropy : 6.42
Noise : 74
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image is slightly blurry, particularly in the background. The lighting is uneven and there are some artificial-looking highlights on the man’s skin.
A Moment of Intrigue at the Wedding Reception
A woman in a white dress, captured in a moment of thoughtful curiosity, gazes over her shoulder at something in the distance. The bokeh-filled background of the bustling wedding reception adds a layer of mystery to this intriguing scene.
Prompt
Embarrassment Sad eyes, a forced smile, a longing gaze: Lonely and out of place ; A woman in a wedding dress; eye-level; Single Persons; A crowded wedding reception with happy couples; cinematic
Characteristic
Shot : A young woman in a white dress sits at a table, looking over her shoulder at something out of frame. The scene is likely a wedding reception or other formal event. The background is blurred, with people sitting at tables in the background.
Aesthetic Score : 0.7
Mood : intrigued, thoughtful, contemplative
Quality
Entropy : 6.16
Noise : 92
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Hope Rises in the City: A Man in Red Cape Inspires the Crowd
A powerful image captures a man in a red cape, arms raised high, standing before a cheering crowd. The vibrant colors and dynamic pose evoke a sense of hope and inspiration, making him the focal point of this city scene bathed in sunlight.
Prompt
Embarrassment Blushing, fidgeting, trying to hide: Embarrassed and self-conscious ; A superhero in a cape; eye-level; Heroes; A cheering crowd at a victory parade; cinematic
Characteristic
Shot : A man wearing a red cape stands with his arms raised in the air, facing away from the viewer. Behind him is a crowd of people in the city.
Aesthetic Score : 0.7
Mood : hopeful, inspiring, dramatic
Quality
Entropy : 6.03
Noise : 103
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts in the image, particularly in the background. The image also has a slightly blurry quality, which could be improved.
Lost in Thought: A Woman’s Solitary Elegance
A woman in a black dress sits alone at a dimly lit table, her face illuminated by the soft glow of the setting. The atmosphere is pensive and elegant, with a touch of mystery. The dramatic lighting creates a captivating effect, drawing the viewer’s attention to her thoughtful expression.
Prompt
Embarrassment Nervous laughter, fidgeting with her clothes, avoiding eye contact: Uncomfortable and out of place ; A woman in a casual outfit; eye-level; Normal People; A fancy restaurant with white tablecloths and expensive wine; cinematic
Characteristic
Shot : A woman sits alone at a table in a dimly lit restaurant, with a window behind her casting a spotlight on her face.
Aesthetic Score : 0.8
Mood : melancholy, mysterious, sensual
Quality
Entropy : 5.55
Noise : 86
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Lost in the Shadows: A Moment of Solitude
A figure shrouded in darkness, their face hidden, sits amidst a blur of lights and activity. The dimly lit room and somber mood create a sense of mystery and intrigue, leaving the viewer to wonder about the person’s thoughts and emotions.
Prompt
Embarrassment Head in hands, a defeated sigh, a look of shame: Humiliated and defeated ; A gamer in a hoodie; eye-level; Gamer; A crowded esports tournament with loud cheers and flashing lights; cinematic
Characteristic
Shot : A person wearing a hooded sweatshirt and headphones is sitting in a dimly lit room. The person’s face is obscured by their hand. The scene is likely a gaming tournament, with a screen and other people in the background.
Aesthetic Score : 0.5
Mood : intense, lonely, contemplative
Quality
Entropy : 5.71
Noise : 91
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy and has some noise. There is a slight color shift towards blue in some areas.
A Startled Gentleman in a Tuxedo: Mystery Unveiled?
A man in a tuxedo, caught off guard, stares directly at the camera with a look of surprise. The dimly lit scene, adorned with candles and flowers, adds to the air of suspense and mystery. What has startled him? The answer may lie hidden within the shadows.
Prompt
Embarrassment Stuttering, avoiding eye contact, a nervous smile: Awkward and uncomfortable ; A man in a tuxedo; eye-level; Single Persons; A romantic dinner for two with candles and flowers; cinematic
Characteristic
Shot : A man in a tuxedo is seated at a table with lit candles, looking startled. The setting suggests a formal dinner or a wedding reception.
Aesthetic Score : 0.6
Mood : suspenseful, dramatic, tense
Quality
Entropy : 6.67
Noise : 97
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some slight blurring in the background, no other noticeable errors
Batman Emerges from the Shadows, Intrigue Follows
A shadowy figure, clad in the iconic Batsuit, stands before a throng of photographers, their flashes illuminating the scene with an intense, dramatic light. The low-key lighting and the Batman’s enigmatic pose create an atmosphere of mystery and intrigue, leaving onlookers wondering what secrets lie behind the mask.
Prompt
Embarrassment Sweating, avoiding eye contact, a look of panic: Mortified and ashamed ; A superhero in a mask; eye-level; Heroes; A news conference with reporters asking difficult questions; cinematic
Characteristic
Shot : A man dressed as Batman stands in front of a group of photographers, he has a serious expression on his face and is looking directly at the camera.
Aesthetic Score : 0.6
Mood : intense, brooding, mysterious
Quality
Entropy : 6.63
Noise : 112
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some noise is visible in the image, particularly in the shadows. There is also some slight overexposure in the highlights.
Conclusion
The results of the analysis show that the generative AI model performed well in terms of understanding the scene and creating an aesthetically pleasing image, but struggled with accurately capturing the camera position. Here’s a breakdown:
- Camera Position: The model scored 0.2, indicating a significant difference between the intended camera position in the prompt and the actual camera position in the generated image. This suggests the model may not be very good at understanding and implementing specific camera angles.
- Shot Analysis: The model scored 0.58, which falls within the “good” range. This means the model was able to understand the scene described in the prompt and create a shot that was generally consistent with it.
- Aesthetic Analysis: The model scored 0.1, which is considered “very good”. This indicates that the generated image closely matched the expected aesthetic style described in the prompt.
Overall, the model demonstrates a strong ability to understand the scene and create aesthetically pleasing images, but needs improvement in accurately capturing the intended camera position.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://midjourney.com