AI's Facial Expressions: A Mixed Bag of Results with Flux-pro
- 9 minutes read - 1710 wordsTable of Contents
Facial expressions are a powerful tool in storytelling, conveying emotions and intentions without words. In the realm of generative AI, the ability to create images with specific facial expressions is a crucial aspect of achieving realistic and engaging visuals. This blog post delves into the performance of a generative AI model in capturing facial expressions, camera angles, and aesthetic styles, analyzing its strengths and weaknesses through a series of prompts.
Created with: flux-pro
Lost in the City Lights
A solitary figure stands bathed in the glow of urban lights, their gaze lost in the night. The darkness whispers of loneliness and contemplation, while the glimmering cityscape hints at a hidden story waiting to be unveiled.
Prompt
facial-expressions Anxiety: Overwhelmed, isolated ; A lone figure; eye-level; Single Person; bustling city street at night; cinematic
Characteristic
Shot : A young woman, in a dark jacket, is silhouetted against a bright city lightscape. Her face is illuminated by a single street lamp, and she looks up at the sky with a thoughtful expression.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, urban
Quality
Entropy : 6.63
Noise : 78
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : Minor noise and grain, slight blurriness in the background.
Superman Takes Flight at Dusk
A dramatic silhouette of Superman stands on a rooftop, overlooking a city bathed in the warm hues of twilight. The blurred cityscape emphasizes his heroic pose, creating a sense of power and hope.
Prompt
facial-expressions Anxiety: Pressure, responsibility ; A superhero standing on a rooftop; high angle; Hero; cityscape with flashing lights; cinematic
Characteristic
Shot : A man dressed as Superman is standing on a rooftop overlooking a city. The city lights are visible in the background.
Aesthetic Score : 0.7
Mood : heroic, dramatic, futuristic
Quality
Entropy : 6.90
Noise : 87
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are no visible artifacts or errors in the image.
The Weight of the World: A Portrait of Stress
A woman sits at her desk, overwhelmed by the weight of her responsibilities. The low lighting and her slumped posture convey a sense of despair and hopelessness, capturing the raw emotion of stress.
Prompt
facial-expressions Anxiety: Overwhelmed, stressed ; A person sitting at a desk, surrounded by paperwork; close-up; Normal Person; cluttered office; cinematic
Characteristic
Shot : A person is sitting at a desk, with their head in their hands, surrounded by paperwork. The image is likely taken in an office setting.
Aesthetic Score : 0.4
Mood : sad, stressed, overwhelmed
Quality
Entropy : 6.49
Noise : 70
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, and the lighting is a little too flat.
Lost in the Game: A Moment of Intense Focus
A young man, bathed in the glow of red and blue lights, is completely absorbed in his video game. The dramatic lighting and composition highlight his focused expression, creating a sense of mystery and intensity.
Prompt
facial-expressions Anxiety: Focused, intense ; A gamer hunched over a computer screen; close-up; Gamer; dimly lit room with flashing lights; cinematic
Characteristic
Shot : A young man wearing headphones is focused on playing a video game on his computer. He is illuminated by colorful lights, giving the scene an intense atmosphere. The background is a bit blurry, drawing the eye to the subject.
Aesthetic Score : 0.7
Mood : focused, intense, serious
Quality
Entropy : 6.59
Noise : 68
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
A Moment of Mystery in the City
A young woman, shrouded in green, stands on a bustling city street, her gaze locked directly on the viewer. The shallow depth of field blurs the background, creating a sense of intimacy and drawing you into her enigmatic world. Is she lost in thought, or is there something more to her gaze? This image captures a fleeting moment of urban mystery.
Prompt
facial-expressions Anxiety: Anxious, uncomfortable ; A woman walking down a crowded street; eye-level; Single Person; blurred background of people; cinematic
Characteristic
Shot : A young woman with long brown hair is standing in a city street, looking at the camera. She is wearing a green jacket and a beige scarf.
Aesthetic Score : 0.7
Mood : melancholy, thoughtful, urban
Quality
Entropy : 6.75
Noise : 68
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly blurry, especially around the edges of the frame.
The Shadow Knows: A Portrait of Menace
A close-up portrait of a man with a menacing expression, captured in a dimly lit environment. The intense gaze and dramatic lighting create a sense of suspense and danger, drawing the viewer into a world of mystery.
Prompt
facial-expressions Anxiety: Fear, anticipation ; A hero facing a menacing villain; medium shot; Hero; dark and ominous setting; cinematic
Characteristic
Shot : A man with a dark and intense expression, lit by a blue-ish light source. He is wearing a dark jacket and appears to be in a shadowy environment.
Aesthetic Score : 0.7
Mood : dark, intense, mysterious
Quality
Entropy : 6.23
Noise : 67
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors or artifacts.
Lost in the Crowd: A Man’s Intense Gaze Speaks Volumes
A solitary figure, clad in black leather and a white shirt, stands amidst the bustling chaos of a train station or airport. His piercing gaze, directed straight at the viewer, conveys a sense of seriousness and contemplation. The blurred background emphasizes his isolation, adding to the air of mystery surrounding this enigmatic individual.
Prompt
facial-expressions Anxiety: Impatient, restless ; A person waiting in a long line; eye-level; Normal Person; crowded waiting room; cinematic
Characteristic
Shot : A young man with brown hair and a serious expression looks directly at the camera. He is wearing a black leather jacket and a white shirt. He is standing in a public place, possibly a train station, as blurry figures are visible behind him.
Aesthetic Score : 0.6
Mood : serious, intense, contemplative
Quality
Entropy : 6.80
Noise : 69
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight amount of noise in the image, particularly in the background.
Lost in the Glow: A Portrait of Intensity
A close-up portrait captures a young man’s focused gaze, illuminated by the ethereal glow of a computer screen. The dimly lit room and his intense expression create a sense of mystery and intrigue, leaving the viewer wondering what secrets lie within the digital world.
Prompt
facial-expressions Anxiety: Adrenaline, pressure ; A gamer’s hands frantically moving across a keyboard; close-up; Gamer; glowing computer screen; cinematic
Characteristic
Shot : A man with dark hair is looking at a computer screen in a dimly lit room. There’s a red glow coming from the screen.
Aesthetic Score : 0.7
Mood : focused, intense, mysterious
Quality
Entropy : 6.34
Noise : 78
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some noise and grain are present in the image. Also, the image is slightly blurry.
A Solitary Figure Under a Stormy Sky
A man in a suit stands alone in a field, his gaze fixed on a dramatic, stormy sky. The scene evokes a sense of melancholy and contemplation, with a hint of hope amidst the darkness.
Prompt
facial-expressions Anxiety: Loneliness, despair ; A man standing alone in a vast field; wide shot; Single Person; open sky with dark clouds; cinematic
Characteristic
Shot : A lone man in a suit standing in a field, looking up at a dramatic stormy sky.
Aesthetic Score : 0.6
Mood : melancholy, contemplative, hopeful
Quality
Entropy : 6.52
Noise : 68
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some slight noise and the focus is slightly off.
A Solitary Figure Contemplates the Ashes of a Lost City
A lone man, shrouded in the smoky haze of a post-apocalyptic cityscape, stands with his back to the camera, lost in contemplation. The scene evokes a sense of melancholic longing and uncertainty, as the smoke billows ominously, a stark reminder of the destruction that has befallen the world.
Prompt
facial-expressions Anxiety: Guilt, responsibility ; A hero looking out over a devastated city; high angle; Hero; destroyed buildings and smoke; cinematic
Characteristic
Shot : A man stands with his back to the camera looking out at a city skyline with smoke in the distance. The image is taken from a high vantage point, so you can see the city spread out below.
Aesthetic Score : 0.6
Mood : gloomy, melancholic, contemplative
Quality
Entropy : 6.89
Noise : 74
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, especially in the background.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
- Camera Position: The model scored 0.45, which is considered below average. This suggests that the model didn’t accurately capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.52, which is considered average. This indicates that the model was able to understand the scene in the prompt to a reasonable degree, but not exceptionally well.
- Aesthetic Analysis: The model scored 0.1, which is considered very good. This means that the generated image closely matched the expected aesthetic style described in the prompt.
Overall, the model seems to be better at understanding the aesthetic style of the prompt than it is at accurately capturing camera positions and shot composition.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://fal.ai/models/fal-ai/flux-pro/api