AI's Facial Expressions: A Mixed Bag of Emotions with Flux-schnell
- 9 minutes read - 1783 wordsTable of Contents
Facial expressions are a powerful tool for conveying emotions and telling stories. In the realm of generative AI, the ability to create realistic and expressive faces is a crucial step towards creating truly immersive experiences. This blog post explores the current state of AI’s facial expression capabilities, analyzing its performance in various scenarios and highlighting its strengths and weaknesses.
Created with: flux-schnell
Lost in the Rain: A Moment of Melancholy in the City
A young man walks alone in the rain-soaked streets, his contemplative gaze meeting the camera. The dimly lit urban landscape and the steady patter of rain create a mood of pensive isolation, hinting at a story waiting to be told.
Prompt
facial-expressions Guilt: Desolate, regretful ; A lone figure; eye-level; Single Person; Empty street at night, rain falling; cinematic
Characteristic
Shot : A young man is standing in the rain, looking directly at the camera with a thoughtful expression. The scene is lit by street lights and there is a building in the background.
Aesthetic Score : 0.6
Mood : melancholy, pensive, introspective
Quality
Entropy : 6.29
Noise : 75
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise and grain, especially in the shadows. There are also some artifacts around the edges of the man’s face.
Superman: Ready to Save the Day
A close-up portrait captures Superman’s unwavering determination, his gaze fixed on the city skyline he vows to protect. The dramatic lighting and composition heighten the intensity of the moment, showcasing the hero’s unwavering resolve.
Prompt
facial-expressions Guilt: Heavy, burdened, conflicted ; A superhero, cape billowing in the wind; medium shot; Hero; City skyline, destroyed buildings in the background; cinematic
Characteristic
Shot : A close-up portrait of a man dressed as Superman, with a cityscape in the background. The man is looking intently at the viewer.
Aesthetic Score : 0.7
Mood : intense, serious, heroic
Quality
Entropy : 6.86
Noise : 89
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible errors in the image.
A Glimpse into the Past: A Woman’s Melancholy Reflection
A woman holds a faded photograph, her gaze fixed on a younger version of herself. The lighting casts long shadows, adding to the sense of mystery and longing. The image evokes a poignant feeling of nostalgia, as she contemplates the passage of time and the memories captured in the photograph.
Prompt
facial-expressions Guilt: Nostalgic, melancholic ; A woman holding a photo of a loved one; close-up; Normal Person; A cluttered kitchen, dishes piled in the sink; cinematic
Characteristic
Shot : A woman is holding a photo of a couple, possibly a romantic couple, in a kitchen or living room with a window in the background. The image is taken in a slightly darker lighting condition. The woman is looking at the camera, while the photo is held in front of her face.
Aesthetic Score : 0.6
Mood : melancholy, thoughtful, nostalgic
Quality
Entropy : 6.81
Noise : 84
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors or artifacts are visible in the image.
Lost in the Code: A Moment of Intense Focus
A young man, bathed in the glow of his computer screen, is completely absorbed in his work. The dimly lit room and his focused expression create a sense of suspense and intensity, highlighting the power of concentration in the digital age.
Prompt
facial-expressions Guilt: Isolated, self-loathing ; A gamer, hunched over a computer screen; close-up; Gamer; Neon lights reflecting in their eyes, empty pizza boxes scattered around; cinematic
Characteristic
Shot : A young man wearing headphones is sitting in front of a computer. He is using a keyboard and mouse. There is a pizza on the table in front of him.
Aesthetic Score : 0.6
Mood : focused, intense, gaming
Quality
Entropy : 6.10
Noise : 64
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight graininess and some noise. The colors are a little bit muted.
Lost in the Crowd: One Man’s Anxiety Amidst the Blur
A solitary figure stands amidst a sea of faces, his apprehension palpable. The sharp focus on his worried expression, contrasted with the blurred background, amplifies his sense of isolation and the weight of his uncertainty.
Prompt
facial-expressions Guilt: Alienated, invisible ; A man standing in a crowded room, looking lost; wide shot; Single Person; A party, people laughing and dancing, oblivious to him; cinematic
Characteristic
Shot : A man stands in a crowded bar, looking directly at the camera with a concerned expression. The background is blurry, indicating a social gathering.
Aesthetic Score : 0.6
Mood : tense, worried, observant
Quality
Entropy : 6.33
Noise : 67
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors.
Heroic Figure Stands Amidst the Ruins of War
A lone warrior in dark armor and a crimson cloak stands over a fallen comrade on a smoke-filled battlefield. The dramatic contrast between the figure and the fallen soldier evokes a powerful sense of loss and heroism, capturing the somber mood of the scene.
Prompt
facial-expressions Guilt: Torn, conflicted, remorseful ; A hero, standing over a fallen villain; medium shot; Hero; A battlefield, smoke and debris everywhere; cinematic
Characteristic
Shot : A man dressed as a knight stands over a fallen soldier on a battle-scarred battlefield. The scene is full of smoke and debris, hinting at a recent battle.
Aesthetic Score : 0.6
Mood : dramatic, somber, intense
Quality
Entropy : 6.80
Noise : 82
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major artifacts or errors, the image is well-composed and with high resolution. However, the lighting may be too flat and lacks in depth.
A Tense Dinner: Mystery and Suspense Linger in the Shadows
Four figures gather around a table, their faces etched with seriousness. The low light casts long shadows, adding to the palpable tension and hinting at a hidden story. Is this a family gathering gone wrong, or a meeting with secrets to keep?
Prompt
facial-expressions Guilt: Awkward, strained, unspoken ; A family gathered around a table, but the atmosphere is tense; medium shot; Normal People; A dimly lit dining room, empty chairs at the table; cinematic
Characteristic
Shot : Four people are seated at a table in a dimly lit dining room, the scene is reminiscent of a family gathering or a suspenseful drama.
Aesthetic Score : 0.7
Mood : serious, subdued, melancholic
Quality
Entropy : 5.69
Noise : 63
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors.
Lost in the Pixelated World: A Moment of Focus in Dim Light
A young person, bathed in the soft glow of a television screen, is engrossed in a video game. The dimly lit room creates an atmosphere of introspection and focus, highlighting the player’s intense concentration. Shadows dance around the scene, adding a touch of drama to this intimate moment.
Prompt
facial-expressions Guilt: Disillusioned, defeated, empty ; A gamer, staring at a blank screen, controller in hand; close-up; Gamer; A dimly lit room, empty energy drink cans scattered around; cinematic
Characteristic
Shot : A young man sitting on a couch in a dimly lit living room, he is playing video games, and there is a TV in the background.
Aesthetic Score : 0.5
Mood : melancholy, introspective, lonely
Quality
Entropy : 6.05
Noise : 48
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slightly grainy texture and the lighting is uneven.
Lost in the City: A Moment of Melancholy
A young woman walks through a bustling city street, her gaze averted, lost in thought. The shallow depth of field isolates her, creating a sense of quiet contemplation amidst the urban chaos. The mood is melancholic, reflecting a sense of loneliness and introspection.
Prompt
facial-expressions Guilt: Lonely, isolated, rejected ; A woman walking away from a group of friends; long shot; Single Person; A bustling city street, people rushing by; cinematic
Characteristic
Shot : A young woman walks down a busy city street, focused on something beyond the camera. The city is in the background.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, urban
Quality
Entropy : 6.77
Noise : 84
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is a bit blurry, particularly in the background.
Silhouetted Against the City, a Moment of Contemplation
A solitary figure stands against the backdrop of a blurred cityscape, gazing at a distant moon. The image evokes a sense of melancholy and contemplation, highlighting the loneliness and isolation of urban life while offering a glimmer of hope and mystery.
Prompt
facial-expressions Guilt: Reflective, contemplative, seeking redemption ; A hero, standing on a rooftop, looking out at the city; wide shot; Hero; A cityscape bathed in moonlight, a sense of peace; cinematic
Characteristic
Shot : A man is standing on a rooftop in a city at night. He is looking at the cityscape and there is a full moon in the sky.
Aesthetic Score : 0.6
Mood : melancholy, contemplative, peaceful
Quality
Entropy : 6.39
Noise : 54
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly overexposed, making the sky too bright.
Conclusion
The analysis shows that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered okay. This means the camera position in the generated image was somewhat different from what was specified in the prompt.
- Shot Analysis: The model scored 0.54, which is considered good. This indicates that the model was able to understand the scene in the prompt and create a shot that was fairly close to what was expected.
- Aesthetic Analysis: The model scored 0.16, which is considered okay. This suggests that the generated image’s aesthetic was somewhat different from what was expected.
Overall, the model seems to be better at understanding the scene and shot composition than it is at capturing the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://fal.ai/models/fal-ai/flux/schnell/api