AI's Facial Expressions: A Mixed Bag with Flux-dev
- 9 minutes read - 1785 wordsTable of Contents
Facial expressions are a powerful tool in storytelling, conveying emotions and adding depth to characters. In the realm of AI-generated content, capturing these expressions accurately is crucial for creating engaging and believable narratives. This blog post examines the performance of a generative AI model in capturing facial expressions, analyzing its strengths and weaknesses in translating complex visual instructions.
Created with: flux-dev
Lost in the Code: A Portrait of Focus and Determination
A young man, bathed in a dramatic blue and red light, stares intently at the camera, headphones on, lost in the world of coding. His serious expression and the intense lighting create a powerful image of focus and determination.
Prompt
facial-expressions Worry: intense, focused ; Gamer’s hands on a keyboard; close-up; Gamer; flashing lights and sounds from the game; cinematic
Characteristic
Shot : A young man wearing headphones is looking directly at the camera while typing on a keyboard. The lighting is blue and red, creating a dramatic and intense atmosphere.
Aesthetic Score : 0.6
Mood : intense, focused, dramatic
Quality
Entropy : 6.57
Noise : 67
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are some minor image artifacts, especially around the edges of the subject’s hair and the headphones. The color balance is slightly off, with the blue being more dominant than the red.
Lost in the Blue: A Teenager’s Digital Focus
A young person, bathed in blue light, sits intently at their computer, headphones on, lost in the digital world. The scene evokes a sense of focus, intensity, and a touch of isolation, highlighting the power of technology to captivate and immerse.
Prompt
facial-expressions Worry: intense, focused ; Gamer with headphones on; close-up; Gamer; dimly lit room with glowing computer screen; cinematic
Characteristic
Shot : A young person wearing headphones is sitting at a computer desk, looking at a monitor displaying a game interface. The room is lit with blue and purple neon light.
Aesthetic Score : 0.6
Mood : focused, intense, gamer
Quality
Entropy : 6.17
Noise : 52
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors
Lost in the Rain: A Moment of Melancholy
A young woman finds solace in the quiet contemplation of a rainy day, her wistful gaze reflecting a sense of introspection and loneliness. The intimate lighting and composition enhance the mood of melancholy, drawing the viewer into her private world.
Prompt
facial-expressions Worry: melancholy, lonely ; Single woman; eye-level; Single Persons; dimly lit coffee shop with rain outside; cinematic
Characteristic
Shot : A young woman with long dark hair is looking out of a window. The background is blurry and the woman is in focus. There is a raindrop pattern on the window.
Aesthetic Score : 0.7
Mood : melancholy, pensive, wistful
Quality
Entropy : 6.30
Noise : 71
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : no visible errors
The Man of Steel, Cast in Shadows
A brooding Superman stands amidst the blurred lights of a nighttime city, his expression hinting at a dark and serious moment. The dramatic lighting amplifies the intensity of the scene, leaving the viewer questioning what lies ahead.
Prompt
facial-expressions Worry: intense, burdened ; Man in a superhero costume; medium shot; Heroes; cityscape at night with flashing sirens; cinematic
Characteristic
Shot : A man dressed as Superman, standing in a city at night with a blurry background of street lights and buildings
Aesthetic Score : 0.7
Mood : serious, dramatic, superheroic
Quality
Entropy : 5.93
Noise : 58
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise and grain, particularly in the darker areas, as well as some minor artifacting in the subject’s hair.
Lost in Thought: A Moment of Melancholy in Autumn
A solitary figure sits on a park bench, lost in contemplation. The warm light of fall bathes the scene, while the blurred background suggests a sense of isolation and introspection. The image evokes a mood of wistful melancholy, capturing a fleeting moment of quiet reflection.
Prompt
facial-expressions Worry: sad, reflective ; Man sitting alone on a park bench; long shot; Single Persons; empty park with falling leaves; cinematic
Characteristic
Shot : A man sits on a bench in a park, with a soft focus background of autumn foliage
Aesthetic Score : 0.6
Mood : melancholy, contemplative, introspective
Quality
Entropy : 6.87
Noise : 70
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : Minor artifacts are noticeable in the background foliage, particularly in the areas with more intense color.
A Soldier’s Focus: Anticipation in the Face of Uncertainty
A lone soldier, clad in military garb, stands amidst a blurred backdrop of comrades and a military vehicle. His intense gaze fixed on the horizon, a map clutched in his hand, speaks volumes of the seriousness and anticipation hanging in the air. The scene evokes a sense of dramatic tension, leaving the viewer to ponder the unfolding events.
Prompt
facial-expressions Worry: serious, strategic ; Hero looking at a map; medium shot; Heroes; war-torn battlefield with smoke and debris; cinematic
Characteristic
Shot : A man in a military uniform is looking off to the side while holding a map. There are other soldiers in the background and a truck.
Aesthetic Score : 0.6
Mood : serious, pensive, war-torn
Quality
Entropy : 6.78
Noise : 75
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slight blur around the edges. There is also some graininess present in the image.
Lost in the Smoke: A Woman’s Pensive Gaze Over the City
A woman in a black blazer stands against a cityscape backdrop, her gaze lost in the distance. A cloud of smoke hangs in the air, adding a layer of mystery and tension to the scene. The mood is both pensive and urban, leaving the viewer to wonder about the woman’s thoughts and the secrets hidden within the smoke.
Prompt
facial-expressions Worry: determined, resolute ; Heroine standing on a rooftop; medium shot; Heroes; cityscape with smoke and fire in the distance; cinematic
Characteristic
Shot : A woman in a black jacket and white shirt stands in front of a cityscape with smoke in the background.
Aesthetic Score : 0.7
Mood : dramatic, mysterious, contemplative
Quality
Entropy : 6.73
Noise : 53
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, and the colors are a bit muted. There is also a slight artifact in the top-left corner of the image.
Lost in the Shadows: A Moment of Melancholy on a Lonely Street
A young woman walks alone under the soft glow of streetlights, her expression hinting at a contemplative mood. The empty street and low lighting create a sense of mystery and solitude, adding a dramatic touch to this evocative image.
Prompt
facial-expressions Worry: lonely, vulnerable ; Woman walking alone at night; long shot; Single Persons; deserted street with streetlights; cinematic
Characteristic
Shot : A woman stands on a street at night, lit by streetlights. The background is blurry and the focus is on the woman.
Aesthetic Score : 0.7
Mood : melancholy, introspective, lonely
Quality
Entropy : 6.17
Noise : 43
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible errors.
Tense Kitchen Encounter: A Silent Argument Unfolds
A man and a woman stand locked in a heated exchange, their emotions palpable in the air. The man’s anger is evident, while the woman’s sadness is reflected in her averted gaze. The dramatic lighting and close proximity of the characters heighten the tension, leaving the viewer to wonder what secrets lie beneath the surface.
Prompt
facial-expressions Worry: tense, frustrated ; Couple arguing in a kitchen; eye-level; Normal People; cluttered kitchen with dirty dishes; cinematic
Characteristic
Shot : A man and woman are arguing in a kitchen. The man is wearing a blue polo shirt and the woman is wearing a grey tank top. There is a window in the background, and the kitchen is fairly clean.
Aesthetic Score : 0.6
Mood : tense, frustrated, confrontational
Quality
Entropy : 6.63
Noise : 77
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are some minor image errors, such as some noise in the shadows. The composition is a bit too tight.
A Look of Intensity: Mystery Unfolds in the Subway’s Depths
A young woman stands amidst the throngs of a crowded subway car, her gaze locked directly on the viewer. The dim, moody lighting casts an air of suspense, hinting at an impending event. Her intense expression and the darkened environment create a palpable sense of unease and anticipation.
Prompt
facial-expressions Worry: anxious, overwhelmed ; Young woman in a crowded subway; eye-level; Normal People; blurred faces of commuters; cinematic
Characteristic
Shot : A woman with long brown hair is standing in a crowded subway car. She is looking at the camera with a concerned expression. The photo is taken from a slightly elevated angle, which gives the viewer a sense of intimacy. The lighting is soft and diffused, creating a moody atmosphere.
Aesthetic Score : 0.7
Mood : mysterious, contemplative, somber
Quality
Entropy : 6.66
Noise : 60
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.48, also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create the expected shot composition.
- Aesthetic Analysis: The model scored 0.11, which is considered very good. This means the generated image closely matched the desired aesthetic style.
Overall, the model seems to be better at capturing the desired aesthetic style than understanding the camera positions and shot composition. This suggests that the model might need further training to improve its ability to interpret and translate complex visual instructions.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://fal.ai/models/fal-ai/flux/dev/api