AI's Facial Expressions: A Mixed Bag of Success with Flux-pro
- 9 minutes read - 1770 wordsTable of Contents
Facial expressions are a powerful tool in storytelling, conveying emotions and intentions with a single glance. Generative AI models are increasingly being used to create images with realistic facial expressions, but how well do they capture the nuances of human emotion? This blog post delves into the performance of a generative AI model in creating images with facial expressions, analyzing its strengths and weaknesses across various scenes and styles. We’ll explore how the model handles camera position, shot analysis, and aesthetic style, providing insights into the current state of AI’s ability to capture the complexities of human expression.
Created with: flux-pro
Lost in the Neon Glow: A Woman’s Mysterious Journey Through the Night Market
A woman shrouded in mystery walks through a vibrant night market, the blurry lights and neon signs creating a sense of depth and movement. The scene evokes a mood of urban intrigue, leaving you wondering about her destination and the secrets she holds.
Prompt
facial-expressions Confusion: Disoriented, overwhelmed ; A lone figure; eye-level; Single Person; a bustling city street with neon signs and crowds; cinematic
Characteristic
Shot : A woman in a dark leather jacket is walking down a street in a city at night, with colorful neon signs and lights in the background.
Aesthetic Score : 0.7
Mood : mysterious, urban, introspective
Quality
Entropy : 6.69
Noise : 69
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable image errors or artifacts.
Superman Surveys the Devastation
A somber Superman stands amidst the ruins of a city, a burning building casting an ominous glow. The image captures the hero’s determination to rebuild and the weight of the destruction he faces.
Prompt
facial-expressions Confusion: Doubt, uncertainty ; A superhero in a tattered costume; eye-level; Hero; a destroyed cityscape with smoke and debris; cinematic
Characteristic
Shot : A man dressed as Superman standing in a destroyed city, likely after a battle. There are fires in the background.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 6.55
Noise : 87
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.40
Image errors : There are some artifacts in the image, particularly around the edges of Superman’s suit. The smoke and fire in the background look somewhat artificial.
Contemplation in the Corporate Jungle
A woman in a business suit stands alone in a blurred office setting, her serious expression suggesting deep thought and contemplation. The shallow depth of field draws the viewer’s attention to her face, highlighting the intensity of her moment of reflection.
Prompt
facial-expressions Confusion: Lost, unmoored ; A woman in a business suit; eye-level; Normal People; a sterile office with fluorescent lights and cubicles; cinematic
Characteristic
Shot : A woman in a business suit stands in an office setting. She looks up slightly, with a thoughtful expression. The office is empty, suggesting a moment of solitude.
Aesthetic Score : 0.6
Mood : serious, thoughtful, professional
Quality
Entropy : 6.60
Noise : 72
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears slightly overexposed, resulting in a flat appearance. There is some noise in the shadows, particularly in the background.
Lost in the Code: A Hacker’s Focus
A young programmer, bathed in the cool glow of his monitors, is deeply engrossed in a complex coding project. The dimly lit room, awash in blue and purple hues, adds to the intensity of his focus, creating a sense of mystery and intrigue.
Prompt
facial-expressions Confusion: Frustration, bewilderment ; A gamer with headphones on; close-up; Gamer; a dimly lit room with a computer screen displaying a complex game interface; cinematic
Characteristic
Shot : A man is sitting in front of a computer screen, wearing headphones, focused on the screen. There is another screen in the background, and the scene is lit by soft, blueish light.
Aesthetic Score : 0.6
Mood : focused, intense, concentrated
Quality
Entropy : 6.66
Noise : 74
Prompt Clip Score : 0.17
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant errors in the image. The lighting is a little bit flat, but that could be intentional.
Lost in the Fog: A Mysterious Figure in the Shadows
A man in a trench coat stands shrouded in fog, his serious expression hinting at a hidden story. The atmosphere is thick with mystery and suspense, drawing you into a world of shadows and secrets.
Prompt
facial-expressions Confusion: Suspicious, wary ; A man in a trench coat; eye-level; Single Person; a foggy alleyway with flickering streetlights; cinematic
Characteristic
Shot : A man in a trench coat stands in a foggy alley, lit by a single lamppost in the distance.
Aesthetic Score : 0.6
Mood : mysterious, brooding, atmospheric
Quality
Entropy : 6.43
Noise : 66
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors.
A Knight’s Melancholy in the Misty Forest
A lone knight stands amidst a misty forest, his armor reflecting the muted light of the surrounding trees. His serious gaze and solitary posture create a sense of mystery and isolation, evoking a melancholic mood. The dramatic lighting enhances the scene’s aesthetic appeal.
Prompt
facial-expressions Confusion: Disillusioned, lost ; A knight in shining armor; eye-level; Hero; a dark forest with twisted trees and ominous shadows; cinematic
Characteristic
Shot : A man wearing armor stands in a dark, misty forest, looking towards the camera.
Aesthetic Score : 0.7
Mood : serious, intense, brooding
Quality
Entropy : 6.85
Noise : 93
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some noise in the image, minor sharpening artifacts on the subject’s armor.
Cozy Family Dinner: A Moment of Intimacy and Connection
This heartwarming scene captures a family enjoying dessert after dinner in a cozy home setting. The warm lighting and intimate composition highlight the close bond between them, creating a sense of comfort and togetherness.
Prompt
facial-expressions Confusion: Awkward, uncomfortable ; A family at a dinner table; eye-level; Normal People; a brightly lit kitchen with mismatched plates and silverware; cinematic
Characteristic
Shot : A family is sitting around a table eating, likely dessert. The lighting is warm and the table is covered in a floral tablecloth, but the scene is somewhat cluttered.
Aesthetic Score : 0.6
Mood : cozy, intimate, mundane
Quality
Entropy : 6.81
Noise : 78
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, particularly the background. It has some minor color artifacts.
Lost in the Game: A Moment of Focused Play
A young person sits captivated, controller in hand, as the warm glow of the television screen illuminates their face. The image captures the immersive power of video games, showcasing a moment of focused, relaxed, and playful engagement.
Prompt
facial-expressions Confusion: Overwhelmed, disoriented ; A gamer holding a controller; close-up; Gamer; a brightly lit room with a TV screen displaying a chaotic game scene; cinematic
Characteristic
Shot : A person is sitting in a chair, facing a TV screen, holding a video game controller in their hand. The TV screen is showing a video game scene.
Aesthetic Score : 0.6
Mood : focused, relaxed, calm
Quality
Entropy : 6.65
Noise : 53
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly blurry, particularly in the background.
Lost in the City’s Embrace
A solitary figure navigates the bustling urban landscape, her gaze lost in thought. The low angle and blurred background create a sense of isolation and mystery, capturing the fleeting moments of solitude amidst the city’s relentless energy.
Prompt
facial-expressions Confusion: Lost, alienated ; A woman walking down a crowded street; eye-level; Single Person; a bustling city street with people rushing past; cinematic
Characteristic
Shot : A woman in a black puffer jacket walks through a busy city street. The street is blurred and out of focus.
Aesthetic Score : 0.6
Mood : urban, melancholic, lonely
Quality
Entropy : 6.68
Noise : 74
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some minor blurring and noise around the edges of the image, especially in the background.
Superman Stands Tall Against the City Lights
A heroic figure in a Superman suit stands silhouetted against a backdrop of a vibrant city skyline and a full moon. The dramatic lighting and composition evoke a sense of hope and power, capturing the essence of the iconic superhero.
Prompt
facial-expressions Confusion: Doubt, questioning ; A superhero standing on a rooftop; eye-level; Hero; a cityscape with twinkling lights and a full moon; cinematic
Characteristic
Shot : A man in a superhero costume stands in a city, looking out over the cityscape. The sun is setting in the background and there is a full moon in the sky.
Aesthetic Score : 0.6
Mood : dramatic, heroic, contemplative
Quality
Entropy : 6.65
Noise : 94
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant artifacts or errors in the image. The composition is slightly off, with the subject being too close to the edge of the frame.
Conclusion
The analysis shows that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.51, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.13, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://fal.ai/models/fal-ai/flux-pro/api