AI's Facial Expressions: A Mixed Bag of Triumph and Struggle with Flux-dev
- 9 minutes read - 1858 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate realistic and expressive images is a coveted skill. This blog post examines the performance of a generative AI model in capturing facial expressions across a range of scenes and camera angles. We’ll explore how the model excels in understanding the context of a scene and its camera position, but also highlight its limitations in achieving the desired aesthetic. Through a detailed analysis of its strengths and weaknesses, we aim to provide insights into the current state of AI-generated facial expressions and the potential for future advancements.
Created with: flux-dev
Silhouetted Against Hope: A Man’s Triumphant Sunset
A powerful image of a man standing with arms outstretched on a rooftop, silhouetted against a vibrant sunset. The scene evokes feelings of hope, inspiration, and triumph, capturing a moment of resilience and possibility.
Prompt
facial-expressions Excitement: Victorious, powerful ; A hero standing triumphantly on a rooftop; high-angle; Hero; a cityscape with a dramatic storm in the background; cinematic
Characteristic
Shot : A man stands with arms raised in the air on a rooftop overlooking a city skyline during sunset.
Aesthetic Score : 0.6
Mood : hopeful, uplifting, triumphant
Quality
Entropy : 6.75
Noise : 61
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy and suffers from a lack of sharpness. Some halos are visible around the man and the buildings.
Lost in the City Lights
A young person with dark curly hair stands alone on a bustling city street at night. The background is a blur of passing cars and streetlights, creating a sense of mystery and intrigue. Their pensive expression and the urban setting evoke a mood of quiet contemplation.
Prompt
facial-expressions Excitement: Thrilled, anticipation ; A lone figure; eye-level; Single Person; bustling city street at night; cinematic
Characteristic
Shot : A young person, likely female, walking down a city street at night. They are wearing a black suit and have a pensive expression on their face. The background is blurry, but it is clear that they are in an urban setting.
Aesthetic Score : 0.7
Mood : mysterious, urban, contemplative
Quality
Entropy : 6.71
Noise : 57
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors
Silhouetted Hero Takes Flight Against a Fiery Sunset
A powerful silhouette of a man in a red cape soars through the sky, bathed in the warm glow of a setting sun. This image evokes a sense of hope, majesty, and the triumph of good over evil. The dramatic composition captures the essence of freedom and empowerment, leaving a lasting impression of strength and resilience.
Prompt
facial-expressions Excitement: Triumphant, exhilarating ; A superhero in mid-air; low-angle; Hero; cityscape with a dramatic sunset; cinematic
Characteristic
Shot : A man in a red cape is flying through the air against a backdrop of a city skyline and a sunset.
Aesthetic Score : 0.7
Mood : hopeful, powerful, adventurous
Quality
Entropy : 6.39
Noise : 47
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some artifacts around the edges of the man’s silhouette, suggesting it may have been edited or manipulated.
In the Zone: A Gamer’s Intense Focus
A close-up shot captures the raw intensity of a young gamer, headphones on, eyes glued to the screen. The low lighting and intimate framing draw you into their world, showcasing their unwavering determination as they navigate the digital battlefield.
Prompt
facial-expressions Excitement: Intense, focused ; A gamer’s hands furiously tapping on a keyboard; close-up; Gamer; a dimly lit room with glowing screens; cinematic
Characteristic
Shot : A young person wearing headphones is playing video games on their computer. The scene is dimly lit and features a keyboard and a monitor showing a video game.
Aesthetic Score : 0.6
Mood : focused, intense, digital
Quality
Entropy : 6.44
Noise : 68
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry and the lighting is uneven. There are some artifacts around the edges of the subject and the screen.
Man Flees Fiery Explosion in Desperate Dash
A lone figure, clad in brown, races through a desolate desert landscape, a weapon clutched in his hand. Behind him, a massive fiery explosion erupts, casting an ominous glow across the scene. The man’s determined expression and the dramatic backdrop create a sense of urgency and intensity, hinting at a desperate escape from a perilous situation.
Prompt
facial-expressions Excitement: Brave, adrenaline-fueled ; A hero charging into battle; low-angle; Hero; a chaotic battlefield with explosions and smoke; cinematic
Characteristic
Shot : A lone figure in a dusty, post-apocalyptic world, running through an explosion of fire
Aesthetic Score : 0.7
Mood : dramatic, adventurous, desolate
Quality
Entropy : 6.45
Noise : 84
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image suffers from some minor overexposure in the background and a slight lack of sharpness, especially in the subject’s face.
Sun-Kissed Joy: Girls Running Free in a Vibrant Field
Capture the essence of carefree joy with this vibrant image of girls running through a sun-drenched field. Their smiles and energetic movements create a sense of pure happiness and excitement, making this a perfect representation of youthful exuberance.
Prompt
facial-expressions Excitement: Joyful, carefree ; A group of friends laughing and running; eye-level; Normal People; a sunny park with a vibrant green lawn; cinematic
Characteristic
Shot : Five young girls are running through a green grassy field, bathed in sunlight, towards the camera with happy expressions on their faces. The background shows lush green trees and a bright blue sky.
Aesthetic Score : 0.7
Mood : joyful, carefree, youthful
Quality
Entropy : 6.72
Noise : 92
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible image errors.
Neon Nights: A Moment of Intensity Captured
A young man, bathed in vibrant pink and blue neon light, stares intently towards the camera, headphones on, a hint of surprise in his eyes. The blurred figure behind him suggests a lively social scene, adding a layer of intrigue to this captivating moment. The dramatic lighting and focused gaze create a sense of intensity and playfulness, capturing a fleeting emotion in a vibrant and engaging way.
Prompt
facial-expressions Excitement: Engrossed, focused ; A gamer’s face illuminated by the screen; close-up; Gamer; a dark room with neon lights reflecting on the screen; cinematic
Characteristic
Shot : A young man wearing headphones, lit with pink and blue light, is looking to the side, possibly listening to music or playing a game. Another person is in the background, partially visible.
Aesthetic Score : 0.6
Mood : intense, focused, digital
Quality
Entropy : 6.40
Noise : 61
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : Minor blurring around the edges and some grain in the image. The image lacks sharpness and detail, particularly in the background.
Silhouetted Against the Setting Sun: A Moment of Contemplation
A lone woman in a white dress stands on the precipice of a cliff, her silhouette stark against the fiery hues of the setting sun. The vast ocean stretches before her, mirroring the vastness of her thoughts. This image evokes a sense of serenity, contemplation, and a glimmer of hope.
Prompt
facial-expressions Excitement: Awe-inspiring, liberating ; A woman standing on a cliff overlooking a vast ocean; eye-level; Single Person; dramatic clouds and a setting sun; cinematic
Characteristic
Shot : A lone figure in a white dress stands on the edge of a cliff overlooking the ocean. The sun is setting in the distance, casting a warm glow over the scene.
Aesthetic Score : 0.7
Mood : tranquil, serene, hopeful
Quality
Entropy : 6.56
Noise : 81
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors in the image.
Party Time! Joyful Celebration Captured in a Snapshot
A young girl, beaming with excitement, lies on a colorful blanket, surrounded by the festive atmosphere of a party. Balloons float in the background, and the girl’s infectious joy is palpable, creating a sense of happiness and celebration.
Prompt
facial-expressions Excitement: Happy, celebratory ; A family celebrating a birthday; eye-level; Normal People; a brightly decorated living room with balloons and streamers; cinematic
Characteristic
Shot : A little girl with a party hat is lying on a bed with colorful confetti around her. She is smiling and looking at the camera. There is a man and a woman in the background, also smiling. There are balloons in the background.
Aesthetic Score : 0.8
Mood : joyful, playful, celebratory
Quality
Entropy : 6.79
Noise : 76
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are some slight artifacts in the image, particularly around the girl’s hair. The colors are slightly oversaturated.
The Thrill of the Ride: Capturing the Excitement of a Roller Coaster
This image captures the pure joy and adrenaline rush of a roller coaster ride. The man’s wide-open mouth and exaggerated expression perfectly convey the excitement of the moment, while the blurred background emphasizes the speed and intensity of the experience.
Prompt
facial-expressions Excitement: Thrilling, exhilarating ; A man riding a rollercoaster; POV shot; Single Person; a fast-paced ride with twists and turns; cinematic
Characteristic
Shot : A man is riding a roller coaster, looking up towards the sky. The roller coaster is in the background, with the man in the foreground.
Aesthetic Score : 0.6
Mood : exciting, joyful, carefree
Quality
Entropy : 6.65
Noise : 62
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight blurring of the subject’s hand, likely due to movement or low light conditions.
Conclusion
The generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, indicating a moderate ability to react to camera positions in the prompt. This is considered okay, as a score between 0.5 and 0.75 is considered good, and above 0.75 is very good.
- Shot Analysis: The model scored 0.56, indicating a good ability to understand the scene described in the prompt. This is considered good, as a score between 0.5 and 0.75 is considered good, and above 0.75 is very good.
- Aesthetic Analysis: The model scored 0.15, indicating a slight deviation from the expected aesthetic. This is considered very good, as a score between -0.2 and 0.1 is considered very good.
Overall, the model shows promise in understanding the scene and camera position, but needs improvement in generating images that match the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://fal.ai/models/fal-ai/flux/dev/api