AI's Facial Expressions: A Step Forward, But Still Room for Growth with Flux-pro
- 9 minutes read - 1797 wordsTable of Contents
Facial expressions are a powerful tool for conveying emotions and adding depth to visual storytelling. In the realm of generative AI, the ability to accurately depict these expressions is crucial for creating realistic and engaging images. This blog post examines the performance of a generative AI model in capturing facial expressions across various scenes, highlighting its strengths and weaknesses in this critical aspect of image generation.
Created with: flux-pro
Lost in the City Lights: A Moment of Contemplation
A young woman, her glasses reflecting the urban glow, walks through the night, her gaze fixed on the distant cityscape. The soft lighting casts a veil of mystery, inviting viewers to share in her contemplative mood.
Prompt
facial-expressions Excitement: Thrilled, anticipation ; A lone figure; eye-level; Single Person; bustling city street at night; cinematic
Characteristic
Shot : A young woman with glasses is standing on a street, looking at the lights of the city. She is wearing a brown jacket and a scarf.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, urban
Quality
Entropy : 6.81
Noise : 81
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable artifacts or errors in the image.
Superman Soars into a Dramatic Sunset
A superhero, possibly Superman, takes flight against a breathtaking sunset backdrop. The scene evokes a sense of epic hope and dramatic excitement, capturing the essence of a superhero’s journey.
Prompt
facial-expressions Excitement: Triumphant, exhilarating ; A superhero in mid-air; low-angle; Hero; cityscape with a dramatic sunset; cinematic
Characteristic
Shot : A superhero, dressed in a red cape and blue suit, is flying through the air against a sunset backdrop with a cityscape in the distance.
Aesthetic Score : 0.7
Mood : dynamic, hopeful, powerful
Quality
Entropy : 6.81
Noise : 74
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.40
Image errors : No significant errors detected
Friends Run Free in Sunny Park
A vibrant snapshot of pure joy! This photo captures a group of friends running through a park on a sunny day, their laughter echoing through the green space. The shallow depth of field isolates them, emphasizing their carefree energy and the moment’s excitement.
Prompt
facial-expressions Excitement: Joyful, carefree ; A group of friends laughing and running; eye-level; Normal People; a sunny park with a vibrant green lawn; cinematic
Characteristic
Shot : A group of five young adults are running and laughing in a park on a sunny day.
Aesthetic Score : 0.6
Mood : joyful, carefree, energetic
Quality
Entropy : 6.53
Noise : 83
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slight blur, which could be due to motion blur or a shallow depth of field.
In the Zone: Gamer’s Focus Under Dramatic Lighting
A captivating image of a gamer immersed in their game, illuminated by dramatic lighting that highlights their focused hands and the keyboard. The scene features two monitors, one displaying a world map and the other showcasing the game itself, creating a sense of intensity and technological immersion.
Prompt
facial-expressions Excitement: Intense, focused ; A gamer’s hands furiously tapping on a keyboard; close-up; Gamer; a dimly lit room with glowing screens; cinematic
Characteristic
Shot : A person is sitting in front of a computer screen and playing a game. The person is wearing a headset and is looking at the screen. The screen is showing a map of the world. The person’s hands are on a keyboard.
Aesthetic Score : 0.6
Mood : focused, intense, competitive
Quality
Entropy : 6.74
Noise : 57
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has slight blurriness, especially around the edges. Some artifacts are visible on the screen, likely from compression.
Finding Serenity at Sunset
A woman embraces the tranquility of the moment, silhouetted against a breathtaking sunset over the vast ocean. The scene evokes feelings of peace, hope, and serenity.
Prompt
facial-expressions Excitement: Awe-inspiring, liberating ; A woman standing on a cliff overlooking a vast ocean; eye-level; Single Person; dramatic clouds and a setting sun; cinematic
Characteristic
Shot : A lone figure stands on a clifftop with her arms raised, overlooking a vast ocean with crashing waves. The sun is setting, casting golden light on the water, creating a serene and beautiful scene.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, hopeful
Quality
Entropy : 6.85
Noise : 94
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, particularly in the sky, which is losing some of its detail.
Running from the Ruins: A Soldier’s Desperate Escape
A powerful image captures the intensity of war as a soldier races through a devastated landscape. The scene is filled with a sense of urgency and drama, leaving the viewer breathless.
Prompt
facial-expressions Excitement: Brave, adrenaline-fueled ; A hero charging into battle; low-angle; Hero; a chaotic battlefield with explosions and smoke; cinematic
Characteristic
Shot : A man in a military-style jacket runs through a war-torn landscape, with smoke and debris in the background.
Aesthetic Score : 0.7
Mood : intense, action, dramatic
Quality
Entropy : 6.72
Noise : 100
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors or artifacts.
Birthday Wishes and Joyful Smiles
A heartwarming scene of a young girl celebrating her birthday, blowing out candles on a cake with a joyful smile. The warm lighting and soft colors capture the celebratory mood, while the adults in the background add a touch of family love and happiness.
Prompt
facial-expressions Excitement: Happy, celebratory ; A family celebrating a birthday; eye-level; Normal People; a brightly decorated living room with balloons and streamers; cinematic
Characteristic
Shot : A family celebrating a birthday with cake and balloons. The girl in the center is the birthday girl wearing a blue hat and smiling.
Aesthetic Score : 0.7
Mood : joyful, celebratory, loving
Quality
Entropy : 6.85
Noise : 71
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible image errors.
Lost in the Rhythm: A Moment of Contemplation
A young man, bathed in the ethereal glow of blue and red light, gazes off into the distance, lost in thought. His headphones suggest a world of sound, while the dramatic lighting adds an air of mystery and intrigue. This image captures a moment of quiet contemplation, leaving the viewer to wonder what thoughts are swirling in his mind.
Prompt
facial-expressions Excitement: Engrossed, focused ; A gamer’s face illuminated by the screen; close-up; Gamer; a dark room with neon lights reflecting on the screen; cinematic
Characteristic
Shot : A close-up portrait of a young man wearing headphones, illuminated by colorful lights, likely in a gaming or studio setting.
Aesthetic Score : 0.7
Mood : focused, intense, contemplative
Quality
Entropy : 6.63
Noise : 71
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some minor noise is present, particularly in the darker areas of the image. The focus is sharp on the subject, but there’s a slight blur on the background.
Roller Coaster Ride of Joy: A Man’s Thrilling Smile
Capture the exhilaration of a roller coaster ride with this image. A man, beaming with happiness, looks directly at the camera, his smile amplified by the blur of speed. The scene evokes a sense of excitement and thrill, making you feel like you’re right there on the ride.
Prompt
facial-expressions Excitement: Thrilling, exhilarating ; A man riding a rollercoaster; POV shot; Single Person; a fast-paced ride with twists and turns; cinematic
Characteristic
Shot : A man on a roller coaster, looking towards the camera and smiling widely, capturing the thrill of the ride. The background features a bright sky and the track of the roller coaster.
Aesthetic Score : 0.6
Mood : joyful, exhilarating, adventurous
Quality
Entropy : 6.81
Noise : 60
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image appears to be slightly overexposed, causing some loss of detail in the highlights, particularly in the sky. The focus seems slightly off, resulting in a slightly blurry image.
Silhouetted Against the Setting Sun, a Symbol of Hope
A powerful image captures the essence of inspiration and hope. A man stands on a rooftop, arms outstretched, bathed in the golden light of the setting sun. His silhouette, sharp and dramatic, is framed against the city skyline, creating a sense of scale and power. The billowing cloak adds a dynamic element to the composition, further emphasizing the feeling of hope and possibility.
Prompt
facial-expressions Excitement: Victorious, powerful ; A hero standing triumphantly on a rooftop; high-angle; Hero; a cityscape with a dramatic storm in the background; cinematic
Characteristic
Shot : A man in a cape stands with his arms outstretched, overlooking a city skyline at sunset. The sky is filled with dramatic clouds, and the cityscape is bathed in a warm, golden light.
Aesthetic Score : 0.7
Mood : inspirational, hopeful, triumphant
Quality
Entropy : 6.43
Noise : 81
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor noise and grain, but it is not very noticeable.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t fully capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.54, which falls within the “good” range. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.15, which is outside the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated from the expected aesthetic described in the prompt.
Overall, the model shows promise in understanding scene composition and camera positioning, but needs improvement in capturing the desired aesthetic.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://fal.ai/models/fal-ai/flux-pro/api