AI's Facial Expressions: A Mixed Bag with Leonardo-ai
- 9 minutes read - 1795 wordsTable of Contents
Facial expressions are a powerful tool in storytelling, conveying emotions and intentions without words. Generative AI is increasingly being used to create images with specific facial expressions, but how well does it capture the nuances of human emotion? This blog post explores the capabilities of a generative AI model in understanding and implementing facial expressions, camera angles, and aesthetics. We’ll analyze the results of a test, highlighting the model’s strengths and weaknesses, and discuss the potential of this technology in the future.
Created with: leonardo-ai
Lost in Thought: A Moment of Serenity in the Park
A solitary figure sits on a bench, bathed in the warm glow of the sun. The bare trees and soft lighting create a sense of peace and contemplation, as the man gazes into the distance, lost in his own thoughts. This image evokes a mood of pensive serenity, capturing the quiet beauty of a moment of introspection.
Prompt
facial-expressions Thoughtfulness: Melancholy, contemplative ; A lone figure sitting on a park bench; eye-level; Single Person; a bustling city park in the background; cinematic
Characteristic
Shot : A young man sits on a bench in a park, looking off into the distance. The scene is shot from a low angle, with the man’s figure dominating the frame. The background is a blur of trees and leaves, with the sun shining through the branches. The image has a melancholic, contemplative feel.
Aesthetic Score : 0.7
Mood : melancholic, contemplative, thoughtful
Quality
Entropy : 6.87
Noise : 103
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, which washes out some of the detail in the background. There is also a slight chromatic aberration, which is visible around the edges of the subject’s figure.
Heroic Silhouette Against the Dusk
A superhero stands tall on a rooftop, their silhouette stark against the vibrant cityscape at dusk. The dramatic lighting and contemplative mood evoke a sense of heroism and power.
Prompt
facial-expressions Thoughtfulness: Reflective, introspective ; A superhero standing on a rooftop, looking out at the city; eye-level; Hero; a sprawling cityscape with twinkling lights; cinematic
Characteristic
Shot : A lone superhero figure stands on a rooftop overlooking a cityscape at dusk, looking out towards the horizon.
Aesthetic Score : 0.7
Mood : dramatic, heroic, contemplative
Quality
Entropy : 6.78
Noise : 94
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.30
Image errors : The background cityscape appears slightly blurred and lacks detail, which could be due to compression or editing. The lighting on the superhero’s face is somewhat harsh.
Lost in Thought: A Moment of Contemplation on the Train
A woman, bathed in soft light, gazes out the train window, her book forgotten in her lap. The passing scenery blurs, mirroring the quiet introspection in her eyes. The muted colors and calm atmosphere evoke a sense of longing and contemplation, capturing a fleeting moment of quiet reflection.
Prompt
facial-expressions Thoughtfulness: Peaceful, absorbed ; A woman reading a book on a train; eye-level; Normal Person; a blurry view of passing scenery outside the window; cinematic
Characteristic
Shot : A woman is sitting in a train looking out the window, reading a book. The train is moving through a rural landscape.
Aesthetic Score : 0.7
Mood : pensive, contemplative, peaceful
Quality
Entropy : 6.82
Noise : 91
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has no significant errors.
Lost in the Digital World: A Moment of Intense Focus
A young man, shrouded in the dim glow of his computer screen, is completely absorbed in his task. Headphones on, eyes fixed on the map displayed on his monitor, he exudes an air of intense concentration. The low lighting and his focused gaze create a sense of mystery and intrigue, leaving us to wonder what digital world he’s exploring.
Prompt
facial-expressions Thoughtfulness: Intense, focused ; A gamer sitting in a dimly lit room, staring intently at a computer screen; eye-level; Gamer; a cluttered desk with gaming peripherals; cinematic
Characteristic
Shot : A young man is sitting in a dimly lit room wearing headphones. He is looking intently at a computer screen, his face illuminated by the screen’s light. The image has a moody and contemplative feel.
Aesthetic Score : 0.6
Mood : intense, focused, dramatic
Quality
Entropy : 6.03
Noise : 87
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slightly grainy texture and some noise in the shadows. This could be due to poor lighting or low-resolution editing. The screen reflections could be distracting.
Solitude by the Stormy Sea
A lone figure walks along a wet, sandy beach, their path leading towards a choppy sea under a cloudy sky. The scene evokes a sense of melancholic reflection and tranquility, with the dramatic effect heightened by the figure’s isolation and the stormy backdrop.
Prompt
facial-expressions Thoughtfulness: Solitary, introspective ; A man walking alone on a deserted beach; eye-level; Single Person; the vast ocean stretching out before him; cinematic
Characteristic
Shot : A solitary figure walks along a wet sandy beach towards the ocean, under a cloudy sky at dusk.
Aesthetic Score : 0.7
Mood : melancholic, contemplative, lonely
Quality
Entropy : 6.47
Noise : 93
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors observed.
Firefighter Stands Tall Amidst the Flames
A firefighter in full gear faces the inferno, his calm demeanor a stark contrast to the raging fire behind him. The scene evokes a sense of seriousness and dramatic tension, highlighting the bravery and resilience of those who fight fires.
Prompt
facial-expressions Thoughtfulness: Somber, reflective ; A firefighter standing amidst the ruins of a fire; eye-level; Hero; smoke and debris filling the air; cinematic
Characteristic
Shot : A firefighter in full gear stands in front of a burning building, the flames are intense and the smoke billows up from the scene.
Aesthetic Score : 0.6
Mood : dramatic, somber, intense
Quality
Entropy : 6.69
Noise : 99
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
A Moment of Intimacy: Sharing Laughter and Love over a Delicious Meal
Experience the warmth and happiness of a couple enjoying a delightful dinner in their cozy kitchen. With plates of food, wine glasses, and a bottle of wine on the table, they share stories and laughter, creating an intimate and memorable moment.
Prompt
facial-expressions Thoughtfulness: Intimate, connected ; A family gathered around a dinner table; eye-level; Normal People; a warm, inviting kitchen setting; cinematic
Characteristic
Shot : A couple is enjoying a meal together in a cozy kitchen with warm lighting. There is a window with a view of a natural landscape outside.
Aesthetic Score : 0.7
Mood : intimate, cozy, happy
Quality
Entropy : 6.79
Noise : 97
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
Lost in the Game: The Intensity of Digital Immersion
A man, headphones on, eyes glued to the screen, embodies the focused intensity of a gamer lost in the digital world. The dramatic lighting and close-up shot draw you into his experience, highlighting the emotional engagement of gaming.
Prompt
facial-expressions Thoughtfulness: Excited, immersed ; A gamer holding a controller, eyes glued to the screen; close-up; Gamer; a vibrant, colorful gaming world displayed on the monitor; cinematic
Characteristic
Shot : A young man is playing a video game on his computer. He is wearing headphones and looking intensely at the screen. The room is dimly lit and the image is focused on the man’s face and his hands on the keyboard.
Aesthetic Score : 0.6
Mood : intense, focused, engrossed
Quality
Entropy : 6.47
Noise : 93
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some blurriness on the background screens, particularly the one on the right.
Finding Serenity in the Blossoms
A young woman finds peace and tranquility amidst the beauty of a blooming park. The soft sunlight and her focused expression create a calming atmosphere, inviting viewers to share in the moment of serenity.
Prompt
facial-expressions Thoughtfulness: Peaceful, creative ; A woman sitting on a park bench, sketching in a notebook; eye-level; Single Person; a serene park setting with blooming flowers; cinematic
Characteristic
Shot : A young woman is sitting on a park bench under a tree with pink blossoms, reading a book.
Aesthetic Score : 0.7
Mood : calm, peaceful, thoughtful
Quality
Entropy : 6.95
Noise : 102
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors.
Hero Faces the Storm
A brooding superhero stares into a turbulent sky, hinting at an impending battle or a dark secret. The dramatic lighting and intense gaze create a sense of anticipation and danger.
Prompt
facial-expressions Thoughtfulness: Determined, resolute ; A superhero looking up at the sky, a determined expression on their face; eye-level; Hero; a dramatic sky with dark clouds gathering; cinematic
Characteristic
Shot : A man in a superhero costume, looking up towards the sky, with dark, stormy clouds in the background.
Aesthetic Score : 0.7
Mood : intense, dramatic, serious
Quality
Entropy : 6.91
Noise : 93
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.1, indicating a very low ability to accurately represent the camera position described in the prompt. This suggests the model may not be very good at understanding and implementing camera angles.
- Shot Analysis: The model scored 0.565, which is considered good. This means the model was able to understand the scene described in the prompt and create an image that reflects it fairly well.
- Aesthetic Analysis: The model scored 0.08, which is considered very good. This means the generated image closely matched the expected aesthetic style.
Overall, the model seems to be better at understanding the scene and achieving the desired aesthetic than it is at accurately representing the camera position.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://leonardo.ai