AI's Facial Expressions: A Step Forward, But Still Room for Growth with Leonardo-ai
- 9 minutes read - 1900 wordsTable of Contents
The ability to generate realistic facial expressions is a crucial aspect of creating compelling and engaging visual content. This blog post examines the performance of a generative AI model in this area, focusing on its ability to capture the nuances of human emotion through facial expressions. We’ll explore how the model interprets prompts that include specific camera positions, shot compositions, and aesthetic styles, and analyze its strengths and weaknesses in each area. By understanding the model’s capabilities and limitations, we can gain valuable insights into the future of AI-generated imagery and its potential to revolutionize the creative process.
Created with: leonardo-ai
Lost in the Neon Glow: A Mysterious Figure Walks the Wet Streets
A lone figure disappears into the shadows of a neon-lit alleyway, their silhouette reflected in the puddles on the wet pavement. The scene evokes a sense of mystery and intrigue, leaving you wondering about their destination and the secrets they hold.
Prompt
facial-expressions Surprise: Eerie, suspenseful ; A lone figure walking down a deserted street; eye-level; Single Person; neon signs reflecting in puddles; cinematic
Characteristic
Shot : A lone figure walks down a narrow, wet alleyway in an Asian city. The alley is lined with buildings, some with brightly lit neon signs, and the street is slick with rain.
Aesthetic Score : 0.8
Mood : gloomy, mysterious, urban
Quality
Entropy : 6.38
Noise : 105
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slightly washed-out look, and there is some noise in the shadows.
Heroic Silhouette: A Superhero Stands Guard at Dusk
A dramatic image captures a superhero in a blue and red costume, silhouetted against the city lights at dusk. The scene evokes a sense of hope and heroism, as the figure stands watch over the cityscape.
Prompt
facial-expressions Surprise: Triumphant, awe-inspiring ; A superhero standing on a rooftop, looking out over the city; eye-level; Hero; cityscape at night, with flashing lights and sirens in the distance; cinematic
Characteristic
Shot : A superhero stands on a rooftop overlooking a cityscape at dusk, the city lights are twinkling in the distance, the sky is a mix of purple and orange.
Aesthetic Score : 0.7
Mood : heroic, determined, dramatic
Quality
Entropy : 6.59
Noise : 95
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts in the background, particularly in the sky and the lights. The subject’s costume and skin texture appear overly smooth and artificial.
Suspenseful Dining: A Tense Encounter
In a dimly lit dining room, a couple sits at a table filled with plates of food and drinks. The man’s direct gaze at the camera creates a sense of unease, while the woman’s distant look adds to the dramatic tension. The dark view from the window and the strategic camera position heighten the suspicious mood, making for a suspenseful and dramatic scene.
Prompt
facial-expressions Surprise: Innocent, unsettling ; A family having dinner together, unaware of the approaching danger; eye-level; Normal People; cozy kitchen, warm lighting; cinematic
Characteristic
Shot : A man and a woman are sitting at a dinner table in a dimly lit room. The woman is looking at something off-camera with a worried expression. The man is looking directly at the camera with a concerned expression.
Aesthetic Score : 0.6
Mood : tense, dramatic, anxious
Quality
Entropy : 6.63
Noise : 96
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is a slight blur in the background of the image. The lighting is also not evenly distributed. The image has a subtle, but present grain which adds to the feeling of realism.
Lost in the Code: A Young Man’s Intense Focus Under Blue Light
A young man, bathed in blue light, is completely absorbed in his work. Headphones on, fingers flying across the keyboard, he’s lost in the digital world, his concentration palpable. The blue lighting adds a sense of mystery and intensity, highlighting the man’s unwavering focus.
Prompt
facial-expressions Surprise: Intense, focused ; A gamer sitting in a dimly lit room, eyes glued to the screen; close-up; Gamer; glowing monitor, keyboard, and mouse; cinematic
Characteristic
Shot : A young man wearing headphones is sitting in a dark room, lit by a blue screen and keyboard, and looking intently at his computer.
Aesthetic Score : 0.7
Mood : focused, intense, concentrated
Quality
Entropy : 6.29
Noise : 92
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors in the image.
Caught in the Blur: A Woman’s Shocked Expression Amidst the Chaos
A woman stands frozen in shock on a bustling train platform, her expression a stark contrast to the blurry, anonymous crowd around her. The scene evokes a sense of tension and suspense, leaving the viewer wondering what has transpired.
Prompt
facial-expressions Surprise: Panic, frantic ; A woman standing in a crowded train station, suddenly realizing she’s lost her purse; eye-level; Single Person; bustling crowd, hurried footsteps; cinematic
Characteristic
Shot : A young woman is standing on a train platform, looking scared. She is surrounded by a crowd of people who are walking past her. A train is visible in the background. The scene is set in a public transportation setting.
Aesthetic Score : 0.6
Mood : tense, anxious, dramatic
Quality
Entropy : 6.88
Noise : 100
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Firefighter Braces Against Blazing Inferno
A lone firefighter stands defiant against a raging inferno, the flames casting an eerie glow on their protective gear. The scene is one of intense drama, with billowing smoke and scattered debris painting a picture of destruction and somber reflection.
Prompt
facial-expressions Surprise: Brave, heroic ; A hero emerging from a burning building, carrying a child; eye-level; Hero; smoke and flames, collapsing structure; cinematic
Characteristic
Shot : A firefighter in full gear stands in front of a burning building. The building is engulfed in flames and smoke. There is debris and rubble around the building.
Aesthetic Score : 0.5
Mood : dramatic, somber, intense
Quality
Entropy : 6.81
Noise : 103
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Sunny Day Picnic with Friends
A tranquil scene of three friends enjoying a relaxed picnic in a park on a sunny day. The image evokes a sense of calm and friendship, capturing the essence of a casual gathering.
Prompt
facial-expressions Surprise: Peaceful, ominous ; A group of friends enjoying a picnic in a park, unaware of the strange object falling from the sky; eye-level; Normal People; sunny day, green grass, blue sky; cinematic
Characteristic
Shot : Three young adults are having a picnic in a park. They are sitting on a blanket, and there is a basket of food in front of them. The scene is casual and relaxed, with a bright and sunny atmosphere.
Aesthetic Score : 0.6
Mood : relaxed, casual, happy
Quality
Entropy : 6.88
Noise : 107
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No notable errors in the image.
Caught in the Act: A Shocking Close-Up
A man’s face, frozen in a moment of surprise, is illuminated by a stark blue light. His hands hover over a keyboard, hinting at a secret or a dangerous situation. The close-up shot and dramatic lighting create a sense of intense suspense, leaving the viewer wondering what has just transpired.
Prompt
facial-expressions Surprise: Disbelief, frustration ; A gamer’s hands frantically moving across the keyboard, as a sudden glitch appears on the screen; close-up; Gamer; distorted screen, flashing lights; cinematic
Characteristic
Shot : A man with a surprised expression on his face is hunched over a keyboard, his hands are poised over the keys. The image is shot from a low angle, giving a sense of urgency and tension.
Aesthetic Score : 0.5
Mood : tense, dramatic, suspenseful
Quality
Entropy : 6.23
Noise : 90
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable image errors or artifacts.
Sunlight Dappled Journey Through a Serene Forest
A man, lost in thought, walks through a lush forest bathed in golden sunlight. The peaceful atmosphere and the play of light create a sense of mystery and wonder, highlighting the solitary nature of his journey.
Prompt
facial-expressions Surprise: Mystical, awe-inspiring ; A man walking through a forest, suddenly finding himself face-to-face with a mythical creature; eye-level; Single Person; dense foliage, dappled sunlight; cinematic
Characteristic
Shot : A man walks through a dense forest, sunlight streams through the trees, creating a warm, hazy atmosphere. The man is dressed in a plaid shirt and jeans, and is walking on a path through the woods.
Aesthetic Score : 0.7
Mood : tranquil, peaceful, serene
Quality
Entropy : 6.70
Noise : 106
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to be slightly blurry. There is some noise and compression artifacting, which is most noticeable in the shadow areas.
Soldier’s Grim Determination Amidst the Inferno
A lone soldier, clad in full combat gear, navigates a war-torn landscape, his face etched with tension as flames and smoke billow behind him. The dramatic lighting and the soldier’s somber expression create a powerful sense of danger and urgency, capturing the intensity of the battlefield.
Prompt
facial-expressions Surprise: Melancholy, reflective ; A hero standing on a battlefield, surrounded by fallen enemies, realizing the true cost of victory; eye-level; Hero; smoke and debris, wounded soldiers; cinematic
Characteristic
Shot : A soldier walks through a destroyed battlefield, smoke and fire in the background. The soldier looks concerned and possibly injured.
Aesthetic Score : 0.7
Mood : dramatic, tense, gritty
Quality
Entropy : 6.84
Noise : 98
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to have a slight blur to the soldier’s face and the background seems a bit too smooth. This could be an artifact from the camera sensor or image processing.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.2, indicating it’s not very good at reacting to camera positions in the prompt. This means the generated image’s camera position significantly deviates from what was requested.
- Shot Analysis: The model scored 0.54, which is good. This means the generated image’s shot composition is fairly close to what was described in the prompt.
- Aesthetic Analysis: The model scored 0.15, which is very good. This means the generated image’s aesthetic is very close to what was expected.
Overall, the model seems to be better at understanding the scene and its aesthetic than it is at accurately capturing the camera position.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://leonardo.ai