AI's Facial Expressions: A Step Towards Realism, But Still a Work in Progress with Freepik
- 9 minutes read - 1882 wordsTable of Contents
The ability to generate realistic facial expressions is a crucial step towards creating truly immersive and engaging AI-generated content. This blog post examines the performance of a generative AI model in capturing facial expressions within various scenes. We’ll explore how the model excels in capturing the desired aesthetic style, but struggles with accurately representing the scene and camera position. Through this analysis, we gain insights into the current capabilities and limitations of AI in generating realistic facial expressions, paving the way for future advancements in this field.
Created with: freepik
Lost in the Neon Glow: A Solitary Figure Walks the City Streets
A lone figure, shrouded in mystery, navigates the vibrant, rain-slicked streets of a bustling city at night. The neon lights cast a colorful glow, reflecting off the wet pavement and creating a sense of nostalgia and intrigue. The silhouette of the figure adds to the dramatic effect, leaving their identity and purpose shrouded in secrecy.
Prompt
facial-expressions Skepticism: Melancholy, disillusioned ; A lone figure, back turned, walking away from a brightly lit city skyline; eye-level; Single Person; Urban, neon signs, bustling crowds; cinematic
Characteristic
Shot : A lone figure walks down a bustling city street at night. The street is lined with tall buildings and lit by bright neon signs.
Aesthetic Score : 0.7
Mood : gloomy, urban, lonely
Quality
Entropy : 6.65
Noise : 65
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some slight blurring in the background and some artifacts in the neon signs.
Superman Stands Against the Flames: A City’s Hope in the Face of Disaster
A dramatic image captures the essence of heroism as Superman, silhouetted against a fiery cityscape, prepares to face an unknown threat. The mood is intense, the stakes are high, and the promise of sacrifice hangs heavy in the air.
Prompt
facial-expressions Skepticism: Doubtful, conflicted ; A superhero, cape billowing, standing on a rooftop, looking down at a city in chaos; eye-level; Hero; Smoke, fire, destruction; cinematic
Characteristic
Shot : Superman stands on a rooftop overlooking a burning city. Smoke billows in the sky as the hero stares out at the destruction.
Aesthetic Score : 0.7
Mood : dramatic, heroic, somber
Quality
Entropy : 6.84
Noise : 51
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be digitally altered. Some textures and details appear blurry and unrealistic.
Local Woman Intrigued by Shocking News
A young woman sits at a cafe, her brow furrowed in concentration as she reads a newspaper with a captivating headline. Her expression suggests a mix of intrigue, thoughtfulness, and curiosity, leaving the reader wondering what could have caught her attention so intensely.
Prompt
facial-expressions Skepticism: Cynical, disbelieving ; A woman, dressed in everyday clothes, holding a newspaper with a sensational headline; eye-level; Normal People; Coffee shop, people going about their day; cinematic
Characteristic
Shot : A young woman in a cafe is looking directly at the camera while holding a newspaper in front of her. The cafe is out of focus in the background.
Aesthetic Score : 0.7
Mood : serious, focused, contemplative
Quality
Entropy : 6.88
Noise : 65
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors or artifacts in the image.
Lost in Thought Amidst the Pizza Boxes
A young man sits surrounded by a mountain of pizza boxes, his gaze fixed on something beyond the frame. The scene evokes a sense of mundane boredom, yet the pensive expression on his face hints at a deeper thought process. The clutter of boxes adds a touch of dramatic effect, suggesting a moment of reflection amidst the chaos.
Prompt
facial-expressions Skepticism: Suspicious, wary ; A gamer, hunched over a computer screen, surrounded by empty pizza boxes and energy drink cans; close-up; Gamer; Dark room, flashing lights, gaming peripherals; cinematic
Characteristic
Shot : A young man, wearing a blue t-shirt with a pizza design, leans on a table piled high with cardboard boxes, cans, and pizza boxes. He looks slightly bored or contemplative.
Aesthetic Score : 0.4
Mood : bored, contemplative, mundane
Quality
Entropy : 6.83
Noise : 51
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors, but the image appears slightly underexposed and the lighting is uneven.
Lost in Thought at the Bar
A young man sits alone in a dimly lit bar, his contemplative gaze and half-empty glass of beer hinting at a story of loneliness and introspection. The scene evokes a sense of mystery and melancholic beauty.
Prompt
facial-expressions Skepticism: Doubtful, introspective ; A man, sitting alone in a dimly lit bar, staring into his drink; eye-level; Single Person; Empty bar, flickering neon lights, rain outside; cinematic
Characteristic
Shot : A young man is sitting alone at a bar, looking thoughtful. It’s a dimly lit, nighttime scene with a moody atmosphere.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, lonely
Quality
Entropy : 6.45
Noise : 45
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is slight chromatic aberration present in the image, especially noticeable around the edges of objects.
Armed and Ready: A Moment of High Tension
A young man, weapon in hand, stands before a crowd, his gaze locked on the camera. The scene is charged with intensity, suspense, and a palpable sense of determination. The dramatic lighting and the man’s unwavering expression heighten the tension, leaving the viewer on the edge of their seat, anticipating what will unfold next.
Prompt
facial-expressions Skepticism: Uncertain, hesitant ; A hero, standing in front of a crowd, holding a weapon, but looking conflicted; eye-level; Hero; cheering crowd, bright lights, stage; cinematic
Characteristic
Shot : A young man with a weapon standing in front of a crowd of people in a dark room with some lights in the background.
Aesthetic Score : 0.7
Mood : tense, suspenseful, dramatic
Quality
Entropy : 6.73
Noise : 57
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors, the background could be improved with better definition.
Intrigued Gazes and Candlelight: A Mystery Unfolds
A group of young adults huddle around a table, illuminated by flickering candlelight. Their focused gazes, directed towards something unseen, create an atmosphere of suspense and anticipation. Plates of food sit untouched, hinting at a mystery that has captivated their attention. What secrets lie hidden in the shadows?
Prompt
facial-expressions Skepticism: Disbelieving, amused ; A group of friends, gathered around a table, listening to a story with skeptical expressions; eye-level; Normal People; Cozy living room, warm lighting, snacks; cinematic
Characteristic
Shot : A group of young adults sitting at a dinner table looking up, likely at something off-camera. There is a warm and inviting atmosphere, with candles lit on the table and a cozy, homey setting.
Aesthetic Score : 0.6
Mood : intrigued, suspenseful, cozy
Quality
Entropy : 6.88
Noise : 65
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Lost in the Code: A Moment of Intense Focus
A young man, bathed in dramatic lighting, sits hunched over his computer, headphones on, lost in a world of code. His serious expression speaks of intense concentration and deep contemplation. The scene evokes a sense of mystery and the power of focused dedication.
Prompt
facial-expressions Skepticism: Frustrated, doubtful ; A gamer, staring intently at a screen, but with a look of frustration; close-up; Gamer; Brightly lit room, gaming setup, controller in hand; cinematic
Characteristic
Shot : A young man is sitting at a desk in a dimly lit room, wearing headphones, looking off to the side. There are two monitors on the desk behind him, and a keyboard and mouse in front of him.
Aesthetic Score : 0.6
Mood : focused, pensive, concentrated
Quality
Entropy : 6.64
Noise : 47
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : The lighting is a bit uneven, and there are some minor artifacts in the image.
Lost in the Crowd: A Moment of Solitude
A young woman stands alone in a bustling street, her gaze fixed directly on the viewer. The blurred background emphasizes her isolation, creating a poignant sense of melancholy and loneliness.
Prompt
facial-expressions Skepticism: Paranoid, distrustful ; A woman, walking through a crowded street, looking around with suspicion; eye-level; Single Person; Busy city street, people rushing by, street vendors; cinematic
Characteristic
Shot : A young woman is walking through a crowded street, looking directly at the camera. The background is blurred and out of focus, making the woman the focal point of the image.
Aesthetic Score : 0.6
Mood : melancholy, lonely, thoughtful
Quality
Entropy : 6.90
Noise : 61
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy, especially in the background. There is also some slight chromatic aberration, particularly around the edges of the image.
Silhouetted Against the City Lights
A solitary figure stands on a rooftop, bathed in the glow of a distant cityscape. The stark contrast between the man’s silhouette and the twinkling lights evokes a sense of solitude and contemplation, capturing the essence of urban life.
Prompt
facial-expressions Skepticism: Isolated, disillusioned ; A hero, standing on a rooftop, looking out at a city skyline, but with a sense of loneliness; eye-level; Hero; City lights, distant sounds of the city; cinematic
Characteristic
Shot : A man stands on a rooftop overlooking a city skyline at night. The city is lit up by streetlights and the buildings are silhouetted against the dark sky.
Aesthetic Score : 0.7
Mood : melancholic, contemplative, urban
Quality
Entropy : 6.59
Noise : 45
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.50
Image errors : The city skyline appears somewhat artificial and the lighting is slightly too uniform. There are some minor artifacts in the background.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.15, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.49, which is considered below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflects it.
- Aesthetic Analysis: The model scored 0.07, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model seems to be better at capturing the desired aesthetic than understanding the scene and camera position. This suggests that the model might need further training to improve its ability to interpret and translate prompts into accurate visual representations.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.freepik.com