AI's Facial Expressions: A Mixed Bag with Freepik

AI's Facial Expressions: A Deep Dive into Generative Model Performance with Freepik

Contents

Facial expressions are a powerful tool in storytelling, conveying emotions and intentions without words. In the realm of generative AI, the ability to accurately capture and generate these expressions is crucial for creating compelling and realistic imagery. This blog post explores the capabilities of a generative AI model in capturing facial expressions, analyzing its performance across various scenarios and highlighting its strengths and weaknesses.

Created with: freepik

Lost in the City Lights

A young woman stands alone in the urban night, her face bathed in the soft glow of streetlights. The blurred city lights create a sense of depth and mystery, reflecting her introspective mood and the loneliness of the urban landscape.

Lost in the City Lights

Prompt

facial-expressions Anxiety: Overwhelmed, isolated ; A lone figure; eye-level; Single Person; bustling city street at night; cinematic

Characteristic

Shot : A young woman with short dark hair is standing in a city street at night, looking directly at the camera. The city lights are blurred in the background.

Aesthetic Score : 0.8

Mood : melancholy, urban, contemplative

Quality

Entropy : 6.73

Noise : 50

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image appears to be slightly overexposed, especially in the background.

Superman: A Silhouette of Hope Against the City Lights

A dramatic and heroic image of Superman standing tall on a rooftop, his silhouette dominating the cityscape. The lighting highlights his muscular physique and the twinkling city lights below, creating a sense of power and hope.

Superman: A Silhouette of Hope Against the City Lights

Prompt

facial-expressions Anxiety: Pressure, responsibility ; A superhero standing on a rooftop; high angle; Hero; cityscape with flashing lights; cinematic

Characteristic

Shot : Superman standing on a rooftop overlooking a city skyline at night

Aesthetic Score : 0.7

Mood : heroic, dramatic, hopeful

Quality

Entropy : 6.76

Noise : 51

Prompt Clip Score : 0.24

AI Evaluation

Likelihood of AI : 0.60

Image errors : The subject’s muscles are somewhat exaggerated and the cityscape looks a bit artificial.

Drowning in Paperwork: The Crushing Weight of Stress

A man sits defeated at his desk, hands buried in his hair, surrounded by towering stacks of paperwork. The image captures the overwhelming feeling of stress and anxiety that can come with a heavy workload.

Drowning in Paperwork: The Crushing Weight of Stress

Prompt

facial-expressions Anxiety: Overwhelmed, stressed ; A person sitting at a desk, surrounded by paperwork; close-up; Normal Person; cluttered office; cinematic

Characteristic

Shot : A young man sits at a desk with his head in his hands surrounded by stacks of paper. He looks tired and overwhelmed.

Aesthetic Score : 0.5

Mood : stressed, overwhelmed, tired

Quality

Entropy : 6.72

Noise : 60

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.10

Image errors : The image appears to be somewhat overexposed, particularly in the areas around the subject’s head and shoulders. This may be due to the lighting conditions.

Lost in the Code: A Moment of Intense Focus

A young man, bathed in the blue glow of his computer screen, is completely absorbed in his work. The darkness surrounding him amplifies the intensity of his focus, creating a sense of suspense and intrigue.

Lost in the Code: A Moment of Intense Focus

Prompt

facial-expressions Anxiety: Focused, intense ; A gamer hunched over a computer screen; close-up; Gamer; dimly lit room with flashing lights; cinematic

Characteristic

Shot : A young man is sitting at a computer desk, wearing headphones and looking at the monitor. The room is dimly lit, with a gaming monitor behind him showing a vibrant scene. There’s a keyboard and mouse in front of him, and a monitor to his left.

Aesthetic Score : 0.7

Mood : focused, intense, techy

Quality

Entropy : 6.33

Noise : 47

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.20

Image errors : There’s a slight blurring of the subject’s left arm and some minor noise visible in the shadows.

Lost in the City: A Moment of Melancholy

A young woman with short brown hair stands alone in a bustling city street, her gaze fixed directly on the viewer. The background blurs into a sea of activity, highlighting her isolation and vulnerability. Her pensive expression invites contemplation, leaving us to wonder about her story and the emotions she carries.

Lost in the City: A Moment of Melancholy

Prompt

facial-expressions Anxiety: Anxious, uncomfortable ; A woman walking down a crowded street; eye-level; Single Person; blurred background of people; cinematic

Characteristic

Shot : A young woman stands in a crowded city street, looking directly at the camera with a serious expression. The background is blurred, creating a sense of isolation.

Aesthetic Score : 0.7

Mood : melancholy, contemplative, urban

Quality

Entropy : 6.80

Noise : 59

Prompt Clip Score : 0.27

AI Evaluation

Likelihood of AI : 0.20

Image errors : No significant artifacts or errors are visible.

Intense Gaze, Mysterious Figure: A Moment of Suspense

A man with a serious expression stares directly at the viewer, his face illuminated by a single, harsh light. Behind him, a shadowy figure in a hooded cloak blurs into the darkness, adding to the sense of mystery and danger. The scene is both captivating and unsettling, leaving the viewer wondering what secrets lie hidden in the shadows.

Intense Gaze, Mysterious Figure: A Moment of Suspense

Prompt

facial-expressions Anxiety: Fear, anticipation ; A hero facing a menacing villain; medium shot; Hero; dark and ominous setting; cinematic

Characteristic

Shot : A man with a determined expression looks directly at the camera, with another person in the background wearing a hooded cloak.

Aesthetic Score : 0.7

Mood : intense, mysterious, suspenseful

Quality

Entropy : 5.97

Noise : 46

Prompt Clip Score : 0.19

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image has a slight blurring effect, which may be intentional or a result of post-processing. The shadows are somewhat harsh and the lighting seems unnatural.

Lost in the Crowd, Found in Thought

A young woman sits amidst a bustling crowd, her gaze fixed on something unseen. The shallow depth of field isolates her, drawing the viewer into her world of intrigue and contemplation. Her expression speaks volumes, hinting at a story waiting to be told.

Lost in the Crowd, Found in Thought

Prompt

facial-expressions Anxiety: Impatient, restless ; A person waiting in a long line; eye-level; Normal Person; crowded waiting room; cinematic

Characteristic

Shot : A young woman with dark hair is sitting in an audience, looking away. Her face is in focus, and the rest of the image is blurred.

Aesthetic Score : 0.7

Mood : thoughtful, mysterious, contemplative

Quality

Entropy : 6.76

Noise : 55

Prompt Clip Score : 0.21

AI Evaluation

Likelihood of AI : 0.20

Image errors : No visible errors.

Focused on the Task at Hand

A close-up shot captures the hands of a person typing on a backlit keyboard, their concentration evident in the focused posture. The glow of the computer screen in the background adds a techy feel to the image, highlighting the digital nature of the task at hand.

Focused on the Task at Hand

Prompt

facial-expressions Anxiety: Adrenaline, pressure ; A gamer’s hands frantically moving across a keyboard; close-up; Gamer; glowing computer screen; cinematic

Characteristic

Shot : A person’s hands are typing on a keyboard in a dimly lit room.

Aesthetic Score : 0.3

Mood : focused, intense, serious

Quality

Entropy : 6.44

Noise : 41

Prompt Clip Score : 0.19

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image is a bit blurry and the lighting is not very flattering. There is a slight noise in the image, particularly in the background.

A Stormy Silhouette: A Man Contemplates the Unforeseen

A solitary figure stands amidst a field of tall grass, his gaze fixed on the horizon. A dramatic storm cloud looms overhead, casting a sense of melancholy and anticipation. The man’s silhouetted form against the stormy sky creates a powerful image of contemplation and the unknown.

A Stormy Silhouette: A Man Contemplates the Unforeseen

Prompt

facial-expressions Anxiety: Loneliness, despair ; A man standing alone in a vast field; wide shot; Single Person; open sky with dark clouds; cinematic

Characteristic

Shot : A man stands in a field, looking out towards a dramatic, stormy sky.

Aesthetic Score : 0.7

Mood : melancholy, contemplative, ominous

Quality

Entropy : 6.42

Noise : 51

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.20

Image errors : No significant errors.

A City in Flames, A Man’s Resolve

A young man stands silhouetted against a fiery cityscape, his expression a mix of sadness and determination. The scene is one of war and devastation, a powerful visual of the aftermath of conflict.

A City in Flames, A Man’s Resolve

Prompt

facial-expressions Anxiety: Guilt, responsibility ; A hero looking out over a devastated city; high angle; Hero; destroyed buildings and smoke; cinematic

Characteristic

Shot : A man stands in profile on a rooftop, looking out at a city devastated by fire and smoke. The scene is dramatic and evocative of war or disaster.

Aesthetic Score : 0.6

Mood : desolate, somber, grim

Quality

Entropy : 6.80

Noise : 57

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.30

Image errors : The image has some minor artifacts, particularly in the smoke and fire.

Conclusion

The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:

Camera Position:

  • Score: 0.35
  • Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t quite capture the intended camera position as described in the prompt.

Shot Analysis:

  • Score: 0.475
  • Interpretation: This score also falls below the “good” range. It indicates that the model had some difficulty understanding the scene and creating the desired shot composition.

Aesthetic Analysis:

  • Score: 0.13
  • Interpretation: This score is within the “very good” range of -0.2 to 0.1. It means that the generated image’s aesthetic closely matched the expected aesthetic described in the prompt.

Overall:

While the model excelled in capturing the desired aesthetic, it struggled with accurately interpreting the camera position and shot composition. This suggests that the model might need further training to better understand and respond to these aspects of the prompt.

Sources: