AI Captures the Emotion, But Misses the Angle: A Look at Facial Expressions in AI-Generated Images with Stability-ai-ultra
- 9 minutes read - 1894 wordsTable of Contents
Facial expressions are a powerful tool for conveying emotions and storytelling. In the realm of AI-generated imagery, capturing these expressions accurately is crucial for creating compelling and engaging visuals. This blog post explores the capabilities of a generative AI model in understanding and generating facial expressions, focusing on the model’s performance in capturing the intended camera position and aesthetic style. We’ll examine the results of a specific analysis and discuss the implications for the future of AI-generated imagery.
Created with: stability-ai-ultra
Lost in the Neon Labyrinth: A Solitary Figure in a Futuristic Cityscape
A lone figure sits in the heart of a wet, neon-lit street, shrouded in darkness and surrounded by an empty cityscape. The vibrant colors and distant figures create a sense of loneliness and mystery, transporting viewers to a futuristic world where isolation reigns.
Prompt
facial-expressions Contempt: Alienation, isolation, detachment ; A lone figure, back turned to the camera; eye-level; Single Person; A bustling city street at night, neon signs reflecting in puddles; cinematic
Characteristic
Shot : A hooded figure sits alone in a puddle on a wet, neon-lit street in a bustling city. The background is blurred with people walking and neon signs reflecting in the puddles.
Aesthetic Score : 0.8
Mood : lonely, mysterious, urban
Quality
Entropy : 6.83
Noise : 89
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : No noticeable errors.
Superman’s Silhouette: A Heroic Sunset Over the City
A nostalgic and hopeful scene captures Superman standing tall on a rooftop, his silhouette against the setting sun. The dramatic effect evokes a sense of grandeur and heroism, reminding us of the power and hope that Superman represents.
Prompt
facial-expressions Contempt: Disillusionment, weariness, cynicism ; A superhero, standing on a rooftop, looking down at the city; eye-level; Hero; A cityscape bathed in the golden light of sunset; cinematic
Characteristic
Shot : Superman standing on a rooftop, looking out at a cityscape at sunset. The Empire State Building is visible in the distance.
Aesthetic Score : 0.7
Mood : heroic, hopeful, nostalgic
Quality
Entropy : 6.48
Noise : 94
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some minor artifacts in the image, particularly around the edges of the buildings and Superman’s cape.
Power Moves in the Shadows
A businessman strides confidently through a bustling office hallway, his presence commanding attention despite the blurred background. The play of light and shadow adds a layer of intrigue, hinting at the secrets and ambitions that drive him.
Prompt
facial-expressions Contempt: Apathy, boredom, resignation ; A man in a suit, walking through a crowded office; eye-level; Normal People; A sterile, corporate office environment, fluorescent lights casting harsh shadows; cinematic
Characteristic
Shot : A businessman in a suit walks through a corporate office with other people working in the background.
Aesthetic Score : 0.6
Mood : professional, serious, determined
Quality
Entropy : 6.64
Noise : 69
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some artifacts and blurriness are noticeable in the background, particularly around the figures.
Contempt in Two Scenes: A Young Man’s Struggle with Isolation
This image captures the duality of a young man’s experience. The bright, energetic top scene contrasts sharply with the dark, somber bottom scene, both featuring him staring at the word ‘contempt’ on his computer screen. The juxtaposition highlights his feelings of loneliness and contemplation, leaving the viewer to ponder the nature of his struggle.
Prompt
facial-expressions Contempt: Obsessive, detached, nihilistic ; A gamer, hunched over a computer screen, eyes glued to the monitor; eye-level; Gamer; A dimly lit room, cluttered with gaming paraphernalia; cinematic
Characteristic
Shot : A man sitting in front of a computer, in a dimly lit room, it seems he is playing a video game
Aesthetic Score : 0.6
Mood : lonely, bored, frustration
Quality
Entropy : 6.42
Noise : 68
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image seems to have some light blur around the man’s face, this blur is not very noticeable but it gives the image a bit of an unfinished look
Lost in the Rain: A Moment of Melancholy
A solitary woman finds herself lost in thought at a cafe table, the rain blurring the city lights outside. The wet windowpane reflects her introspective mood, creating a poignant image of loneliness and contemplation.
Prompt
facial-expressions Contempt: Melancholy, loneliness, disillusionment ; A woman, sitting alone in a cafe, staring out the window; eye-level; Single Person; A rainy day, the cafe filled with the sound of rain and chatter; cinematic
Characteristic
Shot : A woman sits alone at a table in a cafe, looking out at the rainy street. There are two coffee cups on the table.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, lonely
Quality
Entropy : 6.21
Noise : 76
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slight blurriness, which may be intentional but could be improved.
Lost in the Shadows: A Man’s Solitary Journey
A lone figure, clad in futuristic black, sits shrouded in the darkness of an urban alley, illuminated only by the flickering glow of street lamps. The scene evokes a sense of loneliness, mystery, and dramatic tension, leaving the viewer to ponder the man’s story and the secrets he holds.
Prompt
facial-expressions Contempt: Superiority, arrogance, disdain ; A hero, standing over a defeated villain, looking down with disdain; not too close; Hero; A dark, gritty alleyway, lit by flickering streetlights; cinematic
Characteristic
Shot : A man in a black suit, possibly a superhero or a soldier, is sitting on the ground in a dark alleyway with warm lighting. The alleyway is slightly cluttered, with visible cracks in the pavement.
Aesthetic Score : 0.6
Mood : dark, brooding, somber
Quality
Entropy : 6.41
Noise : 90
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has slightly grainy texture, which could be due to low light or post-processing. There are some slight artifacts in the darker areas.
Lost in the Crowd: A Glimpse of Urban Anonymity
A sea of faces converge in a bustling shopping mall, their gaze drawn to a sign reading ‘Conterpt’. The image captures the anonymity and overwhelming feeling of being lost in a crowd, a common experience in the urban landscape.
Prompt
facial-expressions Contempt: Indifference, apathy, boredom ; A group of people, standing in a queue, looking bored and apathetic; eye-level; Normal People; A sterile, modern shopping mall, filled with the sounds of chatter and music; cinematic
Characteristic
Shot : A crowded indoor space, likely a mall or shopping center, with people walking and looking at store displays. The image has a sense of movement and energy.
Aesthetic Score : 0.6
Mood : busy, urban, commercial
Quality
Entropy : 6.55
Noise : 78
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some of the people in the background are blurry, especially those further away. This could be a stylistic choice or an artifact of the rendering process.
A Collage of Violence and Unease
This dark and chaotic collage features four distinct scenes, each contributing to a sense of tension and discomfort. A man brandishing a gun, another screaming in anguish, a solitary figure by a fire, and an empty room all combine to create a haunting and unsettling atmosphere.
Prompt
facial-expressions Contempt: Desensitization, aggression, detachment ; A gamer, playing a violent video game, his face contorted in a grimace; not too close; Gamer; A dimly lit room, filled with the sounds of explosions and gunfire; cinematic
Characteristic
Shot : The image is a collage of four scenes: a man holding a gun, a man screaming while holding a gun, a man sitting in front of an explosion, and a room with a computer screen displaying a game.
Aesthetic Score : 0.3
Mood : intense, dark, violent
Quality
Entropy : 6.41
Noise : 75
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.50
Image errors : The image has some visible compression artifacts and the lighting appears a bit unrealistic.
Golden Hour Solitude
A lone figure walks towards the setting sun, casting a long shadow on a path lined with trees. The scene evokes a sense of peace and contemplation, with a touch of mystery in the silhouette against the golden light.
Prompt
facial-expressions Contempt: Despair, loneliness, isolation ; A man, walking through a deserted park, his face etched with sadness; eye-level; Single Person; A park at dusk, the trees casting long shadows; cinematic
Characteristic
Shot : A lone figure walks down a path lined with trees in a foggy, sunlit autumnal setting.
Aesthetic Score : 0.7
Mood : tranquil, serene, contemplative
Quality
Entropy : 6.79
Noise : 112
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly blurry. Some leaves look unnatural and slightly faded.
One Stands Against the Tide: A Hero’s Burden in a Battlefield of Loss
A solitary superhero stands amidst a field of fallen soldiers, their presence a stark contrast to the somber march of survivors in the distance. The image captures the dramatic weight of their heroism, highlighting the cost of victory and the burden of responsibility in a world ravaged by conflict.
Prompt
facial-expressions Contempt: Disillusionment, cynicism, weariness ; A hero, standing on a battlefield, surrounded by the carnage of war; not too close; Hero; A battlefield, littered with the bodies of fallen soldiers; cinematic
Characteristic
Shot : A superhero, dressed in a red and blue costume, stands in the foreground, overlooking a battlefield. The background shows a group of soldiers in military uniform advancing toward the camera.
Aesthetic Score : 0.7
Mood : intense, dramatic, somber
Quality
Entropy : 6.56
Noise : 84
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image exhibits a minor level of image artifacting, particularly in the background where the soldiers are located. This is mostly seen as a slight blurring and texture alteration.
Conclusion
The results of the analysis show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.19, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.58, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.16, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position. The aesthetic quality of the generated image is very good.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://stability.ai