AI Captures the Emotion, But Misses the Angle with Stability-ai-ultra
- 10 minutes read - 1984 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate realistic and emotionally evocative images is a coveted skill. This new AI model takes a step towards achieving this goal by focusing on capturing facial expressions within specific scenes. While the model demonstrates a strong understanding of scene composition and aesthetic style, it falls short in accurately capturing the intended camera position. This blog post delves into the model’s performance, analyzing its strengths and weaknesses, and discussing the implications for future development. We’ll explore how the model excels at capturing the emotional nuances of facial expressions, but struggles with the technical aspects of camera angles. Through examples and analysis, we’ll gain insights into the potential and limitations of this exciting new technology.
Created with: stability-ai-ultra
One Woman’s Surprise Amidst the Laughter
A playful scene unfolds with a crowd of women, all beaming and laughing, except for one who appears startled and confused. Her unexpected reaction creates a sense of intrigue, leaving viewers wondering what she’s seen or heard.
Prompt
facial-expressions Jealousy: Lonely and envious ; A single woman; eye-level; Single Persons; A crowded party with couples dancing and laughing; cinematic
Characteristic
Shot : A group of people, mostly women, are looking at something off screen, most are laughing, but one woman is expressing disgust and fear
Aesthetic Score : 0.6
Mood : humorous, surprised, nervous
Quality
Entropy : 4.28
Noise : 59
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.60
Image errors : Some lines are too thick, some facial features have noticeable errors. The lines are a bit too bold, and the overall style is a bit too graphic.
Love and Hope in the City’s Embrace
A superhero couple stands silhouetted against a breathtaking sunset, their love story unfolding against the backdrop of a sprawling cityscape. The dramatic lighting and their poignant poses evoke a sense of romance, hope, and the enduring power of their bond.
Prompt
facial-expressions Jealousy: Bitter and isolated ; A superhero standing alone on a rooftop; eye-level; Heroes; A city skyline with a couple holding hands in the distance; cinematic
Characteristic
Shot : A couple in superhero costumes standing on a rooftop overlooking a city skyline at sunrise. The top image shows them holding hands and looking out at the city. The bottom image shows the male superhero standing alone, looking back at the city. The bottom image features a bloodied edge suggesting a possible dramatic event.
Aesthetic Score : 0.6
Mood : romantic, dramatic, hopeful
Quality
Entropy : 6.71
Noise : 76
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The city skyline appears slightly blurry and lacks detail, especially in the bottom image. The blood splatters on the bottom image seem overly defined and unrealistic. The overall image appears to have been processed with a filter, making it somewhat artificial.
Three Scenes of Loneliness and Jealousy
This image tells a story of heartbreak and isolation through three distinct scenes. Two men engage in conversation, then sit together, while a woman sits alone, her sadness palpable. The juxtaposition of these scenes suggests a narrative of jealousy and loneliness, leaving the viewer to ponder the unspoken emotions.
Prompt
facial-expressions Jealousy: Heartbroken and resentful ; A man watching his ex-girlfriend laughing with another man; eye-level; Normal People; A bustling cafe with people chatting and enjoying coffee; cinematic
Characteristic
Shot : The image depicts a series of comic panels set in a cafe setting. The panels follow a narrative of jealousy and heartbreak.
Aesthetic Score : 0.5
Mood : sad, dramatic, melancholic
Quality
Entropy : 6.06
Noise : 62
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image contains misspellings in the text, such as “Healosey” and “Hearththem”. There are also some minor inconsistencies in the line art, particularly in the background.
Lost in the Game: A Moment of Intense Focus
A young man is completely absorbed in his gaming session, the dramatic lighting and his intense focus creating a palpable sense of immersion and excitement. The dimly lit room, decorated with posters and a large speaker, adds to the atmosphere of a dedicated gamer’s sanctuary.
Prompt
facial-expressions Jealousy: Obsessive and competitive ; A gamer staring intently at his computer screen; eye-level; Gamer; A dimly lit room with posters of video game characters on the walls; cinematic
Characteristic
Shot : A young man is sitting at a computer, wearing headphones and looking intensely at the screen. The room is dimly lit with pink and blue lights, creating a dramatic and slightly futuristic atmosphere.
Aesthetic Score : 0.6
Mood : focused, intense, futuristic
Quality
Entropy : 6.68
Noise : 65
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to have some slight noise, especially in the darker areas. The blurriness in the background is also somewhat distracting.
A Family’s Sunny Stroll: Capturing Joy and Mystery in a Lush Park
This heartwarming image captures a family walking through a vibrant park on a sunny day. The warm lighting and lush greenery create a sense of peace and happiness, while the family walking away from the viewer adds a touch of mystery. The image is beautifully composed, with the family in the foreground and the trees in the background, creating a sense of depth and distance.
Prompt
facial-expressions Jealousy: Yearning and wistful ; A woman looking at a couple holding hands in the park; eye-level; Single Persons; A sunny park with children playing and couples strolling; cinematic
Characteristic
Shot : A family of four walking away from the camera in a park, bathed in the warm glow of the setting sun. There are other people in the background, also walking.
Aesthetic Score : 0.7
Mood : tranquil, happy, carefree
Quality
Entropy : 6.57
Noise : 74
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors. There might be some minor compression artifacts, but they are not noticeable.
Champion’s Triumph: A Moment of Glory Under the Spotlight
A solitary figure stands bathed in the glow of victory, silhouetted against a sea of cheering faces. Confetti rains down as the stadium erupts in celebration, capturing the raw emotion of a hard-earned triumph.
Prompt
facial-expressions Jealousy: Disgruntled and envious ; A hero watching another hero receive accolades; eye-level; Heroes; A crowded stadium with cheering fans and flashing lights; cinematic
Characteristic
Shot : A person standing in the center of a packed stadium under bright stage lights, being cheered on by the crowd
Aesthetic Score : 0.6
Mood : excitement, anticipation, triumph
Quality
Entropy : 6.77
Noise : 84
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly around the edges of the figures and the lights. There is also some visible aliasing in the crowd.
Caught in the Moment: Joy and Excitement at the Party
A vibrant snapshot of a party in full swing, capturing the joy and excitement of the moment. Three figures stand out in sharp focus, their expressions radiating happiness, while the blurred background hints at the lively atmosphere and the energy of the crowd.
Prompt
facial-expressions Jealousy: Angry and betrayed ; A man watching his wife dancing with another man at a party; eye-level; Normal People; A brightly lit party with people dancing and laughing; cinematic
Characteristic
Shot : A man and a woman are dancing at a party, surrounded by a crowd of people. The background is blurred and the lights are colorful, creating a festive atmosphere. The man is laughing loudly and the woman is smiling, enjoying themselves.
Aesthetic Score : 0.6
Mood : festive, joyful, playful
Quality
Entropy : 6.64
Noise : 73
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Lost in the Neon Glow: A Gamer’s Focus Under a Futuristic Light
A dimly lit room bathed in neon pink light reveals a gamer engrossed in a cartoon-style video game. The silhouette of the gamer, back to the camera, creates a sense of mystery and intrigue, capturing the focused intensity of the gaming experience.
Prompt
facial-expressions Jealousy: Frustrated and envious ; A gamer watching a livestream of another player achieving a high score; eye-level; Gamer; A dimly lit room with a computer screen displaying the livestream; cinematic
Characteristic
Shot : A gamer in a dimly lit room, playing a video game. The image features two monitors, a keyboard, a mouse, and a small toy figure.
Aesthetic Score : 0.6
Mood : intense, playful, focused
Quality
Entropy : 6.37
Noise : 63
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to be slightly blurry and the edges are a bit pixelated.
A Timeless Embrace: Love in the Rain-Soaked City
Experience the enchanting allure of a romantic cityscape, where a couple shares a tender kiss beneath the night sky. Amidst the gentle patter of raindrops, the city lights twinkle, casting a warm glow on the wet pavement. This intimate scene, filled with nostalgia and drama, captures the essence of love’s enduring power.
Prompt
facial-expressions Jealousy: Melancholy and longing ; looking at a couple kissing in the rain; eye-level; Single Persons; A rainy street with puddles reflecting the city lights; cinematic
Characteristic
Shot : A couple kissing in the rain on a city street at night. The street is wet and there are reflections of the streetlights in the puddles.
Aesthetic Score : 0.7
Mood : romantic, dreamy, nostalgic
Quality
Entropy : 6.90
Noise : 106
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are some minor artifacts in the rain effect.
Superman Stands Tall Amidst City Chaos
A dramatic image captures Superman facing a massive explosion, his silhouette illuminated against the fiery backdrop. The hero’s stoic stance and the fleeing civilians create a sense of urgency and heroism, highlighting the action-packed scene.
Prompt
facial-expressions Jealousy: Frustrated and envious ; A hero watching another hero save the day; eye-level; Heroes; A chaotic scene with explosions and people running for safety; cinematic
Characteristic
Shot : A superhero in a red cape stands in front of a large explosion in a city, people are running away from the blast.
Aesthetic Score : 0.7
Mood : dramatic, intense, action
Quality
Entropy : 6.83
Noise : 83
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.50
Image errors : The image has some minor artifacts around the edges of the explosion, but these are not overly noticeable.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.15, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.635, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.15, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position. The aesthetic analysis suggests that the model is capable of producing visually appealing images.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://stability.ai