AI Struggles to Capture Pride's Vibrant Spirit with Scenario
- 9 minutes read - 1707 wordsTable of Contents
Generative AI has made significant strides in creating realistic and visually appealing images. However, when it comes to capturing the complex emotions and cultural nuances of events like Pride celebrations, the technology still faces challenges. This blog post delves into a case study where an AI model was tasked with generating images based on specific prompts related to Pride, revealing its limitations in accurately translating camera positions, aesthetics, and facial expressions.
Created with: scenario
Radiant Joy: Capturing the Spirit of Celebration
A young woman beams with infectious joy, her glitter makeup and wide smile radiating energy. The blurred background hints at a vibrant, festive atmosphere, capturing the essence of celebration.
Prompt
facial-expressions Pride: Joyful, confident, celebratory ; A single person; eye-level; Single Persons; A bustling Pride parade with rainbow flags and confetti; cinematic
Characteristic
Shot : A young woman is smiling and laughing with a rainbow flag in the background
Aesthetic Score : 0.8
Mood : joyful, happy, celebratory
Quality
Entropy : 6.82
Noise : 93
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors
Pride Parade: A Celebration of Joy and Hope
A woman with short brown hair beams with joy as she waves a rainbow flag at a vibrant Pride parade, surrounded by others celebrating diversity and inclusion. The scene captures the spirit of hope and optimism that fills the air.
Prompt
facial-expressions Pride: Empowered, defiant, hopeful ; A person holding a rainbow flag high; eye-level; Single Persons; A crowd of people at a Pride rally; cinematic
Characteristic
Shot : A woman is holding a rainbow flag and smiling joyfully in a crowd of people during a pride parade.
Aesthetic Score : 0.8
Mood : joyful, celebratory, proud
Quality
Entropy : 6.78
Noise : 96
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some slight noise is visible, particularly in the background, indicating a possible lack of optimal lighting or processing.
Hopeful Hues Against the City Skyline
A young woman, draped in a rainbow flag, stands against a vibrant sunset backdrop, her silhouette a symbol of optimism and celebration. The dramatic play of light and shadow adds depth to the scene, capturing a moment of hope and joy.
Prompt
facial-expressions Pride: Powerful, inspiring, hopeful ; A superhero in a rainbow costume; eye-level; Heroes; A cityscape with a Pride flag flying in the background; cinematic
Characteristic
Shot : A woman with blonde hair is wearing a rainbow flag as a cape and looking to the right, with an out-of-focus cityscape behind her.
Aesthetic Score : 0.7
Mood : confident, hopeful, optimistic
Quality
Entropy : 6.71
Noise : 98
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.50
Image errors : There are no significant artifacts or errors in the image.
Laughter in the Spotlight: Capturing Joy at a Vibrant Event
A woman’s infectious laughter fills the air, her joy radiating amidst the blurred lights and bustling crowd of a concert or party. The scene exudes a carefree and festive mood, with the light reflecting off her jacket adding to the sense of excitement and happiness.
Prompt
facial-expressions Pride: Joyful, carefree, celebratory ; A group of people dancing in a club; eye-level; Normal People; A brightly lit dance floor with rainbow lights; cinematic
Characteristic
Shot : A woman in a glittery jacket is laughing and looking up. A man is next to her and a third person in the background is looking over the woman’s shoulder.
Aesthetic Score : 0.7
Mood : joyful, festive, celebratory
Quality
Entropy : 6.74
Noise : 97
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.60
Image errors : The image has some blur in the background, and the woman’s hair looks a bit unnatural.
Love and Pride Shine Bright on This Colorful Street
A couple strolls hand-in-hand down a vibrant street, their happiness amplified by a large rainbow flag waving overhead. The scene radiates optimism and joy, capturing the spirit of celebration and acceptance.
Prompt
facial-expressions Pride: Loving, peaceful, accepting ; A couple holding hands and walking down the street; eye-level; Normal People; A quiet, residential street with rainbow flags on display; cinematic
Characteristic
Shot : A couple walking down a street lined with colorful houses, holding hands with a rainbow flag in the background.
Aesthetic Score : 0.7
Mood : romantic, hopeful, happy
Quality
Entropy : 6.56
Noise : 105
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has a slight digital painting style and the colors are a bit saturated.
A Rainbow of Faces: Celebrating Diversity and Unity
This vibrant collage captures the spirit of a diverse community, showcasing a playful and inclusive atmosphere. The fragmented portraits, while adding a unique visual element, might leave viewers wanting a more cohesive narrative.
Prompt
facial-expressions Pride: Fun, playful, inclusive ; A gamer playing a video game with rainbow-themed characters; eye-level; Gamer; A brightly lit gaming room with posters of LGBTQ+ characters; cinematic
Characteristic
Shot : A collage of images featuring diverse individuals with bright, colorful backgrounds and LGBTQ+ symbols
Aesthetic Score : 0.7
Mood : joyful, celebratory, inclusive
Quality
Entropy : 6.53
Noise : 93
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has a slight AI-generated look, especially in the skin textures and hair. Some of the figures are slightly blurry and the borders between the panels are not seamless.
Pride Parade Filled with Joy and Celebration
A woman radiates happiness as she holds a sign proclaiming ‘Dhaiton A Eage Pride’ at a vibrant pride event. The scene is filled with colorful costumes, joyous laughter, and a sense of hope and community.
Prompt
facial-expressions Pride: Determined, hopeful, powerful ; A person holding a sign with a message of acceptance; eye-level; Single Persons; A crowd of people at a Pride protest; cinematic
Characteristic
Shot : A woman is smiling and holding a sign that says “Dhaiton a Eage Pride” in front of a rainbow flag. Other people can be seen in the background, but they are blurred.
Aesthetic Score : 0.7
Mood : joyful, celebratory, vibrant
Quality
Entropy : 6.65
Noise : 95
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image is slightly blurry, especially in the background. The woman’s face is slightly distorted, with an unnatural smile. There are some minor artifacts in the background.
Laughter and Joy: Friends Celebrate in Style
A close-up shot captures the genuine happiness of four young women laughing together. Colorful decorations in the background suggest a celebratory atmosphere, making this a heartwarming scene of friendship and joy.
Prompt
facial-expressions Pride: Joyful, celebratory, inclusive ; A group of friends celebrating at a Pride party; eye-level; Normal People; A brightly decorated room with rainbow decorations; cinematic
Characteristic
Shot : A group of four young women are laughing together, possibly celebrating a birthday or other special occasion. They are positioned close together and seem to be enjoying each other’s company.
Aesthetic Score : 0.8
Mood : joyful, celebratory, carefree
Quality
Entropy : 6.81
Noise : 94
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Rainbow Sparkle: A Bold Statement in Confidence
This close-up shot captures a woman’s radiant face adorned with a vibrant rainbow glitter design. The blurred rainbow background adds to the whimsical and bold mood, highlighting her confident gaze and unique style.
Prompt
facial-expressions Pride: Awe, inspiration, hope ; A person looking out at a Pride parade with a sense of wonder; eye-level; Single Persons; A vibrant parade with colorful floats and music; cinematic
Characteristic
Shot : Close-up portrait of a woman’s face with a rainbow makeup design
Aesthetic Score : 0.7
Mood : playful, bold, colorful
Quality
Entropy : 6.80
Noise : 99
Prompt Clip Score : 0.17
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight artifacts visible in the rainbow design, especially in the glitter. There’s a very subtle focus issue around the edge of the right eye.
Rainbow Hair, Happy Vibes: A Playful Portrait
This vibrant portrait captures a woman with bright, rainbow-colored hair, radiating happiness and playful energy. Her headphones and faint smile add to the trendy and cheerful mood, while the soft pink background complements the colors beautifully.
Prompt
facial-expressions Pride: Creative, playful, inclusive ; A gamer creating a rainbow-themed character in a video game; eye-level; Gamer; A computer screen with a character creation menu; cinematic
Characteristic
Shot : A young woman with brightly colored hair is wearing headphones and smiling gently. The background is a soft pink.
Aesthetic Score : 0.7
Mood : happy, playful, youthful
Quality
Entropy : 6.65
Noise : 88
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.60
Image errors : The image is slightly blurry around the edges, but overall it’s of good quality.
Conclusion
The results show that the generative AI model performed okay in terms of camera position and shot analysis, but not so well in terms of aesthetic analysis. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t quite capture the intended camera positions as described in the prompt.
- Shot Analysis: The model scored 0.63, which falls within the “good” range. This indicates that the model was able to understand the scene in the prompt reasonably well, but could still be improved.
- Aesthetic Analysis: The model scored 0.03, which is far from the “very good” range of -0.2 to 0.1. This means the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall, the model needs improvement in its ability to accurately interpret and translate camera positions and aesthetic preferences from the prompt into the generated image.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.scenario.com