AI Captures Pride's Joy, But Struggles with Camera Angles with Midjourney
- 9 minutes read - 1892 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate realistic and emotionally evocative images is a rapidly evolving field. This blog post examines a recent experiment where an AI model was tasked with creating images depicting Pride celebrations. The model’s performance was impressive in capturing the vibrant spirit of these events, showcasing the joy, acceptance, and celebration that define Pride. However, the model struggled with accurately representing camera angles, highlighting the ongoing challenges in achieving perfect visual fidelity. We’ll explore the model’s strengths and weaknesses, analyzing its performance in capturing the essence of Pride and its limitations in replicating specific camera perspectives.
Created with: midjourney
Confetti Dreams: A Moment of Joy and Celebration
A young person, radiating happiness, gazes up at the sky as confetti dances around them. The vibrant colors and playful atmosphere evoke a sense of carefree joy, with a subtle hint of pride represented by the rainbow flag. This image captures the essence of celebration and the beauty of simple moments of happiness.
Prompt
Pride Pride, happiness, liberation: Joyful, confident, celebratory ; A single person; eye-level; Single Persons; A bustling Pride parade with rainbow flags and confetti; cinematic
Characteristic
Shot : A young person with short hair and sunglasses looks up at the sky, surrounded by colorful confetti, with a blue sky and white clouds in the background
Aesthetic Score : 0.8
Mood : joyful, celebratory, whimsical
Quality
Entropy : 6.56
Noise : 101
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image is slightly blurry. The confetti is a bit too uniform in shape and size. There are some slight artifacts in the background.
Pride Flag Flies High, Symbol of Hope and Joy
A woman, silhouetted against the crowd, holds a pride flag aloft, capturing the spirit of celebration and hope at a pride march. The flag’s movement in the wind adds a dynamic energy to the scene, while the backlighting creates a sense of mystery and intrigue.
Prompt
Pride Determination, pride, solidarity: Empowered, defiant, hopeful ; A person holding a rainbow flag high; eye-level; Single Persons; A crowd of people at a Pride rally; cinematic
Characteristic
Shot : A person holding up a rainbow pride flag in a crowd of people
Aesthetic Score : 0.8
Mood : joyful, hopeful, celebratory
Quality
Entropy : 6.31
Noise : 87
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : No major artifacts or errors are visible.
Hope Flies High: Superheroine Stands for Equality
A powerful image of hope and empowerment. A superheroine stands proudly before a vibrant rainbow flag, its colors dancing in the wind. The cityscape behind her reflects the vastness of the cause she champions, while the dramatic lighting and her determined gaze speak volumes about her unwavering commitment.
Prompt
Pride Confidence, determination, strength: Powerful, inspiring, hopeful ; A superhero in a rainbow costume; eye-level; Heroes; A cityscape with a Pride flag flying in the background; cinematic
Characteristic
Shot : A superheroine standing in front of a rainbow pride flag with a cityscape in the background. The sun is setting.
Aesthetic Score : 0.8
Mood : powerful, hopeful, inspiring
Quality
Entropy : 6.58
Noise : 76
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : No noticeable artifacts or errors
Lost in the Neon Glow: Dancing Under the Spotlight
Capture the energy of a vibrant club scene with this image. Dimly lit with colorful lights and fog, the scene is alive with dancing figures, creating a sense of mystery and excitement. The play of light and shadow adds a dramatic touch, making this a perfect shot for capturing the thrill of the night.
Prompt
Pride Happiness, excitement, liberation: Joyful, carefree, celebratory ; A group of people dancing in a club; eye-level; Normal People; A brightly lit dance floor with rainbow lights; cinematic
Characteristic
Shot : A group of young people are dancing in a club or party. There is a lot of colored light and fog creating a hazy atmosphere.
Aesthetic Score : 0.6
Mood : fun, energetic, vibrant
Quality
Entropy : 6.23
Noise : 100
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise and slight blurring, especially in the background. Some of the details are lost due to the low lighting and haze.
Love and Hope Shine Bright on a Sunny Day
A couple, hand in hand and waving rainbow flags, walks towards a radiant future. The sun’s glow illuminates their path, symbolizing a hopeful and celebratory mood.
Prompt
Pride Love, happiness, contentment: Loving, peaceful, accepting ; A couple holding hands and walking down the street; eye-level; Normal People; A quiet, residential street with rainbow flags on display; cinematic
Characteristic
Shot : A couple is walking down a street holding hands, holding pride flags, the sun is shining and the street is lined with trees.
Aesthetic Score : 0.7
Mood : joyful, hopeful, celebratory
Quality
Entropy : 6.70
Noise : 112
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No obvious artifacts or errors.
Immersed in the Game: A Gamer’s Focused Intensity
A young man, headphones on, sits in a gaming chair bathed in colorful light. His focused expression and the dramatic lighting capture the intensity of his gaming experience. This image evokes a sense of energy and playful concentration.
Prompt
Pride Concentration, excitement, joy: Fun, playful, inclusive ; A gamer playing a video game with rainbow-themed characters; eye-level; Gamer; A brightly lit gaming room with posters of LGBTQ+ characters; cinematic
Characteristic
Shot : A young man is sitting in a gaming chair, wearing headphones, and playing video games in a dimly lit room. The room has colorful string lights and various posters on the walls.
Aesthetic Score : 0.6
Mood : focused, energetic, playful
Quality
Entropy : 6.87
Noise : 78
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : The lighting is a bit uneven, with the subject’s face being slightly underexposed compared to the rest of the scene.
A Moment of Pride: Finding Joy in the Crowd
A young woman, adorned with colorful sunglasses and a rainbow scarf, stands out amidst the vibrant energy of a pride parade. The blurred background creates a sense of mystery, drawing attention to her solitary figure and the hopeful spirit she embodies.
Prompt
Pride Determination, conviction, solidarity: Determined, hopeful, powerful ; A person holding a sign with a message of acceptance; eye-level; Single Persons; A crowd of people at a Pride protest; cinematic
Characteristic
Shot : A young woman with a rainbow scarf stands in a crowd of people. The background is blurry and the woman is in focus. The woman is looking to the left of the frame.
Aesthetic Score : 0.7
Mood : bright, hopeful, celebratory
Quality
Entropy : 6.36
Noise : 70
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise and slight blurriness. The colors are a bit muted.
Joyful Celebration: A Night of Dancing and Laughter
Capture the vibrant energy of a party filled with laughter and dancing. A woman in a white crop top, surrounded by colorful streamers, radiates joy as she celebrates with friends. The image evokes a sense of festivity and happiness, perfect for capturing the spirit of a memorable night.
Prompt
Pride Happiness, laughter, connection: Joyful, celebratory, inclusive ; A group of friends celebrating at a Pride party; eye-level; Normal People; A brightly decorated room with rainbow decorations; cinematic
Characteristic
Shot : A young woman with curly hair is dancing and laughing at a party decorated with rainbow flags and lights. She’s wearing a white tank top and has her arms raised in the air. The party appears to be lively and fun.
Aesthetic Score : 0.7
Mood : joyful, celebratory, carefree
Quality
Entropy : 6.85
Noise : 99
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no significant errors in the image.
Lost in Thought, Reaching for the Sky
A young woman with a vibrant afro, shielded by sunglasses, gazes upwards, lost in contemplation. The shallow depth of field isolates her against a blurred backdrop, hinting at a world of possibilities and unspoken dreams.
Prompt
Pride Wonder, joy, acceptance: Awe, inspiration, hope ; A person looking out at a Pride parade with a sense of wonder; eye-level; Single Persons; A vibrant parade with colorful floats and music; cinematic
Characteristic
Shot : A young woman with short curly hair and sunglasses is looking up at a rainbow flag in the background, there is a crowd of people in the background.
Aesthetic Score : 0.7
Mood : optimistic, hopeful, joyful
Quality
Entropy : 6.81
Noise : 76
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors detected, some minor color noise is present.
Cyberpunk Focus: A Woman Lost in a Digital World
A woman, bathed in neon light, sits intently at her computer, headphones on, eyes fixed on a vibrant digital image. The cyberpunk aesthetic and her focused gaze create a sense of intrigue and mystery, leaving you wondering what secrets lie within the screen.
Prompt
Pride Concentration, excitement, joy: Creative, playful, inclusive ; A gamer creating a rainbow-themed character in a video game; eye-level; Gamer; A computer screen with a character creation menu; cinematic
Characteristic
Shot : A young woman wearing headphones and glasses is sitting at a desk in a dimly lit room, looking at a computer screen with a colorful image on it. The room is lit with a variety of colored lights, giving the scene an energetic and vibrant feel.
Aesthetic Score : 0.6
Mood : energetic, vibrant, focused
Quality
Entropy : 6.54
Noise : 87
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slightly blurry quality, particularly around the edges, and the lighting is a bit uneven. The colors are oversaturated, and the rainbow image on the screen appears somewhat unrealistic.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.15, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.635, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.04, which is considered very good. This means that the generated image’s aesthetic closely matched the expected aesthetic described in the prompt.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position. The aesthetic quality of the generated image is very good.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://midjourney.com