AI's Artistic Eye: Capturing Emotion, Missing the Shot with Midjourney
- 9 minutes read - 1815 wordsTable of Contents
In the realm of artificial intelligence, generative models are pushing the boundaries of creativity. These models can generate images, text, and even music based on user prompts. One area of particular interest is the ability of these models to capture and express human emotions through facial expressions. This analysis explores the performance of a generative AI model in creating images that depict a variety of facial expressions within specific scene contexts. The results reveal a fascinating interplay between the model’s strengths and weaknesses, highlighting the ongoing challenges in developing AI models that can fully understand and translate complex visual concepts.
Created with: midjourney
Lost in the Neon Glow: A Solitary Figure Walks the City Streets
A lone figure navigates the bustling city at night, their silhouette stark against the vibrant backdrop of neon signs and wet pavement. The scene evokes a sense of urban loneliness and melancholic intrigue, highlighting the contrast between the bright lights and the solitary figure.
Prompt
Excitement Excited, determined: Thrilled, anticipation ; A lone figure; eye-level; Single Person; bustling city street at night; cinematic
Characteristic
Shot : A lone figure walking down a street in a city at night, the city lights reflecting in the wet pavement.
Aesthetic Score : 0.7
Mood : melancholy, urban, contemplative
Quality
Entropy : 6.42
Noise : 117
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slightly blurry and grainy quality, which is likely due to the painting style. The lighting is also somewhat uneven, with some areas being brighter than others.
Hope Takes Flight: Superhero Silhouette at Sunset
A dramatic silhouette of a superhero soaring above a city at sunset evokes a powerful sense of hope and empowerment. The scene captures the essence of heroism and the promise of a brighter future.
Prompt
Excitement Confident, focused: Triumphant, exhilarating ; A superhero in mid-air; low-angle; Hero; cityscape with a dramatic sunset; cinematic
Characteristic
Shot : Silhouette of a superhero flying over a city at sunset.
Aesthetic Score : 0.7
Mood : epic, hopeful, powerful
Quality
Entropy : 6.41
Noise : 118
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some slight blurriness and pixelation.
Sun-Kissed Joy: Friends Embrace the Open Field
Capture the essence of carefree happiness as a group of young adults revel in the beauty of a sunny day. Their laughter and smiles radiate pure joy, while the vibrant green landscape adds a touch of serenity to this energetic scene.
Prompt
Excitement Smiling, laughing: Joyful, carefree ; A group of friends laughing and running; eye-level; Normal People; a sunny park with a vibrant green lawn; cinematic
Characteristic
Shot : A group of young people running through a park on a sunny day. The focus is on the young woman in the center of the image, who is laughing and has her hair blowing in the wind.
Aesthetic Score : 0.7
Mood : joyful, carefree, youthful
Quality
Entropy : 6.83
Noise : 98
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors in the image.
The Glow of Focus: Hands Typing in a Digital Oasis
A close-up shot captures the intensity of a person typing on a keyboard bathed in red and blue light. The dimly lit room and focus on the hands create a sense of focused energy, highlighting the digital world’s allure.
Prompt
Excitement Concentrated, determined: Intense, focused ; A gamer’s hands furiously tapping on a keyboard; close-up; Gamer; a dimly lit room with glowing screens; cinematic
Characteristic
Shot : A person’s hands are typing on a keyboard in a dimly lit room. The keyboard is illuminated with red and blue lights, and the background is blurry. The scene is likely in a home or office.
Aesthetic Score : 0.7
Mood : focused, dark, mysterious
Quality
Entropy : 5.09
Noise : 55
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible errors in the image.
Silhouetted Serenity: A Woman Contemplates the Sunset
A lone woman stands on the edge of a cliff, her silhouette stark against the fiery hues of a breathtaking sunset over the ocean. The scene evokes a sense of serene contemplation and dramatic solitude, capturing the beauty of the moment in a powerful image.
Prompt
Excitement Amazed, exhilarated: Awe-inspiring, liberating ; A woman standing on a cliff overlooking a vast ocean; eye-level; Single Person; dramatic clouds and a setting sun; cinematic
Characteristic
Shot : A woman stands silhouetted on a cliff edge overlooking an ocean sunset.
Aesthetic Score : 0.7
Mood : dramatic, peaceful, contemplative
Quality
Entropy : 6.71
Noise : 90
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors in the image.
Into the Fire: A Soldier’s Run Through Chaos
A dramatic image captures the intensity of a battlefield, with a soldier running through smoke and fire. The silhouette against the chaos creates a powerful sense of danger and urgency.
Prompt
Excitement Determined, fierce: Brave, adrenaline-fueled ; A hero charging into battle; low-angle; Hero; a chaotic battlefield with explosions and smoke; cinematic
Characteristic
Shot : A soldier in camouflage runs away from an explosion. There is a lot of smoke and debris in the air.
Aesthetic Score : 0.7
Mood : dramatic, intense, chaotic
Quality
Entropy : 6.79
Noise : 107
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has a few artifacts, mainly some blurry spots in the background. The smoke and debris are also slightly unrealistic, and the soldier’s motion is a bit unnatural.
Birthday Joy: A Family Celebrates with Confetti and Laughter
Capture the pure joy of a birthday celebration as a family of three revels in confetti and laughter. The father holds a gift, the mother beams with happiness, and the child gazes up at the falling confetti, creating a heartwarming and festive scene.
Prompt
Excitement Smiling, laughing: Happy, celebratory ; A family celebrating a birthday; eye-level; Normal People; a brightly decorated living room with balloons and streamers; cinematic
Characteristic
Shot : A family is celebrating a birthday or another special occasion with confetti falling around them. They are all laughing and smiling, and the atmosphere is very joyful.
Aesthetic Score : 0.7
Mood : joyful, celebratory, happy
Quality
Entropy : 6.77
Noise : 89
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, and the colors are not very vibrant. The confetti is also a bit pixelated.
Pink Light, Intense Gaze: A Moment of Suspense
A close-up shot captures a man’s face bathed in vibrant pink light, his gaze piercing and unsettling. The dramatic lighting and intense expression create a palpable sense of mystery and menace, leaving the viewer questioning what lies ahead.
Prompt
Excitement Concentrated, intense: Engrossed, focused ; A gamer’s face illuminated by the screen; close-up; Gamer; a dark room with neon lights reflecting on the screen; cinematic
Characteristic
Shot : A close-up portrait of a man’s face illuminated by pink light, looking directly at the camera.
Aesthetic Score : 0.6
Mood : intense, dark, mysterious
Quality
Entropy : 6.36
Noise : 93
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image quality is high, but there are some minor artifacts and blurring in the background. The man’s eyes are slightly out of focus, which may be intentional but detracts from the overall sharpness of the image.
Screaming for Joy (and Maybe a Little Fear) on the Roller Coaster
A man lets out a primal scream as he hurtles through the twists and turns of a roller coaster ride. The motion blur captures the exhilarating speed and intensity of the experience, leaving you feeling the thrill vicariously.
Prompt
Excitement Screaming, laughing: Thrilling, exhilarating ; A man riding a rollercoaster; POV shot; Single Person; a fast-paced ride with twists and turns; cinematic
Characteristic
Shot : A man is screaming while riding a roller coaster, captured in a selfie-style photo. The image is blurred to create a sense of speed and movement.
Aesthetic Score : 0.5
Mood : exciting, adrenaline-fueled, exhilarating
Quality
Entropy : 6.46
Noise : 103
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is overly blurred and the background is a bit messy due to the blur effect. The blur might be intentional to represent speed, but it could be toned down a bit.
Man Defies the Storm, Embracing Hope Amidst the Lightning
A solitary figure stands triumphant on a rooftop, arms raised in defiance against a raging storm. The dramatic backdrop of a lightning strike illuminates the city below, creating a powerful and hopeful scene.
Prompt
Excitement Confident, determined: Victorious, powerful ; A hero standing triumphantly on a rooftop; high-angle; Hero; a cityscape with a dramatic storm in the background; cinematic
Characteristic
Shot : A man standing on a rooftop in a city, with his arms raised in triumph, during a heavy rainstorm with lightning in the background.
Aesthetic Score : 0.6
Mood : dramatic, hopeful, powerful
Quality
Entropy : 6.60
Noise : 77
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.50
Image errors : The image has some noticeable blurring and pixelation, particularly around the edges. The cityscape is also somewhat blurry and lacks detail. The lighting is uneven, with some areas being too dark and others too bright.
Conclusion
The results show that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored a 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored a 0.4, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflects it.
- Aesthetic Analysis: The model scored a 0.14, which is considered very good. This means that the generated image closely matched the expected aesthetic style, despite the issues with camera position and scene understanding.
Overall, the model seems to be better at capturing the desired aesthetic than understanding the scene and camera position. This suggests that the model might need further training to improve its ability to interpret and translate prompts into accurate visual representations.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://midjourney.com