AI's Artistic Eye: Capturing Emotion, Not Camera Angles with Midjourney
- 10 minutes read - 2028 wordsTable of Contents
In the realm of artificial intelligence, generative models are pushing the boundaries of creativity. These models can generate images, text, and even music based on user prompts. However, their ability to accurately capture complex scene descriptions remains a challenge. This blog post examines the performance of a generative AI model in creating images based on detailed prompts, focusing on its strengths and weaknesses in capturing camera positions, shot analysis, and aesthetic analysis. We’ll explore how the model excels in capturing the desired aesthetic style while struggling with accurately representing camera angles and scene details. Through examples and analysis, we’ll gain insights into the current capabilities and limitations of generative AI in image creation.
Created with: midjourney
Lost in the Neon Glow: A Lonely Figure Walks the Wet Streets
A solitary figure traverses a rain-slicked city street, bathed in the vibrant glow of neon signs. Towering buildings cast long shadows, amplifying the sense of isolation and mystery. The figure’s retreating back adds to the feeling of loneliness, leaving the viewer to ponder their story.
Prompt
Realization A look of sudden understanding, perhaps tinged with sadness: Melancholy, introspective ; A lone figure; eye-level; Single Person; a bustling city street at night, with neon signs and rain reflecting on the wet pavement; cinematic
Characteristic
Shot : A lonely figure walks down a rainy city street at night, surrounded by glowing neon signs, bustling with activity.
Aesthetic Score : 0.7
Mood : lonely, melancholic, urban
Quality
Entropy : 6.32
Noise : 123
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image suffers from some digital art artifacts, mainly noticeable on the person, giving it a slightly unrealistic look, and the reflections on the water surface seem unnatural.
A Moment of Contemplation at Sunset’s Embrace
A solitary figure in a red suit sits on the edge of a skyscraper, gazing out at a sprawling cityscape bathed in the golden hues of sunset. The scene evokes a sense of tranquility and epic grandeur, as the figure contemplates the vastness of the city below.
Prompt
Realization A determined gaze, a sense of responsibility dawning on their face: Triumphant, awe-inspiring ; A superhero, standing atop a skyscraper; wide shot; Hero; a sprawling cityscape bathed in the golden light of sunset; cinematic
Characteristic
Shot : A superhero in a red and blue suit is sitting on the edge of a building overlooking a sprawling cityscape. The sun is setting in the background, casting a warm glow over the scene.
Aesthetic Score : 0.7
Mood : epic, heroic, powerful
Quality
Entropy : 5.66
Noise : 104
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some minor artifacts visible in the image, particularly around the edges of the buildings.
Lost in the Shadows of a Messy Kitchen
A young woman, her dark hair framing a contemplative gaze, sits amidst the clutter of a neglected kitchen. The image, intentionally dark and moody, evokes a sense of melancholy and isolation, leaving the viewer to ponder the unspoken story behind her somber expression.
Prompt
Realization A look of weary acceptance, a realization of the mundane nature of life: Disillusioned, resigned ; A young woman, sitting at a kitchen table; close-up; Normal People; a cluttered kitchen, with dishes piled in the sink and a half-eaten meal on the table; cinematic
Characteristic
Shot : A young woman sits at a table in a kitchen, staring directly at the camera. The table is cluttered with dishes and food scraps, creating a sense of disarray. The kitchen itself is dimly lit, with a stove and cabinets visible in the background.
Aesthetic Score : 0.7
Mood : melancholy, lonely, introspective
Quality
Entropy : 6.52
Noise : 106
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some graininess and noise is present, especially in the darker areas of the image, but it’s not overly distracting.
The Hacker’s Focus
A young man, bathed in the glow of his computer screen, is locked in a moment of intense concentration. The low lighting and close-up framing heighten the suspense, leaving the viewer wondering what secrets he’s uncovering.
Prompt
Realization A look of sudden insight, a realization of a strategy or a solution: Intense, focused ; A gamer, hunched over a computer screen; close-up; Gamer; a dimly lit room, with flashing lights from the monitor and empty pizza boxes scattered around; cinematic
Characteristic
Shot : A young man is looking intently at a computer screen, illuminated by the blue light. A pizza box is in the foreground, suggesting a late-night gaming or coding session.
Aesthetic Score : 0.6
Mood : intense, focused, techy
Quality
Entropy : 6.18
Noise : 66
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant image errors; minor artifacts are present but insignificant
Lost in the Crowd: A Man’s Solitary Journey
A lone figure stands amidst the bustling chaos of a train station, his isolation emphasized by the blurred movement of the surrounding crowd. The scene evokes a sense of loneliness, mystery, and melancholic reflection, with the motion blur adding a touch of urgency and highlighting the man’s detachment from the world around him.
Prompt
Realization A look of confusion, a realization of his own insignificance in the grand scheme of things: Lost, alienated ; A man, walking through a crowded train station; eye-level; Single Person; a sea of faces, all rushing in different directions; cinematic
Characteristic
Shot : A man stands in a train station, surrounded by blurred figures of people walking past. The scene is black and white, with a sense of isolation and loneliness.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, lonely
Quality
Entropy : 6.04
Noise : 113
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant image errors
Captain America: A Hero Amidst the Flames
A powerful image captures Captain America standing resolute in a war-torn landscape, the fiery explosion behind him highlighting his unwavering determination. The dramatic contrast between hero and destruction creates a sense of urgency and intensity, showcasing the hero’s unwavering spirit in the face of chaos.
Prompt
Realization A look of fierce determination, a realization of the gravity of the situation: Determined, resolute ; A superhero, standing in the middle of a battle; wide shot; Hero; a chaotic scene of destruction and explosions, with enemies closing in; cinematic
Characteristic
Shot : Captain America standing in a war-torn environment with fire and debris in the background
Aesthetic Score : 0.7
Mood : intense, dramatic, heroic
Quality
Entropy : 6.97
Noise : 97
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some artifacts and blurring are visible in the background, particularly around the fire and debris.
The Warmth of Family Gatherings
A dimly lit dining room, rustic charm, and a family gathered around a table enjoying a meal. This image captures the intimate and nostalgic feeling of shared moments, evoking a sense of warmth and connection.
Prompt
Realization A look of shared understanding, a realization of the importance of family and connection: Nostalgic, heartwarming ; A family, gathered around a dinner table; medium shot; Normal People; a warm and inviting kitchen, with the aroma of home-cooked food filling the air; cinematic
Characteristic
Shot : A family is gathered around a table in a dimly lit dining room, eating a meal.
Aesthetic Score : 0.6
Mood : rustic, nostalgic, somber
Quality
Entropy : 6.40
Noise : 104
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, especially in the background. The lighting is a bit harsh and there is some noise in the shadows.
Intense Focus in the Shadows
A man, bathed in the dim glow of a computer screen, is consumed by his task. The low light and close-up shot create a palpable sense of suspense, drawing you into the mystery unfolding before your eyes.
Prompt
Realization A look of disappointment, a realization of failure and the need to try again: Defeated, frustrated ; A gamer, staring at a blank screen; close-up; Gamer; a dimly lit room, with the only light coming from the monitor, which is now displaying a game over message; cinematic
Characteristic
Shot : A man is looking intently at a laptop screen in a dimly lit room. The focus is on his face, which is illuminated by the laptop screen.
Aesthetic Score : 0.6
Mood : intense, focused, mysterious
Quality
Entropy : 6.21
Noise : 97
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Silhouetted Solitude: A Woman Finds Peace at Sunset
A lone figure stands on a cliff, bathed in the golden light of the setting sun. The vast ocean stretches out before her, reflecting the tranquility of the moment. This image captures the essence of solitude and contemplation, offering a glimpse into a peaceful escape from the world.
Prompt
Realization A look of peace, a realization of the vastness of the world and the smallness of her own problems: Reflective, contemplative ; A woman, standing on a cliff overlooking the ocean; eye-level; Single Person; a vast expanse of blue water stretching out to the horizon, with the sun setting in the distance; cinematic
Characteristic
Shot : A woman standing on a cliff overlooking the ocean, with the sun shining in the distance.
Aesthetic Score : 0.7
Mood : peaceful, tranquil, contemplative
Quality
Entropy : 6.79
Noise : 118
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Hope Amidst the Ruins: A Lone Figure Stands Tall
A solitary figure, cloaked in red, stands defiant amidst a field of rubble, the remnants of a shattered city. The setting sun casts a warm glow on the scene, offering a glimmer of hope against the backdrop of dark clouds and smoke. This powerful image captures the resilience of the human spirit in the face of destruction.
Prompt
Realization A look of resolve, a realization of the need to rebuild and create a better future: Hopeful, determined ; A superhero, standing in the ruins of a city; wide shot; Hero; a desolate landscape, with smoke rising from the rubble and the sun breaking through the clouds; cinematic
Characteristic
Shot : A lone figure, possibly a superhero, stands amidst the ruins of a city. The sky is filled with smoke and a golden light, suggesting a recent disaster.
Aesthetic Score : 0.6
Mood : dramatic, hopeful, melancholic
Quality
Entropy : 6.26
Noise : 118
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to have some minor artifacts, particularly in the smoke and the debris.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.45, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflected it.
- Aesthetic Analysis: The model scored 0.14, which is considered very good. This means that the generated image closely matched the expected aesthetic style described in the prompt.
Overall, the model seems to be better at understanding and capturing the desired aesthetic style than it is at accurately representing camera positions and scene descriptions.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://midjourney.com