AI's Artistic Eye: Capturing Emotion, Not Camera Angles with Midjourney

AI's Artistic Eye: Capturing Emotion, Not Camera Angles with Midjourney

Contents

In the realm of artificial intelligence, generative models are pushing the boundaries of creativity. These models can generate images, text, and even music based on user prompts. However, their ability to accurately capture complex scene descriptions remains a challenge. This blog post examines the performance of a generative AI model in creating images based on detailed prompts, focusing on its strengths and weaknesses in capturing camera positions, shot analysis, and aesthetic analysis. We’ll explore how the model excels in capturing the desired aesthetic style while struggling with accurately representing camera angles and scene details. Through examples and analysis, we’ll gain insights into the current capabilities and limitations of generative AI in image creation.

Created with: midjourney

Lost in the Neon Glow: A Lonely Figure Walks the Wet Streets

A solitary figure traverses a rain-slicked city street, bathed in the vibrant glow of neon signs. Towering buildings cast long shadows, amplifying the sense of isolation and mystery. The figure’s retreating back adds to the feeling of loneliness, leaving the viewer to ponder their story.

Lost in the Neon Glow: A Lonely Figure Walks the Wet Streets

Prompt

Realization A look of sudden understanding, perhaps tinged with sadness: Melancholy, introspective ; A lone figure; eye-level; Single Person; a bustling city street at night, with neon signs and rain reflecting on the wet pavement; cinematic

Characteristic

Shot : A lonely figure walks down a rainy city street at night, surrounded by glowing neon signs, bustling with activity.

Aesthetic Score : 0.7

Mood : lonely, melancholic, urban

Quality

Entropy : 6.32

Noise : 123

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.90

Image errors : The image suffers from some digital art artifacts, mainly noticeable on the person, giving it a slightly unrealistic look, and the reflections on the water surface seem unnatural.

A Moment of Contemplation at Sunset’s Embrace

A solitary figure in a red suit sits on the edge of a skyscraper, gazing out at a sprawling cityscape bathed in the golden hues of sunset. The scene evokes a sense of tranquility and epic grandeur, as the figure contemplates the vastness of the city below.

A Moment of Contemplation at Sunset’s Embrace

Prompt

Realization A determined gaze, a sense of responsibility dawning on their face: Triumphant, awe-inspiring ; A superhero, standing atop a skyscraper; wide shot; Hero; a sprawling cityscape bathed in the golden light of sunset; cinematic

Characteristic

Shot : A superhero in a red and blue suit is sitting on the edge of a building overlooking a sprawling cityscape. The sun is setting in the background, casting a warm glow over the scene.

Aesthetic Score : 0.7

Mood : epic, heroic, powerful

Quality

Entropy : 5.66

Noise : 104

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.80

Image errors : There are some minor artifacts visible in the image, particularly around the edges of the buildings.

Lost in the Shadows of a Messy Kitchen

A young woman, her dark hair framing a contemplative gaze, sits amidst the clutter of a neglected kitchen. The image, intentionally dark and moody, evokes a sense of melancholy and isolation, leaving the viewer to ponder the unspoken story behind her somber expression.

Lost in the Shadows of a Messy Kitchen

Prompt

Realization A look of weary acceptance, a realization of the mundane nature of life: Disillusioned, resigned ; A young woman, sitting at a kitchen table; close-up; Normal People; a cluttered kitchen, with dishes piled in the sink and a half-eaten meal on the table; cinematic

Characteristic

Shot : A young woman sits at a table in a kitchen, staring directly at the camera. The table is cluttered with dishes and food scraps, creating a sense of disarray. The kitchen itself is dimly lit, with a stove and cabinets visible in the background.

Aesthetic Score : 0.7

Mood : melancholy, lonely, introspective

Quality

Entropy : 6.52

Noise : 106

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.10

Image errors : Some graininess and noise is present, especially in the darker areas of the image, but it’s not overly distracting.

The Hacker’s Focus

A young man, bathed in the glow of his computer screen, is locked in a moment of intense concentration. The low lighting and close-up framing heighten the suspense, leaving the viewer wondering what secrets he’s uncovering.

The Hacker’s Focus

Prompt

Realization A look of sudden insight, a realization of a strategy or a solution: Intense, focused ; A gamer, hunched over a computer screen; close-up; Gamer; a dimly lit room, with flashing lights from the monitor and empty pizza boxes scattered around; cinematic

Characteristic

Shot : A young man is looking intently at a computer screen, illuminated by the blue light. A pizza box is in the foreground, suggesting a late-night gaming or coding session.

Aesthetic Score : 0.6

Mood : intense, focused, techy

Quality

Entropy : 6.18

Noise : 66

Prompt Clip Score : 0.21

AI Evaluation

Likelihood of AI : 0.10

Image errors : No significant image errors; minor artifacts are present but insignificant

Lost in the Crowd: A Man’s Solitary Journey

A lone figure stands amidst the bustling chaos of a train station, his isolation emphasized by the blurred movement of the surrounding crowd. The scene evokes a sense of loneliness, mystery, and melancholic reflection, with the motion blur adding a touch of urgency and highlighting the man’s detachment from the world around him.

Lost in the Crowd: A Man’s Solitary Journey

Prompt

Realization A look of confusion, a realization of his own insignificance in the grand scheme of things: Lost, alienated ; A man, walking through a crowded train station; eye-level; Single Person; a sea of faces, all rushing in different directions; cinematic

Characteristic

Shot : A man stands in a train station, surrounded by blurred figures of people walking past. The scene is black and white, with a sense of isolation and loneliness.

Aesthetic Score : 0.7

Mood : melancholy, contemplative, lonely

Quality

Entropy : 6.04

Noise : 113

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.10

Image errors : No significant image errors

Captain America: A Hero Amidst the Flames

A powerful image captures Captain America standing resolute in a war-torn landscape, the fiery explosion behind him highlighting his unwavering determination. The dramatic contrast between hero and destruction creates a sense of urgency and intensity, showcasing the hero’s unwavering spirit in the face of chaos.

Captain America: A Hero Amidst the Flames

Prompt

Realization A look of fierce determination, a realization of the gravity of the situation: Determined, resolute ; A superhero, standing in the middle of a battle; wide shot; Hero; a chaotic scene of destruction and explosions, with enemies closing in; cinematic

Characteristic

Shot : Captain America standing in a war-torn environment with fire and debris in the background

Aesthetic Score : 0.7

Mood : intense, dramatic, heroic

Quality

Entropy : 6.97

Noise : 97

Prompt Clip Score : 0.22

AI Evaluation

Likelihood of AI : 0.80

Image errors : Some artifacts and blurring are visible in the background, particularly around the fire and debris.

The Warmth of Family Gatherings

A dimly lit dining room, rustic charm, and a family gathered around a table enjoying a meal. This image captures the intimate and nostalgic feeling of shared moments, evoking a sense of warmth and connection.

The Warmth of Family Gatherings

Prompt

Realization A look of shared understanding, a realization of the importance of family and connection: Nostalgic, heartwarming ; A family, gathered around a dinner table; medium shot; Normal People; a warm and inviting kitchen, with the aroma of home-cooked food filling the air; cinematic

Characteristic

Shot : A family is gathered around a table in a dimly lit dining room, eating a meal.

Aesthetic Score : 0.6

Mood : rustic, nostalgic, somber

Quality

Entropy : 6.40

Noise : 104

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.10

Image errors : The image is slightly overexposed, especially in the background. The lighting is a bit harsh and there is some noise in the shadows.

Intense Focus in the Shadows

A man, bathed in the dim glow of a computer screen, is consumed by his task. The low light and close-up shot create a palpable sense of suspense, drawing you into the mystery unfolding before your eyes.

Intense Focus in the Shadows

Prompt

Realization A look of disappointment, a realization of failure and the need to try again: Defeated, frustrated ; A gamer, staring at a blank screen; close-up; Gamer; a dimly lit room, with the only light coming from the monitor, which is now displaying a game over message; cinematic

Characteristic

Shot : A man is looking intently at a laptop screen in a dimly lit room. The focus is on his face, which is illuminated by the laptop screen.

Aesthetic Score : 0.6

Mood : intense, focused, mysterious

Quality

Entropy : 6.21

Noise : 97

Prompt Clip Score : 0.21

AI Evaluation

Likelihood of AI : 0.20

Image errors : No noticeable artifacts or errors.

Silhouetted Solitude: A Woman Finds Peace at Sunset

A lone figure stands on a cliff, bathed in the golden light of the setting sun. The vast ocean stretches out before her, reflecting the tranquility of the moment. This image captures the essence of solitude and contemplation, offering a glimpse into a peaceful escape from the world.

Silhouetted Solitude: A Woman Finds Peace at Sunset

Prompt

Realization A look of peace, a realization of the vastness of the world and the smallness of her own problems: Reflective, contemplative ; A woman, standing on a cliff overlooking the ocean; eye-level; Single Person; a vast expanse of blue water stretching out to the horizon, with the sun setting in the distance; cinematic

Characteristic

Shot : A woman standing on a cliff overlooking the ocean, with the sun shining in the distance.

Aesthetic Score : 0.7

Mood : peaceful, tranquil, contemplative

Quality

Entropy : 6.79

Noise : 118

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.10

Image errors : No noticeable artifacts or errors.

Hope Amidst the Ruins: A Lone Figure Stands Tall

A solitary figure, cloaked in red, stands defiant amidst a field of rubble, the remnants of a shattered city. The setting sun casts a warm glow on the scene, offering a glimmer of hope against the backdrop of dark clouds and smoke. This powerful image captures the resilience of the human spirit in the face of destruction.

Hope Amidst the Ruins: A Lone Figure Stands Tall

Prompt

Realization A look of resolve, a realization of the need to rebuild and create a better future: Hopeful, determined ; A superhero, standing in the ruins of a city; wide shot; Hero; a desolate landscape, with smoke rising from the rubble and the sun breaking through the clouds; cinematic

Characteristic

Shot : A lone figure, possibly a superhero, stands amidst the ruins of a city. The sky is filled with smoke and a golden light, suggesting a recent disaster.

Aesthetic Score : 0.6

Mood : dramatic, hopeful, melancholic

Quality

Entropy : 6.26

Noise : 118

Prompt Clip Score : 0.30

AI Evaluation

Likelihood of AI : 0.80

Image errors : The image appears to have some minor artifacts, particularly in the smoke and the debris.

Conclusion

The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.

Here’s a breakdown:

  • Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
  • Shot Analysis: The model scored 0.45, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflected it.
  • Aesthetic Analysis: The model scored 0.14, which is considered very good. This means that the generated image closely matched the expected aesthetic style described in the prompt.

Overall, the model seems to be better at understanding and capturing the desired aesthetic style than it is at accurately representing camera positions and scene descriptions.

Sources: