AI's Artistic Eye: Capturing Aesthetics, But Struggling with Camera Shots with Midjourney

AI Image Generation: A Tale of Two Shots with Midjourney

Contents

In the realm of AI image generation, the ability to accurately interpret camera positions and shot descriptions is crucial for creating realistic and engaging visuals. However, recent experiments have revealed a fascinating dichotomy: while AI excels at capturing the desired aesthetic, it struggles with accurately translating camera positions into the generated images. This blog post explores this phenomenon, examining the strengths and weaknesses of AI in this domain.

Created with: midjourney

Silhouetted Against the Setting Sun: A Lone Figure in a Desolate Landscape

A solitary figure walks across a barren, dusty landscape, silhouetted against a vibrant yellow sunset. The dramatic composition evokes a sense of epic loneliness and isolation, capturing the raw beauty of a desolate world.

Silhouetted Against the Setting Sun: A Lone Figure in a Desolate Landscape

Prompt

Two-shot Two-shot: Epic, hopeful, determined ; A lone hero, silhouetted against the setting sun; Two-shot; Heroism; A vast, desolate landscape; cinematic

Characteristic

Shot : A lone figure in a dark silhouette stands on a rocky hill with a large sun in the background, against a bright red sky

Aesthetic Score : 0.7

Mood : dramatic, epic, lonely

Quality

Entropy : 5.94

Noise : 94

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.80

Image errors : The image is slightly blurry, and the figure’s silhouette is not very detailed.

Awe-Inspiring Waterfall: A Couple’s Serene Escape

A couple stands mesmerized before a majestic waterfall, enveloped by lush greenery. The misty spray and vibrant foliage create a sense of mystery and wonder, capturing the essence of peace and tranquility.

Awe-Inspiring Waterfall: A Couple’s Serene Escape

Prompt

Two-shot Two-shot: Wonder, excitement, awe ; Two adventurers, gazing in awe at a towering waterfall; Two-shot; Adventure; Lush, tropical rainforest; cinematic

Characteristic

Shot : Two people stand in front of a tall waterfall in a lush forest. The waterfall is the main focal point, with the people providing scale and a sense of human presence. The surrounding forest is green and dense, suggesting a secluded and tranquil environment.

Aesthetic Score : 0.8

Mood : serene, mysterious, adventurous

Quality

Entropy : 6.64

Noise : 120

Prompt Clip Score : 0.33

AI Evaluation

Likelihood of AI : 0.20

Image errors : No visible artifacts or errors in the image.

Lost in the Neon Glow: A Gamer’s Focus

A young man, bathed in the vibrant hues of pink and blue neon, is completely engrossed in his video game. The dimly lit room and blurred background create a sense of immersion, highlighting the intensity and futuristic aesthetic of the scene.

Lost in the Neon Glow: A Gamer’s Focus

Prompt

Two-shot Two-shot: Intense, focused, competitive ; Two gamers, intensely focused on a screen, controllers in hand; Two-shot; Gaming; A dimly lit room with neon lights; cinematic

Characteristic

Shot : A young man in a hoodie and headphones is playing a video game in a dimly lit room. There are neon lights casting a purple and blue glow over the scene. Another man is visible in the background, also playing a video game.

Aesthetic Score : 0.7

Mood : intense, focused, futuristic

Quality

Entropy : 6.59

Noise : 77

Prompt Clip Score : 0.27

AI Evaluation

Likelihood of AI : 0.20

Image errors : Slight noise and graininess are noticeable in the image. The sharpness could be improved, particularly in the background.

Love in Venice: A Selfie with St. Mark’s Basilica

A couple captures their romantic adventure in Venice with a selfie in front of the iconic St. Mark’s Basilica. Their wide smiles and the beautiful setting radiate joy and carefree happiness.

Love in Venice: A Selfie with St. Mark’s Basilica

Prompt

Two-shot Two-shot: Happy, carefree, celebratory ; Two tourists, smiling and taking a selfie in front of a famous landmark; Two-shot; Tourism; A bustling city square; cinematic

Characteristic

Shot : A young couple taking a selfie in front of a large building in Venice, Italy.

Aesthetic Score : 0.7

Mood : happy, romantic, adventurous

Quality

Entropy : 6.96

Noise : 77

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.10

Image errors : There are minor artifacts on the man’s jacket. The focus is slightly off, with the man being more in focus than the woman.

Laughter and Lanterns: A Night Market Moment

Two young women share a joyous laugh amidst the vibrant glow of a bustling night market. Warm lighting from the lanterns creates an intimate and captivating atmosphere, highlighting their infectious smiles and the festive spirit of the scene.

Laughter and Lanterns: A Night Market Moment

Prompt

Two-shot Two-shot: Joyful, adventurous, curious ; Two friends, sharing a laugh as they explore a foreign city; Two-shot; Travel; A vibrant, colorful street market; cinematic

Characteristic

Shot : Two young women are laughing at each other in a night market setting, lit by colorful lanterns.

Aesthetic Score : 0.75

Mood : happy, carefree, joyful

Quality

Entropy : 6.43

Noise : 84

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.20

Image errors : No significant errors.

Cheers to Friendship: A Toast in the Warm Glow of a Pub

Capture the joy and intimacy of a group of friends raising a toast in a dimly lit pub. The warm lighting and cozy atmosphere create a sense of camaraderie, while the focus on the hands holding drinks adds a touch of anticipation and celebration.

Cheers to Friendship: A Toast in the Warm Glow of a Pub

Prompt

Two-shot Two-shot: Warm, celebratory, intimate ; A group of friends, raising their glasses in a toast; Two-shot; Groups; A cozy, dimly lit pub; cinematic

Characteristic

Shot : A group of friends toasting with drinks in a dimly lit bar or restaurant.

Aesthetic Score : 0.7

Mood : warm, intimate, celebratory

Quality

Entropy : 6.06

Noise : 110

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.10

Image errors : None

Two Astronauts, One Unseen Threat: A Glimpse into a Tense Space Mission

A low-angle shot captures two astronauts in their spacesuits, their faces etched with seriousness, as they navigate a futuristic spaceship corridor. The imposing size of the astronauts and their intense focus create a palpable sense of suspense, hinting at a hidden danger lurking within the vastness of space.

Two Astronauts, One Unseen Threat: A Glimpse into a Tense Space Mission

Prompt

Two-shot Two-shot: Serious, focused, determined ; Two astronauts, working together in a space station; Two-shot; Heroism; The vast emptiness of space; cinematic

Characteristic

Shot : Two astronauts in white space suits are standing in a spacecraft looking out a window. The spacecraft interior is lit with a soft blue and orange light, creating a futuristic atmosphere.

Aesthetic Score : 0.7

Mood : serious, suspenseful, futuristic

Quality

Entropy : 6.44

Noise : 112

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.80

Image errors : Slight blurring in the background and some artifacts around the astronaut’s helmets.

Lost in the Verdant Labyrinth

Two figures, shrouded in mystery, navigate a dense jungle path. Backlighting and shadowy silhouettes create an atmosphere of suspense and intrigue, hinting at an adventurous journey through the unknown.

Lost in the Verdant Labyrinth

Prompt

Two-shot Two-shot: Suspenseful, adventurous, determined ; Two explorers, navigating a treacherous jungle path; Two-shot; Adventure; Dense, overgrown jungle; cinematic

Characteristic

Shot : Two people are walking on a muddy path in a lush green forest. The path is narrow and surrounded by dense foliage. The people are wearing backpacks and hats, and they are walking in the same direction.

Aesthetic Score : 0.8

Mood : mysterious, adventurous, tranquil

Quality

Entropy : 6.62

Noise : 117

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.20

Image errors : No noticeable errors

Victory High Five: Neon Lights and Gaming Glory

Two young gamers celebrate a hard-fought victory with a high five, bathed in the glow of neon lights. The energy is palpable, capturing the thrill of competition and the joy of success.

Victory High Five: Neon Lights and Gaming Glory

Prompt

Two-shot Two-shot: Excited, triumphant, celebratory ; Two gamers, celebrating a victory with a high-five; Two-shot; Gaming; A brightly lit gaming room with colorful lights; cinematic

Characteristic

Shot : Two young men, wearing headphones, in a gaming setup with colorful LED lights, are high-fiving each other, a video game screen is visible in the background, one of them is looking directly at the camera

Aesthetic Score : 0.7

Mood : joyful, excited, celebratory

Quality

Entropy : 6.86

Noise : 106

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.10

Image errors : No significant errors are visible, the image is well-exposed and the colors are vibrant.

A Timeless Romance: Silhouettes at Sunset

Experience the serenity of a romantic evening as a couple stands silhouetted against a vibrant sunset on a peaceful beach. The crashing waves add to the tranquility, while the dramatic effect of the silhouettes creates an intimate and mysterious atmosphere.

A Timeless Romance: Silhouettes at Sunset

Prompt

Two-shot Two-shot: Peaceful, romantic, contemplative ; Two travelers, gazing out at a breathtaking sunset over the ocean; Two-shot; Travel; A serene beach with golden sand; cinematic

Characteristic

Shot : A couple sits on a beach, watching the sunset over the ocean.

Aesthetic Score : 0.7

Mood : romantic, peaceful, serene

Quality

Entropy : 6.83

Noise : 101

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.20

Image errors : There is a slight amount of noise in the image, particularly in the shadows, which detracts from the overall sharpness.

Conclusion

The results show that the generative AI model performed okay in terms of camera position and shot analysis, but very well in terms of aesthetic analysis. Here’s a breakdown:

  • Camera Position Analysis: The score of 0.3 indicates that the model’s ability to react to camera positions in the prompt is below average. A score between 0.5 and 0.75 would be considered good, and above 0.75 very good.
  • Shot Analysis: The score of 0.52 indicates that the model’s ability to understand the scene in a prompt is average. A score between 0.5 and 0.75 would be considered good, and above 0.75 very good.
  • Aesthetic Analysis: The score of 0.03 indicates that the model is very good at producing images that match the expected aesthetic. A score between -0.2 and 0.1 is considered very good.

Overall, the model seems to be better at capturing the desired aesthetic than accurately interpreting camera positions and shot descriptions.

Sources: