AI's Artistic Eye: Capturing Aesthetics, But Struggling with Camera Shots with Midjourney
- 8 minutes read - 1701 wordsTable of Contents
In the realm of AI image generation, the ability to accurately interpret camera positions and shot descriptions is crucial for creating realistic and engaging visuals. However, recent experiments have revealed a fascinating dichotomy: while AI excels at capturing the desired aesthetic, it struggles with accurately translating camera positions into the generated images. This blog post explores this phenomenon, examining the strengths and weaknesses of AI in this domain.
Created with: midjourney
Silhouetted Against the Setting Sun: A Lone Figure in a Desolate Landscape
A solitary figure walks across a barren, dusty landscape, silhouetted against a vibrant yellow sunset. The dramatic composition evokes a sense of epic loneliness and isolation, capturing the raw beauty of a desolate world.
Prompt
Two-shot Two-shot: Epic, hopeful, determined ; A lone hero, silhouetted against the setting sun; Two-shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A lone figure in a dark silhouette stands on a rocky hill with a large sun in the background, against a bright red sky
Aesthetic Score : 0.7
Mood : dramatic, epic, lonely
Quality
Entropy : 5.94
Noise : 94
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is slightly blurry, and the figure’s silhouette is not very detailed.
Awe-Inspiring Waterfall: A Couple’s Serene Escape
A couple stands mesmerized before a majestic waterfall, enveloped by lush greenery. The misty spray and vibrant foliage create a sense of mystery and wonder, capturing the essence of peace and tranquility.
Prompt
Two-shot Two-shot: Wonder, excitement, awe ; Two adventurers, gazing in awe at a towering waterfall; Two-shot; Adventure; Lush, tropical rainforest; cinematic
Characteristic
Shot : Two people stand in front of a tall waterfall in a lush forest. The waterfall is the main focal point, with the people providing scale and a sense of human presence. The surrounding forest is green and dense, suggesting a secluded and tranquil environment.
Aesthetic Score : 0.8
Mood : serene, mysterious, adventurous
Quality
Entropy : 6.64
Noise : 120
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors in the image.
Lost in the Neon Glow: A Gamer’s Focus
A young man, bathed in the vibrant hues of pink and blue neon, is completely engrossed in his video game. The dimly lit room and blurred background create a sense of immersion, highlighting the intensity and futuristic aesthetic of the scene.
Prompt
Two-shot Two-shot: Intense, focused, competitive ; Two gamers, intensely focused on a screen, controllers in hand; Two-shot; Gaming; A dimly lit room with neon lights; cinematic
Characteristic
Shot : A young man in a hoodie and headphones is playing a video game in a dimly lit room. There are neon lights casting a purple and blue glow over the scene. Another man is visible in the background, also playing a video game.
Aesthetic Score : 0.7
Mood : intense, focused, futuristic
Quality
Entropy : 6.59
Noise : 77
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight noise and graininess are noticeable in the image. The sharpness could be improved, particularly in the background.
Love in Venice: A Selfie with St. Mark’s Basilica
A couple captures their romantic adventure in Venice with a selfie in front of the iconic St. Mark’s Basilica. Their wide smiles and the beautiful setting radiate joy and carefree happiness.
Prompt
Two-shot Two-shot: Happy, carefree, celebratory ; Two tourists, smiling and taking a selfie in front of a famous landmark; Two-shot; Tourism; A bustling city square; cinematic
Characteristic
Shot : A young couple taking a selfie in front of a large building in Venice, Italy.
Aesthetic Score : 0.7
Mood : happy, romantic, adventurous
Quality
Entropy : 6.96
Noise : 77
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are minor artifacts on the man’s jacket. The focus is slightly off, with the man being more in focus than the woman.
Laughter and Lanterns: A Night Market Moment
Two young women share a joyous laugh amidst the vibrant glow of a bustling night market. Warm lighting from the lanterns creates an intimate and captivating atmosphere, highlighting their infectious smiles and the festive spirit of the scene.
Prompt
Two-shot Two-shot: Joyful, adventurous, curious ; Two friends, sharing a laugh as they explore a foreign city; Two-shot; Travel; A vibrant, colorful street market; cinematic
Characteristic
Shot : Two young women are laughing at each other in a night market setting, lit by colorful lanterns.
Aesthetic Score : 0.75
Mood : happy, carefree, joyful
Quality
Entropy : 6.43
Noise : 84
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors.
Cheers to Friendship: A Toast in the Warm Glow of a Pub
Capture the joy and intimacy of a group of friends raising a toast in a dimly lit pub. The warm lighting and cozy atmosphere create a sense of camaraderie, while the focus on the hands holding drinks adds a touch of anticipation and celebration.
Prompt
Two-shot Two-shot: Warm, celebratory, intimate ; A group of friends, raising their glasses in a toast; Two-shot; Groups; A cozy, dimly lit pub; cinematic
Characteristic
Shot : A group of friends toasting with drinks in a dimly lit bar or restaurant.
Aesthetic Score : 0.7
Mood : warm, intimate, celebratory
Quality
Entropy : 6.06
Noise : 110
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Two Astronauts, One Unseen Threat: A Glimpse into a Tense Space Mission
A low-angle shot captures two astronauts in their spacesuits, their faces etched with seriousness, as they navigate a futuristic spaceship corridor. The imposing size of the astronauts and their intense focus create a palpable sense of suspense, hinting at a hidden danger lurking within the vastness of space.
Prompt
Two-shot Two-shot: Serious, focused, determined ; Two astronauts, working together in a space station; Two-shot; Heroism; The vast emptiness of space; cinematic
Characteristic
Shot : Two astronauts in white space suits are standing in a spacecraft looking out a window. The spacecraft interior is lit with a soft blue and orange light, creating a futuristic atmosphere.
Aesthetic Score : 0.7
Mood : serious, suspenseful, futuristic
Quality
Entropy : 6.44
Noise : 112
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : Slight blurring in the background and some artifacts around the astronaut’s helmets.
Lost in the Verdant Labyrinth
Two figures, shrouded in mystery, navigate a dense jungle path. Backlighting and shadowy silhouettes create an atmosphere of suspense and intrigue, hinting at an adventurous journey through the unknown.
Prompt
Two-shot Two-shot: Suspenseful, adventurous, determined ; Two explorers, navigating a treacherous jungle path; Two-shot; Adventure; Dense, overgrown jungle; cinematic
Characteristic
Shot : Two people are walking on a muddy path in a lush green forest. The path is narrow and surrounded by dense foliage. The people are wearing backpacks and hats, and they are walking in the same direction.
Aesthetic Score : 0.8
Mood : mysterious, adventurous, tranquil
Quality
Entropy : 6.62
Noise : 117
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors
Victory High Five: Neon Lights and Gaming Glory
Two young gamers celebrate a hard-fought victory with a high five, bathed in the glow of neon lights. The energy is palpable, capturing the thrill of competition and the joy of success.
Prompt
Two-shot Two-shot: Excited, triumphant, celebratory ; Two gamers, celebrating a victory with a high-five; Two-shot; Gaming; A brightly lit gaming room with colorful lights; cinematic
Characteristic
Shot : Two young men, wearing headphones, in a gaming setup with colorful LED lights, are high-fiving each other, a video game screen is visible in the background, one of them is looking directly at the camera
Aesthetic Score : 0.7
Mood : joyful, excited, celebratory
Quality
Entropy : 6.86
Noise : 106
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors are visible, the image is well-exposed and the colors are vibrant.
A Timeless Romance: Silhouettes at Sunset
Experience the serenity of a romantic evening as a couple stands silhouetted against a vibrant sunset on a peaceful beach. The crashing waves add to the tranquility, while the dramatic effect of the silhouettes creates an intimate and mysterious atmosphere.
Prompt
Two-shot Two-shot: Peaceful, romantic, contemplative ; Two travelers, gazing out at a breathtaking sunset over the ocean; Two-shot; Travel; A serene beach with golden sand; cinematic
Characteristic
Shot : A couple sits on a beach, watching the sunset over the ocean.
Aesthetic Score : 0.7
Mood : romantic, peaceful, serene
Quality
Entropy : 6.83
Noise : 101
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight amount of noise in the image, particularly in the shadows, which detracts from the overall sharpness.
Conclusion
The results show that the generative AI model performed okay in terms of camera position and shot analysis, but very well in terms of aesthetic analysis. Here’s a breakdown:
- Camera Position Analysis: The score of 0.3 indicates that the model’s ability to react to camera positions in the prompt is below average. A score between 0.5 and 0.75 would be considered good, and above 0.75 very good.
- Shot Analysis: The score of 0.52 indicates that the model’s ability to understand the scene in a prompt is average. A score between 0.5 and 0.75 would be considered good, and above 0.75 very good.
- Aesthetic Analysis: The score of 0.03 indicates that the model is very good at producing images that match the expected aesthetic. A score between -0.2 and 0.1 is considered very good.
Overall, the model seems to be better at capturing the desired aesthetic than accurately interpreting camera positions and shot descriptions.