AI's Artistic Eye: Capturing the Scene, But Missing the Shot with Midjourney
- 10 minutes read - 1920 wordsTable of Contents
In the realm of visual storytelling, capturing the essence of a scene is paramount. This involves not only the aesthetic elements but also the precise positioning of the camera and the composition of the shot. Recently, we conducted an experiment using a generative AI model to create images based on detailed scene descriptions. The results revealed an intriguing dichotomy: while the model excelled at capturing the desired aesthetic, it struggled with accurately translating camera positions and shot compositions. This begs the question: can AI truly understand and replicate the nuances of visual storytelling, or are there limitations that need to be addressed?
Created with: midjourney
Silhouetted in the City’s Glow: A Moment of Melancholy Contemplation
A lone figure stands on a rooftop, their silhouette stark against the vibrant cityscape. The futuristic metropolis, shrouded in a hazy glow, evokes a sense of serenity and isolation. This image captures a moment of quiet contemplation, a feeling of being both connected to and detached from the bustling world below.
Prompt
high-key-lighting High-key lighting with strong backlighting from the city lights: Hopeful, introspective, slightly melancholic ; A lone figure standing on a rooftop overlooking a bustling city; medium-shot; Single Person; Neon-lit cityscape; cinematic
Characteristic
Shot : A lone figure stands on a rooftop overlooking a sprawling cityscape at night. The city is bathed in warm, orange light from streetlights and buildings. The air is thick with fog, adding to the mysterious and atmospheric quality of the scene.
Aesthetic Score : 0.7
Mood : melancholy, futuristic, contemplative
Quality
Entropy : 6.86
Noise : 96
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image is somewhat blurry, especially in the distance, and lacks detail in some areas. The figure is rather generic, lacking personality. The fog appears a bit artificial and overdone.
Silhouette of Hope: A Man’s Torch Against the Sunset
A powerful silhouette of a man holding a flaming torch against a vibrant sunset sky. The dramatic lighting and composition evoke a sense of hope and strength, making this image both visually striking and emotionally resonant.
Prompt
high-key-lighting High-key lighting with strong rim lighting on the hero: Triumphant, inspiring, hopeful ; A superhero silhouetted against a bright sunrise, holding a burning torch aloft; medium-shot; Hero; Golden sky with clouds; cinematic
Characteristic
Shot : A silhouette of a man in a cape holding a flaming torch with a sunset in the background.
Aesthetic Score : 0.6
Mood : dramatic, heroic, hopeful
Quality
Entropy : 6.52
Noise : 88
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry and the edges of the silhouette are not perfectly defined.
Laughter in the Sun: A Moment of Joy in the Park
A young woman basks in the warmth of a sunny day, her laughter echoing through the park. The scene is filled with a contagious joy, enhanced by the natural beauty of the surroundings. The large tree in the background and the distant figures add a sense of peace and tranquility to this heartwarming moment.
Prompt
high-key-lighting High-key lighting with soft, diffused sunlight: Joyful, carefree, lighthearted ; A young woman laughing with friends at a picnic in a sun-drenched park; medium-shot; Normal People; Lush green grass and trees; cinematic
Characteristic
Shot : A young woman with red hair is laughing in a park, surrounded by green grass and trees. The sun is shining and there are other people in the background, out of focus.
Aesthetic Score : 0.8
Mood : happy, carefree, warm
Quality
Entropy : 6.67
Noise : 98
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight blur in the background, but it is probably intentional to create a soft focus effect.
A Scientist’s Focus: Where Light and Shadow Unveil Mystery
A clinical, sterile laboratory setting comes alive with the focused work of a scientist. Intricate glassware and equipment are bathed in a play of light and shadow, creating a sense of depth and mystery. The aesthetic score of 0.6 suggests a visually compelling scene.
Prompt
high-key-lighting High-key lighting with strong overhead lights: Focused, determined, optimistic ; A scientist working intently in a brightly lit laboratory, surrounded by complex machinery; medium-shot; Single Person; White walls and gleaming equipment; cinematic
Characteristic
Shot : A laboratory with a scientist working at a bench with various scientific equipment
Aesthetic Score : 0.6
Mood : clinical, sterile, professional
Quality
Entropy : 6.87
Noise : 108
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors.
Joyful Playground Moments Captured in a Playful Composition
This vibrant photograph captures the essence of childhood joy, showcasing children playing on a colorful playground. The composition creates a sense of depth and scale, with the playground in the foreground and the sky in the background. Natural light and shadow enhance the visual appeal, creating a carefree and playful mood.
Prompt
high-key-lighting High-key lighting with bright, sunny skies: Playful, innocent, carefree ; A group of children playing in a brightly colored playground; wide-shot; Normal People; Colorful slides, swings, and climbing structures; cinematic
Characteristic
Shot : Children playing on a colorful playground with slides and climbing structures
Aesthetic Score : 0.6
Mood : playful, carefree, sunny
Quality
Entropy : 6.75
Noise : 99
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears slightly grainy and has a somewhat faded color palette, possibly due to age or processing.
Silhouetted in the Spotlight: A Figure of Mystery
A single figure stands bathed in light on a darkened stage, their silhouette slightly obscured, creating an air of intrigue. The dramatic use of light and shadow emphasizes their isolation, leaving the viewer to ponder their story.
Prompt
high-key-lighting High-key lighting with a single, intense spotlight: Dramatic, powerful, confident ; A lone figure standing on a stage, bathed in spotlight, about to deliver a speech; studio; Single Person; Dark stage with a single spotlight; cinematic
Characteristic
Shot : A single person silhouetted in a spotlight on a stage
Aesthetic Score : 0.7
Mood : dramatic, mysterious, hopeful
Quality
Entropy : 3.97
Noise : 65
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has slight graininess and a slightly unnatural lighting effect.
Silhouettes of Joy: Friends Celebrate in a Dazzling Glow
Capture the spirit of celebration with this image, where a group of friends revel in a dimly lit room, bathed in the glow of balloons and confetti. The use of silhouettes and backlighting adds a touch of mystery and intrigue, making this a visually captivating scene.
Prompt
high-key-lighting High-key lighting with warm, inviting light: Joyful, celebratory, festive ; A group of friends celebrating a birthday party in a brightly decorated room; medium-shot; Normal People; Balloons, streamers, and festive decorations; cinematic
Characteristic
Shot : A group of friends are celebrating in a dimly lit room, confetti is falling down from the ceiling, there are balloons and drinks on a table, and the friends are silhouetted against the warm light
Aesthetic Score : 0.6
Mood : joyful, celebratory, intimate
Quality
Entropy : 6.14
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some graininess in the image, especially in the darker areas. The lighting appears a bit uneven and creates some shadows.
A Solitary Figure Contemplates the Majesty of Nature
A lone figure stands on a mountain peak, silhouetted against a breathtaking sunset and a sea of clouds. The scene evokes a sense of serenity, contemplation, and the overwhelming vastness of nature.
Prompt
high-key-lighting High-key lighting with strong backlighting from the sun: Serene, contemplative, awe-inspiring ; A lone figure standing on a mountain peak, bathed in golden sunlight, with a breathtaking view below; medium-shot; Single Person; Majestic mountain range with clouds; cinematic
Characteristic
Shot : A lone figure stands on the peak of a mountain, overlooking a sea of clouds with a warm sunset in the background.
Aesthetic Score : 0.8
Mood : serene, contemplative, inspiring
Quality
Entropy : 6.68
Noise : 80
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The clouds are somewhat repetitive and lacking in detail, and the figure’s silhouette is a bit blurry.
Blurry Lights and Energetic Dance: A Night of Vibrant Fun
Capture the energy and excitement of a night out with this image. A group of women dance under colorful, blurry lights, creating a playful and vibrant atmosphere.
Prompt
high-key-lighting High-key lighting with colorful spotlights: Energetic, expressive, joyful ; A group of dancers performing in a brightly lit studio, their movements fluid and graceful; medium-shot; Normal People; Mirrors and dance floor with colorful lighting; cinematic
Characteristic
Shot : A group of women dancing in a brightly lit room with colorful lights.
Aesthetic Score : 0.6
Mood : energetic, vibrant, fun
Quality
Entropy : 6.79
Noise : 109
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noticeable blur in the background. The lighting appears artificial and uneven, resulting in some blown-out highlights.
Silhouetted Hope in a Field of Gold
A young woman stands amidst a vibrant field of sunflowers, bathed in the golden glow of the setting sun. Her silhouette against the bright sky evokes a sense of mystery and hope, capturing a moment of peaceful serenity.
Prompt
high-key-lighting High-key lighting with soft, diffused sunlight: Peaceful, serene, hopeful ; A lone figure standing in a field of sunflowers, bathed in warm sunlight, with a gentle breeze blowing through their hair; medium-shot; Single Person; Field of sunflowers with a blue sky; cinematic
Characteristic
Shot : A young woman stands in a field of sunflowers, her back to the camera. The sun is setting, casting a warm glow over the scene.
Aesthetic Score : 0.7
Mood : peaceful, nostalgic, dreamy
Quality
Entropy : 6.40
Noise : 105
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slightly grainy texture.
Conclusion
The results of the analysis show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect.
Here’s a breakdown:
- Camera Position: The model scored 0.1, indicating a poor performance. This means there’s a significant difference between the intended camera position in the prompt and the actual camera position in the generated image.
- Shot Analysis: The model scored 0.34, indicating a fair performance. This suggests that the model was able to capture some aspects of the intended shot composition, but there were still noticeable discrepancies.
- Aesthetic Analysis: The model scored 0.12, indicating a very good performance. This means the generated image closely matched the expected aesthetic, suggesting the model was able to capture the desired visual style.
Overall, the model seems to be better at understanding the aesthetic aspects of the prompt than the camera position and shot composition. This suggests that the model might need further training to improve its ability to accurately interpret and translate camera positions and shot descriptions into visual representations.