AI's Artistic Eye: Capturing the Scene, But Missing the Shot with Midjourney

AI's Artistic Eye: Capturing the Scene, But Missing the Shot with Midjourney

Contents

In the realm of visual storytelling, capturing the essence of a scene is paramount. This involves not only the aesthetic elements but also the precise positioning of the camera and the composition of the shot. Recently, we conducted an experiment using a generative AI model to create images based on detailed scene descriptions. The results revealed an intriguing dichotomy: while the model excelled at capturing the desired aesthetic, it struggled with accurately translating camera positions and shot compositions. This begs the question: can AI truly understand and replicate the nuances of visual storytelling, or are there limitations that need to be addressed?

Created with: midjourney

Silhouetted in the City’s Glow: A Moment of Melancholy Contemplation

A lone figure stands on a rooftop, their silhouette stark against the vibrant cityscape. The futuristic metropolis, shrouded in a hazy glow, evokes a sense of serenity and isolation. This image captures a moment of quiet contemplation, a feeling of being both connected to and detached from the bustling world below.

Silhouetted in the City’s Glow: A Moment of Melancholy Contemplation

Prompt

high-key-lighting High-key lighting with strong backlighting from the city lights: Hopeful, introspective, slightly melancholic ; A lone figure standing on a rooftop overlooking a bustling city; medium-shot; Single Person; Neon-lit cityscape; cinematic

Characteristic

Shot : A lone figure stands on a rooftop overlooking a sprawling cityscape at night. The city is bathed in warm, orange light from streetlights and buildings. The air is thick with fog, adding to the mysterious and atmospheric quality of the scene.

Aesthetic Score : 0.7

Mood : melancholy, futuristic, contemplative

Quality

Entropy : 6.86

Noise : 96

Prompt Clip Score : 0.20

AI Evaluation

Likelihood of AI : 0.90

Image errors : The image is somewhat blurry, especially in the distance, and lacks detail in some areas. The figure is rather generic, lacking personality. The fog appears a bit artificial and overdone.

Silhouette of Hope: A Man’s Torch Against the Sunset

A powerful silhouette of a man holding a flaming torch against a vibrant sunset sky. The dramatic lighting and composition evoke a sense of hope and strength, making this image both visually striking and emotionally resonant.

Silhouette of Hope: A Man’s Torch Against the Sunset

Prompt

high-key-lighting High-key lighting with strong rim lighting on the hero: Triumphant, inspiring, hopeful ; A superhero silhouetted against a bright sunrise, holding a burning torch aloft; medium-shot; Hero; Golden sky with clouds; cinematic

Characteristic

Shot : A silhouette of a man in a cape holding a flaming torch with a sunset in the background.

Aesthetic Score : 0.6

Mood : dramatic, heroic, hopeful

Quality

Entropy : 6.52

Noise : 88

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.10

Image errors : The image is slightly blurry and the edges of the silhouette are not perfectly defined.

Laughter in the Sun: A Moment of Joy in the Park

A young woman basks in the warmth of a sunny day, her laughter echoing through the park. The scene is filled with a contagious joy, enhanced by the natural beauty of the surroundings. The large tree in the background and the distant figures add a sense of peace and tranquility to this heartwarming moment.

Laughter in the Sun: A Moment of Joy in the Park

Prompt

high-key-lighting High-key lighting with soft, diffused sunlight: Joyful, carefree, lighthearted ; A young woman laughing with friends at a picnic in a sun-drenched park; medium-shot; Normal People; Lush green grass and trees; cinematic

Characteristic

Shot : A young woman with red hair is laughing in a park, surrounded by green grass and trees. The sun is shining and there are other people in the background, out of focus.

Aesthetic Score : 0.8

Mood : happy, carefree, warm

Quality

Entropy : 6.67

Noise : 98

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image has a slight blur in the background, but it is probably intentional to create a soft focus effect.

A Scientist’s Focus: Where Light and Shadow Unveil Mystery

A clinical, sterile laboratory setting comes alive with the focused work of a scientist. Intricate glassware and equipment are bathed in a play of light and shadow, creating a sense of depth and mystery. The aesthetic score of 0.6 suggests a visually compelling scene.

A Scientist’s Focus: Where Light and Shadow Unveil Mystery

Prompt

high-key-lighting High-key lighting with strong overhead lights: Focused, determined, optimistic ; A scientist working intently in a brightly lit laboratory, surrounded by complex machinery; medium-shot; Single Person; White walls and gleaming equipment; cinematic

Characteristic

Shot : A laboratory with a scientist working at a bench with various scientific equipment

Aesthetic Score : 0.6

Mood : clinical, sterile, professional

Quality

Entropy : 6.87

Noise : 108

Prompt Clip Score : 0.20

AI Evaluation

Likelihood of AI : 0.20

Image errors : There are no visible artifacts or errors.

Joyful Playground Moments Captured in a Playful Composition

This vibrant photograph captures the essence of childhood joy, showcasing children playing on a colorful playground. The composition creates a sense of depth and scale, with the playground in the foreground and the sky in the background. Natural light and shadow enhance the visual appeal, creating a carefree and playful mood.

Joyful Playground Moments Captured in a Playful Composition

Prompt

high-key-lighting High-key lighting with bright, sunny skies: Playful, innocent, carefree ; A group of children playing in a brightly colored playground; wide-shot; Normal People; Colorful slides, swings, and climbing structures; cinematic

Characteristic

Shot : Children playing on a colorful playground with slides and climbing structures

Aesthetic Score : 0.6

Mood : playful, carefree, sunny

Quality

Entropy : 6.75

Noise : 99

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.10

Image errors : The image appears slightly grainy and has a somewhat faded color palette, possibly due to age or processing.

Silhouetted in the Spotlight: A Figure of Mystery

A single figure stands bathed in light on a darkened stage, their silhouette slightly obscured, creating an air of intrigue. The dramatic use of light and shadow emphasizes their isolation, leaving the viewer to ponder their story.

Silhouetted in the Spotlight: A Figure of Mystery

Prompt

high-key-lighting High-key lighting with a single, intense spotlight: Dramatic, powerful, confident ; A lone figure standing on a stage, bathed in spotlight, about to deliver a speech; studio; Single Person; Dark stage with a single spotlight; cinematic

Characteristic

Shot : A single person silhouetted in a spotlight on a stage

Aesthetic Score : 0.7

Mood : dramatic, mysterious, hopeful

Quality

Entropy : 3.97

Noise : 65

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image has slight graininess and a slightly unnatural lighting effect.

Silhouettes of Joy: Friends Celebrate in a Dazzling Glow

Capture the spirit of celebration with this image, where a group of friends revel in a dimly lit room, bathed in the glow of balloons and confetti. The use of silhouettes and backlighting adds a touch of mystery and intrigue, making this a visually captivating scene.

Silhouettes of Joy: Friends Celebrate in a Dazzling Glow

Prompt

high-key-lighting High-key lighting with warm, inviting light: Joyful, celebratory, festive ; A group of friends celebrating a birthday party in a brightly decorated room; medium-shot; Normal People; Balloons, streamers, and festive decorations; cinematic

Characteristic

Shot : A group of friends are celebrating in a dimly lit room, confetti is falling down from the ceiling, there are balloons and drinks on a table, and the friends are silhouetted against the warm light

Aesthetic Score : 0.6

Mood : joyful, celebratory, intimate

Quality

Entropy : 6.14

Noise : 86

Prompt Clip Score : 0.27

AI Evaluation

Likelihood of AI : 0.10

Image errors : Some graininess in the image, especially in the darker areas. The lighting appears a bit uneven and creates some shadows.

A Solitary Figure Contemplates the Majesty of Nature

A lone figure stands on a mountain peak, silhouetted against a breathtaking sunset and a sea of clouds. The scene evokes a sense of serenity, contemplation, and the overwhelming vastness of nature.

A Solitary Figure Contemplates the Majesty of Nature

Prompt

high-key-lighting High-key lighting with strong backlighting from the sun: Serene, contemplative, awe-inspiring ; A lone figure standing on a mountain peak, bathed in golden sunlight, with a breathtaking view below; medium-shot; Single Person; Majestic mountain range with clouds; cinematic

Characteristic

Shot : A lone figure stands on the peak of a mountain, overlooking a sea of clouds with a warm sunset in the background.

Aesthetic Score : 0.8

Mood : serene, contemplative, inspiring

Quality

Entropy : 6.68

Noise : 80

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.80

Image errors : The clouds are somewhat repetitive and lacking in detail, and the figure’s silhouette is a bit blurry.

Blurry Lights and Energetic Dance: A Night of Vibrant Fun

Capture the energy and excitement of a night out with this image. A group of women dance under colorful, blurry lights, creating a playful and vibrant atmosphere.

Blurry Lights and Energetic Dance: A Night of Vibrant Fun

Prompt

high-key-lighting High-key lighting with colorful spotlights: Energetic, expressive, joyful ; A group of dancers performing in a brightly lit studio, their movements fluid and graceful; medium-shot; Normal People; Mirrors and dance floor with colorful lighting; cinematic

Characteristic

Shot : A group of women dancing in a brightly lit room with colorful lights.

Aesthetic Score : 0.6

Mood : energetic, vibrant, fun

Quality

Entropy : 6.79

Noise : 109

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.20

Image errors : There is some noticeable blur in the background. The lighting appears artificial and uneven, resulting in some blown-out highlights.

Silhouetted Hope in a Field of Gold

A young woman stands amidst a vibrant field of sunflowers, bathed in the golden glow of the setting sun. Her silhouette against the bright sky evokes a sense of mystery and hope, capturing a moment of peaceful serenity.

Silhouetted Hope in a Field of Gold

Prompt

high-key-lighting High-key lighting with soft, diffused sunlight: Peaceful, serene, hopeful ; A lone figure standing in a field of sunflowers, bathed in warm sunlight, with a gentle breeze blowing through their hair; medium-shot; Single Person; Field of sunflowers with a blue sky; cinematic

Characteristic

Shot : A young woman stands in a field of sunflowers, her back to the camera. The sun is setting, casting a warm glow over the scene.

Aesthetic Score : 0.7

Mood : peaceful, nostalgic, dreamy

Quality

Entropy : 6.40

Noise : 105

Prompt Clip Score : 0.22

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image has a slightly grainy texture.

Conclusion

The results of the analysis show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect.

Here’s a breakdown:

  • Camera Position: The model scored 0.1, indicating a poor performance. This means there’s a significant difference between the intended camera position in the prompt and the actual camera position in the generated image.
  • Shot Analysis: The model scored 0.34, indicating a fair performance. This suggests that the model was able to capture some aspects of the intended shot composition, but there were still noticeable discrepancies.
  • Aesthetic Analysis: The model scored 0.12, indicating a very good performance. This means the generated image closely matched the expected aesthetic, suggesting the model was able to capture the desired visual style.

Overall, the model seems to be better at understanding the aesthetic aspects of the prompt than the camera position and shot composition. This suggests that the model might need further training to improve its ability to accurately interpret and translate camera positions and shot descriptions into visual representations.

Sources: