AI's Artistic Journey: Capturing the Essence of Scenes, But Missing the Mark on Camera Angles with Dall-e-3
- 10 minutes read - 2026 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post examines the performance of a generative AI model in capturing the essence of various scenes, focusing on its strengths and weaknesses in understanding camera positions and aesthetic styles. We’ll explore how the model excels at capturing the overall feel and visual elements of a scene, but struggles with accurately replicating the intended camera angles. Through a series of examples, we’ll delve into the model’s capabilities and limitations, providing insights into the ongoing development of AI-powered image generation.
Created with: dall-e-3
A Solitary Figure Contemplates the Stormy Sea
A lone figure stands on a windswept cliff, gazing out at a turbulent sea. The dramatic scene evokes a sense of both awe and melancholy, as the figure seems to be contemplating the vastness and power of nature. The image is filled with dramatic elements, from the crashing waves to the towering cliffs, creating a sense of foreboding and adventure.
Prompt
poses rule-of-thirds: Epic, determined, hopeful ; A lone hero standing on a cliff overlooking a vast, stormy sea; Wide shot; Heroism; Dramatic sky with crashing waves; cinematic
Characteristic
Shot : A lone figure stands on a cliff edge overlooking a vast, turbulent sea. In the distance, towering cliffs with waterfalls cascade down into the swirling waters below. The sky is dramatic, with large, stormy clouds and a warm glow of the setting sun.
Aesthetic Score : 0.8
Mood : dramatic, epic, contemplative
Quality
Entropy : 6.89
Noise : 105
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some parts of the image appear slightly blurry, particularly the water and the details on the cliff face.
Whispers in the Mist: A Gathering Under a Shadowed Sky
A chilling scene unfolds in a dark and misty forest. A group cloaked in shadows huddle around a flickering campfire, their faces obscured by the gloom. A wolf-like creature lurks nearby, its presence adding to the palpable sense of mystery and impending danger. This image evokes a mood of eerie suspense, leaving the viewer wondering what secrets lie hidden within the shadows.
Prompt
poses rule-of-thirds: Intriguing, mysterious, suspenseful ; A group of adventurers huddled around a campfire in a dense forest; Medium shot; Adventure; Shadows and flickering flames; cinematic
Characteristic
Shot : A group of people are gathered around a campfire in a dark and misty forest. There is a tent in the background and a mysterious figure in the distance. The scene is very atmospheric and mysterious
Aesthetic Score : 0.7
Mood : dark, mysterious, suspenseful
Quality
Entropy : 6.23
Noise : 87
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts, such as slight blurriness around the edges of the figures. The lighting is also a bit uneven, with some areas being overexposed.
The Sword is Drawn: A Moment of Intense Focus in the Game
A player is locked in a moment of intense focus, their character poised with a sword, ready for action. The scene is charged with anticipation, promising a thrilling encounter in the game.
Prompt
poses rule-of-thirds: Focused, intense, exhilarating ; A gamer’s hands intensely gripping a controller, the screen displaying a thrilling moment in a video game; Close-up; Gaming; Blurred background of the game’s visuals; cinematic
Characteristic
Shot : A person playing a video game with a controller, a character from a video game is visible on the screen in the background
Aesthetic Score : 0.6
Mood : intense, focused, immersive
Quality
Entropy : 6.62
Noise : 72
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some minor artifacts around the edges of the controller.
Solitude and Majesty: A Hiker Finds Peace in the Mountain’s Embrace
A lone hiker stands on a rocky outcropping, dwarfed by the majestic peaks reflected in the serene mountain lake. The sun bathes the scene in a golden glow, creating a moment of tranquility and awe-inspiring beauty.
Prompt
poses rule-of-thirds: Tranquil, awe-inspiring, peaceful ; A majestic mountain range reflected in a still lake, with a lone hiker standing on a rocky outcrop; Wide shot; Tourism; Clear blue sky and vibrant green foliage; cinematic
Characteristic
Shot : A lone hiker stands on a rocky outcropping overlooking a serene alpine lake with majestic mountain peaks in the background. The water reflects the surrounding scenery, creating a mirror-like effect. The sky is a vibrant blue, and the sun casts a warm glow over the landscape.
Aesthetic Score : 0.8
Mood : peaceful, serene, awe-inspiring
Quality
Entropy : 6.60
Noise : 108
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable artifacts or errors in the image.
A Journey Through Time: Vintage Train Carries Nostalgia Across the Landscape
Step back in time aboard a vintage train, where the rhythmic clatter of wheels and the gentle sway of the carriage create a sense of peace. Gaze out the window at the breathtaking scenery - a vast field stretching towards a majestic mountain range, a timeless vista that evokes a sense of wonder and nostalgia. The juxtaposition of the train and the landscape creates a dramatic effect, highlighting the grandeur of nature and the enduring power of travel.
Prompt
poses rule-of-thirds: Nostalgic, romantic, adventurous ; A vintage train speeding through a picturesque countryside, with a lone traveler gazing out the window; Medium shot; Travel; Rolling hills and vibrant fields; cinematic
Characteristic
Shot : A view from the inside of a train looking out the window, a long train stretches out in front of the viewer, traveling through a valley of lush green fields and misty mountains with a single person standing in the back of the last car, looking ahead.
Aesthetic Score : 0.8
Mood : nostalgic, adventurous, peaceful
Quality
Entropy : 6.65
Noise : 105
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly around the edges of the window and the train, and the train is not perfectly aligned with the tracks.
Friends Gather for a Festive Feast Under String Lights
A group of friends share a joyful meal at a vibrant outdoor market, bathed in warm string lights and surrounded by colorful flags. The intimate framing and warm lighting create a celebratory and engaging atmosphere.
Prompt
poses rule-of-thirds: Joyful, lively, celebratory ; A group of friends laughing and enjoying a meal together at a bustling outdoor market; Medium shot; Groups; Colorful stalls and vibrant street life; cinematic
Characteristic
Shot : A group of friends are enjoying a meal at an outdoor market, with the sun setting in the background.
Aesthetic Score : 0.7
Mood : happy, joyful, vibrant
Quality
Entropy : 6.82
Noise : 109
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some slight artifacts, especially in the background.
Silhouette of Hope: A Woman Walks Towards the Sunset
A solitary figure walks along a sandy beach, their silhouette framed against the vibrant hues of a setting sun. The scene evokes a sense of tranquility and hope, inviting viewers to contemplate the beauty of solitude and the promise of a new beginning.
Prompt
poses rule-of-thirds: Melancholy, reflective, hopeful ; A lone figure standing on a deserted beach, watching the sun setting over the horizon; Wide shot; Heroism; Golden light illuminating the sky and water; cinematic
Characteristic
Shot : A woman walks towards the setting sun on a beach, with islands in the distance
Aesthetic Score : 0.8
Mood : serene, peaceful, contemplative
Quality
Entropy : 6.72
Noise : 102
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Sunlight Pierces the Jungle, Illuminating a Tense Adventure
A group of adventurers navigate a dense jungle, their path illuminated by a dramatic beam of sunlight. The scene is charged with intensity, mystery, and a sense of urgency, hinting at the dangers that lie ahead.
Prompt
poses rule-of-thirds: Intriguing, suspenseful, adventurous ; A group of explorers navigating a treacherous jungle path, with dense foliage surrounding them; Medium shot; Adventure; Lush greenery and dappled sunlight; cinematic
Characteristic
Shot : A group of four adventurers, two men and two women, are walking through a dense jungle. The light from the sun is shining through the trees, and there is a sense of mystery and danger in the air. The figures are blurred, but their faces are clearly visible.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, tense
Quality
Entropy : 6.61
Noise : 116
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, including a slight blur around the edges. The lighting is slightly unnatural and has a strong cinematic feel.
Lost in the Game: A Moment of Intense Focus
A young man is completely engrossed in a video game, his face illuminated by the screen’s glow. The blurry background and dramatic lighting create a sense of suspense and intensity, drawing the viewer into the player’s world.
Prompt
poses rule-of-thirds: Focused, intense, determined ; A close-up of a gamer’s face, eyes glued to the screen, as they navigate a challenging level in a video game; Close-up; Gaming; Blurred background of the game’s visuals; cinematic
Characteristic
Shot : A young man is playing video games. He is intensely focused on the game, with his eyes locked on the screen. The blurry background suggests that the game is a first-person shooter. He is holding a controller in his hands, and his expression is determined. The lighting is dramatic, with shadows playing across his face. The overall mood is one of intense concentration and excitement.
Aesthetic Score : 0.7
Mood : intense, focused, dramatic
Quality
Entropy : 6.36
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible image errors
Lost in the City of Lights
A solitary figure stands on a rooftop, gazing out at a sprawling cityscape bathed in the glow of countless lights. The night sky, alive with stars, reflects the sense of wonder and isolation in this futuristic scene.
Prompt
poses rule-of-thirds: Energetic, exciting, awe-inspiring ; A panoramic view of a bustling city skyline, with a lone tourist standing on a rooftop overlooking the scene; Wide shot; Tourism; Vibrant lights and towering buildings; cinematic
Characteristic
Shot : A lone figure stands on a rooftop overlooking a sprawling city at night, with tall skyscrapers illuminated by bright lights and a sense of futuristic urbanism.
Aesthetic Score : 0.7
Mood : futuristic, urban, solitary
Quality
Entropy : 6.69
Noise : 132
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly around the edges of the buildings. There are also some areas of blur that are likely due to the image processing.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.25, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.52, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.01, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model demonstrated a good understanding of the scene and its aesthetic, but struggled with accurately capturing the intended camera position.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://openai.com/index/dall-e-3/