AI Captures the Scene, But Struggles with the Shot with Stable-diffusion
- 9 minutes read - 1903 wordsTable of Contents
The world of AI image generation is rapidly evolving, with models capable of creating stunning visuals from simple text prompts. However, achieving a perfect balance between aesthetic appeal and technical accuracy remains a challenge. This blog post examines the results of an AI model tasked with generating images based on specific scene descriptions, highlighting its strengths and weaknesses in capturing the intended camera position, shot analysis, and overall aesthetic.
Created with: stability-ai-core
Silhouetted Serenity: A Moment of Tranquility on the Mountaintop
A lone figure stands at the edge of the world, bathed in the golden light of a setting sun. The vast landscape stretches out before them, offering a moment of peace and inspiration. The silhouette against the fiery sky creates a sense of drama and isolation, reminding us of the beauty and power of nature.
Prompt
poses high-angle: epic, triumphant ; A lone figure standing on a mountain peak, silhouetted against the setting sun; wide shot; heroism; vast, rugged mountain range; cinematic
Characteristic
Shot : A lone figure stands on a mountaintop, silhouetted against a stunning sunset over a vast expanse of mountains and valleys. The scene is filled with drama and a sense of solitude.
Aesthetic Score : 0.8
Mood : dramatic, contemplative, peaceful
Quality
Entropy : 6.26
Noise : 63
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable artifacts or errors in the image. The image appears to be well-exposed and sharp.
Sunbeams Illuminate Adventurous Hikers in a Mysterious Jungle
A group of hikers venture through a dense jungle, bathed in a single, ethereal sunbeam. The scene evokes a sense of mystery, adventure, and serenity, with the dramatic lighting highlighting the hikers and creating a sense of depth.
Prompt
poses high-angle: adventurous, suspenseful ; A group of explorers navigating a dense jungle, their path illuminated by the sun filtering through the canopy; medium shot; adventure; lush, green jungle; cinematic
Characteristic
Shot : A group of hikers walks along a path through a lush, tropical forest. The sun shines through the trees, creating a dappled light effect. The scene is calm and serene, with a sense of adventure.
Aesthetic Score : 0.75
Mood : serene, adventurous, mysterious
Quality
Entropy : 6.56
Noise : 99
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some noise and grain, particularly in the shadows. The hikers appear slightly blurry, particularly in their hands and faces.
Lost in the Neon Glow: A Gamer’s Intense Focus
A low-angle shot captures a gamer immersed in a futuristic video game, their hands gripping the controller as the city lights and screen glow illuminate the scene. The image exudes an intense, focused energy, highlighting the dramatic immersion of the gaming experience.
Prompt
poses high-angle: intense, focused ; A gamer’s hands manipulating a controller, the screen displaying a vibrant, futuristic cityscape; close-up; gaming; a dimly lit room with gaming peripherals; cinematic
Characteristic
Shot : A person is playing a video game with a controller in their hands, the scene is set in a dark room with a computer monitor displaying a futuristic city skyline in the background.
Aesthetic Score : 0.6
Mood : intense, futuristic, focused
Quality
Entropy : 6.31
Noise : 73
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable errors in the image.
A Tranquil European Square: Where History Meets Bustling Life
Capture the essence of a vibrant European city with this wide shot of a grand square. A majestic church, a charming fountain, and a sea of people create a scene brimming with history and life. The perspective from above emphasizes the architectural details and the bustling activity below, while the strong sunlight adds depth and vibrancy to the scene.
Prompt
poses high-angle: lively, energetic ; A bustling city square filled with tourists, capturing the iconic landmarks and vibrant street life; wide shot; tourism; a vibrant, bustling city with historical architecture; cinematic
Characteristic
Shot : A high-angle view of a large, European-style city square with a church, a fountain, and many people walking around. The sky is blue with some white clouds.
Aesthetic Score : 0.8
Mood : tranquil, bustling, historic
Quality
Entropy : 6.82
Noise : 87
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
Solitude in the Desert Sunrise
A lone figure contemplates the vastness of the desert at sunrise, silhouetted against the hazy mountains and the bright, distant sun. The scene evokes a sense of peace, tranquility, and isolation.
Prompt
poses high-angle: reflective, contemplative ; A lone traveler gazing out at a vast desert landscape, the setting sun casting long shadows; medium shot; travel; a vast, desolate desert with sand dunes; cinematic
Characteristic
Shot : A lone figure sits on a sand dune, looking out at a vast desert landscape, with the sun setting in the background.
Aesthetic Score : 0.7
Mood : serene, vast, contemplative
Quality
Entropy : 6.73
Noise : 72
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors.
Campfire Tales Under a Starry Sky
A group of friends gather around a crackling campfire, their faces illuminated by the warm glow. The serene forest and vast, star-filled sky create a cozy and adventurous atmosphere, perfect for sharing stories and forging memories.
Prompt
poses high-angle: warm, intimate ; A group of friends gathered around a campfire, sharing stories and laughter under a starry night sky; medium shot; groups; a serene campsite with a campfire and a starry sky; cinematic
Characteristic
Shot : A group of friends are gathered around a campfire in a forest under a starry night sky, with a tent in the background.
Aesthetic Score : 0.75
Mood : cozy, peaceful, adventurous
Quality
Entropy : 6.09
Noise : 80
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The stars in the sky are slightly pixelated, likely due to over-sharpening or compression artifacts.
Superman Soars Above the City at Sunset
A heroic and epic scene of Superman flying over a cityscape, likely New York City, with the sun setting in the background. The dramatic effect is achieved through the use of light, shadow, and Superman’s powerful flying pose, making him appear both small and mighty against the vast cityscape.
Prompt
poses high-angle: powerful, awe-inspiring ; A superhero soaring through the air, the city sprawling beneath them; wide shot; heroism; a sprawling cityscape with towering buildings; cinematic
Characteristic
Shot : A superhero, possibly Superman, is flying over a city skyline. The city is likely New York City, based on the iconic skyline. The sun is setting and casting a warm glow over the cityscape.
Aesthetic Score : 0.6
Mood : heroic, powerful, dramatic
Quality
Entropy : 6.82
Noise : 97
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The city skyline is a bit blurry, and the buildings lack detail. The superhero’s costume looks a bit plastic and unrealistic.
Daredevils Conquer the Cliffside: A Breathtaking View Awaits
Witness the thrill of rock climbing as four adventurers scale a towering cliff face, overlooking a lush valley and winding river. The climbers’ precarious position and the vastness of the landscape create a sense of awe and danger, making this a truly breathtaking scene.
Prompt
poses high-angle: thrilling, dangerous ; A group of adventurers rappelling down a steep cliff face, their ropes dangling against the rock; medium shot; adventure; a dramatic cliff face with a breathtaking view; cinematic
Characteristic
Shot : Four climbers are ascending a cliff face with a view of a river canyon in the background. The climbers are wearing safety gear and appear to be experienced. The scene is visually impressive and captures the beauty of nature and the thrill of rock climbing.
Aesthetic Score : 0.7
Mood : adventurous, daring, awe-inspiring
Quality
Entropy : 6.91
Noise : 102
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : Minor image artifacts are visible on the cliff face and in the background. There is a slight chromatic aberration along the edges of the image.
Lost in the Game: A Moment of Intense Focus
A young man, headphones on, sits in a dimly lit room, his gaze fixed on the computer screen. The image captures the intensity of his focus as he engages in a game, creating a sense of immersion and dedication.
Prompt
poses high-angle: immersive, captivating ; A gamer’s face illuminated by the screen, their eyes focused on the intense action unfolding in the virtual world; close-up; gaming; a dimly lit room with a gaming setup; cinematic
Characteristic
Shot : A young man wearing headphones is sitting at a desk in front of a computer screen, gaming. The scene is set in a dimly lit room, with a desk lamp providing some light.
Aesthetic Score : 0.6
Mood : focused, intense, concentrated
Quality
Entropy : 6.24
Noise : 68
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight amount of grain, and some of the edges are slightly blurry.
Golden Hour Adventure on the Mountaintop
Five hikers stand on a majestic peak, bathed in the warm glow of the setting sun. The vast valley below, with its winding river and distant mountains, creates a breathtaking panorama. This serene and adventurous scene evokes a sense of hope and wonder, capturing the beauty of nature at its most dramatic.
Prompt
poses high-angle: inspiring, hopeful ; A group of travelers standing on a mountaintop, their faces lit by the sunrise, gazing out at the breathtaking panorama; medium shot; travel; a majestic mountain range with a panoramic view; cinematic
Characteristic
Shot : A group of five hikers standing on a mountain top looking at a sunrise over a valley with snow capped mountains in the distance.
Aesthetic Score : 0.8
Mood : serene, adventurous, inspiring
Quality
Entropy : 6.75
Noise : 72
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors in the image.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.53, which is considered good. This indicates that the model was able to understand the scene and create a shot that was relatively close to what was described in the prompt.
- Aesthetic Analysis: The model scored 0.28, which is considered very good. This means that the generated image’s aesthetic was very close to the expected aesthetic described in the prompt.
Overall, the model seems to be better at understanding the scene and creating a visually appealing image than accurately capturing the intended camera position.