AI's Artistic Vision: Capturing the Mood, Missing the Mark with Stable-diffusion
- 9 minutes read - 1778 wordsTable of Contents
In the realm of artificial intelligence, image generation has emerged as a captivating field. AI models are now capable of creating stunning visuals, often mimicking the styles of human artists. However, the journey towards truly replicating the human artistic experience is still ongoing. This blog post delves into the fascinating world of AI-generated imagery, exploring both its strengths and limitations. We will examine a specific case study where an AI model demonstrates a remarkable ability to capture the desired aesthetic but struggles with the technical aspects of camera positioning and shot analysis. This analysis will shed light on the current state of AI image generation and highlight the areas where further development is needed.
Created with: stability-ai-core
Silhouetted Against the Storm
A solitary figure stands in a darkened room, their silhouette stark against the flashes of lightning illuminating a stormy cityscape. The scene evokes a sense of melancholy and isolation, with the dramatic interplay of light and shadow creating a powerful and atmospheric image.
Prompt
lightning practical-lighting: Melancholy, isolation ; A lone figure, silhouetted against a window; medium-shot; Single Person; A dimly lit room with only the glow of a streetlamp outside; cinematic
Characteristic
Shot : A man stands silhouetted in a dark room, looking out a window at a stormy night with lightning
Aesthetic Score : 0.7
Mood : dark, dramatic, lonely
Quality
Entropy : 5.50
Noise : 67
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Heroic Silhouette: A Superhero Stands Guard at Sunset
A powerful superhero, clad in blue and gold, stands on a rooftop overlooking a sprawling cityscape at sunset. His red cape billows dramatically behind him as he gazes out over the city, embodying heroism and strength. The dramatic lighting and his commanding pose create a sense of awe and power.
Prompt
lightning practical-lighting: Hopeful, triumphant ; A superhero standing on a rooftop, bathed in the warm light of a setting sun; medium-shot; Hero; A cityscape with skyscrapers in the background; cinematic
Characteristic
Shot : A superhero stands on a rooftop overlooking a cityscape at sunset. The hero is wearing a blue and gold costume with a red cape.
Aesthetic Score : 0.7
Mood : heroic, dramatic, powerful
Quality
Entropy : 6.88
Noise : 69
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.30
Image errors : No noticeable artifacts or errors
Intimate Dinner Under the Chandelier
A group of friends gather around a table in a warm, well-lit kitchen. The soft glow of the chandelier and candlelight creates a cozy atmosphere, highlighting the intimate connections between them.
Prompt
lightning practical-lighting: Intimate, heartwarming ; A family gathered around a dinner table, illuminated by the soft glow of a chandelier; studio; Normal People; A cozy kitchen with warm, inviting colors; cinematic
Characteristic
Shot : A family dinner gathering in a well-lit dining room. The scene is warm and inviting with a large chandelier hanging overhead, candles on the table, and a beautiful flower arrangement.
Aesthetic Score : 0.7
Mood : warm, cozy, intimate
Quality
Entropy : 6.71
Noise : 70
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight blurriness in the background due to low light, particularly on the faces of the people in the back.
Stormy Night, Heavy Decisions: A Man Grapples with Pressure
A dimly lit office, a man in a suit hunched over paperwork, and a raging storm outside. The scene is charged with tension, hinting at a critical moment where decisions are made under immense pressure.
Prompt
lightning practical-lighting: Intense, suspenseful ; A detective hunched over a desk, illuminated by a single desk lamp; medium-shot; Single Person; A cluttered office with stacks of papers and a flickering neon sign outside the window; cinematic
Characteristic
Shot : A man in a suit sits at a desk in a dimly lit office at night. Papers are scattered across the desk and a lightning storm rages outside the window.
Aesthetic Score : 0.7
Mood : dark, intense, suspenseful
Quality
Entropy : 6.17
Noise : 62
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no significant errors in the image.
A Knight’s Vigil in the Misty Forest
A lone knight, bathed in the ethereal glow of a campfire, stands guard in a misty forest. His sword lies at his feet, a silent testament to the dangers that lurk in the shadows. The scene is both dramatic and mysterious, hinting at a story waiting to unfold.
Prompt
lightning practical-lighting: Mysterious, adventurous ; A warrior standing in a dark forest, illuminated by the flickering flames of a campfire; medium-shot; Hero; A dense forest with towering trees and shadows; cinematic
Characteristic
Shot : A lone knight stands in a dark, misty forest, holding a sword. There is a fire in the background.
Aesthetic Score : 0.7
Mood : mysterious, dramatic, epic
Quality
Entropy : 6.60
Noise : 72
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The lighting is a bit flat, and the contrast is a bit low.
Lost in Thought: A Moment of Solitude in the Night
A young woman finds solace in the quiet of a moonlit park, her contemplative expression and the soft glow of streetlights painting a picture of introspection and quiet reflection.
Prompt
lightning practical-lighting: Peaceful, contemplative ; A young woman sitting on a park bench, bathed in the soft light of a streetlamp; medium-shot; Normal People; A quiet park with trees and benches; cinematic
Characteristic
Shot : A young woman sits on a bench in a park at night, surrounded by trees and streetlights.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, introspective
Quality
Entropy : 6.53
Noise : 68
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors
Unveiling the Secrets: A Scientist’s Focused Pursuit
A scientist, bathed in the cool light of a laboratory, leans intently over a microscope, their white coat a stark contrast to the shadowed equipment surrounding them. The scene exudes a sense of seriousness and focus, hinting at the critical nature of their work.
Prompt
lightning practical-lighting: Focused, analytical ; A scientist working in a laboratory, illuminated by the bright light of a microscope; medium-shot; Single Person; A sterile laboratory with rows of equipment; cinematic
Characteristic
Shot : A scientist is working in a laboratory with a microscope. The scientist is wearing a white lab coat and is looking intently at the microscope.
Aesthetic Score : 0.6
Mood : serious, focused, scientific
Quality
Entropy : 6.85
Noise : 56
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor noise and compression artifacts.
Bonfire Bliss: Friends Gather Under a Starry Sky
A group of friends share laughter and warmth around a crackling bonfire on a serene beach. The fire illuminates their faces, creating a joyful and relaxed atmosphere under a dark, cloudy sky.
Prompt
lightning practical-lighting: Joyful, celebratory ; A group of friends gathered around a bonfire, their faces illuminated by the dancing flames; studio; Normal People; A beach with sand and waves; cinematic
Characteristic
Shot : A group of friends gathered around a bonfire on a beach at dusk.
Aesthetic Score : 0.7
Mood : joyful, warm, social
Quality
Entropy : 6.53
Noise : 74
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Lost in the Shadows: A Lone Figure Walks a Mysterious Alley
A single street lamp casts an eerie glow on a dark and narrow alleyway, revealing a lone figure walking through the shadows. Graffiti covers the brick walls, adding to the sense of mystery and intrigue. The scene evokes a feeling of isolation and loneliness, leaving the viewer wondering what secrets lie hidden in the darkness.
Prompt
lightning practical-lighting: Eerie, suspenseful ; A lone figure walking down a dark alleyway, illuminated by the flickering light of a streetlamp; medium-shot; Single Person; A dark alleyway with graffiti and shadows; cinematic
Characteristic
Shot : A lone figure walks down a dark alleyway, illuminated by a single streetlight at the end. The walls are brick, with some graffiti on them.
Aesthetic Score : 0.7
Mood : mysterious, eerie, suspenseful
Quality
Entropy : 5.22
Noise : 81
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight bit of noise and graininess. The light from the street lamp creates a bit of haloing around the figure’s head.
Superhero Silhouettes Against a Stormy Cityscape
A powerful superhero stands poised on a rooftop, silhouetted against a dramatic cityscape. Lightning strikes in the distance, creating a sense of impending action and danger. The mood is dark, dramatic, and powerful, hinting at a thrilling story unfolding.
Prompt
lightning practical-lighting: Powerful, dramatic ; A superhero standing on a rooftop, silhouetted against the bright lights of a city skyline; medium-shot; Hero; A cityscape with skyscrapers and neon signs; cinematic
Characteristic
Shot : A superhero standing on a rooftop, overlooking a city skyline at night with lightning in the background.
Aesthetic Score : 0.6
Mood : dark, dramatic, powerful
Quality
Entropy : 6.78
Noise : 68
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly in the lighting and the background.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.15, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.43, which is considered below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflected the intended shot.
- Aesthetic Analysis: The model scored 0.12, which is considered very good. This means that the generated image closely matched the expected aesthetic style, despite the issues with camera position and shot analysis.
Overall, the model seems to be struggling with understanding the spatial aspects of the prompt, but it’s able to generate images with the desired aesthetic.