AI's Artistic Struggle: Capturing the Essence of Dramatic Style with Stable-diffusion
- 9 minutes read - 1871 wordsTable of Contents
The dramatic aesthetic, characterized by its use of strong contrasts, intense emotions, and captivating visuals, is a powerful tool in storytelling and visual art. It evokes a sense of awe, suspense, and wonder, drawing viewers into the heart of the narrative. But can AI truly capture the essence of this aesthetic? In this blog post, we explore the challenges and successes of using AI to generate images with a dramatic style. We analyze a case study where an AI model was tasked with creating images based on specific scenes and aesthetics, revealing both the model’s strengths and weaknesses in capturing the desired mood and visual elements. By examining these results, we gain insights into the current capabilities of AI in artistic expression and explore potential solutions for improving its understanding of dramatic aesthetics.
Created with: stability-ai-core
Silhouettes of Solitude: A Figure Walks into the Setting Sun
A lone figure traverses a dusty desert road, their silhouette stark against the fiery sunset. The scene evokes a sense of melancholy and contemplation, leaving the viewer to ponder the figure’s journey and the mysteries that lie ahead.
Prompt
French New Wave: epic, melancholic ; A lone figure, silhouetted against a setting sun; long shot; heroism; a vast, empty desert landscape; cinematic
Characteristic
Shot : A lone figure walks away from the viewer on a dirt road in the desert, with the sun setting behind them.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, lonely
Quality
Entropy : 6.66
Noise : 97
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors.
Where Will Your Next Adventure Take You?
A close-up shot of a hand pointing at a vintage map, hinting at a journey filled with mystery and nostalgia. The compass and ruler in the background add to the sense of adventure and anticipation.
Prompt
French New Wave: intriguing, suspenseful ; A close-up of a weathered map, with a finger tracing a route; medium shot; adventure; a cluttered, dimly lit room; cinematic
Characteristic
Shot : A hand resting on an old map, suggesting exploration and adventure. The lighting is dim and mysterious.
Aesthetic Score : 0.6
Mood : intriguing, mysterious, historical
Quality
Entropy : 6.64
Noise : 88
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight noise is visible in the image, particularly in the darker areas.
Neon Dreams: A Gamer’s Paradise
Step back in time to a dimly lit arcade, where a young man is lost in the thrill of a classic racing game. The vibrant neon lights and his focused expression capture the nostalgic excitement of a bygone era of gaming.
Prompt
French New Wave: intense, energetic ; A hand holding a joystick, fingers moving rapidly; close-up; gaming; a neon-lit arcade with flashing screens; cinematic
Characteristic
Shot : A young man is playing an arcade game, the scene is dimly lit with neon lights.
Aesthetic Score : 0.7
Mood : nostalgic, retro, playful
Quality
Entropy : 6.00
Noise : 86
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry and has some noise, particularly in the background.
Lost in Parisian Dreams: A Moment of Longing at the Eiffel Tower
A young woman stands before the iconic Eiffel Tower, her pensive gaze capturing a moment of quiet reflection. The shallow depth of field draws attention to her thoughtful expression, creating a sense of romance and longing against the backdrop of the Parisian cityscape.
Prompt
French New Wave: romantic, nostalgic ; A young woman, her face filled with wonder, gazing at the Eiffel Tower; medium shot; tourism; a bustling Parisian street; cinematic
Characteristic
Shot : A young woman stands in front of the Eiffel Tower in Paris, looking thoughtfully towards the viewer. She is dressed in a beige plaid coat and a dark blue shirt. The background is slightly blurred, emphasizing the subject.
Aesthetic Score : 0.7
Mood : pensive, urban, romantic
Quality
Entropy : 6.84
Noise : 83
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Contemplation on the Rails: A Man Finds Peace in the Vastness of the Field
A man gazes out the window of a train, his contemplative pose reflecting the calm and peaceful mood of the scene. A vast yellow field stretches out before him, with a distant train adding a touch of perspective. The bright, sunny day enhances the sense of tranquility, creating a moment of quiet reflection.
Prompt
French New Wave: reflective, contemplative ; A train speeding through a countryside landscape, with a lone figure looking out the window; long shot; travel; a vibrant, sun-drenched field; cinematic
Characteristic
Shot : A man is looking out of a train window at a beautiful countryside landscape. The field is full of bright yellow flowers, and the train is moving along the track.
Aesthetic Score : 0.7
Mood : calm, peaceful, contemplative
Quality
Entropy : 6.57
Noise : 100
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.10
Image errors : No major errors in the image
Warmth and Togetherness: A Family Meal
This heartwarming image captures the essence of family bonding. A family sits around a kitchen table, sharing a meal and enjoying each other’s company. The scene radiates warmth and intimacy, creating a sense of happiness and togetherness.
Prompt
French New Wave: intimate, heartwarming ; A family gathered around a table, sharing a meal, with laughter and conversation; medium shot; family; a warm, inviting kitchen; cinematic
Characteristic
Shot : A family is gathered around a table for a meal in a cozy kitchen setting. A woman is serving food while the others are chatting and enjoying the company.
Aesthetic Score : 0.7
Mood : warm, intimate, happy
Quality
Entropy : 6.76
Noise : 99
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.00
Image errors : No visible artifacts or errors in the image.
Chasing Shadows: A Moment of Urgency in Paris
A black and white photograph captures the intensity of a chase through the narrow streets of Paris. The low angle and blurred background create a sense of urgency and motion, while the subject’s focused expression adds to the dramatic tension.
Prompt
French New Wave: urgent, dramatic ; A young man, his face etched with determination, running through a crowded marketplace; medium shot; heroism; a chaotic, bustling market; cinematic
Characteristic
Shot : A black and white photograph of a man running through a crowded street in Paris. The man is wearing a suit and tie, and he is surrounded by other people, some of whom are running as well.
Aesthetic Score : 0.8
Mood : tense, dramatic, urgent
Quality
Entropy : 6.16
Noise : 98
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.00
Image errors : No significant errors, slight graininess
Timeless Elegance: A Compass’s Shadowed Tale
A close-up shot captures the intricate details of a compass resting on a textured surface. Soft lighting casts a dramatic shadow, adding depth and intrigue to the minimal, classic composition. This timeless image evokes a sense of mystery and invites contemplation.
Prompt
French New Wave: mysterious, suspenseful ; A close-up of a compass needle spinning, pointing towards an unknown destination; close-up; adventure; a dimly lit, mysterious room; cinematic
Characteristic
Shot : A close-up image of a compass lying on a textured surface, the compass is in focus, the background is out of focus and slightly blurred.
Aesthetic Score : 0.7
Mood : classic, nostalgic, elegant
Quality
Entropy : 5.99
Noise : 81
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy and could benefit from sharpening.
Secrets in the Shadows: A Tense Gathering Unfolds
A group of young men huddle around a laptop in a dimly lit room, their faces illuminated by the screen’s glow. The atmosphere is thick with tension and mystery, hinting at a secret they are about to uncover. The dramatic lighting and close-up shot of the laptop create a sense of intrigue, leaving viewers eager to know what lies behind the screen.
Prompt
French New Wave: intense, focused ; A group of friends huddled around a computer screen, their faces illuminated by the glow; medium shot; gaming; a dimly lit, cluttered room; cinematic
Characteristic
Shot : A group of young people are gathered around a laptop, looking intently at the screen. The room is dimly lit, creating a sense of mystery and suspense.
Aesthetic Score : 0.7
Mood : intense, suspenseful, mysterious
Quality
Entropy : 6.04
Noise : 81
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some noise present in the shadows and edges of the image. No major artifacts are present.
Sunset Stroll: Two Figures Disappear into the Golden Hour
A pair of men walk away from the camera, their silhouettes fading into the warm glow of a setting sun. The cobblestone street stretches before them, leading into a European city bathed in the golden light of dusk. The scene evokes a sense of romance, nostalgia, and peaceful mystery.
Prompt
French New Wave: romantic, nostalgic ; A couple walking hand-in-hand along a cobblestone street, their silhouettes framed by the setting sun; long shot; tourism; a romantic, picturesque town; cinematic
Characteristic
Shot : Two men in coats and hats walk down a cobblestone street towards the setting sun. The street is lined with old buildings and there is a sense of quiet solitude.
Aesthetic Score : 0.8
Mood : romantic, nostalgic, calm
Quality
Entropy : 6.69
Noise : 100
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Conclusion
The results indicate that the generative AI model performed well in understanding and executing camera positions and shot composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored a 0.42, which falls slightly below the “good” range of 0.5 to 0.75. This suggests that while the model generally understood the camera positions described in the prompt, there were some discrepancies between the intended and actual camera angles in the generated image.
- Shot Analysis: The model scored a 0.58, placing it within the “good” range. This indicates that the model was able to successfully translate the shot descriptions in the prompt into the generated image, demonstrating a good understanding of scene composition.
- Aesthetic Analysis: The model scored a 0.07, which is significantly lower than the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt. The model may have struggled to capture the desired mood, style, or visual elements.
Overall, the model shows promise in understanding and executing camera positions and shot composition, but needs improvement in achieving the desired aesthetic.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://stability.ai