AI's Artistic Struggle: Capturing the 'style-aesthetic' with Scenario
- 9 minutes read - 1851 wordsTable of Contents
The ‘style-aesthetic’ is a powerful tool in visual storytelling, allowing artists to evoke specific emotions and moods through carefully chosen visual elements. This style often involves dramatic lighting, striking compositions, and evocative color palettes. It’s commonly used in film, photography, and digital art to create a sense of grandeur, mystery, or emotional depth. This blog post explores the challenges of using AI to generate images that capture this ‘style-aesthetic’ effectively.
Created with: scenario
Silhouetted Against Hope: A Moment of Contemplation
A lone woman, cloaked in black, stands on a hilltop, her silhouette stark against the fiery orange sunset. The vast, rolling landscape stretches before her, mirroring the vastness of her thoughts. A sense of melancholy and contemplation hangs in the air, yet a glimmer of hope shines through the dramatic scene.
Prompt
Avant-garde: Epic, melancholic ; A lone figure, silhouetted against a blazing sunset; long shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A lone woman in a long black coat stands on a hilltop, looking out over a vast, rolling landscape. The sun is setting behind her, casting a warm glow over the scene.
Aesthetic Score : 0.8
Mood : melancholy, contemplative, serene
Quality
Entropy : 6.39
Noise : 106
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be slightly overexposed, and there are some minor artifacts around the edges of the woman’s figure.
Reaching for the Cosmic Dream
A solitary hand, adorned with a ring, stretches towards a swirling nebula of vibrant colors. The ethereal scene evokes a sense of wonder and mystery, blurring the lines between reality and dreams.
Prompt
Avant-garde: Surreal, mysterious ; A hand reaching out from a swirling vortex of light; close-up; Adventure; A kaleidoscope of colors and abstract shapes; cinematic
Characteristic
Shot : A hand with a ring on it reaches into a swirling vortex of colors. The background is a psychedelic abstract pattern of swirling lines and shapes.
Aesthetic Score : 0.6
Mood : psychedelic, surreal, mysterious
Quality
Entropy : 6.50
Noise : 120
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The background has a noticeable pixelation, which suggests it might have been upscaled from a lower resolution image.
Lost in the Neon Labyrinth: A Cyberpunk Cityscape
A solitary figure stands on a rooftop, gazing out at a futuristic cityscape bathed in vibrant neon hues. The dramatic interplay of light and shadow evokes a sense of isolation and contemplation, capturing the essence of cyberpunk loneliness.
Prompt
Avant-garde: Nostalgic, futuristic ; A pixelated character, rendered in a retro 8-bit style, standing on a precipice overlooking a digital cityscape; medium shot; Gaming; A neon-lit, futuristic cityscape; cinematic
Characteristic
Shot : A lone woman in a futuristic cityscape, looking out over a sprawling metropolis at sunset. The scene is dominated by a tall skyscraper in the distance, which is lit up in vibrant colors.
Aesthetic Score : 0.8
Mood : futuristic, cyberpunk, ethereal
Quality
Entropy : 6.83
Noise : 111
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.90
Image errors : No major errors. Some details are slightly blurry but this is typical for digital painting style.
Lost in the Fog: A Moment of Melancholy on the Platform
A solitary figure, shrouded in mist, waits patiently on a train platform. The fog adds a sense of mystery and isolation, highlighting the woman’s melancholic mood and creating a poignant scene of quiet contemplation.
Prompt
Avant-garde: Lonely, evocative ; A single, weathered suitcase, abandoned on a deserted train platform; close-up; Tourism; A misty, atmospheric train station; cinematic
Characteristic
Shot : A woman in a vintage style coat and hat stands on a train platform with a suitcase in her hand. There is a train behind her and the platform is empty, with fog in the background.
Aesthetic Score : 0.7
Mood : melancholy, pensive, nostalgic
Quality
Entropy : 6.76
Noise : 85
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are no significant errors in the image. The fog is slightly overexposed, but not distracting.
A Crack in the Pavement, A Reflection of Hope
A solitary woman walks down a cracked street, her reflection in a puddle mirroring her melancholy. The broken pavement evokes a sense of isolation, yet a glimmer of hope shines through in the reflection, suggesting a path forward.
Prompt
Avant-garde: Disorienting, dreamlike ; A pair of feet walking on a cracked, abstract pavement; low-angle shot; Travel; A distorted, surreal cityscape; cinematic
Characteristic
Shot : A person is walking down a cracked city street with their reflection in a puddle in the foreground.
Aesthetic Score : 0.6
Mood : lonely, reflective, surreal
Quality
Entropy : 6.79
Noise : 105
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The reflection in the puddle is slightly distorted and unrealistic. The background buildings seem blurry and poorly defined.
A Moment of Shared Mystery: Three Women Gather Around a Candlelit Book
In a dimly lit room, three women huddle around a table, their faces illuminated by the warm glow of candles. They are engrossed in a large open book, their expressions hinting at a shared secret or a captivating story. The soft light creates an atmosphere of intimacy and mystery, while the blurred background draws attention to the figures and their connection. This image evokes a sense of calm and intrigue, inviting viewers to imagine the tale unfolding before them.
Prompt
Avant-garde: Intimate, mysterious ; A family gathered around a flickering candle, their faces obscured by shadows; close-up; Family; A dimly lit, antique room; cinematic
Characteristic
Shot : Three young women sitting around a table lit by candles, with an open book in front of them. The scene is set in a dimly lit room, with warm, inviting lighting. The women are dressed in simple, elegant clothing.
Aesthetic Score : 0.9
Mood : serene, intimate, studious
Quality
Entropy : 6.75
Noise : 98
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : No noticeable artifacts or errors.
A Moment of Letting Go
A woman in a beige coat stands in a white room, her gaze fixed on a red balloon drifting away. The contrast between her stillness and the balloon’s ascent evokes a sense of longing and wistful contemplation. The dangling string serves as a poignant reminder of what has been lost.
Prompt
Avant-garde: Hopeful, symbolic ; A single, red balloon floating against a stark, white background; close-up; Heroism; A minimalist, abstract setting; cinematic
Characteristic
Shot : A woman in a beige coat is standing in a white room, looking at a red balloon floating on the ceiling.
Aesthetic Score : 0.6
Mood : minimalistic, contemplative, somber
Quality
Entropy : 6.30
Noise : 60
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Nostalgia on the Couch: A Retro Gaming Moment
A cozy scene of a person enjoying a classic video game console on a bed, with a TV in the background. The image evokes a sense of nostalgia and captures the simple joy of retro gaming.
Prompt
Avant-garde: Nostalgic, introspective ; A hand holding a vintage game controller, the screen reflecting a distorted, pixelated world; close-up; Gaming; A dimly lit, retro-themed room; cinematic
Characteristic
Shot : A person is playing a vintage video game console with a retro-style game on the screen. The console is positioned on a bed.
Aesthetic Score : 0.7
Mood : nostalgic, retro, cozy
Quality
Entropy : 6.79
Noise : 86
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Lost in Wonder: A Woman Gazes at a Swirling Sky
A solitary figure in a blue dress stands on a rocky precipice, her gaze drawn upwards to a mesmerizing cloud formation. The vastness of the mountains and the ethereal beauty of the sky create a sense of awe and mystery, leaving the viewer to ponder the woman’s thoughts and the secrets held within the swirling clouds.
Prompt
Avant-garde: Sublime, awe-inspiring ; A lone figure standing on a mountain peak, their silhouette framed by a swirling vortex of clouds; long shot; Adventure; A dramatic, mountainous landscape; cinematic
Characteristic
Shot : A lone woman stands on a mountain peak overlooking a vast sea of clouds, with a dramatic cloud formation swirling in the sky above.
Aesthetic Score : 0.8
Mood : dreamy, ethereal, mystical
Quality
Entropy : 6.79
Noise : 96
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.90
Image errors : No visible artifacts or errors.
Lost in Wanderlust: A Face Framed by Dreams of Travel
A woman’s enigmatic gaze is surrounded by evocative imagery of travel, hinting at a yearning for adventure and a longing for distant lands. The dreamy atmosphere and nostalgic elements create a captivating scene that whispers of journeys yet to be taken.
Prompt
Avant-garde: Energetic, disorienting ; A series of fragmented, overlapping images, depicting different aspects of travel and tourism; montage; Tourism; A chaotic, abstract collage; cinematic
Characteristic
Shot : A collage style image with a woman’s face as the central element, surrounded by various travel related images such as planes, ships, maps, and buildings.
Aesthetic Score : 0.6
Mood : dreamy, nostalgic, adventurous
Quality
Entropy : 6.76
Noise : 107
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some visible artifacts and inconsistencies, particularly around the edges of the different elements. The image also appears to be somewhat over-sharpened.
Conclusion
The results indicate that the generative AI model performed well in understanding and executing the camera position and shot instructions, but struggled with achieving the desired aesthetic. Here’s a breakdown:
Camera Position:
- Score: 0.45
- Interpretation: This score falls below the “good” range (0.5-0.75). It suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
Shot Analysis:
- Score: 0.66
- Interpretation: This score falls within the “good” range. It indicates that the model was able to understand and execute the shot instructions reasonably well, but there might be some minor discrepancies between the prompt and the generated image.
Aesthetic Analysis:
- Score: 0.09
- Interpretation: This score is significantly below the “very good” range (-0.2 to 0.1). It suggests a considerable difference between the expected aesthetic and the actual aesthetic of the generated image. The model might have struggled to capture the desired mood, style, or visual elements.
Overall:
While the model demonstrated good understanding of camera position and shot instructions, it fell short in achieving the desired aesthetic. This suggests that the model might need further training or optimization to better understand and execute aesthetic prompts.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://www.scenario.com