AI's Artistic Struggle: Capturing the 'Dramatic' Aesthetic with Stability-ai-ultra
- 9 minutes read - 1892 wordsTable of Contents
The ‘dramatic’ aesthetic, characterized by high contrast, strong lighting, and a sense of tension, is a powerful tool in visual storytelling. It’s often used to evoke emotions like suspense, awe, and heroism. But can AI truly capture this aesthetic? In this article, we explore the challenges and successes of AI in generating images with a ‘dramatic’ style. We’ll analyze the results of a test, examining the AI’s ability to understand scene composition, camera position, and the nuances of visual style. Through this analysis, we’ll gain insights into the current capabilities of AI in artistic expression and its potential for future development.
Created with: stability-ai-ultra
A Soldier’s Solitude in the Aftermath of War
A lone soldier stands amidst the wreckage of a battlefield, the setting sun casting long shadows on the debris. The image evokes a sense of melancholy and isolation, highlighting the human cost of conflict.
Prompt
Gritty realism: Melancholy, determined ; A lone soldier, silhouetted against the setting sun; wide shot; Heroism; a war-torn battlefield littered with debris and the wreckage of tanks; cinematic
Characteristic
Shot : A soldier in camouflage stands amidst the rubble of a war zone, with two tanks in the distance. The sun is setting in the background, casting a warm glow on the scene.
Aesthetic Score : 0.6
Mood : somber, dramatic, melancholic
Quality
Entropy : 6.16
Noise : 80
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.60
Image errors : Some of the rubble appears somewhat unrealistic, and the textures of the tanks and soldier are a bit blurry.
The Old Man’s Gaze: A Mystery in the Woods
A close-up portrait of an elderly man with a long white beard, his eyes piercing the viewer. The intensity of his gaze and the blurred forest background create a sense of mystery and intrigue. What secrets does he hold?
Prompt
Gritty realism: Intrigued, apprehensive ; A weathered explorer, their face etched with lines of hardship, peering through a dense jungle canopy; close-up; Adventure; overgrown ruins of an ancient temple; cinematic
Characteristic
Shot : A close-up portrait of an elderly man with a long white beard, he is wearing a bandana, and his face is covered in dirt. He is looking directly at the camera with a serious expression. The background is blurred and out of focus.
Aesthetic Score : 0.7
Mood : intense, rugged, mysterious
Quality
Entropy : 6.76
Noise : 108
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable artifacts or errors in the image.
In the Zone: A Gamer’s Focus
A close-up shot captures the intensity of a gamer, their hand gripping the controller, eyes fixed on the screen. The lighting and composition create a sense of depth and focus, highlighting the controller and the player’s dedication to the game.
Prompt
Gritty realism: Focused, intense ; A gamer’s hands, gripping a worn controller, illuminated by the flickering glow of a monitor; close-up; Gaming; a dimly lit room filled with empty pizza boxes and energy drink cans; cinematic
Characteristic
Shot : A person is holding a gaming controller in a dimly lit room, with a computer monitor in the background and various objects on the desk.
Aesthetic Score : 0.6
Mood : dark, intense, focused
Quality
Entropy : 6.31
Noise : 80
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise and grain, especially in the darker areas. The focus is slightly off on the controller’s buttons.
Lost in the Desert: A Moment of Solitude at an Abandoned Diner
A lone figure stands silhouetted against the setting sun, their back turned to a dilapidated diner in the heart of the desert. The scene evokes a sense of loneliness and nostalgia, with the weathered diner and desolate landscape serving as a poignant backdrop to the figure’s isolation.
Prompt
Gritty realism: Lonely, contemplative ; A weary traveler, their backpack slung over their shoulder, gazing out at a desolate, dusty landscape; medium shot; Tourism; a crumbling roadside diner with faded neon signs; cinematic
Characteristic
Shot : A lone traveler stands in front of a dilapidated diner in a desert landscape. The diner has a bright neon sign that reads “Diner” and the windows are broken.
Aesthetic Score : 0.7
Mood : desolate, lonely, nostalgic
Quality
Entropy : 6.85
Noise : 95
Prompt Clip Score : 0.39
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable errors in the image.
Lost in the Rain: A Moment of Contemplation on a Dimly Lit Train
A group of passengers, bathed in the soft glow of the train’s interior lights, gaze out the window at the pouring rain. The scene evokes a sense of cozy contemplation, tinged with a touch of mystery and suspense.
Prompt
Gritty realism: Intimate, hopeful ; A family huddled together in a cramped train compartment, their faces illuminated by the flickering light of a single overhead bulb; medium shot; Travel; a train rattling through a dark, rain-soaked countryside; cinematic
Characteristic
Shot : A group of people are riding a train, the window is showing a rainy landscape outside
Aesthetic Score : 0.7
Mood : gloomy, melancholic, contemplative
Quality
Entropy : 6.42
Noise : 95
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has slight noise and grain, especially in the darker areas, but it is not significant.
Tiny Wonder in the City That Never Sleeps
A young boy stands in awe amidst the towering skyscrapers and dazzling lights of Times Square, a poignant reminder of the wonder and scale of city life.
Prompt
Gritty realism: Awe, curiosity ; A young boy, his eyes wide with wonder, staring up at a towering skyscraper; low angle shot; Family; a bustling city street filled with people and traffic; cinematic
Characteristic
Shot : A young boy is standing in Times Square, looking up at the towering skyscrapers and bright lights. The scene is bustling with activity, and the boy’s expression is one of wonder and curiosity.
Aesthetic Score : 0.7
Mood : awe, wonder, city life
Quality
Entropy : 6.96
Noise : 71
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight blurring, some areas are too dark
Firefighter Faces the Blaze with Unwavering Determination
A powerful image captures the intensity of a fire as a firefighter, clad in full gear, stands resolute in front of a burning building. The scene evokes a sense of danger and determination, highlighting the bravery of those who face such perilous situations.
Prompt
Gritty realism: Brave, determined ; A firefighter, their face obscured by smoke, battling a raging inferno; close-up; Heroism; a burning building with flames licking at the sky; cinematic
Characteristic
Shot : A firefighter in full gear stands in front of a burning house. The fire is intense and the smoke is billowing. The firefighter is looking off to the side with a serious expression on his face.
Aesthetic Score : 0.7
Mood : dramatic, intense, somber
Quality
Entropy : 6.78
Noise : 85
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors were found.
Conquering the Summit: A Hiker’s Journey Through Snowy Peaks
Experience the thrill of adventure as a lone hiker ascends a majestic mountain range, the snowy peaks and dramatic depth of field creating a sense of awe and inspiration. The blurred figures in the background add to the sense of isolation and the challenge of the journey.
Prompt
Gritty realism: Exhausted, determined ; A group of adventurers, their faces grimy and exhausted, navigating a treacherous mountain pass; wide shot; Adventure; a snow-covered mountain range with jagged peaks; cinematic
Characteristic
Shot : A group of three hikers are ascending a snow-covered mountain. The main subject is the hiker in the foreground, and the others are behind him, providing a sense of scale. The mountain in the background is large and imposing, providing a dramatic backdrop for the scene.
Aesthetic Score : 0.7
Mood : dramatic, adventurous, determined
Quality
Entropy : 6.76
Noise : 82
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some of the details of the mountains and the hikers’ faces are blurry. The lighting is a little bit uneven, and the composition is slightly off-center.
In the Zone: Gamer’s Intensity Illuminated
A young man is fully immersed in his game, the colorful lighting casting dramatic shadows as he focuses intently on the screen. His determination is palpable, captured in the intensity of his gaze and the poised hand on the keyboard.
Prompt
Gritty realism: Focused, competitive ; A gamer, their eyes glued to the screen, their fingers flying across the keyboard; close-up; Gaming; a dimly lit room filled with computer monitors and gaming peripherals; cinematic
Characteristic
Shot : A young man wearing a headset is focused on playing a game on a computer. The image is lit with blue and red hues creating a dramatic effect.
Aesthetic Score : 0.6
Mood : intense, focused, gamer
Quality
Entropy : 6.54
Noise : 73
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noise in the image, particularly in the shadows. There’s slight motion blur in the hand, which detracts from the sharp focus.
Lost in the Neon Glow: A Lonely Figure in a Cyberpunk Cityscape
A solitary figure walks through a rain-slicked, neon-drenched street in a futuristic city. The vibrant lights and empty streets create a sense of melancholy and isolation, capturing the essence of cyberpunk urban life.
Prompt
Gritty realism: Lonely, introspective ; A lone traveler, their suitcase in hand, walking down a deserted street; medium shot; Tourism; a city skyline at night, with neon lights reflecting off the wet pavement; cinematic
Characteristic
Shot : A lone figure walks down a wet, neon-lit street in a city at night. The street is lined with tall buildings, and the signs are glowing brightly in the dark.
Aesthetic Score : 0.8
Mood : mysterious, futuristic, lonely
Quality
Entropy : 6.87
Noise : 105
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.90
Image errors : No visible artifacts or errors.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered okay. This means the generated image’s camera position was somewhat different from what was requested in the prompt.
- Shot Analysis: The model scored 0.49, which is considered good. This indicates the generated image’s shot composition was fairly close to what was described in the prompt.
- Aesthetic Analysis: The model scored 0.05, which is considered very good. This means the generated image’s aesthetic was very close to the expected aesthetic.
Overall, the model seems to be better at understanding the scene and shot composition than it is at capturing the desired aesthetic.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://stability.ai