AI's Artistic Struggle: Capturing the 'Dramatic' Aesthetic with Flux-pro
- 9 minutes read - 1845 wordsTable of Contents
The ‘dramatic’ aesthetic, characterized by high contrast, strong lighting, and a sense of tension, is a powerful tool in visual storytelling. It’s often used to evoke emotions like suspense, awe, and heroism. But can AI truly capture this aesthetic? In this article, we explore the challenges and successes of AI in generating images with a ‘dramatic’ style. We’ll analyze the results of a test, examining the AI’s ability to understand scene composition, camera position, and the nuances of visual style. Through this analysis, we’ll gain insights into the current capabilities of AI in artistic expression and its potential for future development.
Created with: flux-pro
Silhouetted Soldier Faces the Setting Sun
A lone soldier stands in silhouette against a vibrant sunset, facing a powerful tank in a desolate field. The scene evokes a sense of melancholy and hope, highlighting the soldier’s isolation and the dramatic contrast between vulnerability and power.
Prompt
Gritty realism: Melancholy, determined ; A lone soldier, silhouetted against the setting sun; wide shot; Heroism; a war-torn battlefield littered with debris and the wreckage of tanks; cinematic
Characteristic
Shot : A lone soldier stands in a field at sunset, facing a tank in the distance. The background is a hazy sunset sky.
Aesthetic Score : 0.6
Mood : melancholy, solitude, war
Quality
Entropy : 6.51
Noise : 72
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some minor artifacts, such as the halo effect around the sun.
A Weathered Face, A Distant Temple, and a Mystery Unfolding
A man with a weathered face, his gaze fixed on a distant temple, stands amidst lush greenery. The scene evokes a sense of mystery and contemplation, with the blurred temple adding to the enigmatic atmosphere. The contrast between the sharp focus on his face and the soft blur of the temple creates a dramatic effect, leaving the viewer wondering what secrets lie ahead.
Prompt
Gritty realism: Intrigued, apprehensive ; A weathered explorer, their face etched with lines of hardship, peering through a dense jungle canopy; close-up; Adventure; overgrown ruins of an ancient temple; cinematic
Characteristic
Shot : A close-up of an older man’s face, looking over his shoulder at a temple in the background. The man is wearing a brown jacket.
Aesthetic Score : 0.7
Mood : mysterious, thoughtful, intriguing
Quality
Entropy : 6.79
Noise : 80
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, especially in the background.
Focused Fun: Gamer’s Paradise
A relaxed figure engrossed in a video game, controller in hand, with the promise of pizza in the background. The intimate lighting highlights the player’s focus and playful energy.
Prompt
Gritty realism: Focused, intense ; A gamer’s hands, gripping a worn controller, illuminated by the flickering glow of a monitor; close-up; Gaming; a dimly lit room filled with empty pizza boxes and energy drink cans; cinematic
Characteristic
Shot : A person is playing video games with a controller, there is pizza in the background.
Aesthetic Score : 0.6
Mood : relaxed, playful, focused
Quality
Entropy : 6.91
Noise : 58
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noise in the image, especially in the dark areas.
Lost in the Dust: A Figure Walks Towards the Unknown
A solitary figure traverses a desolate landscape, their smallness against the vastness of the sky emphasizing a profound sense of loneliness. The overcast sky and the sign reading ‘Dirly’ add to the melancholic mood, hinting at a journey towards an uncertain destination.
Prompt
Gritty realism: Lonely, contemplative ; A weary traveler, their backpack slung over their shoulder, gazing out at a desolate, dusty landscape; medium shot; Tourism; a crumbling roadside diner with faded neon signs; cinematic
Characteristic
Shot : A lone figure walks away from a roadside diner on a dusty road. The sky is overcast and the lighting is muted.
Aesthetic Score : 0.6
Mood : melancholy, lonely, contemplative
Quality
Entropy : 6.79
Noise : 86
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly overexposed and the color grading is a bit flat.
Secrets in the Shadows: A Train Ride Filled with Mystery
Three figures shrouded in darkness, their faces etched with unspoken emotions. A train journey unfolds under the cloak of night, hinting at a story of intrigue and hidden truths. The intimate setting and low-light atmosphere create a sense of suspense, leaving viewers to wonder what secrets lie beneath the surface.
Prompt
Gritty realism: Intimate, hopeful ; A family huddled together in a cramped train compartment, their faces illuminated by the flickering light of a single overhead bulb; medium shot; Travel; a train rattling through a dark, rain-soaked countryside; cinematic
Characteristic
Shot : Three people are sitting in a train carriage. The lighting is dim and the atmosphere is moody. The focus is on the people, with the train car acting as a background.
Aesthetic Score : 0.6
Mood : mysterious, intimate, contemplative
Quality
Entropy : 6.39
Noise : 71
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy and has some noise. There’s a slight color cast towards the blues, making the image appear slightly cold.
A City of Dreams: A Boy’s Hopeful Gaze
A young boy, his eyes wide with wonder, looks up at a towering city building. The image captures a sense of playful curiosity and hopeful aspiration, as the boy’s gaze suggests a world of possibilities waiting to be explored.
Prompt
Gritty realism: Awe, curiosity ; A young boy, his eyes wide with wonder, staring up at a towering skyscraper; low angle shot; Family; a bustling city street filled with people and traffic; cinematic
Characteristic
Shot : A young boy looks up at the Freedom Tower in New York City.
Aesthetic Score : 0.6
Mood : awe, wonder, childhood
Quality
Entropy : 6.94
Noise : 71
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise and artifacts, especially in the background.
Firefighter Bravely Faces Inferno
A dramatic image captures a firefighter in full gear standing before a blazing building, the flames casting an orange glow on his face. The contrast between the dark figure and the bright flames emphasizes the danger and intensity of the situation, highlighting the heroism of those who fight fires.
Prompt
Gritty realism: Brave, determined ; A firefighter, their face obscured by smoke, battling a raging inferno; close-up; Heroism; a burning building with flames licking at the sky; cinematic
Characteristic
Shot : A firefighter in full gear stands in front of a burning building, the flames are visible in the background.
Aesthetic Score : 0.7
Mood : dramatic, heroic, intense
Quality
Entropy : 6.66
Noise : 61
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight graininess and a bit of blur in the background, but not distracting.
Adventure Awaits: Hikers Brave the Snowy Peaks
A group of determined hikers navigate a snow-covered mountain trail, bathed in dramatic light and shadow. Their journey captures the essence of adventure and the thrill of pushing boundaries.
Prompt
Gritty realism: Exhausted, determined ; A group of adventurers, their faces grimy and exhausted, navigating a treacherous mountain pass; wide shot; Adventure; a snow-covered mountain range with jagged peaks; cinematic
Characteristic
Shot : Four people hiking up a snowy mountain range, with a dramatic sky and a sense of adventure
Aesthetic Score : 0.7
Mood : adventure, determination, nature
Quality
Entropy : 6.97
Noise : 93
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some minor noise in the shadows, especially in the sky
Lost in the Code: A Moment of Focused Intensity
A young man sits hunched over his keyboard, eyes glued to the screen in a dimly lit room. The low lighting and close-up shot capture the intensity of his focus, highlighting the quiet power of concentration in a tech-filled environment.
Prompt
Gritty realism: Focused, competitive ; A gamer, their eyes glued to the screen, their fingers flying across the keyboard; close-up; Gaming; a dimly lit room filled with computer monitors and gaming peripherals; cinematic
Characteristic
Shot : A young man is sitting at a computer desk in a dimly lit room, concentrating on his keyboard. He is wearing a black hoodie. There is another person in the background, out of focus, also using a computer.
Aesthetic Score : 0.6
Mood : focused, intense, technological
Quality
Entropy : 6.72
Noise : 70
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy and some of the colors are a bit oversaturated. There are no visible artifacts in the image.
Lost in the City Lights: A Figure Walks Alone in the Night
A solitary figure, shrouded in shadows, traverses a rain-slicked street under the glow of city lights. The scene evokes a sense of melancholy and loneliness, with the dramatic interplay of light and dark highlighting the figure’s isolation and vulnerability.
Prompt
Gritty realism: Lonely, introspective ; A lone traveler, their suitcase in hand, walking down a deserted street; medium shot; Tourism; a city skyline at night, with neon lights reflecting off the wet pavement; cinematic
Characteristic
Shot : A lone figure walks down a city street at night carrying a suitcase. The city lights create a warm, inviting glow, and the figure is silhouetted against the light.
Aesthetic Score : 0.6
Mood : lonely, hopeful, urban
Quality
Entropy : 6.92
Noise : 81
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, especially in the background. The lighting is uneven.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered okay. This means the generated image’s camera position was somewhat different from what was requested in the prompt.
- Shot Analysis: The model scored 0.49, which is considered good. This indicates the generated image’s shot composition was fairly close to what was described in the prompt.
- Aesthetic Analysis: The model scored 0.05, which is considered very good. This means the generated image’s aesthetic was very close to the expected aesthetic.
Overall, the model seems to be better at understanding the scene and shot composition than it is at capturing the desired aesthetic.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux-pro/api