AI's Artistic Journey: Capturing the Dramatic Aesthetic with Flux-schnell
- 10 minutes read - 1924 wordsTable of Contents
The dramatic style, often characterized by its use of strong contrasts, dramatic lighting, and impactful compositions, is a popular aesthetic in visual media. This style is frequently employed in film, photography, and even video games to create a sense of intensity, suspense, or grandeur. However, replicating this style with AI image generation presents unique challenges, particularly when it comes to accurately translating camera positions. This blog post explores the results of an experiment that aimed to generate images with a dramatic aesthetic, highlighting the model’s strengths and weaknesses in capturing the desired style.
Created with: flux-schnell
Silhouetted Against the Setting Sun: A Soldier’s Solitary Vigil
A lone soldier stands in a field, their silhouette stark against the fiery orange sunset. Military vehicles loom in the background, adding to the sense of isolation and power. The scene evokes a feeling of melancholy and solitude, highlighting the weight of the soldier’s duty.
Prompt
style-aesthetic Gritty realism: Melancholy, determined ; A lone soldier, silhouetted against the setting sun; wide shot; Heroism; a war-torn battlefield littered with debris and the wreckage of tanks; cinematic
Characteristic
Shot : A lone soldier stands silhouetted against a bright orange sunset, with the sun partially obscured by his head. He is standing on a hill with military vehicles in the background.
Aesthetic Score : 0.6
Mood : dramatic, somber, contemplative
Quality
Entropy : 5.96
Noise : 49
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable image errors, although the silhouettes are a bit soft and the background could use more detail.
Lost in the Jungle: A Man’s Intense Gaze
A man with a gray beard, framed by lush greenery, stares intently at something just out of view. The selective focus draws the viewer’s eye to his face, creating a sense of mystery and contemplation. The jungle setting and natural lighting add to the dramatic effect, leaving the viewer wondering what lies beyond the frame.
Prompt
style-aesthetic Gritty realism: Intrigued, apprehensive ; A weathered explorer, their face etched with lines of hardship, peering through a dense jungle canopy; close-up; Adventure; overgrown ruins of an ancient temple; cinematic
Characteristic
Shot : A man with a serious expression is looking out from behind a thick curtain of foliage, with a temple in the background. The image has a moody, cinematic feel.
Aesthetic Score : 0.7
Mood : mysterious, dramatic, contemplative
Quality
Entropy : 6.53
Noise : 96
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is some noise in the image, particularly in the shadows. The image also has a slight green tint.
Lost in the Game: A Moment of Focused Intensity
A player’s hands grip the controller, eyes fixed on the blurry TV screen. The dimly lit room adds an air of mystery, highlighting the intense focus and relaxed enjoyment of the gaming experience.
Prompt
style-aesthetic Gritty realism: Focused, intense ; A gamer’s hands, gripping a worn controller, illuminated by the flickering glow of a monitor; close-up; Gaming; a dimly lit room filled with empty pizza boxes and energy drink cans; cinematic
Characteristic
Shot : A person’s hand holding a game controller in a dimly lit room with a TV in the background.
Aesthetic Score : 0.4
Mood : dark, concentrated, focused
Quality
Entropy : 5.92
Noise : 41
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : Slight noise and graininess in the image, especially in the darker areas.
Lost in the Landscape: A Man, a Diner, and a Haunting Sense of Isolation
A solitary figure stands before a faded diner sign, his backpack a testament to a journey through a desolate landscape. The scene evokes a sense of loneliness, contemplation, and nostalgia, leaving viewers to ponder the story behind the man and his destination.
Prompt
style-aesthetic Gritty realism: Lonely, contemplative ; A weary traveler, their backpack slung over their shoulder, gazing out at a desolate, dusty landscape; medium shot; Tourism; a crumbling roadside diner with faded neon signs; cinematic
Characteristic
Shot : A man in a blue shirt and a backpack stands in front of a diner sign, looking at something off-screen. It appears to be a deserted road on a sunny day. The building behind the man looks to be abandoned.
Aesthetic Score : 0.6
Mood : nostalgic, lonely, contemplative
Quality
Entropy : 6.64
Noise : 72
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor artifacts in the background, particularly around the trees and the diner sign. The image is also slightly overexposed, causing some loss of detail in the highlights.
Four Children Gaze into the Fog, Their Thoughts Unseen
A group of children, their faces etched with a mix of melancholy and mystery, sit on a train traversing a foggy landscape. The subdued lighting and ethereal mist create an atmosphere of suspense, leaving the viewer to ponder the thoughts behind their thoughtful expressions.
Prompt
style-aesthetic Gritty realism: Intimate, hopeful ; A family huddled together in a cramped train compartment, their faces illuminated by the flickering light of a single overhead bulb; medium shot; Travel; a train rattling through a dark, rain-soaked countryside; cinematic
Characteristic
Shot : A group of young people sit inside a train, the mood is somber, the lighting is dim and the interior is old and worn
Aesthetic Score : 0.6
Mood : somber, mysterious, tense
Quality
Entropy : 6.15
Noise : 75
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some noise and grain, there is slight chromatic aberration in the edges
A Child’s Wonder: Gazing Up at the City’s Majesty
A young boy stands in awe, his eyes fixed on the towering skyscrapers of a bustling city. The upward angle of the shot captures his childlike wonder and the immense scale of the urban landscape, evoking a sense of curiosity and awe.
Prompt
style-aesthetic Gritty realism: Awe, curiosity ; A young boy, his eyes wide with wonder, staring up at a towering skyscraper; low angle shot; Family; a bustling city street filled with people and traffic; cinematic
Characteristic
Shot : A young child looking up at tall buildings in a city setting.
Aesthetic Score : 0.7
Mood : reflective, hopeful, urban
Quality
Entropy : 6.85
Noise : 86
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : Slight overexposure in the highlights, particularly in the sky.
Heroic Firefighter Faces Down Blazing Inferno
A dramatic scene unfolds as a firefighter, clad in full gear, stands defiantly in front of a burning building. The chaotic and dangerous situation is captured in this intense image, highlighting the bravery and heroism of those who risk their lives to protect others.
Prompt
style-aesthetic Gritty realism: Brave, determined ; A firefighter, their face obscured by smoke, battling a raging inferno; close-up; Heroism; a burning building with flames licking at the sky; cinematic
Characteristic
Shot : A firefighter wearing a helmet and respirator mask stands in front of a burning building, smoke and fire are visible in the background.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 6.76
Noise : 77
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : There is a noticeable amount of noise in the image, particularly in the smoke and fire areas. The image also appears slightly oversharpened, which can create an artificial look.
Contemplating the Peaks: A Man Finds Solitude in the Mountains
A bearded man, clad in a blue jacket and carrying a backpack, stands amidst a breathtaking mountainous landscape. His intense gaze suggests a moment of deep contemplation, capturing the adventurous spirit and serious nature of his journey. The rugged terrain and the partially visible figure in the background add to the dramatic effect of this solitary scene.
Prompt
style-aesthetic Gritty realism: Exhausted, determined ; A group of adventurers, their faces grimy and exhausted, navigating a treacherous mountain pass; wide shot; Adventure; a snow-covered mountain range with jagged peaks; cinematic
Characteristic
Shot : A man with a beard and a backpack is hiking in the mountains. A second person is partially visible in the background.
Aesthetic Score : 0.75
Mood : adventurous, rugged, determined
Quality
Entropy : 6.77
Noise : 89
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slight blur in the facial features, especially around the eyes and mouth.
The Focus of a Gamer
A young man, lost in the world of gaming, his face illuminated by the glow of the screen. The low-light and close-up shot emphasize his intense concentration as he navigates the digital landscape.
Prompt
style-aesthetic Gritty realism: Focused, competitive ; A gamer, their eyes glued to the screen, their fingers flying across the keyboard; close-up; Gaming; a dimly lit room filled with computer monitors and gaming peripherals; cinematic
Characteristic
Shot : A young man, wearing a headset, is focused on a computer screen in a dimly lit room.
Aesthetic Score : 0.7
Mood : focused, intense, serious
Quality
Entropy : 5.93
Noise : 59
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly blurry in the background.
Silhouette of Mystery: A Lonely Figure in the Urban Night
A man, shrouded in darkness, walks down a deserted city street at night, his suitcase a silent companion. The brightly lit storefront casts his silhouette against the pavement, creating an air of mystery and intrigue. The scene evokes a sense of loneliness and anticipation, leaving the viewer wondering who he’s meeting and what secrets lie ahead.
Prompt
style-aesthetic Gritty realism: Lonely, introspective ; A lone traveler, their suitcase in hand, walking down a deserted street; medium shot; Tourism; a city skyline at night, with neon lights reflecting off the wet pavement; cinematic
Characteristic
Shot : A man walking down a city street at night, carrying a suitcase. There are buildings and streetlights on both sides, and a tower in the distance.
Aesthetic Score : 0.6
Mood : gloomy, urban, lonely
Quality
Entropy : 5.65
Noise : 84
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noise in the image, particularly in the shadows.
Conclusion
The results indicate that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered below average. This suggests that the model didn’t accurately translate the camera position described in the prompt into the generated image.
- Shot Analysis: The model scored 0.59, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.09, which is considered very good. This means that the generated image’s aesthetic closely matched the expected aesthetic, despite the model’s struggles with camera position.
Overall, the model shows promise in understanding the scene and achieving the desired aesthetic, but needs improvement in accurately translating camera positions.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux/schnell/api