AI's Dramatic Style: A Tale of Two Worlds with Flux-dev
- 9 minutes read - 1806 wordsTable of Contents
The dramatic style, often characterized by strong contrasts, heightened emotions, and impactful visuals, is a powerful tool in storytelling and visual art. In the realm of AI-generated imagery, the ability to capture this style presents both exciting possibilities and intriguing challenges. This blog post explores the results of a test that examined an AI model’s capacity to generate images with a dramatic aesthetic, analyzing its strengths and weaknesses in understanding scene composition, camera position, and overall visual impact.
Created with: flux-dev
Into the Darkness: A Man’s Determined Journey
A lone figure, illuminated by a burning torch, ventures into the unknown. The mysterious setting, whether a cave or a dense forest, adds to the intensity and sense of adventure. The contrast between the darkness and the bright flame creates a dramatic effect, highlighting the man’s unwavering purpose.
Prompt
dramatic-styles Rule of Thirds: Dramatic, suspenseful ; A hero’s face, illuminated by a flickering torch, as they make a crucial decision; Close-up; Heroism; A dark, mysterious cave; cinematic
Characteristic
Shot : A man with dark hair and a beard holds a burning torch, the light of which illuminates his face and a bit of his surroundings. He is wearing a dark jacket and stands in a dark, smoky environment.
Aesthetic Score : 0.6
Mood : mysterious, suspenseful, intense
Quality
Entropy : 5.45
Noise : 44
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise and artifacts, especially in the darker areas. The torch flame also seems a bit unnatural.
A Father and Daughter’s Peaceful Stroll Through Rolling Hills
A heartwarming scene of a father and daughter walking along a path in a grassy field, with rolling hills creating a sense of depth and perspective. The peaceful and serene mood is captured in the composition, evoking a feeling of tranquility and connection.
Prompt
dramatic-styles Rule of Thirds: Peaceful, heartwarming ; A family holding hands, walking along a scenic path, with a breathtaking view behind them; Medium shot; Family; A lush, green valley with rolling hills; cinematic
Characteristic
Shot : A father and daughter are walking down a path in a grassy field. The sun is shining and the sky is blue. There are trees and hills in the distance.
Aesthetic Score : 0.7
Mood : tranquil, wholesome, happy
Quality
Entropy : 6.76
Noise : 79
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Shadowed Figure Confronts a Sword-Wielding Crowd
A lone figure, shrouded in mystery, stands defiant before a throng of armed individuals. The hazy atmosphere and dramatic silhouette create a sense of suspense and intrigue, leaving the viewer to wonder what secrets lie hidden within the fog.
Prompt
dramatic-styles Rule of Thirds: Intense, suspenseful ; A determined hero raising their sword, ready to face a horde of enemies; Medium shot; Heroism; A dark, foreboding battlefield; cinematic
Characteristic
Shot : A lone figure, shrouded in a dark cloak, stands before a throng of shadowy figures, their swords held high, in a misty, medieval setting.
Aesthetic Score : 0.7
Mood : mysterious, ominous, foreboding
Quality
Entropy : 6.69
Noise : 50
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some minor artifacts visible in the image, such as slight blurriness and noise. These are not overly distracting.
Campfire Tales: Friends Gather in the Forest’s Embrace
A cozy gathering of friends around a crackling campfire in a dark forest. The low light and warm glow create an inviting atmosphere, while the surrounding darkness hints at adventure and mystery.
Prompt
dramatic-styles Rule of Thirds: Warm, intimate ; A group of friends huddled around a campfire, sharing stories and laughter; Medium shot; Adventure; A dense forest with stars twinkling above; cinematic
Characteristic
Shot : A group of young adults sit around a campfire in a wooded area. The fire is glowing brightly, casting warm light on the faces of the people. The trees behind them are silhouetted against the dark sky.
Aesthetic Score : 0.7
Mood : cozy, intimate, adventurous
Quality
Entropy : 6.34
Noise : 70
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant image errors, minor blur in the background
Lost in the Game: A Moment of Intense Focus
A player is fully immersed in their game, their focused expression and the dramatic lighting creating a sense of mystery and intrigue. The scene captures the intensity and dedication of a gamer lost in the digital world.
Prompt
dramatic-styles Rule of Thirds: Focused, immersive ; A gamer’s hands intensely navigating a virtual world on a glowing screen; Close-up; Gaming; A dimly lit room with gaming peripherals; cinematic
Characteristic
Shot : A person sitting in a dimly lit room, focused on a computer screen, possibly gaming, with a headset on and a keyboard in front of them.
Aesthetic Score : 0.6
Mood : focused, intense, determined
Quality
Entropy : 6.32
Noise : 52
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has slight blurriness around the edges and some noise in the darker areas.
Triumphant Silhouette: A Moment of Hope and Majesty
A lone figure stands tall on a mountain peak, their silhouette stark against the setting sun. This inspirational scene evokes feelings of accomplishment, hope, and the vastness of the world. The dramatic contrast between the figure and the sun emphasizes the feeling of scale and the power of human spirit.
Prompt
dramatic-styles Rule of Thirds: Triumphant, exhilarating ; A gamer’s avatar, a powerful warrior, standing triumphantly on a virtual mountain peak; Wide shot; Gaming; A vibrant, fantastical game world; cinematic
Characteristic
Shot : A lone figure stands on a mountain peak with their arms raised in triumph as the sun sets behind them.
Aesthetic Score : 0.6
Mood : inspirational, hopeful, dramatic
Quality
Entropy : 6.37
Noise : 57
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are no noticeable errors in the image.
Solitude by the Sea: A Moment of Contemplation
A lone figure walks along a sandy beach, their silhouette fading into the hazy horizon. The vast ocean stretches behind, creating a sense of peace and serenity. The image evokes a mood of contemplation and solitude, leaving the viewer to ponder the figure’s journey and the mysteries of the sea.
Prompt
dramatic-styles Rule of Thirds: Tranquil, contemplative ; A lone figure walking along a deserted beach, with the ocean stretching out before them; Wide shot; Travel; A vast, empty beach with crashing waves; cinematic
Characteristic
Shot : A lone figure walks along a sandy beach towards the horizon, the ocean stretches out before them.
Aesthetic Score : 0.7
Mood : solitude, contemplative, peaceful
Quality
Entropy : 5.15
Noise : 30
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Lost in the Shadows of a European City
A solitary figure walks down a narrow, overcast street, shrouded in mystery and melancholy. The towering buildings create a sense of enclosure, drawing the viewer’s eye to the man’s silhouette against the bright sky. This urban scene evokes a mood of intrigue and contemplation, leaving the viewer wondering about the man’s journey and the secrets hidden within the city’s depths.
Prompt
dramatic-styles Rule of Thirds: Awe-inspiring, nostalgic ; A traveler standing on a cobblestone street, looking up at a towering cathedral; Medium shot; Tourism; A charming, historic European city; cinematic
Characteristic
Shot : A lone figure walks down a narrow, cobblestone street lined with tall buildings, silhouetted against the hazy cityscape.
Aesthetic Score : 0.6
Mood : mysterious, urban, solitary
Quality
Entropy : 6.90
Noise : 92
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Silhouetted Against the Sunset, a Moment of Contemplation
A solitary figure stands on a cliff edge, their back turned to the camera, gazing out at a fiery sunset over a mountain range. The silhouette against the vibrant sky evokes a sense of loneliness and contemplation, capturing a dramatic yet peaceful moment.
Prompt
dramatic-styles Rule of Thirds: Epic, hopeful ; A lone adventurer standing on a cliff edge, silhouetted against a breathtaking sunset; Wide shot; Adventure; Majestic mountain range; cinematic
Characteristic
Shot : A lone figure stands on the edge of a cliff, overlooking a vast mountain range, as the sun sets in the distance.
Aesthetic Score : 0.7
Mood : serene, contemplative, majestic
Quality
Entropy : 6.59
Noise : 45
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors, but the sky appears a bit oversaturated and the figure’s silhouette is slightly blurry.
Golden Hour Serenity: A Rooftop View of Tranquility
Three figures stand silhouetted against a breathtaking sunset, bathed in the warm glow of the city skyline. The scene evokes a sense of calm contemplation and peaceful serenity, capturing the beauty of a moment in time.
Prompt
dramatic-styles Rule of Thirds: Joyful, connected ; A family gazing out at a stunning cityscape from a rooftop terrace; Medium shot; Family; A vibrant, bustling city skyline; cinematic
Characteristic
Shot : A family of three standing on a rooftop, looking out at a cityscape, with the sun setting in the background.
Aesthetic Score : 0.6
Mood : tranquil, hopeful, romantic
Quality
Entropy : 6.60
Noise : 64
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, resulting in some loss of detail in the highlights.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t fully capture the intended camera position in the prompt.
- Shot Analysis: The model scored 0.59, which falls within the “good” range. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.37, which is far from the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall, the model shows promise in understanding the scene and camera position, but needs improvement in generating images that match the desired aesthetic.
Sources:
- https://www.swiff.org/article/crafting-the-tone-and-style-of-a-film
- https://digital-photography-school.com/backlighting-in-photography/
- https://www.studiobinder.com/blog/what-is-chiaroscuro-definition-examples/
- https://infocusfilmschool.com/4-wildly-different-movie-styles-youll-explore-filmmaking-college/
- https://cinepunked.com/2022/09/23/a-quick-guide-to-visual-style/
- https://cinematography.com/index.php?/forums/topic/184-desaturation-techniques/
- https://www.reddit.com/r/Filmmakers/comments/1452afb/colour_grading_an_underrated_factor_in_the/
- https://digital-photography-school.com/rule-of-thirds/
- https://fal.ai/models/fal-ai/flux/dev/api