AI's Dramatic Vision: A Mixed Bag of Shots and Aesthetics with Flux-schnell
- 9 minutes read - 1858 wordsTable of Contents
The ‘dramatic style’ is a powerful tool in visual storytelling, evoking emotion and immersing viewers in a scene. This style often involves dramatic lighting, dynamic camera angles, and a focus on capturing the essence of a moment. In this blog post, we explore the capabilities of AI in generating images that embody this dramatic style. We analyze the results of an experiment where AI was tasked with creating images based on specific scene descriptions, focusing on its ability to capture camera position, shot analysis, and aesthetic elements.
Created with: flux-schnell
Silhouetted Against the Sunset: A Moment of Solitude on the Mountaintop
A lone hiker stands on a cliff, bathed in the warm glow of the setting sun. The silhouette against the fiery sky evokes a sense of peace and contemplation, highlighting the vastness of the mountain range and the smallness of humanity in its presence.
Prompt
dramatic-styles Rule of Thirds: Epic, hopeful ; A lone adventurer standing on a cliff edge, silhouetted against a breathtaking sunset; Wide shot; Adventure; Majestic mountain range; cinematic
Characteristic
Shot : A lone hiker stands on a cliff, silhouetted against a bright orange sunset over a mountain range.
Aesthetic Score : 0.8
Mood : tranquil, inspirational, serene
Quality
Entropy : 6.48
Noise : 38
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors
One Against Many: A Lone Warrior Faces the Shadows
A dramatic scene unfolds in a foggy battlefield. A lone warrior, silhouetted against the darkness, stands defiant, sword raised high, facing an army of shadowy figures. The lighting creates an atmosphere of mystery and tension, emphasizing the warrior’s courage and the epic battle ahead.
Prompt
dramatic-styles Rule of Thirds: Intense, suspenseful ; A determined hero raising their sword, ready to face a horde of enemies; Medium shot; Heroism; A dark, foreboding battlefield; cinematic
Characteristic
Shot : A lone warrior stands in the middle of a battlefield, holding a sword aloft, with a blur of figures surrounding him. The scene is set in a dark and cloudy environment, giving an impression of a battle taking place in the twilight hours.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 6.56
Noise : 70
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no notable artifacts or errors in the image.
Lost in the Shadows: A Gamer’s Intense Focus
A dimly lit room, a flickering monitor, and a player completely absorbed in the digital world. This image captures the intensity and focus of gaming, with a dark and mysterious atmosphere adding to the intrigue.
Prompt
dramatic-styles Rule of Thirds: Focused, immersive ; A gamer’s hands intensely navigating a virtual world on a glowing screen; Close-up; Gaming; A dimly lit room with gaming peripherals; cinematic
Characteristic
Shot : A person is playing a video game in a dark room. The screen is showing a game with green hills. The person’s hands are holding a controller.
Aesthetic Score : 0.3
Mood : focused, intense, dark
Quality
Entropy : 5.36
Noise : 32
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry and the lighting is uneven.
Family Finds Peace Amidst the Cityscape
A family of four enjoys a contemplative moment on an observation deck, silhouetted against a cloudy cityscape. The scene evokes a sense of peace and happiness, capturing the beauty of the city from a unique perspective.
Prompt
dramatic-styles Rule of Thirds: Joyful, connected ; A family gazing out at a stunning cityscape from a rooftop terrace; Medium shot; Family; A vibrant, bustling city skyline; cinematic
Characteristic
Shot : A family of four is standing on a rooftop overlooking a city skyline. The parents are in the foreground with their arms around each other, looking at the skyline. The two children are standing behind them, also looking at the skyline.
Aesthetic Score : 0.6
Mood : happy, content, hopeful
Quality
Entropy : 6.75
Noise : 85
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant errors in the image.
Tranquil Urban Journey Towards a Majestic Cathedral
A solitary figure walks down a historic street, the towering cathedral in the distance creating a sense of scale and mystery. The scene evokes a tranquil urban atmosphere, inviting viewers to explore the hidden stories within the city’s walls.
Prompt
dramatic-styles Rule of Thirds: Awe-inspiring, nostalgic ; A traveler standing on a cobblestone street, looking up at a towering cathedral; Medium shot; Tourism; A charming, historic European city; cinematic
Characteristic
Shot : A man walking towards a large stone cathedral, with old European-style buildings on either side. The man has a backpack on, and he’s looking towards the cathedral.
Aesthetic Score : 0.6
Mood : tranquil, urban, historic
Quality
Entropy : 6.83
Noise : 106
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major artifacts or errors, but the image could benefit from a bit more sharpness and contrast.
Lost in the Vastness: A Figure Contemplates the Sea
A solitary figure walks along a sandy beach, dwarfed by the immensity of the ocean. The cloudy sky above reflects the mood of contemplation and melancholy, highlighting the figure’s sense of isolation and insignificance.
Prompt
dramatic-styles Rule of Thirds: Tranquil, contemplative ; A lone figure walking along a deserted beach, with the ocean stretching out before them; Wide shot; Travel; A vast, empty beach with crashing waves; cinematic
Characteristic
Shot : A solitary figure walks along a sandy beach towards the ocean, under a cloudy sky. The sky is a muted grey, and the waves are calm and gentle.
Aesthetic Score : 0.6
Mood : serene, lonely, contemplative
Quality
Entropy : 6.05
Noise : 71
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible artifacts or errors.
A Flickering Flame, A Brooding Gaze
A man with long hair and a beard, clad in dark, ornate garments, stares intently at a flickering torch. The play of light and shadow creates a mysterious and intense atmosphere, highlighting his features and the dancing flames.
Prompt
dramatic-styles Rule of Thirds: Dramatic, suspenseful ; A hero’s face, illuminated by a flickering torch, as they make a crucial decision; Close-up; Heroism; A dark, mysterious cave; cinematic
Characteristic
Shot : A man with a beard and wearing ornate clothing is looking intently at a torch. The image is dark, but the man’s face is illuminated by the light of the torch.
Aesthetic Score : 0.6
Mood : mysterious, dramatic, intense
Quality
Entropy : 5.48
Noise : 63
Prompt Clip Score : 0.18
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise and grain, likely due to low-light conditions.
Campfire Cozy: Friends Gather Around the Flames
A group of friends share laughter and warmth around a crackling campfire in the woods. The inviting glow of the fire draws you into the scene, creating a cozy and friendly atmosphere.
Prompt
dramatic-styles Rule of Thirds: Warm, intimate ; A group of friends huddled around a campfire, sharing stories and laughter; Medium shot; Adventure; A dense forest with stars twinkling above; cinematic
Characteristic
Shot : A group of friends are gathered around a campfire in a forest at night. The fire is bright and warm, and the friends are laughing and talking. The forest is dark and mysterious, and the stars are twinkling above.
Aesthetic Score : 0.7
Mood : cozy, friendly, adventurous
Quality
Entropy : 5.09
Noise : 81
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise and grain, and the lighting is uneven. The people in the image are also a little blurry, but this could be a stylistic choice.
A Silhouette of Triumph: Witnessing Hope Against the Setting Sun
A lone figure, silhouetted against a breathtaking sunset, stands atop a mountain, sword in hand. The scene evokes a sense of epic triumph and hopeful anticipation, as the figure gazes towards the horizon, their silhouette amplified against the dramatic sky.
Prompt
dramatic-styles Rule of Thirds: Triumphant, exhilarating ; A gamer’s avatar, a powerful warrior, standing triumphantly on a virtual mountain peak; Wide shot; Gaming; A vibrant, fantastical game world; cinematic
Characteristic
Shot : A lone figure stands atop a mountain peak, looking out at a vast landscape of rolling hills and a dramatic sunset.
Aesthetic Score : 0.7
Mood : epic, inspiring, dramatic
Quality
Entropy : 6.63
Noise : 84
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.80
Image errors : Slight blurriness in the background, likely due to the sunset’s glare.
Sunset Serenity: A Family’s Walk Through Mountain Majesty
A heartwarming scene unfolds as a family strolls along a path amidst breathtaking mountain scenery. The golden hues of the setting sun bathe the landscape in a peaceful glow, creating a sense of tranquility and nostalgia. Silhouetted against the distant peaks, the family’s figures evoke a sense of wonder and the vastness of nature’s beauty.
Prompt
dramatic-styles Rule of Thirds: Peaceful, heartwarming ; A family holding hands, walking along a scenic path, with a breathtaking view behind them; Medium shot; Family; A lush, green valley with rolling hills; cinematic
Characteristic
Shot : A family of three is walking on a dirt path through a grassy field with rolling hills in the background. The sun is setting, casting a warm glow on the scene.
Aesthetic Score : 0.7
Mood : serene, hopeful, nostalgic
Quality
Entropy : 6.81
Noise : 95
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : no visible errors
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.585, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.39, which is considered below average. This means that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall, the model demonstrated a good understanding of the scene and shot composition, but struggled to accurately capture the intended camera position and aesthetic.
Sources:
- https://www.swiff.org/article/crafting-the-tone-and-style-of-a-film
- https://digital-photography-school.com/backlighting-in-photography/
- https://www.studiobinder.com/blog/what-is-chiaroscuro-definition-examples/
- https://infocusfilmschool.com/4-wildly-different-movie-styles-youll-explore-filmmaking-college/
- https://cinepunked.com/2022/09/23/a-quick-guide-to-visual-style/
- https://cinematography.com/index.php?/forums/topic/184-desaturation-techniques/
- https://www.reddit.com/r/Filmmakers/comments/1452afb/colour_grading_an_underrated_factor_in_the/
- https://digital-photography-school.com/rule-of-thirds/
- https://fal.ai/models/fal-ai/flux/schnell/api