AI's Struggle with Dramatic Symmetry: A Visual Analysis with Flux-dev
- 9 minutes read - 1865 wordsTable of Contents
The ‘dramatic-styles’ aesthetic often employs symmetrical compositions to create a sense of balance, grandeur, and visual impact. This style is commonly found in various media, including film, photography, and video games, to enhance the narrative and evoke specific emotions. In this blog post, we explore the capabilities of a generative AI model in capturing this dramatic style, specifically focusing on its ability to generate images with symmetrical compositions.
Created with: flux-dev
Sun-Dappled Path Through a Mystical Forest
A solitary figure walks along a path bathed in sunlight, the trees casting long shadows and creating a misty, ethereal atmosphere. The scene evokes a sense of tranquility and mystery, inviting you to explore the secrets hidden within the forest.
Prompt
dramatic-styles Symmetry and Patterns: Tranquil, contemplative ; A lone traveler walking along a winding road through a symmetrical forest; medium shot; Travel; Trees forming a natural archway with sunlight filtering through the leaves; cinematic
Characteristic
Shot : A single person walks down a path through a forest with trees forming a tunnel overhead and a bright light at the end of the path. The sun shines through the leaves and creates a soft, ethereal glow.
Aesthetic Score : 0.8
Mood : peaceful, serene, hopeful
Quality
Entropy : 6.61
Noise : 113
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.30
Image errors : No noticeable artifacts or errors are present in the image.
Lost in the Glow: A Solitary Journey Through a Futuristic Labyrinth
A lone figure traverses a mesmerizing futuristic corridor, bathed in the ethereal glow of luminous pillars and a celestial orb. The scene evokes a sense of mystery and isolation, hinting at the weight of the character’s journey through this enigmatic world.
Prompt
dramatic-styles Symmetry and Patterns: Surreal, fantastical ; A player’s avatar standing in a symmetrical, virtual world, surrounded by glowing orbs; medium shot; Gaming; Vibrant colors and geometric patterns in the background; cinematic
Characteristic
Shot : A lone figure walks down a futuristic corridor lined with red pillars. The corridor is illuminated by glowing orbs on the floor, and there is a large, glowing orb in the distance.
Aesthetic Score : 0.8
Mood : futuristic, mysterious, otherworldly
Quality
Entropy : 6.60
Noise : 108
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts visible in the image, particularly around the edges of the glowing orbs.
Silhouettes of Hope: A Family’s Sunset Moment
A tranquil and hopeful scene unfolds as a family of four stands silhouetted against a vibrant sunset over a cityscape. The dramatic effect of their figures against the fiery sky evokes a sense of romance and optimism, capturing a precious moment of togetherness.
Prompt
dramatic-styles Symmetry and Patterns: Joyful, nostalgic ; A family standing on a balcony overlooking a symmetrical cityscape; medium shot; Travel; Golden hour light casting long shadows and creating geometric patterns; cinematic
Characteristic
Shot : A family of four silhouettes standing on a balcony overlooking a city skyline at sunset. The cityscape is hazy and the sun is setting behind the family, creating a warm glow.
Aesthetic Score : 0.7
Mood : peaceful, serene, romantic
Quality
Entropy : 6.56
Noise : 67
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight blurring around the edges of the silhouettes, likely due to lens flare from the setting sun
A Serene Journey Through Time
Sunlight bathes a group of figures as they walk through an ancient structure, its tall columns casting long shadows. The light, streaming through the opening at the end, creates a hazy, ethereal atmosphere, inviting contemplation and wonder.
Prompt
dramatic-styles Symmetry and Patterns: Mysterious, awe-inspiring ; A group of explorers entering a symmetrical, ancient temple; wide shot; Adventure; Intricate carvings and patterns on the walls and ceiling; cinematic
Characteristic
Shot : A group of people walking through an ancient stone corridor with a bright light at the end.
Aesthetic Score : 0.6
Mood : mysterious, contemplative, dramatic
Quality
Entropy : 6.75
Noise : 99
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry. Some of the details in the stonework are not as sharp as they could be.
Whispers in the Mist: Two Figures Walk Towards the Unknown
A sense of mystery hangs heavy in the air as two figures navigate a misty, stone corridor. The light at the end casts long shadows, hinting at secrets waiting to be revealed. The atmosphere is both eerie and captivating, leaving you wondering what lies ahead.
Prompt
dramatic-styles Symmetry and Patterns: Mysterious, suspenseful ; A group of adventurers navigating a symmetrical maze of ancient ruins; medium shot; Adventure; Intricate stone carvings and patterns on the walls; cinematic
Characteristic
Shot : Two figures walk down a long, stone corridor with a misty, atmospheric light. The corridor is narrow and the walls are tall and imposing.
Aesthetic Score : 0.6
Mood : mysterious, eerie, isolated
Quality
Entropy : 6.73
Noise : 104
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Solitude Amidst the Clouds
A lone figure stands on a rocky peak, bathed in soft light, overlooking a vast, misty mountain range. The scene evokes a sense of tranquility and contemplation, with the isolation of the figure adding a touch of drama to the ethereal beauty.
Prompt
dramatic-styles Symmetry and Patterns: Epic, inspiring ; A lone hero standing on a mountain peak; wide shot; Heroism; Dramatic cloudscape with a symmetrical mountain range in the background; cinematic
Characteristic
Shot : A lone figure stands on a mountain peak, overlooking a vast, misty landscape. The sky is a soft, pale blue, and the clouds are swirling around the mountain.
Aesthetic Score : 0.7
Mood : mysterious, serene, contemplative
Quality
Entropy : 6.31
Noise : 80
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.60
Image errors : Slight color banding in the sky, some noise in the shadows, the mountain textures seem a bit repetitive and too perfect
Lost in the Neon Maze: A Gamer’s Dream
A futuristic cityscape bursts with vibrant neon light, blurring into a mesmerizing backdrop for a gamer holding a controller. The scene captures the thrill and immersion of a cyberpunk world, where reality and virtual reality collide.
Prompt
dramatic-styles Symmetry and Patterns: Futuristic, vibrant ; A player’s hands manipulating a controller, with the screen displaying a symmetrical, pixelated cityscape; close-up; Gaming; Neon lights and geometric patterns reflecting on the screen; cinematic
Characteristic
Shot : A person’s hands holding a video game controller with a blurry background of a city street in a video game.
Aesthetic Score : 0.7
Mood : nostalgic, gaming, futuristic
Quality
Entropy : 6.58
Noise : 65
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors.
Golden Hour Skyline: A Majestic Panorama
Capture the serene beauty of a city skyline bathed in the warm glow of sunset. The dramatic contrast of light and shadow creates a contemplative mood, perfect for a moment of reflection.
Prompt
dramatic-styles Symmetry and Patterns: Awe-inspiring, majestic ; A panoramic view of a city skyline with symmetrical skyscrapers; wide shot; Tourism; Golden hour light casting long shadows and creating geometric patterns; cinematic
Characteristic
Shot : A city skyline at sunset, with the sun setting in the distance behind the tall buildings, casting a golden glow over the scene.
Aesthetic Score : 0.8
Mood : serene, peaceful, majestic
Quality
Entropy : 6.76
Noise : 99
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise and banding in the sky, particularly around the sun.
One Figure Stands Alone in a Field of Blood
A lone figure, cloaked in mystery, walks through a field of fallen figures, their blood staining the ground. The misty backdrop adds to the eerie atmosphere, creating a sense of isolation and power. This dark and grim scene evokes a sense of unease and leaves the viewer questioning the figure’s purpose.
Prompt
dramatic-styles Symmetry and Patterns: Grim, powerful ; A lone warrior standing in the center of a symmetrical battlefield, surrounded by fallen enemies; medium shot; Heroism; Bloodstains forming a pattern on the ground; cinematic
Characteristic
Shot : A lone figure in a black coat walks through a field of dead bodies, lit by a hazy sun. Crosses stand along the path, marking the end of the field.
Aesthetic Score : 0.7
Mood : eerie, desolate, dramatic
Quality
Entropy : 6.72
Noise : 85
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some of the dead bodies look slightly pixelated, and the blood on the ground is a little too bright.
Hands United: A Symbol of Unity and Connection
This close-up image captures the essence of togetherness, showcasing several hands placed together on a warm wooden surface. The image evokes a sense of unity and connection, highlighting the power of human interaction.
Prompt
dramatic-styles Symmetry and Patterns: Intimate, heartwarming ; A family gathered around a table, their hands forming a symmetrical pattern as they share a meal; close-up; Family; Warm, inviting lighting and a rustic wooden table; cinematic
Characteristic
Shot : Close-up shot of multiple hands clasped together on a wooden surface. The hands are arranged in a circular pattern.
Aesthetic Score : 0.7
Mood : warm, intimate, connected
Quality
Entropy : 6.58
Noise : 72
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No obvious errors, but the lighting is a little flat.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.36, which is below the “good” range of 0.5 to 0.75. This indicates that the model didn’t fully capture the intended camera positions described in the prompts.
- Shot Analysis: The model scored 0.63, falling within the “good” range. This suggests that the model was able to understand the scene and create shots that were generally aligned with the prompt.
- Aesthetic Analysis: The model scored 0.28, which is significantly higher than the “very good” range of -0.2 to 0.1. This indicates a significant difference between the expected aesthetic and the actual aesthetic of the generated images. The model likely struggled to create images that matched the desired aesthetic style.
Overall, the model shows promise in understanding scene composition and camera positions, but needs improvement in generating images that meet the desired aesthetic criteria.
Sources:
- https://www.swiff.org/article/crafting-the-tone-and-style-of-a-film
- https://digital-photography-school.com/backlighting-in-photography/
- https://www.studiobinder.com/blog/what-is-chiaroscuro-definition-examples/
- https://infocusfilmschool.com/4-wildly-different-movie-styles-youll-explore-filmmaking-college/
- https://cinepunked.com/2022/09/23/a-quick-guide-to-visual-style/
- https://cinematography.com/index.php?/forums/topic/184-desaturation-techniques/
- https://www.reddit.com/r/Filmmakers/comments/1452afb/colour_grading_an_underrated_factor_in_the/
- https://digital-photography-school.com/rule-of-thirds/
- https://fal.ai/models/fal-ai/flux/dev/api