AI's Artistic Struggle: Capturing the Essence of 'Dramatic' Aesthetics with Flux-pro
- 9 minutes read - 1884 wordsTable of Contents
The ‘dramatic’ aesthetic is a powerful tool in visual storytelling, evoking emotions and creating a sense of grandeur. It often involves striking contrasts, dramatic lighting, and carefully composed shots that draw the viewer’s attention to specific elements. This style is commonly used in film, photography, and even video games to enhance the narrative and create a memorable experience. However, can AI effectively translate these artistic intentions into visual outputs? This blog post explores the challenges and successes of using AI to generate images with a ‘dramatic’ aesthetic, analyzing the results of a prompt experiment and highlighting the areas where AI excels and where it still needs improvement.
Created with: flux-pro
Silhouetted Mystery in a City of Fire
A lone figure, shrouded in shadow, stands against a breathtaking sunset, their presence a stark contrast to the towering futuristic cityscape behind them. The scene evokes a sense of eerie mystery and desolation, leaving the viewer to ponder the secrets hidden within the fading light.
Prompt
Hyper-realistic: Epic, hopeful ; A lone figure, silhouetted against the setting sun; wide shot; Heroism; A vast, desolate landscape with a lone, crumbling tower in the distance; cinematic
Characteristic
Shot : A lone figure stands in the foreground, gazing towards a futuristic city silhouetted against a fiery sunset.
Aesthetic Score : 0.7
Mood : solitude, mystery, futuristic
Quality
Entropy : 6.43
Noise : 65
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts and blurring in the background, particularly around the city silhouette.
A Gaze Through the Foliage: A Portrait of Mystery
This close-up portrait captures the intensity of an older man’s gaze, partially obscured by foliage. The framing creates a sense of intimacy and intrigue, leaving the viewer wondering about the story behind his thoughtful expression.
Prompt
Hyper-realistic: Intrigued, adventurous ; A weathered explorer, eyes wide with wonder, peering into a dense jungle; close-up; Adventure; Lush, vibrant foliage, sunlight filtering through the canopy; cinematic
Characteristic
Shot : Close-up portrait of a weathered man with a beard, wearing a hat, peeking through foliage.
Aesthetic Score : 0.6
Mood : mysterious, rugged, intense
Quality
Entropy : 6.80
Noise : 84
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry. The leaves in the foreground have some artifacts and appear somewhat out of focus.
In the Zone: A Gamer’s Hand Grips the Controller
A close-up shot captures the intensity of a gaming session, with a hand gripping a controller against a backdrop of vibrant, blurred computer monitors and neon lights. The composition emphasizes the focus and dedication of the player, creating a sense of immersion in the virtual world.
Prompt
Hyper-realistic: Focused, intense ; A gamer’s hands, deftly manipulating a controller, fingers flying across buttons; close-up; Gaming; A brightly lit gaming setup with a high-definition monitor displaying a vibrant, immersive game world; cinematic
Characteristic
Shot : A person’s hand is holding a game controller in front of a computer screen with a blurred background, there are colorful lights in the background and they cast a glow on the person’s hand.
Aesthetic Score : 0.6
Mood : focused, intense, dark
Quality
Entropy : 6.86
Noise : 64
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is a slight blurriness to the image, especially the background.
Immerse Yourself in the Vibrant Energy of a Chinese Street Market
Experience the bustling atmosphere of a historic Chinese city through the lens of a street market. Witness the vibrant colors, the lively energy, and the exotic charm of this captivating scene.
Prompt
Hyper-realistic: Energetic, vibrant ; A bustling marketplace in a foreign city, filled with vibrant colors and exotic goods; wide shot; Tourism; A bustling, vibrant city street with traditional architecture and people from all walks of life; cinematic
Characteristic
Shot : A bustling street market in a Chinese city, with vendors selling their wares and people walking by. The buildings are old and traditional, with red roofs and lanterns hanging from the eaves. The street is narrow and lined with stalls and shops, and the air is filled with the sounds of chatter and haggling.
Aesthetic Score : 0.7
Mood : lively, vibrant, bustling
Quality
Entropy : 6.79
Noise : 111
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : Minor artifacts in some areas, particularly in the shadows
A Moment of Solitude on the Mountaintop
A lone hiker stands on a breathtaking mountain ridge, dwarfed by the majestic snow-capped peaks. The serene landscape evokes a sense of adventure and contemplation, highlighting the beauty and solitude of nature.
Prompt
Hyper-realistic: Tranquil, awe-inspiring ; A lone traveler, gazing out at a breathtaking mountain range, a sense of peace washing over them; medium shot; Travel; Majestic mountains, snow-capped peaks, and a clear blue sky; cinematic
Characteristic
Shot : A lone hiker stands on a rocky ridge, overlooking a vast, snow-capped mountain range in the distance. The sky is a clear, bright blue, and the air is crisp and clean.
Aesthetic Score : 0.8
Mood : serene, contemplative, adventurous
Quality
Entropy : 6.67
Noise : 85
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant artifacts or errors
Campfire Companionship Under a Starry Sky
A heartwarming scene of three friends gathered around a crackling campfire, bathed in the warm glow of the flames and the twinkling light of a million stars. Their smiles and relaxed postures speak volumes about the joy and intimacy they share in this cozy setting.
Prompt
Hyper-realistic: Warm, nostalgic ; A family gathered around a campfire, sharing stories and laughter; medium shot; Family; A cozy campsite under a starry night sky, with a crackling fire and the smell of roasting marshmallows; cinematic
Characteristic
Shot : Three people are gathered around a campfire at night, with a starry sky overhead.
Aesthetic Score : 0.7
Mood : cozy, nostalgic, relaxed
Quality
Entropy : 6.57
Noise : 77
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor artifacts, such as noise in the shadows. The fire is overly sharpened.
Heroic Silhouette Against the Cityscape
A lone figure, cloaked in red, stands atop a rooftop, gazing out at the sprawling city below. The hazy sky and distant cityscape create a dramatic backdrop, emphasizing the man’s sense of power and hope.
Prompt
Hyper-realistic: Powerful, inspiring ; A superhero, soaring through the air, cape billowing behind them; wide shot; Heroism; A sprawling cityscape with towering skyscrapers and bustling streets below; cinematic
Characteristic
Shot : A superhero in a red cape stands on a tall building overlooking a city skyline. The city is mostly obscured by fog and haze, creating a sense of mystery and grandeur. The sun is shining brightly in the distance, casting a warm glow on the scene.
Aesthetic Score : 0.7
Mood : dramatic, hopeful, epic
Quality
Entropy : 6.60
Noise : 82
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some slight artifacts around the edges of the cape and the superhero’s figure. These artifacts are likely due to the image being digitally altered or compressed.
Conquering the Summit: Climbers Brave the Snowy Peaks
A breathtaking scene of climbers ascending a snow-covered mountain, their bright orange jackets a stark contrast against the vast, dramatic landscape. The image captures the thrill and danger of their adventure, emphasizing the scale of the mountain and the climbers’ small size.
Prompt
Hyper-realistic: Thrilling, dangerous ; A group of adventurers, navigating a treacherous mountain path, ropes and ice axes in hand; medium shot; Adventure; A rugged, snow-covered mountain range with steep cliffs and icy crevasses; cinematic
Characteristic
Shot : A group of hikers are ascending a snowy mountain peak. The image captures the climbers from a distance, with the majestic mountain range providing a dramatic backdrop.
Aesthetic Score : 0.8
Mood : adventurous, serene, inspiring
Quality
Entropy : 6.68
Noise : 110
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No notable artifacts or errors
Lost in the Neon Glow: A Glimpse into a Futuristic World
A young woman, her face illuminated by vibrant neon lights, explores a world of wonder and possibility through a VR headset. The scene evokes a sense of futuristic mystery and hopeful anticipation, leaving viewers eager to discover what lies beyond the digital horizon.
Prompt
Hyper-realistic: Engrossed, surreal ; A player, immersed in a virtual reality game, their face contorted in concentration; close-up; Gaming; A futuristic, immersive virtual reality environment with vibrant colors and intricate details; cinematic
Characteristic
Shot : A young woman is wearing a VR headset and looking up, illuminated by blue and red neon lights.
Aesthetic Score : 0.6
Mood : futuristic, cyberpunk, awe
Quality
Entropy : 6.85
Noise : 69
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible errors in the image.
Silhouettes of Love: A Family’s Sunset Stroll
A heartwarming scene of a family walking on a beach at sunset, their silhouettes painted against the fiery sky. The image evokes a sense of peace, serenity, and the enduring bond of family.
Prompt
Hyper-realistic: Peaceful, heartwarming ; A family, standing on a beach, watching the sunset over the ocean; wide shot; Family; A serene beach with golden sand, turquoise water, and a fiery sunset; cinematic
Characteristic
Shot : A family of three silhouetted against a sunset on a beach
Aesthetic Score : 0.7
Mood : romantic, peaceful, tranquil
Quality
Entropy : 6.73
Noise : 87
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors.
Conclusion
This analysis shows that the generative AI model performed well in terms of understanding camera positions and shot composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
Camera Position:
- Score: 0.4
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera positions described in the prompt.
Shot Analysis:
- Score: 0.495
- Interpretation: This score is close to the “good” range, indicating that the model generally understood the shot composition described in the prompt. However, it wasn’t quite able to perfectly recreate the intended scene.
Aesthetic Analysis:
- Score: 0.3
- Interpretation: This score is significantly below the “very good” range of -0.2 to 0.1. It indicates that the generated image’s aesthetic deviated considerably from the desired aesthetic described in the prompt.
Overall:
The model demonstrates a decent understanding of camera positions and shot composition, but struggles to achieve the desired aesthetic. This suggests that the model might need further training to better understand and translate aesthetic preferences into visual outputs.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux-pro/api