AI Captures the Essence, But Misses the Shot: A Study in Aesthetic Style with Flux-schnell
- 9 minutes read - 1881 wordsTable of Contents
The ‘style-aesthetic’ is a powerful tool for artists and designers, allowing them to evoke specific emotions and create a desired visual impact. This style often involves dramatic lighting, bold colors, and striking compositions, aiming to capture the essence of a scene or emotion. Examples of this style can be found in cinematic photography, where the camera angle and lighting are carefully chosen to create a sense of drama and grandeur. In this blog post, we explore the results of an experiment using a generative AI model to create images based on specific ‘style-aesthetic’ prompts. The results reveal both the strengths and limitations of AI in capturing the nuances of this artistic style.
Created with: flux-schnell
Solitude and Majesty: A Figure Stands Tall Against the Setting Sun
A lone figure silhouetted against a breathtaking sunset, perched atop a mountain peak overlooking a sea of clouds. The scene evokes a sense of serenity, majesty, and hope, with the vastness of the landscape emphasizing the figure’s solitude and the dramatic effect of the golden light.
Prompt
style-aesthetic Naturalistic: Epic, triumphant ; A lone figure, silhouetted against the setting sun, standing atop a mountain peak; wide shot; Heroism; Majestic mountain range with clouds swirling around the peak; cinematic
Characteristic
Shot : A lone figure stands on a mountain peak, silhouetted against a bright sunrise over a vast, cloud-covered landscape.
Aesthetic Score : 0.8
Mood : serene, hopeful, contemplative
Quality
Entropy : 6.72
Noise : 61
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible errors or artifacts in the image.
Hidden in Plain Sight: A Man Peeking Through the Leaves
A mysterious figure, partially obscured by lush foliage, creates a sense of suspense and intrigue. The soft focus on the man’s face adds to the enigmatic atmosphere, leaving the viewer wondering what secrets lie behind the leaves.
Prompt
style-aesthetic Naturalistic: Intriguing, adventurous ; A weathered explorer, their face etched with determination, peering through dense jungle foliage; close-up; Adventure; Lush, vibrant rainforest with sunlight filtering through the canopy; cinematic
Characteristic
Shot : A man with a beard is hiding behind a large leafy plant, looking out with a serious expression.
Aesthetic Score : 0.6
Mood : mysterious, intense, contemplative
Quality
Entropy : 6.68
Noise : 85
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.30
Image errors : Slight blurring around the edges of the image, possibly due to compression or editing.
Lost in the Game: A Moment of Focused Play
A dimly lit room, a glowing monitor, and a player completely immersed in the digital world. This image captures the focused intensity and playful energy of a dedicated gamer, with the low lighting adding a sense of intimacy and drawing attention to the hands expertly navigating the game.
Prompt
style-aesthetic Naturalistic: Focused, intense ; A gamer’s hands, illuminated by the glow of a monitor, rapidly manipulating a controller; close-up; Gaming; A dimly lit room with gaming posters and peripherals scattered around; cinematic
Characteristic
Shot : A person is playing video games in a dimly lit room. The room is cluttered with electronic equipment, and the person is holding a controller in their hands. The person’s face is not visible, but their hands are illuminated by the light of the screen. The image has a dark and mysterious feel, with hints of blue and green lighting.
Aesthetic Score : 0.6
Mood : dark, mysterious, focused
Quality
Entropy : 6.35
Noise : 50
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, particularly in the shadows and highlights. The image also appears to be slightly blurry.
A Symphony of Colors and Life: A Bustling Asian Market
Immerse yourself in the vibrant energy of an Asian market, where colorful lanterns dance in the breeze and the air is thick with the aroma of fresh produce. The interplay of light and shadow adds a touch of mystery to this bustling scene, capturing the essence of daily life in a vibrant and crowded marketplace.
Prompt
style-aesthetic Naturalistic: Energetic, vibrant ; A bustling marketplace in a foreign city, filled with vibrant colors and exotic goods; wide shot; Tourism; A bustling street with traditional architecture and locals going about their day; cinematic
Characteristic
Shot : A bustling street market in Asia with colorful lanterns and vibrant produce on display
Aesthetic Score : 0.7
Mood : lively, vibrant, bustling
Quality
Entropy : 6.88
Noise : 118
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant image errors
Silhouetted Against the Setting Sun: A Moment of Solitude in the Desert
A lone traveler stands in the heart of a vast desert, the setting sun casting a warm glow over the sand dunes. His silhouette against the horizon evokes a sense of peace, contemplation, and adventure. The scene captures the beauty and solitude of the desert landscape, inviting viewers to imagine their own journeys.
Prompt
style-aesthetic Naturalistic: Solitude, contemplative ; A lone traveler, gazing out at a vast, open desert landscape; medium shot; Travel; A desolate desert with sand dunes stretching as far as the eye can see; cinematic
Characteristic
Shot : A man with a backpack is standing in a desert looking at the horizon.
Aesthetic Score : 0.6
Mood : solitude, contemplative, adventurous
Quality
Entropy : 6.69
Noise : 64
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No artifacts or errors are present in the image.
Campfire Nights: Cozy Gatherings Under a Starry Sky
A group of friends huddle around a crackling campfire, bathed in the warm glow of the flames and the soft light of the moon. The scene exudes a sense of cozy intimacy and relaxation, perfect for a night of shared stories and laughter.
Prompt
style-aesthetic Naturalistic: Warm, nostalgic ; A family gathered around a campfire, sharing stories and laughter; medium shot; Family; A cozy campsite under a starry night sky with a crackling fire in the foreground; cinematic
Characteristic
Shot : A group of friends gathered around a campfire under a starry night sky, sharing stories and enjoying each other’s company.
Aesthetic Score : 0.7
Mood : cozy, heartwarming, nostalgic
Quality
Entropy : 6.15
Noise : 84
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some slight artifacts in the background, particularly around the trees. The fire itself appears slightly blurry and lacks detail.
A Hiker’s Solitude Amidst Majestic Peaks
A lone figure traverses a narrow path on a cliff edge, dwarfed by the grandeur of the surrounding mountains. The scene evokes a sense of serenity, adventure, and isolation, highlighting the potential danger of the hiker’s journey.
Prompt
style-aesthetic Naturalistic: Challenging, determined ; A lone hiker, navigating a treacherous mountain path; medium shot; Heroism; A rugged mountain trail with steep cliffs and breathtaking views; cinematic
Characteristic
Shot : A lone hiker walks on a narrow path along the edge of a steep mountain cliff with a stunning valley vista below.
Aesthetic Score : 0.75
Mood : epic, adventurous, serene
Quality
Entropy : 6.65
Noise : 109
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors detected.
Lost in the Metaverse: VR Takes Over
A close-up shot captures the excitement and immersion of a group of people experiencing a virtual reality adventure. Their playful expressions and futuristic headsets paint a vivid picture of the future of entertainment.
Prompt
style-aesthetic Naturalistic: Excited, immersive ; A group of friends, their faces lit by the screen of a VR headset, immersed in a virtual world; close-up; Gaming; A dimly lit room with VR headsets and controllers scattered around; cinematic
Characteristic
Shot : A group of people wearing VR headsets are gathered in a dimly lit room, some are smiling and appear to be engaged in the virtual experience
Aesthetic Score : 0.6
Mood : excited, playful, futuristic
Quality
Entropy : 6.12
Noise : 56
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to have some noise and compression artifacts, particularly in the background. Some of the colors are a bit unnatural.
Cityscape Serenity: A Majestic Aerial View
Experience the grandeur of a bustling city from above. This aerial shot captures the towering skyscrapers reaching for the clear blue sky, punctuated by fluffy white clouds. The scene evokes a sense of urban peace and majesty, offering a breathtaking perspective on the city’s scale and beauty.
Prompt
style-aesthetic Naturalistic: Energetic, cosmopolitan ; A panoramic view of a bustling city skyline, captured from a rooftop; wide shot; Tourism; A vibrant city with towering skyscrapers and bustling streets below; cinematic
Characteristic
Shot : A panoramic view of a city skyline, with tall buildings and a clear blue sky. The buildings are mostly skyscrapers, and there are some clouds in the sky.
Aesthetic Score : 0.7
Mood : calm, peaceful, urban
Quality
Entropy : 6.70
Noise : 99
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some of the buildings in the distance are slightly blurred.
A Family’s Road Trip: Tranquility and Adventure Await
A family car winds its way through a picturesque rural landscape, the open road and distant hills promising a journey filled with tranquility, hope, and adventure. The scene evokes a sense of freedom and the anticipation of exciting experiences to come.
Prompt
style-aesthetic Naturalistic: Peaceful, nostalgic ; A family driving down a scenic highway, with rolling hills and fields passing by; medium shot; Travel; A winding highway with lush green fields and distant mountains in the background; cinematic
Characteristic
Shot : A family traveling in a car, looking out at a winding road in a hilly landscape.
Aesthetic Score : 0.7
Mood : serene, tranquil, adventurous
Quality
Entropy : 6.63
Noise : 86
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable artifacts or errors in the image.
Conclusion
The results indicate that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the generated image didn’t accurately reflect the camera position described in the prompt.
- Shot Analysis: The model scored 0.5, which is considered average. This means the generated image somewhat matched the shot described in the prompt, but there were some discrepancies.
- Aesthetic Analysis: The model scored 0.09, which is considered very good. This indicates that the generated image closely matched the expected aesthetic, despite the issues with camera position and shot analysis.
Overall, the model seems to be capable of understanding the general scene and achieving the desired aesthetic, but it struggles with accurately translating the camera position into the generated image.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux/schnell/api