AI's Artistic Eye: Capturing the 'style-aesthetic' with Mixed Results with Flux-schnell
- 10 minutes read - 1962 wordsTable of Contents
The ‘style-aesthetic’ is a powerful tool in visual storytelling, evoking specific emotions and creating a distinct visual identity. This style often utilizes dramatic lighting, striking compositions, and evocative color palettes to immerse viewers in a particular mood. Examples of this style can be found in film noir, epic fantasy, and even contemporary photography. In this blog post, we explore the capabilities of AI in capturing this style, analyzing the results of an experiment that tasked an AI model with generating images based on a specific ‘style-aesthetic’ prompt.
Created with: flux-schnell
Silhouetted Solitude: A Moment of Tranquility at Sunset
A lone figure stands in contemplation against the backdrop of a setting sun, casting a warm, golden glow over a vast, flat landscape. The image evokes a sense of peace, hope, and the quiet beauty of solitude.
Prompt
style-aesthetic Dogme 95: Epic, hopeful ; A lone figure, silhouetted against a setting sun; long shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A solitary figure stands silhouetted against a brilliant sunset, facing the sun in a vast, empty landscape.
Aesthetic Score : 0.6
Mood : serene, contemplative, hopeful
Quality
Entropy : 4.54
Noise : 52
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors
Precarious Grip: A Hand Reaches for Hope on a Cliffside
A lone hand stretches out towards a rope, clinging to a sheer cliff face. The vast valley below speaks of the danger and the thrill of this adventurous climb. The dramatic scene evokes a sense of suspense, leaving you wondering what lies ahead.
Prompt
style-aesthetic Dogme 95: Suspenseful, thrilling ; A hand reaching out to grasp a rope ladder dangling from a cliff face; close-up; Adventure; A rocky, treacherous mountainside; cinematic
Characteristic
Shot : A person’s hand is reaching out towards a mountain ridge. The background is a blurry landscape of mountains and rock.
Aesthetic Score : 0.5
Mood : adventure, suspense, precarious
Quality
Entropy : 6.88
Noise : 82
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors. There may be some minor noise and compression artifacts but they are barely noticeable.
Lost in the Game: A Moment of Intense Focus
A dimly lit room, the glow of computer screens illuminating a figure engrossed in a video game. The low-key lighting creates a sense of mystery and intrigue, highlighting the player’s focused hands gripping the controller. This image captures the intensity and immersion of gaming, transporting the viewer into a world of digital escape.
Prompt
style-aesthetic Dogme 95: Intense, focused ; A player’s hands frantically manipulating a joystick, their face illuminated by the screen; medium shot; Gaming; A dimly lit room with a computer monitor glowing brightly; cinematic
Characteristic
Shot : A person is sitting in a dimly lit room, likely playing a video game, with their face partially obscured by their glasses and their hand holding a controller in front of a computer monitor, the scene has a mysterious, intimate feeling to it. The room is lit in warm tones, and the person is wearing a gray sweater.
Aesthetic Score : 0.4
Mood : mysterious, intimate, focused
Quality
Entropy : 4.90
Noise : 34
Prompt Clip Score : 0.18
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise in the background, possibly due to low light conditions. There’s also slight blurring around the edges of the person’s hair and glasses.
A Symphony of Colors: Immerse Yourself in a Bustling Asian Market
Experience the vibrant energy of a bustling street market in a foreign country, likely in Asia. The scene is alive with colorful decorations, enticing food stalls, and the lively hum of daily life. The use of color and light creates a sense of depth and energy, drawing you into the heart of the action.
Prompt
style-aesthetic Dogme 95: Energetic, lively ; A bustling marketplace, filled with vibrant colors and exotic goods; wide shot; Tourism; A crowded street in a foreign city; cinematic
Characteristic
Shot : A bustling street market in a foreign country, likely Asia, with colorful flags and lanterns hanging overhead. The market is filled with vendors and shoppers, and there are various goods on display.
Aesthetic Score : 0.6
Mood : vibrant, lively, exotic
Quality
Entropy : 6.87
Noise : 112
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise and compression artifacts are visible in the image. The lighting is uneven, with some areas being overexposed and others underexposed.
Tranquil Journey: Capturing the Beauty of Rural Motion
A train glides through a picturesque countryside, its window reflecting the fleeting scenery. The motion blur evokes a sense of peaceful nostalgia, capturing the dynamism of travel and the tranquility of the landscape.
Prompt
style-aesthetic Dogme 95: Nostalgic, contemplative ; A train speeding through a countryside landscape, blurring the scenery; long shot; Travel; Rolling hills and fields passing by; cinematic
Characteristic
Shot : A view from the inside of a train moving through the countryside. The window reflects the scenery, and the motion blur creates a sense of speed.
Aesthetic Score : 0.6
Mood : tranquil, nostalgic, journey
Quality
Entropy : 6.49
Noise : 57
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts, such as the graininess of the image. The focus is soft, particularly on the landscape outside the train window. The window reflection is a bit distracting.
Warmth and Connection: A Family Gathering Under Candlelight
A rustic dining room bathed in the glow of a chandelier and candlelight. A family or group of friends share a meal, their laughter and conversation filling the air. The intimate setting and warm lighting create a sense of closeness and shared joy.
Prompt
style-aesthetic Dogme 95: Warm, intimate ; A family gathered around a dinner table, sharing a meal and laughter; medium shot; Family; A cozy, well-worn kitchen; cinematic
Characteristic
Shot : A family dinner setting with four people seated around a table with food on it. It is lit by a chandelier and a candle. The table is set with plates, cutlery, and glasses.
Aesthetic Score : 0.6
Mood : warm, cozy, intimate
Quality
Entropy : 6.22
Noise : 83
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image. The lighting is slightly uneven, but this is likely due to the intentional use of warm lighting.
Lost in Thought: A Child’s Moment of Solitude
A young child sits alone in the foreground, their gaze fixed on something unseen. The dim lighting and the blurred figure of an adult in the background create a sense of melancholy and isolation, leaving the viewer to ponder the child’s thoughts and emotions.
Prompt
style-aesthetic Dogme 95: Sad, poignant ; A single tear rolling down a child’s cheek as they watch their parents argue; close-up; Family; A dimly lit living room; cinematic
Characteristic
Shot : A young child, possibly a boy, is looking away from the camera, with a woman out of focus in the background. It appears to be a dimly lit interior scene, likely a living room or family setting.
Aesthetic Score : 0.6
Mood : pensive, melancholic, quiet
Quality
Entropy : 5.93
Noise : 37
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image exhibits some noise and graininess, particularly in the background, indicating the use of high ISO or low lighting conditions. The depth of field is also shallow, with a noticeable blur in the background.
Campfire Tales: Friends, Flames, and Forest Magic
A cozy gathering of friends around a crackling campfire in the heart of the forest. The soft glow of the flames creates an intimate atmosphere, while the surrounding trees whisper tales of adventure. This scene captures the essence of warmth, connection, and the thrill of the unknown.
Prompt
style-aesthetic Dogme 95: Joyful, communal ; A group of friends huddled together around a campfire, sharing stories and laughter; medium shot; Adventure; A dark forest with flickering flames; cinematic
Characteristic
Shot : Four friends are gathered around a campfire in a forest, enjoying each other’s company and the warmth of the fire.
Aesthetic Score : 0.7
Mood : cozy, friendly, intimate
Quality
Entropy : 5.76
Noise : 72
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise and grain, particularly in the darker areas, which can be attributed to low light conditions or post-processing. There are also some minor color shifts and artifacts, which are not very noticeable.
Contemplating the Vastness: A Moment of Peace on the Cliffside
A young man stands on a cliff, dwarfed by the expansive ocean. The scene evokes a sense of contemplation, peace, and adventure, with the contrasting colors of the blue water and gray sky adding to the dramatic effect.
Prompt
style-aesthetic Dogme 95: Awe-inspiring, contemplative ; A lone traveler gazing out at a vast ocean, their face filled with wonder; long shot; Travel; A dramatic coastline with crashing waves; cinematic
Characteristic
Shot : A man standing on a cliff overlooking the ocean, he is looking out at the view and seems to be lost in thought
Aesthetic Score : 0.7
Mood : pensive, contemplative, serene
Quality
Entropy : 6.75
Noise : 73
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise in the background and slightly blurry.
A Glimpse into the Past: A Vintage Snapshot of Familiar Faces
This faded photograph captures a group of people standing before a doorway, likely in a home. The soft lighting and vignette create a nostalgic mood, transporting us back to a time of simpler moments and cherished memories.
Prompt
style-aesthetic Dogme 95: Melancholy, nostalgic ; A hand holding a worn photograph, the image blurred and faded; close-up; Family; A cluttered attic filled with old memories; cinematic
Characteristic
Shot : A group photo of four people standing in front of a brick wall. The photo is printed on a piece of paper and being held by a hand.
Aesthetic Score : 0.6
Mood : casual, nostalgic, intimate
Quality
Entropy : 6.83
Noise : 72
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible artifacts or errors in the image.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.49, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create the expected shot composition.
- Aesthetic Analysis: The model scored 0.17, which is considered very good. This means that the generated image closely matched the desired aesthetic style.
Overall, the model seems to be better at capturing the desired aesthetic style than understanding the camera positions and shot composition. This suggests that the model might need further training to improve its ability to interpret and translate complex visual descriptions.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux/schnell/api