AI's Artistic Journey: Capturing Scenes, Missing the Vibe with Flux-pro
- 10 minutes read - 1954 wordsTable of Contents
The world of AI image generation is rapidly evolving, with models capable of creating stunning visuals based on text prompts. However, capturing the essence of a desired aesthetic style remains a challenge. This blog post explores the results of testing an AI model’s ability to generate images based on specific scenes and aesthetics. We’ll delve into the model’s performance in capturing camera positions, shot types, and overall aesthetic style, highlighting areas where it excels and where it needs improvement.
For example, imagine a scene described as ‘a lone knight, silhouetted against a setting sun; wide shot; Heroism; A vast, desolate battlefield littered with fallen soldiers.’ While the AI might accurately depict a knight and a battlefield, it might struggle to convey the dramatic, heroic tone intended by the ‘Heroism’ aesthetic. This highlights the need for further development in AI’s understanding of artistic nuances and emotional impact.
Created with: flux-pro
The Last Stand: A Silhouette of Solitude
A lone warrior, silhouetted against a fiery sunset, stands amidst a field of fallen comrades. The scene evokes a sense of dramatic loneliness and somber reflection, highlighting the weight of battle and the fragility of life.
Prompt
Baroque: Epic, melancholic ; A lone knight, silhouetted against a setting sun; wide shot; Heroism; A vast, desolate battlefield littered with fallen soldiers.; cinematic
Characteristic
Shot : A lone figure, a warrior with a spear, stands in silhouette against a sunset. The ground is littered with fallen figures, suggesting a battlefield.
Aesthetic Score : 0.6
Mood : melancholy, somber, heroic
Quality
Entropy : 6.27
Noise : 65
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.70
Image errors : Some blurring around the fallen figures and slight artifacts around the warrior’s silhouette.
Ship Battling Fury: A Stormy Sea and a Dramatic Showdown
A large wooden sailing ship faces a raging storm, battling high waves and strong winds. Lightning strikes illuminate the scene, adding to the dramatic and ominous mood. The ship’s silhouette against the stormy sky creates a powerful image of danger and intrigue.
Prompt
Baroque: Dramatic, thrilling ; A pirate ship, sails billowing in the wind, crashing through stormy waves; dynamic, close-up; Adventure; A raging sea with lightning illuminating the sky.; cinematic
Characteristic
Shot : A ship with sails is sailing through a storm. The sky is dark and there are lightning strikes. The waves are large and choppy. There is a lot of white foam.
Aesthetic Score : 0.7
Mood : dramatic, adventurous, powerful
Quality
Entropy : 6.71
Noise : 88
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.70
Image errors : The ship is a little blurry. The lighting is uneven. The waves are not very realistic.
On the Verge of Victory: A Futuristic Cityscape Awaits
A lone figure, controller in hand, stands poised against a backdrop of vibrant city lights and a fading sunset. The blurred background evokes a sense of vastness and anticipation, hinting at the thrilling action that lies ahead in this futuristic urban landscape.
Prompt
Baroque: Intense, focused ; A player’s hand, gripping a controller, illuminated by the glow of a screen; close-up; Gaming; A chaotic, pixelated cityscape on the screen.; cinematic
Characteristic
Shot : A person is holding a game controller in front of a blurry background of a city at sunset.
Aesthetic Score : 0.6
Mood : futuristic, digital, calm
Quality
Entropy : 6.89
Noise : 80
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are no visible artifacts or errors in the image.
A Grand Hallway Beckons with Warm Light and Intricate Detail
Step into a world of elegance and grandeur in this wide, ornate hallway. High ceilings and intricate decorations lead the eye towards a majestic archway, bathed in warm sunlight streaming through the windows. The bustling scene, with fruit stands and people passing through, adds a touch of life and vibrancy to this inviting space.
Prompt
Baroque: Opulent, vibrant ; A grand, ornate palace, bathed in golden sunlight; wide shot; Tourism; A bustling marketplace with vibrant colors and exotic goods.; cinematic
Characteristic
Shot : A grand, ornate hallway with arched doorways and a view of a building at the end. The hallway is lined with fruit stands and a few people are walking through.
Aesthetic Score : 0.7
Mood : grand, opulent, ethereal
Quality
Entropy : 6.86
Noise : 119
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible image errors
A Hiker’s Perspective: Tranquility Amidst Majestic Peaks
Experience the awe-inspiring beauty of a lone hiker standing on a mountain ridge, overlooking a vast valley and snow-capped peaks. The scene evokes a sense of tranquility, adventure, and inspiration, highlighting the dramatic contrast between the hiker’s small stature and the immense landscape.
Prompt
Baroque: Awe-inspiring, contemplative ; A lone traveler, gazing out at a breathtaking mountain range; medium shot; Travel; A vast, snow-capped mountain range with a winding road leading into the distance.; cinematic
Characteristic
Shot : A lone hiker stands on a mountain ridge overlooking a vast valley with a river snaking through it, the mountains rise high in the distance with a clear blue sky above.
Aesthetic Score : 0.8
Mood : serene, majestic, adventurous
Quality
Entropy : 6.60
Noise : 106
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Candlelit Gathering: A Family’s Intimate Moment
A warm and inviting living room scene captures a family gathered around a dining table, bathed in the soft glow of candlelight. The fireplace adds a cozy ambiance, while the intimate setting highlights the family’s connection and shared moments.
Prompt
Baroque: Warm, intimate ; A family gathered around a fireplace, sharing stories and laughter; medium shot; Family; A cozy, candlelit room with portraits of ancestors on the walls.; cinematic
Characteristic
Shot : A group of four people are gathered around a dining table in a dimly lit room. They are all smiling and engaged in conversation. The fireplace behind them is lit and the warm glow of the candles on the table adds to the cozy atmosphere. There are plates of food on the table, suggesting that they are enjoying a meal together.
Aesthetic Score : 0.7
Mood : warm, cozy, intimate
Quality
Entropy : 6.34
Noise : 71
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some slight artifacts in the image, particularly around the edges of the objects. The lighting is also a bit uneven, with some areas being brighter than others.
A Knight’s Tale: Epic Battle in Dust and Glory
A powerful image captures the heart of a knight in full armor, astride his steed, sword raised against a backdrop of swirling dust and distant comrades. The dramatic lighting and composition evoke a sense of epic grandeur and power, transporting the viewer to the heart of a fierce battle.
Prompt
Baroque: Brave, determined ; A knight, charging into battle, his armor gleaming in the sunlight; dynamic, close-up; Heroism; A chaotic battlefield with smoke and dust swirling in the air.; cinematic
Characteristic
Shot : A lone knight in full armor, riding a horse, with a sword in hand, in a dusty battlefield. There are other knights in the background, blurred and out of focus, suggesting battle.
Aesthetic Score : 0.7
Mood : epic, dramatic, heroic
Quality
Entropy : 6.81
Noise : 76
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major image errors, some noise in the background
Enchanted Treasure Awaits in Candlelit Shadows
A treasure chest overflowing with gold coins sits bathed in the warm glow of a single candle. The scene evokes a sense of mystery and magic, hinting at untold riches and ancient secrets. The dark background adds to the intrigue, leaving you wondering what lies beyond the flickering flame.
Prompt
Baroque: Intriguing, mysterious ; A treasure chest, overflowing with gold and jewels, illuminated by a single candle; close-up; Adventure; A dark, mysterious cave with cobwebs and shadows.; cinematic
Characteristic
Shot : A treasure chest overflowing with gold coins, lit by a single candle
Aesthetic Score : 0.75
Mood : mysterious, adventurous, magical
Quality
Entropy : 6.61
Noise : 75
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise in the shadows and the gold appears slightly pixelated
Dreamy Sunset Over Mystical Waterfalls
A lone woman in a white dress stands on a cliff, bathed in the golden light of a breathtaking sunset. Multiple waterfalls cascade down mountain peaks in a surreal and fantastical landscape, creating a scene of ethereal beauty and magic.
Prompt
Baroque: Triumphant, surreal ; A player’s avatar, standing triumphantly on a virtual mountain peak; wide shot; Gaming; A fantastical, digital landscape with glowing waterfalls and floating islands.; cinematic
Characteristic
Shot : A woman in a white dress stands on a cliff overlooking a fantastical landscape with several waterfalls flowing down majestic mountains
Aesthetic Score : 0.75
Mood : mystical, dreamy, ethereal
Quality
Entropy : 6.81
Noise : 95
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.90
Image errors : The waterfalls and clouds appear somewhat blurry and lack detail. Some of the edges of the mountains are also slightly jagged.
City Life Meets Sacred Space: A Vibrant Street Scene Before a Majestic Church
Capture the energy of urban life juxtaposed against the timeless beauty of a grand church. This scene evokes a sense of history and scale, with bustling crowds passing by the impressive architecture.
Prompt
Baroque: Energetic, lively ; A bustling city square, filled with people from all walks of life; wide shot; Tourism; A grand, Baroque cathedral towering over the city.; cinematic
Characteristic
Shot : A bustling street scene in front of a grand cathedral. People are walking along the street, enjoying the day, and the architecture is beautiful.
Aesthetic Score : 0.7
Mood : lively, vibrant, historic
Quality
Entropy : 6.95
Noise : 97
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to be well-exposed and free of artifacts or errors. The color saturation may be slightly high, making some colors appear a bit vibrant.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.53, which falls within the “good” range (0.5 to 0.75). This indicates that the model was able to accurately capture the intended camera positions described in the prompts.
- Shot Analysis: The model scored 0.58, also within the “good” range. This suggests that the model understood the scene descriptions in the prompts and generated images that reflected those descriptions.
- Aesthetic Analysis: The model scored 0.15, which is significantly lower than the “very good” range (-0.2 to 0.1). This indicates that the generated images did not match the expected aesthetic style as closely as desired.
Overall, the model demonstrates a good understanding of camera positions and scene descriptions, but needs improvement in generating images that align with the desired aesthetic style.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux-pro/api