AI Captures the Scene, But Struggles with the Shot with Bfl-flux-pro
- 9 minutes read - 1793 wordsTable of Contents
Generative AI models are revolutionizing the way we create images. These models can generate realistic and visually appealing images based on text prompts, offering exciting possibilities for artists, designers, and content creators. However, these models are still under development, and their capabilities vary depending on the specific task. In this blog post, we delve into the performance of a generative AI model in creating images based on detailed scene descriptions, analyzing its strengths and weaknesses.
Created with: flux-pro
Silhouetted Against the Setting Sun: A Lone Figure Contemplates the City
A solitary figure, cloaked in mystery, stands on a rocky outcrop, their silhouette stark against the warm glow of a setting sun. The distant cityscape, shrouded in a hazy atmosphere, evokes a sense of longing and isolation, hinting at a journey both grand and melancholic.
Prompt
poses looking-back: Melancholy, yet hopeful ; Lone figure in a tattered cloak; wide shot; Heroism; Ruins of a fallen city bathed in the golden light of a setting sun; cinematic
Characteristic
Shot : A lone figure in a cloak stands on a rocky outcropping, looking out over a futuristic cityscape bathed in the glow of a setting sun.
Aesthetic Score : 0.7
Mood : mysterious, melancholic, hopeful
Quality
Entropy : 6.20
Noise : 72
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.90
Image errors : No obvious errors, but the edges of the figure look slightly pixelated, as if from a low resolution image.
Adventure Awaits: Exploring the Jungle Temple
Three intrepid explorers navigate a lush jungle path, their backpacks laden with anticipation. The majestic temple in the distance beckons, promising secrets and wonder. This adventurous scene captures a sense of mystery and curiosity, leaving viewers eager to discover what lies ahead.
Prompt
poses looking-back: Excited, adventurous ; A group of explorers; medium shot; Adventure; Lush jungle with ancient temples in the distance; cinematic
Characteristic
Shot : Three people, two women and a man, are walking through a lush green jungle with an ancient temple structure in the background. They are wearing backpacks and appear to be on a journey or adventure.
Aesthetic Score : 0.6
Mood : adventurous, peaceful, serene
Quality
Entropy : 6.90
Noise : 93
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable artifacts or errors in the image.
Lost in the Glow: A Moment of Digital Focus
A young person, headphones on, is deeply engrossed in their work, a vibrant abstract image with a glowing heart captivating their attention. The scene captures the focused energy of digital creation, infused with a touch of playful mystery.
Prompt
poses looking-back: Intense, focused ; A gamer’s hands on a keyboard; close-up; Gaming; Neon lights reflecting on the screen, displaying a virtual world; cinematic
Characteristic
Shot : A young person sits at a computer, wearing headphones and looking at the screen. The screen displays a heart shape and other colorful visuals. The room is dimly lit with pink and blue lighting.
Aesthetic Score : 0.6
Mood : focused, playful, digital
Quality
Entropy : 6.77
Noise : 68
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise in the image, particularly in the darker areas.
A Solitary Figure Against the Vastness of Winter
A lone hiker stands on a snow-capped mountain peak, dwarfed by the immense, snow-covered landscape. The scene evokes a sense of serenity, contemplation, and adventure, highlighting the dramatic contrast between the hiker’s small figure and the vastness of the mountains.
Prompt
poses looking-back: Awe-inspiring, peaceful ; A lone traveler standing on a mountain peak; long shot; Tourism; Breathtaking panoramic view of a snow-capped mountain range; cinematic
Characteristic
Shot : A lone hiker stands on a mountaintop overlooking a vast, snow-covered landscape.
Aesthetic Score : 0.75
Mood : serene, tranquil, adventurous
Quality
Entropy : 6.60
Noise : 81
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors.
Nostalgia on Rails: A Vintage Locomotive Chugs Through the Desert Sunset
A vintage steam locomotive journeys through a breathtaking desert landscape as the sun sets, casting a golden glow. The scene evokes a sense of nostalgia, adventure, and epic grandeur, with the dramatic lighting and locomotive’s movement adding to the captivating atmosphere.
Prompt
poses looking-back: Nostalgic, adventurous ; A vintage train speeding through a desert landscape; medium shot; Travel; Sun setting over the horizon, casting long shadows; cinematic
Characteristic
Shot : A vintage steam locomotive train is moving through a desert landscape. The sun is setting in the background, casting an orange glow over the scene. There is smoke billowing from the locomotive’s chimney.
Aesthetic Score : 0.7
Mood : nostalgic, adventurous, majestic
Quality
Entropy : 6.88
Noise : 70
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some details look a little artificial, such as the smoke.
City Stroll with Laughter and Joy
Three friends, radiating happiness, walk through a vibrant city, their laughter echoing the carefree spirit of the moment. The scene captures the warmth and joy of friendship amidst a colorful urban backdrop.
Prompt
poses looking-back: Joyful, carefree ; A group of friends laughing and talking; medium shot; Groups; A bustling city street with vibrant street art; cinematic
Characteristic
Shot : Three friends walking down a street in a European city, smiling and looking at the camera. There are other people in the background.
Aesthetic Score : 0.7
Mood : joyful, carefree, friendly
Quality
Entropy : 6.80
Noise : 75
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is a bit blurry, especially in the background. There are some artifacts around the edges of the subjects, likely due to the camera lens.
Lost in the Cosmic Embrace: An Astronaut’s Solitary Journey
A lone astronaut floats amidst the infinite expanse of space, Earth a distant, partially obscured wonder. The scene evokes a profound sense of solitude, wonder, and the thrill of cosmic adventure.
Prompt
poses looking-back: Awe-inspiring, contemplative ; A lone astronaut floating in space; long shot; Heroism; Earth hanging in the distance, a blue marble against the black void; cinematic
Characteristic
Shot : An astronaut floating in space, with a view of Earth in the background.
Aesthetic Score : 0.7
Mood : solitude, awe, wonder
Quality
Entropy : 6.54
Noise : 65
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some artifacts, especially in the background. The Earth looks slightly flat.
Thrill Seekers Ride the Rapids!
A group of friends embrace the excitement of whitewater rafting, their smiles and the churning water capturing the adventurous spirit of the moment. The lush greenery in the background adds a touch of serenity to this exhilarating scene.
Prompt
poses looking-back: Thrilling, exhilarating ; A group of adventurers on a raft; medium shot; Adventure; Rapids churning whitewater, a sense of danger and excitement; cinematic
Characteristic
Shot : A group of friends are rafting down a river. The river is quite fast and there are rapids in the background.
Aesthetic Score : 0.7
Mood : adventurous, exciting, energetic
Quality
Entropy : 6.82
Noise : 99
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant image errors, the only issue might be the slight blurriness of the river in the background, possibly caused by camera shake.
Heroic Silhouette: A Moment of Triumph at Sunset
A superhero stands tall on a mountain peak, bathed in the golden light of the setting sun. The vast valley below and the winding river create a breathtaking backdrop, while the hero’s silhouette against the sky evokes a sense of hope and triumph.
Prompt
poses looking-back: Triumphant, accomplished ; A gamer’s avatar standing on a virtual mountain peak; close-up; Gaming; A vast, fantastical landscape stretching out before them; cinematic
Characteristic
Shot : A superhero stands triumphantly on a mountain peak overlooking a valley with a river winding through it. The sky is a vibrant sunset, with clouds painting the sky in shades of pink and orange. The scene is rendered in a cartoonish style, giving it a whimsical and heroic feel.
Aesthetic Score : 0.6
Mood : triumphant, heroic, hopeful
Quality
Entropy : 6.60
Noise : 68
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image shows slight blurring on the edges of the character, and the textures on the rocks appear somewhat grainy.
Sunset Romance on the Beach
A couple strolls hand-in-hand along a sandy shore as the sun dips below the horizon, casting a warm glow on their silhouettes. The scene evokes a sense of romance, serenity, and peace, making it a perfect picture of love and tranquility.
Prompt
poses looking-back: Romantic, peaceful ; A couple walking hand-in-hand on a beach; long shot; Tourism; Sunset painting the sky in vibrant hues of orange and pink; cinematic
Characteristic
Shot : A couple is walking on a beach at sunset, they are holding hands and looking out at the ocean
Aesthetic Score : 0.7
Mood : romantic, tranquil, happy
Quality
Entropy : 6.66
Noise : 72
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, and the colors are a bit washed out. The background also looks a bit blurry and lacks detail.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.52, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.08, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model demonstrates a good understanding of the scene and shot composition, but needs improvement in accurately capturing the intended camera position. The aesthetic quality of the generated image is very good.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://api.bfl.ml/docs#/util/get_result_v1_get_result_get