AI's Artistic Eye: Capturing Poses, But Missing the Shot with Bfl-flux-pro
- 9 minutes read - 1772 wordsTable of Contents
In the realm of artificial intelligence, generative models are rapidly pushing the boundaries of creativity. These models can generate images, text, and even music, often mimicking human artistic expression. One intriguing area of exploration is the ability of AI to understand and execute specific poses within a given scene. This blog post delves into the results of a recent experiment where a generative AI model was tasked with creating images based on various poses and scene descriptions. The results reveal both the model’s strengths and weaknesses, highlighting the ongoing journey towards AI-generated imagery that truly captures the essence of human vision.
Created with: flux-pro
Back-to-Back, Facing the Unknown: Soldiers in a War-Torn Landscape
Two soldiers, clad in military fatigues, stand back-to-back in a desolate, war-torn landscape. The composition emphasizes their camaraderie and the stark reality of their surroundings, creating a mood of seriousness, drama, and tension.
Prompt
poses embrace: triumphant, camaraderie ; Two soldiers; wide shot; heroism; battlefield with smoke and explosions in the background; cinematic
Characteristic
Shot : Two men in military gear, standing in a foggy, war-torn environment. The background is blurry and suggests a scene of conflict.
Aesthetic Score : 0.7
Mood : serious, tense, contemplative
Quality
Entropy : 6.84
Noise : 75
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major artifacts or errors. The image is slightly grainy, but this contributes to the mood.
Jungle Encounter: A Clash of Cultures in the Wild
A shirtless man and a woman adorned in vibrant tribal attire stand amidst the lush jungle, their contrasting appearances hinting at a story of mystery and conflict. The backdrop of a looming, indistinct structure adds to the sense of intrigue, leaving viewers to wonder about the secrets hidden within this dramatic scene.
Prompt
poses embrace: trust, respect ; A lone explorer and a local guide; medium shot; adventure; lush jungle with ancient ruins in the distance; cinematic
Characteristic
Shot : Two people, a man and a woman, standing in a lush jungle setting, possibly in a tropical region. An old building is visible in the background, adding an element of mystery.
Aesthetic Score : 0.6
Mood : mysterious, exotic, intense
Quality
Entropy : 6.88
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has minor noise and some chromatic aberration. There are some areas of blown-out highlights.
Neon Nights: Fun and Games with a Touch of Drama
Capture the vibrant energy of youth with this image featuring two friends enjoying a night in, bathed in neon light. The playful mood is palpable, while the dramatic lighting adds a touch of intrigue.
Prompt
poses embrace: excitement, joy ; Two gamers celebrating a victory; close-up; gaming; brightly lit gaming room with monitors and controllers; cinematic
Characteristic
Shot : Two young people, a man and a woman, are sitting on a couch in a brightly lit room with gaming monitors and other electronics. The man is holding a video game controller and the woman is looking at him with a smile.
Aesthetic Score : 0.7
Mood : playful, friendly, happy
Quality
Entropy : 6.77
Noise : 70
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts, particularly in the hair and skin of the characters. The lighting also appears to be somewhat unrealistic.
Sunset Serenade: A Love Story Unfolds in Silhouette
In this romantic and dreamy scene, a couple stands silhouetted against a breathtaking sunset cityscape. The woman, dressed in a vibrant pink dress, and the man, in a sharp suit, create a sense of mystery and intimacy. Their hopeful stance against the sunset paints a picture of love and endless possibilities.
Prompt
poses embrace: romantic, awe ; A couple gazing at a breathtaking sunset; long shot; tourism; panoramic view of a city skyline; cinematic
Characteristic
Shot : A couple is silhouetted against the setting sun, gazing at the cityscape
Aesthetic Score : 0.7
Mood : romantic, hopeful, nostalgic
Quality
Entropy : 6.76
Noise : 63
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image appears to be slightly overexposed, leading to a lack of detail in the cityscape.
Silhouettes of Joy: A Family’s Moment of Wonder at Sunset
A heartwarming scene of a family of four silhouetted against a majestic mountain range and vibrant sunset. Their shared gaze towards the horizon speaks volumes of happiness, peace, and hope. The dramatic effect of the silhouettes against the fiery sky creates a sense of awe and wonder, capturing a precious moment of togetherness.
Prompt
poses embrace: unity, accomplishment ; A family standing on a mountain peak; medium shot; travel; majestic mountain range with clouds in the background; cinematic
Characteristic
Shot : A family of four silhouetted against a sunset in the mountains.
Aesthetic Score : 0.7
Mood : joyful, hopeful, adventurous
Quality
Entropy : 6.73
Noise : 66
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some noise and artifacts.
Cheers to Friendship: A Toast in the Warm Glow of a Bar
Capture the joy and camaraderie of a group of friends raising their beers in a dimly lit bar. The warm lighting and focus on their hands create a sense of intimacy and celebration, perfect for evoking a happy and convivial mood.
Prompt
poses embrace: celebratory, friendship ; A group of friends raising their glasses in a toast; close-up; groups; lively bar or restaurant setting; cinematic
Characteristic
Shot : A group of friends raising their glasses in a toast at a dimly lit bar or restaurant.
Aesthetic Score : 0.7
Mood : joyful, celebratory, casual
Quality
Entropy : 6.89
Noise : 79
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
A Moment of Unconditional Love
A tender embrace between a mother and daughter, captured in a moment of pure affection. The soft lighting and close-up framing enhance the intimacy and emotion of this heartwarming scene.
Prompt
poses embrace: love, gratitude ; A young woman and her grandmother; medium shot; heroism; a peaceful park with a fountain in the background; cinematic
Characteristic
Shot : Two women, likely mother and daughter, embracing in a park with a fountain in the background.
Aesthetic Score : 0.8
Mood : loving, tender, happy
Quality
Entropy : 6.88
Noise : 75
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Lost in the Vastness: Astronauts Embrace the Cosmic Dance
Two astronauts, silhouetted against the radiant glow of a distant sun, float amidst the infinite expanse of space. Their contrasting poses and the dramatic lighting evoke a sense of awe and wonder, highlighting the adventurous spirit of human exploration.
Prompt
poses embrace: wonder, awe ; Two astronauts floating in space; long shot; adventure; Earth in the distance; cinematic
Characteristic
Shot : Two astronauts floating in space, one in the foreground and one in the background, against a backdrop of Earth and stars. The astronaut in the foreground is facing the camera with his arms outstretched, while the other astronaut is looking away from the camera.
Aesthetic Score : 0.7
Mood : dreamy, hopeful, adventurous
Quality
Entropy : 6.88
Noise : 83
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has some minor artifacts and errors, such as some blurring in the background, the astronaut’s helmets look slightly plastic-like, and the lighting is a little flat.
Live Rock Concert: Energy, Excitement, and a Haze of Mystery
Capture the raw energy of a live rock concert with this image. The band performs on stage, bathed in vibrant stage lights, while a cheering crowd fills the venue. Smoke and backlighting create a dramatic effect, highlighting the silhouettes of the band members and adding a sense of mystery and intensity.
Prompt
poses embrace: passion, energy ; A group of musicians performing on stage; wide shot; gaming; a concert venue with flashing lights; cinematic
Characteristic
Shot : A rock band is performing on stage in front of a large crowd. The stage is lit with spotlights and pyrotechnics. The band members are all wearing dark clothing and are playing their instruments with passion.
Aesthetic Score : 0.7
Mood : energetic, passionate, ecstatic
Quality
Entropy : 6.40
Noise : 82
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some noise and blur are present in the crowd, possibly due to low lighting and long exposure. The pyrotechnics seem a little overexposed.
Sunset Romance on the Beach
A couple embraces on a golden sandy beach as the sun dips below the horizon, creating a warm and romantic atmosphere. The scene evokes feelings of love, serenity, and contentment.
Prompt
poses embrace: love, hope ; A couple standing on a beach at sunrise; close-up; travel; ocean waves crashing on the shore; cinematic
Characteristic
Shot : A couple is silhouetted against a sunset on a beach, embracing and looking into each other’s eyes.
Aesthetic Score : 0.7
Mood : romantic, tender, intimate
Quality
Entropy : 6.67
Noise : 58
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.30
Image errors : There is a slight halo around the couple due to the harsh light and the contrast adjustment applied to the image.
Conclusion
The results show that the generative AI model performed well in understanding and executing the camera position and shot instructions.
Here’s a breakdown:
- Camera Position: The model scored 0.45, which is considered below average. This suggests that the model struggled to accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.57, which is considered average. This indicates that the model was able to understand the scene and create a shot that somewhat aligned with the prompt’s description.
- Aesthetic Analysis: The model scored 0.09, which is considered very good. This means that the generated image closely matched the expected aesthetic style described in the prompt.
Overall, the model demonstrated a strong ability to understand and execute the aesthetic style, but struggled with accurately capturing the camera position and shot instructions.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://api.bfl.ml/docs#/util/get_result_v1_get_result_get