AI's Artistic Journey: Capturing Poses, But Missing the Essence with Bfl-flux-pro
- 10 minutes read - 1932 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual prompts is rapidly advancing. However, capturing the nuances of human expression, particularly in poses, remains a challenge. This blog post delves into an experiment where an AI model was tasked with generating images based on specific poses and scenes. While the model demonstrates a grasp of camera position and shot analysis, it struggles to achieve the intended aesthetic, highlighting the ongoing challenges in AI’s artistic development. Dramatic poses, often used to convey emotion, heroism, or a sense of awe, require a delicate balance of body language, lighting, and composition. These elements are crucial in conveying the intended message and creating a compelling visual narrative. Examples of dramatic poses can be found in various forms of art, from classical paintings to modern photography. In film, dramatic poses are often used to emphasize a character’s emotions or to create a sense of tension or suspense. In photography, dramatic poses can be used to create a sense of movement, power, or vulnerability. The AI model’s attempt to recreate these poses provides valuable insights into the current capabilities and limitations of AI in understanding and replicating human expression.
Created with: flux-pro
Silhouetted Triumph: A Hiker’s Sunset Victory
A breathtaking sunset paints the sky as a lone hiker stands triumphant on a mountain peak. The silhouette against the fiery hues evokes a sense of serenity, hope, and accomplishment. This powerful image captures the essence of human ambition and the beauty of nature’s grand spectacle.
Prompt
poses low-angle: inspiring, triumphant ; A lone figure standing atop a mountain peak, silhouetted against the rising sun; wide shot; heroism; majestic mountain range with clouds swirling below; cinematic
Characteristic
Shot : A lone hiker stands on a mountain peak, arms outstretched, with a panoramic view of snow-capped mountains below and a vibrant sunrise behind them.
Aesthetic Score : 0.8
Mood : inspirational, adventurous, triumphant
Quality
Entropy : 6.75
Noise : 59
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Lost in the Verdant Mystery
Four adventurers, shrouded in mist, navigate a dense, green forest. Their journey promises both serenity and intrigue, as they delve deeper into the unknown.
Prompt
poses low-angle: mysterious, adventurous ; A group of explorers navigating a dense jungle, their faces illuminated by the light of their headlamps; medium shot; adventure; lush green foliage and ancient ruins in the background; cinematic
Characteristic
Shot : A group of people are hiking through a lush, green jungle. The scene is filled with dense foliage and a misty atmosphere, creating a sense of mystery and adventure.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, tranquil
Quality
Entropy : 6.86
Noise : 98
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight blur, possibly due to motion.
Lost in the Neon Glow: A Gamer’s Cyberpunk Oasis
A solitary figure, controller in hand, stands against a backdrop of a vibrant, blurred cityscape. The screen reflects the neon lights, creating a futuristic and mysterious atmosphere. This image captures the essence of cyberpunk gaming, where reality and virtual worlds collide.
Prompt
poses low-angle: intense, focused ; A gamer’s hands intensely manipulating a controller, their face illuminated by the glow of the monitor; close-up; gaming; a vibrant, futuristic cityscape projected on the screen; cinematic
Characteristic
Shot : A person’s hand holding a video game controller in front of a blurry background of city lights. The controller is the focus of the image and the background is out of focus.
Aesthetic Score : 0.6
Mood : intense, focused, playful
Quality
Entropy : 6.95
Noise : 64
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight blur in the background, which is likely intentional to focus on the controller.
Pharaoh’s Majesty: A Grand Spectacle in Stone
A towering statue of an Egyptian pharaoh commands attention, its imposing presence amplified by the perspective of the viewer. Surrounded by a crowd of onlookers, the statue evokes a sense of grandeur, mystery, and historical significance.
Prompt
poses low-angle: awe-inspiring, historical ; A towering statue of a historical figure, viewed from the perspective of a tourist looking up in awe; wide shot; tourism; a bustling city square with other tourists and vendors; cinematic
Characteristic
Shot : A large statue of an Egyptian pharaoh standing in front of a building with many columns. There are people in the foreground, looking up at the statue.
Aesthetic Score : 0.6
Mood : grand, mysterious, awe
Quality
Entropy : 6.76
Noise : 77
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image. There is a bit of noise in some areas but it’s not very noticeable.
Lost in the Vastness: A Solitary Figure Contemplates the Desert
A lone figure traverses a sprawling, deserted sand dune under a clear, blue sky. The vastness of the landscape emphasizes the figure’s isolation, creating a mood of loneliness and contemplation. The dramatic effect of the figure’s small size against the expansive desert evokes a sense of scale and perspective.
Prompt
poses low-angle: solitude, contemplative ; A lone traveler gazing out at a vast desert landscape, their back to the camera; medium shot; travel; endless sand dunes stretching out to the horizon; cinematic
Characteristic
Shot : A lone figure in a cowboy hat and backpack walks across a vast desert landscape with a warm, hazy light in the sky
Aesthetic Score : 0.7
Mood : solitude, adventure, wanderlust
Quality
Entropy : 6.09
Noise : 52
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to have a slight color cast, and the details on the figure are not very sharp.
Confetti Celebration: Young Adults Embrace the Joy of the Moment
Capture the vibrant energy of youth as a group of friends celebrate outdoors in an urban setting. Surrounded by colorful confetti, their laughter and smiles radiate pure happiness and carefree joy. This image embodies the spirit of celebration and the beauty of shared moments.
Prompt
poses low-angle: joyful, celebratory ; A group of friends celebrating a victory, their arms raised in the air, viewed from the perspective of someone standing below; wide shot; groups; a brightly lit party scene with confetti and balloons; cinematic
Characteristic
Shot : A group of young people are celebrating and throwing confetti in the air. It looks like a festival or a party. There are buildings in the background.
Aesthetic Score : 0.7
Mood : joyful, festive, celebratory
Quality
Entropy : 6.76
Noise : 82
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Silhouette of Courage: Firefighter Battles Blaze
A dramatic image captures the intensity of a firefighter’s bravery as they stand silhouetted against a burning building, axe in hand. The flames engulf the structure, highlighting the danger and heroism of their job.
Prompt
poses low-angle: intense, heroic ; A lone firefighter battling a raging inferno, their silhouette framed against the flames; medium shot; heroism; a burning building with smoke billowing into the sky; cinematic
Characteristic
Shot : A firefighter, silhouetted against a burning building, is holding an axe in his right hand and preparing to enter the building to fight the fire.
Aesthetic Score : 0.6
Mood : dramatic, heroic, tense
Quality
Entropy : 6.85
Noise : 71
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors
On the Edge of Adventure: Hikers Embrace the Majestic View
A group of hikers stand on the precipice of a dramatic cliff, gazing out at a breathtaking valley. Bathed in warm sunlight, the scene evokes a sense of serenity and adventure, with the winding river and mist-shrouded mountains adding to the majestic beauty.
Prompt
poses low-angle: thrilling, adventurous ; A group of adventurers rappelling down a sheer cliff face, their ropes dangling below; medium shot; adventure; a breathtaking view of a mountain range and a valley below; cinematic
Characteristic
Shot : A group of climbers standing on a cliff edge, looking out at a stunning mountain vista with a river snaking through the valley below. The sun is shining and the sky is a soft blue with wispy clouds.
Aesthetic Score : 0.8
Mood : serene, adventurous, inspiring
Quality
Entropy : 6.71
Noise : 86
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Reaching for the Digital Horizon
A hand, bathed in warm pink light, extends towards a keyboard, poised to navigate a digital landscape displayed on the monitor behind. The scene evokes a sense of calm contemplation and anticipation, inviting viewers to ponder the possibilities that lie ahead.
Prompt
poses low-angle: immersive, fantastical ; A gamer’s hands deftly navigating a virtual world, their fingers flying across the keyboard; close-up; gaming; a vibrant, fantasy world displayed on the monitor; cinematic
Characteristic
Shot : A person’s hand is typing on a keyboard in front of a computer monitor. The monitor is displaying a fantasy landscape with a bright blue sky and pink clouds.
Aesthetic Score : 0.4
Mood : dreamy, calm, whimsical
Quality
Entropy : 6.92
Noise : 65
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts, such as the pixelation of the hand and the blurriness of the background.
Mystical Gateway to Hope
A group of figures stand silhouetted against the golden light streaming through a massive, ancient gateway. The scene evokes a sense of tranquility and hope, hinting at a journey into the unknown.
Prompt
poses low-angle: awe-inspiring, historical ; A group of tourists standing in awe before a magnificent ancient temple, their faces illuminated by the setting sun; wide shot; tourism; a sprawling temple complex with intricate carvings and statues; cinematic
Characteristic
Shot : A group of people standing in front of a large stone archway. The sun is setting in the background, casting a golden glow on the scene.
Aesthetic Score : 0.6
Mood : peaceful, warm, contemplative
Quality
Entropy : 6.73
Noise : 84
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, resulting in a washed-out appearance.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.58, which is considered good. This indicates that the model was able to understand the scene and create a shot that was relatively close to what was described in the prompt.
- Aesthetic Analysis: The model scored 0.33, which is considered average. This means that the generated image’s aesthetic was somewhat close to the expected aesthetic, but not particularly strong.
Overall, the model seems to be better at understanding the scene and shot composition than it is at capturing the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://api.bfl.ml/docs#/util/get_result_v1_get_result_get