AI's Artistic Struggle: Capturing the Essence of Poses with Flux-schnell
- 9 minutes read - 1906 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post delves into an experiment where an AI model was tasked with creating images based on specific poses and scene descriptions. While the model demonstrated a good understanding of camera positions and shot composition, it struggled to match the desired aesthetic, highlighting the ongoing challenges in AI’s artistic capabilities. This exploration sheds light on the complexities of translating human artistic vision into the digital realm, showcasing both the strengths and limitations of current AI technology.
Created with: flux-schnell
Silhouette of Hope in a Ruined World
A solitary figure walks towards a vibrant sunset, their silhouette casting a mysterious aura against the backdrop of a crumbling colonnade. The scene evokes a sense of solitude, mystery, and a glimmer of hope amidst the ruins.
Prompt
poses walking-away: Melancholy, yet hopeful ; Lone figure in a tattered cloak; wide shot; Heroism; Ruins of a fallen city bathed in the golden light of a setting sun; cinematic
Characteristic
Shot : A lone figure walks towards a bright sunset in a ruined city, the figure is silhouetted against the light, the city is in the background and is composed of crumbling columns.
Aesthetic Score : 0.7
Mood : melancholy, hopeful, mysterious
Quality
Entropy : 6.28
Noise : 89
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, especially in the background. There are some minor artifacts in the image, such as noise and banding, that are visible in the shadows. The image is also slightly overexposed, which is causing some of the details in the highlights to be lost.
Lost in the Lush: A Tranquil Journey Through the Jungle
A lone traveler with a backpack ventures through a vibrant green jungle, the path ahead beckoning with adventure. The tranquil atmosphere and the distant figure add a sense of mystery and serenity to this captivating scene.
Prompt
poses walking-away: Excited, adventurous ; A young adventurer with a backpack; medium shot; Adventure; Lush jungle with a hidden path leading into the unknown; cinematic
Characteristic
Shot : A man is walking on a path in a lush green forest. He is wearing a backpack and looking ahead. A person can be seen in the distance, walking towards him.
Aesthetic Score : 0.6
Mood : serene, adventurous, mysterious
Quality
Entropy : 6.69
Noise : 118
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : There is a slight blurriness in the image, especially in the background. The colors are also slightly muted.
Lost in Thought on a City Street
A young person, lost in contemplation, stands alone on a dimly lit city street. The shallow depth of field isolates them, creating a sense of mystery and urban solitude.
Prompt
poses walking-away: Focused, determined ; A gamer with a headset; close-up; Gaming; Neon-lit cityscape reflected in a computer screen; cinematic
Characteristic
Shot : A young person wearing headphones and a microphone walks through a city street at night. They are silhouetted against the bright neon signs and blurry lights of the city. The person is in focus, while the background is out of focus.
Aesthetic Score : 0.7
Mood : mysterious, urban, introspective
Quality
Entropy : 6.30
Noise : 47
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image seems to have some noise in the background and around the edges of the subject. The neon sign in the background is slightly blurry and not entirely sharp.
Lost in Love, Framed by Time
A couple strolls hand-in-hand down a cobblestone street in a European city, their silhouettes framed by the rustic charm of ancient buildings. The empty street and intimate setting evoke a sense of romantic isolation and nostalgic longing.
Prompt
poses walking-away: Romantic, carefree ; A couple holding hands; medium shot; Tourism; Picturesque European street with cobblestone paths and colorful buildings; cinematic
Characteristic
Shot : A couple walks hand in hand down a narrow street in a European city. The buildings are old and have a vintage feel. The woman is wearing a pink dress and the man is wearing a black jacket. The street is cobblestone.
Aesthetic Score : 0.6
Mood : romantic, nostalgic, quaint
Quality
Entropy : 6.80
Noise : 97
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly underexposed and lacks sharpness. There is some noise in the shadows.
Silhouettes of Departure: A Man’s Journey into the Twilight
A solitary figure walks away from the camera, his luggage in tow, towards a departing airplane in the distance. The setting sun casts long shadows, creating a melancholic yet hopeful scene of departure. The man’s silhouette against the twilight sky evokes a sense of longing and the bittersweet nature of farewells.
Prompt
poses walking-away: Nostalgic, bittersweet ; A lone traveler with a suitcase; long shot; Travel; Airport runway with a departing airplane in the distance; cinematic
Characteristic
Shot : A man with a backpack and a suitcase is walking away from the camera towards a runway with an airplane in the background. The sun is setting and there is a colorful gradient in the sky.
Aesthetic Score : 0.6
Mood : lonely, contemplative, travel
Quality
Entropy : 6.86
Noise : 70
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Sunset Smiles: Friends Enjoy a Carefree Beach Stroll
Capture the joy of summer with this heartwarming scene of six young women laughing and walking along a beach at sunset. The golden light and their infectious smiles create a mood of pure happiness and carefree fun.
Prompt
poses walking-away: Joyful, carefree ; A group of friends laughing; wide shot; Groups; Beach at sunset with the ocean waves crashing in the background; cinematic
Characteristic
Shot : A group of six young women are walking along a beach at sunset, their silhouettes are framed against the warm light. The beach is sandy and the waves are crashing softly in the background. The women are smiling and looking happy.
Aesthetic Score : 0.7
Mood : happy, carefree, summery
Quality
Entropy : 6.75
Noise : 52
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors are visible in the image.
Lost in the Mist: A Silhouette of Mystery
A lone figure ventures through a misty forest, their path illuminated by a distant light. The silhouette against the glow creates an atmosphere of suspense and intrigue, leaving you wondering what secrets lie ahead.
Prompt
poses walking-away: Determined, resolute ; A lone warrior with a sword; medium shot; Heroism; Dark forest with a path leading into the shadows; cinematic
Characteristic
Shot : A lone figure walks through a dark, misty forest. The path ahead is shrouded in fog, creating a sense of mystery and intrigue. The trees are tall and shadowy, adding to the atmospheric feel of the scene.
Aesthetic Score : 0.7
Mood : dark, mysterious, foreboding
Quality
Entropy : 5.10
Noise : 41
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
Silhouettes of Mystery: Four Figures Disappear into the Light
A sense of adventure and somber mystery hangs in the air as four figures walk away from the camera, their forms silhouetted against the bright light streaming through a stone archway. The dramatic lighting adds to the intrigue, leaving the viewer wondering what lies beyond the portal.
Prompt
poses walking-away: Curious, excited ; A group of explorers with maps; wide shot; Adventure; Ancient ruins with a mysterious entrance; cinematic
Characteristic
Shot : Four people walking towards the camera in front of a large stone archway.
Aesthetic Score : 0.4
Mood : calm, contemplative, introspective
Quality
Entropy : 6.78
Noise : 117
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors, but the image is a bit flat and lacking in contrast.
Immersed in the Future: A Woman’s Joyful VR Gaming Experience
This image captures the excitement of virtual reality gaming. A woman, wearing a VR headset and holding a controller, is fully immersed in a digital world. The futuristic technology and her playful expression create a sense of wonder and excitement.
Prompt
poses walking-away: Immersed, excited ; A gamer with a controller; close-up; Gaming; Virtual reality headset with a fantastical world displayed; cinematic
Characteristic
Shot : A woman is wearing a VR headset and holding a game controller. She is in a crowded indoor space, likely a convention or trade show, and is engrossed in a virtual experience.
Aesthetic Score : 0.6
Mood : excited, curious, playful
Quality
Entropy : 6.77
Noise : 67
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, particularly in the background. There’s a slight distortion around the edges of the VR headset.
A Family’s Journey Begins: A Moment of Departure Captured in Depth
A family, silhouetted against the vastness of a train station, walks away from the camera, their luggage in tow. The shallow depth of field blurs the background, drawing attention to their small figures and the sense of mystery surrounding their destination. The image evokes a feeling of calm anticipation and the excitement of a new adventure.
Prompt
poses walking-away: Emotional, bittersweet ; A family with luggage; long shot; Travel; Train station platform with a departing train in the distance; cinematic
Characteristic
Shot : A family of three is walking away from the camera at a train station. The father is in the foreground, followed by the daughter and then the mother. They are all carrying luggage.
Aesthetic Score : 0.7
Mood : tranquil, hopeful, family
Quality
Entropy : 6.79
Noise : 85
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable image errors
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.45
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera positions described in the prompt.
Shot Analysis:
- Score: 0.54
- Interpretation: This score falls within the “good” range of 0.5 to 0.75. It indicates that the model was able to understand and translate the scene description from the prompt into the generated image fairly well.
Aesthetic Analysis:
- Score: 0.14
- Interpretation: This score is significantly higher than the “very good” range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera positions and shot composition, but struggles to match the desired aesthetic. This suggests that the model might need further training to better understand and translate aesthetic preferences into its generated images.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://fal.ai/models/fal-ai/flux/schnell/api