AI's Artistic Journey: Capturing Poses and Scenes with Flux-schnell
- 9 minutes read - 1788 wordsTable of Contents
Dramatic style poses are a powerful tool in visual storytelling, adding depth and emotion to scenes. They are often used in photography, film, and art to convey a specific mood or message. For example, a lone figure standing on a mountain peak with arms outstretched can evoke a sense of triumph and freedom, while a group of friends huddled together in a dark forest can create a feeling of suspense and mystery. Generative AI models are increasingly being used to create images with dramatic poses, but they still face challenges in accurately capturing the intended camera positions and achieving the desired aesthetic.
Created with: flux-schnell
Awe-Inspiring View: Two Figures Conquer the Clouds
Standing atop a majestic mountain peak, two figures gaze out at a breathtaking panorama of endless clouds. The vastness of the scene evokes a sense of awe and wonder, while the figures’ small stature emphasizes the humbling power of nature. This serene and adventurous moment captures the spirit of contemplation and the beauty of the natural world.
Prompt
poses face-to-face: Determined, awe-inspiring ; A lone adventurer, standing on a mountain peak; wide shot; Adventure; Majestic mountain range with clouds swirling around; cinematic
Characteristic
Shot : Two figures, one taller than the other, standing on a mountaintop with a vast expanse of clouds below.
Aesthetic Score : 0.7
Mood : tranquil, adventurous, contemplative
Quality
Entropy : 6.65
Noise : 79
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : None visible
Silhouettes in the Twilight Forest
A haunting scene unfolds in a dark forest, where four figures stand silhouetted against the piercing sunlight filtering through the trees. The mood is heavy with mystery, eerieness, and melancholic undertones, leaving viewers captivated by the intrigue of the moment.
Prompt
poses face-to-face: Suspenseful, mysterious ; A group of friends, huddled together in a dark forest; medium shot; Adventure; Tall trees casting long shadows, sunlight filtering through the leaves; cinematic
Characteristic
Shot : A group of four people are standing in a dark forest. The sun is shining through the trees, creating a dramatic effect.
Aesthetic Score : 0.4
Mood : mysterious, dark, brooding
Quality
Entropy : 5.24
Noise : 60
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly underexposed, and the silhouettes are not very detailed. There are some visible artifacts in the shadows.
Man Faces Down a Dragon’s Fury
A tense standoff in a fantasy world. A man stands before a fearsome dragon, its red eye burning with intensity. The close-up shot and dramatic lighting create a sense of suspense and impending danger.
Prompt
poses face-to-face: Brave, intense ; A seasoned warrior, facing down a fearsome dragon; close-up; Heroism; Fiery dragon with glowing eyes, smoke billowing around; cinematic
Characteristic
Shot : A man in a dark hooded coat stands close to a giant, black dragon. The dragon’s red eye is glowing intensely, and its teeth are bared. The man’s face is tense and determined.
Aesthetic Score : 0.7
Mood : intense, dramatic, mysterious
Quality
Entropy : 6.64
Noise : 95
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image appears to have some slight artifacts around the edges of the dragon’s body.
Lost in the Digital World: A Boy’s Moment of Focus
A young boy, headphones on, gazes intently at his tablet, lost in a world of digital exploration. The blurred cityscape behind him adds a sense of depth and isolation, highlighting his focused and contemplative mood. The dramatic lighting and the soft focus create a sense of intrigue and curiosity, inviting the viewer to wonder what captivating content holds his attention.
Prompt
poses face-to-face: Focused, determined ; A young gamer, staring intently at a computer screen; close-up; Gaming; Vibrant, futuristic cityscape reflected in the screen; cinematic
Characteristic
Shot : A young boy wearing headphones is looking at a digital device, possibly a tablet or laptop. The background is blurry and appears to be an urban setting.
Aesthetic Score : 0.6
Mood : focused, thoughtful, contemplative
Quality
Entropy : 6.79
Noise : 68
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Parisian Romance: A Silhouette of Love
A couple, lost in each other’s eyes, stands before the iconic Eiffel Tower at dusk. The silhouette of the tower adds a touch of mystery and romance to this intimate moment, captured with a dreamy aesthetic.
Prompt
poses face-to-face: Romantic, nostalgic ; A couple, gazing at each other in front of the Eiffel Tower; medium shot; Tourism; Romantic Parisian cityscape with the Eiffel Tower in the background; cinematic
Characteristic
Shot : A couple is silhouetted against the Eiffel Tower at sunset, looking at each other.
Aesthetic Score : 0.7
Mood : romantic, intimate, dreamy
Quality
Entropy : 6.16
Noise : 70
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are slight compression artifacts visible in the background, and some noise in the shadows.
Lost in the Market’s Glow
A man stands amidst the vibrant chaos of a bustling street market, his presence commanding attention as the colorful background blurs into a dreamy haze. The scene evokes a sense of relaxed contemplation and friendly energy, capturing the essence of a moment lost in the heart of the market.
Prompt
poses face-to-face: Curious, vibrant ; A traveler, standing on a bustling street market; medium shot; Travel; Colorful stalls overflowing with exotic goods, people bustling around; cinematic
Characteristic
Shot : A man standing in a bustling market with colorful decorations in the background
Aesthetic Score : 0.6
Mood : casual, friendly, contemplative
Quality
Entropy : 6.84
Noise : 85
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant image errors. The image is well-lit and the subject is in focus.
Shadows and Flames: A Night of Mystery and Camaraderie
A group of young men gather around a flickering campfire in a shadowy forest, their faces illuminated by the warm orange glow. The scene evokes a sense of mystery, adventure, and the strong bonds of friendship forged in the wilderness.
Prompt
poses face-to-face: Intimate, suspenseful ; A group of explorers, huddled around a campfire; medium shot; Adventure; Dark forest with flickering flames illuminating their faces; cinematic
Characteristic
Shot : A group of four men are gathered around a campfire in a forest. The fire is small and the light is dim. The men are wearing casual clothing and look relaxed and friendly.
Aesthetic Score : 0.6
Mood : mysterious, intimate, warm
Quality
Entropy : 6.02
Noise : 70
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy and has some noise.
Tiny Dreamer Gazes at the Giant: A Girl’s Wonder at the Freedom Tower
A young girl stands in awe, her small figure dwarfed by the towering Freedom Tower. The blurred cityscape behind her adds to the sense of wonder and hope, capturing the spirit of urban dreams.
Prompt
poses face-to-face: Awe-inspiring, hopeful ; A young girl, looking up at a towering skyscraper; wide shot; Tourism; Modern cityscape with towering skyscrapers and bustling streets; cinematic
Characteristic
Shot : A young girl looking up at a tall building, possibly the Freedom Tower in New York City.
Aesthetic Score : 0.6
Mood : curious, hopeful, urban
Quality
Entropy : 6.86
Noise : 82
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, especially in the background.
Friends, Laughter, and Video Game Glory!
Capture the joy of shared gaming with this vibrant scene of four friends laughing and competing. The playful mood and dynamic lighting create a fun and energetic atmosphere.
Prompt
poses face-to-face: Joyful, celebratory ; A group of friends, celebrating a victory in a video game; close-up; Gaming; Brightly lit gaming room with controllers and headsets; cinematic
Characteristic
Shot : Four young adults are playing a video game, laughing and having fun. They are all wearing headphones and holding controllers.
Aesthetic Score : 0.6
Mood : joyful, playful, energetic
Quality
Entropy : 6.83
Noise : 85
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts around the edges of the people’s faces, possibly due to compression.
Silhouetted in Sunset’s Embrace
A solitary figure stands against the fiery hues of a setting sun, their face a canvas of contemplation and quiet melancholy. The dramatic silhouette evokes a sense of mystery and peace, inviting viewers to ponder the moment’s unspoken story.
Prompt
poses face-to-face: Melancholy, contemplative ; A lone traveler, standing on a deserted beach; wide shot; Travel; Vast ocean stretching out to the horizon, golden sunset; cinematic
Characteristic
Shot : A man standing in silhouette against a golden sunset, the beach and ocean behind him. He has a backpack on.
Aesthetic Score : 0.6
Mood : pensive, contemplative, tranquil
Quality
Entropy : 6.27
Noise : 37
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Conclusion
The results show that the generative AI model performed well in terms of understanding camera positions and scene composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.45, which falls below the “good” range of 0.5 to 0.75. This suggests that the model didn’t perfectly capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.5, which is considered “good”. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.11, which is within the “very good” range of -0.2 to 0.1. This means that the generated image’s aesthetic was very close to the expected aesthetic, despite the model’s struggles with camera positioning.
Overall, the model demonstrates a good understanding of scene composition and a strong ability to achieve the desired aesthetic. However, it needs improvement in accurately capturing the intended camera positions.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://fal.ai/models/fal-ai/flux/schnell/api