AI's Artistic Journey: Capturing Poses, But Missing the Mark on Camera Angles with Flux-schnell
- 9 minutes read - 1913 wordsTable of Contents
The world of AI art is constantly evolving, with models becoming increasingly adept at understanding and replicating human creativity. One area of particular interest is the ability of these models to capture the essence of poses and scenes. This blog post explores the results of a generative AI model tasked with creating images based on specific poses and scenes, highlighting both its strengths and weaknesses in capturing the nuances of human artistic expression. Dramatic style poses are often used in photography, film, and art to convey emotion, action, or a sense of power. They can be used to create a sense of drama, excitement, or even humor. For example, a dramatic pose might be used to capture the intensity of a sporting event, the excitement of a wedding, or the sadness of a funeral. This blog post will explore the use of dramatic style poses in AI-generated art, examining the model’s ability to capture the desired aesthetic and understand the nuances of camera angles.
Created with: flux-schnell
Soldiers on the Brink: A Moment of Tense Anticipation
A group of soldiers stand in formation, their faces etched with seriousness, against a backdrop of a smoky, apocalyptic landscape. The image is composed to create a sense of tension and anticipation, suggesting a looming threat or a moment of intense action.
Prompt
poses standing-in-a-row: determined, courageous, hopeful ; A group of soldiers; wide shot; heroism; a battlefield with smoke and explosions in the background; cinematic
Characteristic
Shot : A group of soldiers in military uniform stand in a line, with a backdrop of smoke and explosions. They are all looking forward, except for the soldier on the far right, who is looking to the side.
Aesthetic Score : 0.7
Mood : serious, military, tense
Quality
Entropy : 6.38
Noise : 93
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight amount of noise and grain, especially in the shadows and highlights. There are also some minor imperfections in the edges of the soldiers’ uniforms.
Friends Explore Ancient Wonders with Joyful Smiles
A group of six young adults stand before an ancient temple, basking in the scenic beauty. Their happy and relaxed expressions capture the adventurous spirit of their journey. The image boasts a balanced composition, vibrant colors, and a sense of depth, creating a visually pleasing and uplifting scene.
Prompt
poses standing-in-a-row: excited, curious, adventurous ; A team of explorers; medium shot; adventure; a lush jungle with ancient ruins in the distance; cinematic
Characteristic
Shot : A group of six friends are exploring a jungle in front of an ancient temple.
Aesthetic Score : 0.6
Mood : adventurous, curious, excited
Quality
Entropy : 6.85
Noise : 104
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant artifacts or errors are visible in the image.
The Stage is Set: Gamers Ready for Epic Tournament Showdown
A group of young men, heads down and focused, stand poised in a brightly lit arena, their headsets reflecting the anticipation of an upcoming gaming tournament. The dynamic composition, with the players arranged in a diagonal line, builds excitement and draws the viewer’s eye towards the screen, where the battle is about to begin.
Prompt
poses standing-in-a-row: focused, competitive, passionate ; A group of gamers; close-up shot; gaming; a brightly lit esports arena with cheering fans; cinematic
Characteristic
Shot : A group of young men wearing headsets are standing in a dimly lit stadium-like venue. They are likely attending a gaming tournament or esports event.
Aesthetic Score : 0.6
Mood : excited, focused, competitive
Quality
Entropy : 6.63
Noise : 92
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors or artifacts in the image
Friends Embrace the Mountain Majesty
A group of friends stand together, radiating joy and adventure, against the backdrop of a breathtaking mountain valley. The clear blue sky and stunning views create a sense of awe and wonder, while the towering peaks add a touch of grandeur to the scene.
Prompt
poses standing-in-a-row: happy, relaxed, joyful ; A family of tourists; long shot; tourism; a breathtaking view of a mountain range with a clear blue sky; cinematic
Characteristic
Shot : A group of friends or family standing in front of a mountain range with a clear blue sky in the background
Aesthetic Score : 0.6
Mood : happy, friendly, adventurous
Quality
Entropy : 6.80
Noise : 76
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, and the colors are a bit washed out. Additionally, the mountains in the background are somewhat blurry.
Adventure Awaits: Friends Embark on a Journey Through Nature
A group of six young adults, radiating happiness and camaraderie, stand on a dirt road surrounded by lush greenery. Their casual attire and backpacks hint at an exciting adventure ahead, captured in a vibrant and energetic image.
Prompt
poses standing-in-a-row: free-spirited, adventurous, optimistic ; A group of backpackers; medium shot; travel; a dusty road leading to a distant village with palm trees; cinematic
Characteristic
Shot : A group of six friends are walking down a dirt road in a rural area, they are all wearing casual clothing and backpacks, they seem to be enjoying their journey.
Aesthetic Score : 0.7
Mood : joyful, adventurous, hopeful
Quality
Entropy : 6.86
Noise : 86
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Spotlight Serenade: Women in Formal Attire Command the Stage
A captivating scene unfolds as a group of women, adorned in elegant attire, stand bathed in the glow of spotlights against a dark backdrop. The dramatic lighting accentuates their presence, creating a sense of hope and anticipation. This performance promises an unforgettable experience.
Prompt
poses standing-in-a-row: harmonious, powerful, emotional ; A choir singing in harmony; close-up shot; groups; a dimly lit stage with spotlights; cinematic
Characteristic
Shot : A group of women in choir dress standing on a stage in a dimly lit room with spotlights. The women are singing and appear to be in a performance.
Aesthetic Score : 0.6
Mood : serene, hopeful, celebratory
Quality
Entropy : 6.22
Noise : 77
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Colorful Choreography Lights Up the Stage with Joyful Energy
A group of young women in vibrant outfits bring the stage to life with their energetic dance routine. The playful mood is amplified by the dramatic lighting, highlighting their every move and creating a captivating performance.
Prompt
poses standing-in-a-row: energetic, synchronized, joyful ; A line of dancers; wide shot; groups; a brightly lit stage with colorful costumes; cinematic
Characteristic
Shot : A group of young women in colorful dancewear are performing on a stage with a red curtain in the background.
Aesthetic Score : 0.7
Mood : joyful, energetic, celebratory
Quality
Entropy : 6.86
Noise : 108
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has slight chromatic aberration around the edges, particularly noticeable on the dancers’ legs. The lighting is slightly uneven, with some areas being too bright or too dark.
Sunset Silhouettes: Friends Embrace the Golden Hour
A group of friends stand on a beach, their silhouettes painted against the vibrant sunset. The warm glow of the sky creates a joyful and relaxed atmosphere, highlighting the intimacy and connection between the group.
Prompt
poses standing-in-a-row: relaxed, happy, nostalgic ; A group of friends; medium shot; groups; a sunset over a beach with waves crashing in the background; cinematic
Characteristic
Shot : A group of friends are standing on a beach at sunset. They are all smiling and looking happy. The sky is a beautiful orange and pink color.
Aesthetic Score : 0.6
Mood : joyful, carefree, nostalgic
Quality
Entropy : 6.71
Noise : 93
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
A Team United in Purpose: Professionals Working Together in a Sterile Environment
This image captures a group of four individuals in white lab coats, standing in a sterile room, likely a hospital or laboratory. Their focused expressions and unified pose convey a sense of professionalism and shared purpose. The image evokes a mood of seriousness and dedication, highlighting the importance of teamwork in a professional setting.
Prompt
poses standing-in-a-row: focused, determined, innovative ; A team of scientists; close-up shot; groups; a laboratory with complex machinery and glowing screens; cinematic
Characteristic
Shot : A group of four people, three women and one man, are standing in a room that appears to be a laboratory or a medical facility. They are all wearing white lab coats and are looking at the camera. The man in the center of the image is looking slightly to the left, while the others are looking directly at the camera.
Aesthetic Score : 0.6
Mood : professional, serious, sterile
Quality
Entropy : 6.89
Noise : 91
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, but this does not significantly affect the quality of the image. The background is also a bit blurry, but this is not a major issue.
Hopeful Marchers Demand Change in City Streets
A group of determined protesters march through the city, their signs and banners a powerful display of hope and resolve. The urban backdrop emphasizes the scale of their movement and the urgency of their message.
Prompt
poses standing-in-a-row: determined, passionate, hopeful ; A group of protesters; long shot; groups; a city street with banners and signs; cinematic
Characteristic
Shot : A group of people, mostly women, are protesting or demonstrating in a city street. They are holding signs and banners, and some are wearing hats or scarves.
Aesthetic Score : 0.6
Mood : determined, hopeful, political
Quality
Entropy : 6.81
Noise : 98
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor image artifacts are present, particularly in the background.
Conclusion
The results show that the generative AI model performed well in terms of understanding camera positions and scene composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.45, which falls below the “good” range of 0.5 to 0.75. This suggests that the model didn’t perfectly capture the intended camera angles or perspectives described in the prompt.
- Shot Analysis: The model scored 0.5, which is considered “good”. This indicates that the model was able to understand the scene and its elements reasonably well, but there might be some discrepancies between the prompt and the generated image.
- Aesthetic Analysis: The model scored 0.11, which is within the “very good” range of -0.2 to 0.1. This means that the generated image’s aesthetic was quite close to the expected aesthetic, despite the model’s struggles with camera position and shot composition.
Overall, the model shows promise in understanding the scene and achieving the desired aesthetic, but needs improvement in accurately interpreting camera positions.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://fal.ai/models/fal-ai/flux/schnell/api