AI Captures the Moment, But Misses the Mood with Flux-dev
- 9 minutes read - 1783 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, used to convey emotion, action, and character. From the iconic silhouette of a lone hero against a setting sun to the intimate huddle of adventurers around a campfire, these poses evoke a sense of drama and intrigue. However, capturing the essence of these poses in AI-generated images remains a challenge. While the model can understand the basic elements of composition and camera position, it often struggles to translate the intended aesthetic into a visually compelling image.
Created with: flux-dev
Silhouetted Against the Setting Sun
A solitary figure stands in silhouette against a vibrant orange sunset, bathed in the golden glow of the setting sun. The scene evokes a sense of serenity, contemplation, and isolation, leaving the viewer to ponder the figure’s thoughts and emotions.
Prompt
poses leaning: epic, hopeful ; A lone figure, silhouetted against a setting sun; wide shot; heroism; a vast, desolate landscape; cinematic
Characteristic
Shot : A lone figure stands in silhouette against a vibrant sunset, the sun creating a warm glow across the horizon. The figure is standing on a barren landscape, likely a desert or a plain.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, hopeful
Quality
Entropy : 6.23
Noise : 35
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant errors in the image
Secrets in the Shadows: A Dramatic Encounter in a Dark Cave
A group of men huddle together in the eerie glow of flickering torches, their faces etched with tension and anticipation. The suspenseful atmosphere and dramatic lighting create a sense of mystery and intrigue, leaving you wondering what secrets lie hidden within the cave’s depths.
Prompt
poses leaning: suspenseful, adventurous ; A group of adventurers, their faces illuminated by flickering torchlight; medium shot; adventure; a dark, mysterious cave; cinematic
Characteristic
Shot : Three men huddle together in a dark cave, lit by flickering flames from torches they hold.
Aesthetic Score : 0.7
Mood : mysterious, suspenseful, dramatic
Quality
Entropy : 6.49
Noise : 73
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Red Hot Focus: A Woman’s Intense Concentration at the Keyboard
A young woman sits in a dimly lit room, her fingers flying across a vibrant red keyboard. The scene exudes an atmosphere of intense focus and concentration, heightened by the dramatic lighting and the striking color of the keyboard.
Prompt
poses leaning: intense, focused ; A gamer’s hands, fingers flying across a keyboard; close-up; gaming; a brightly lit gaming setup; cinematic
Characteristic
Shot : A young woman is sitting at a desk in a dimly lit room, typing on a keyboard. She is wearing headphones and has a computer monitor in front of her. There are some colorful lights reflecting on the computer screen, suggesting she’s playing a game.
Aesthetic Score : 0.6
Mood : focused, intense, concentrated
Quality
Entropy : 6.56
Noise : 59
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some noise and slight compression artifacts are visible, especially in the dark areas around the subject’s hair.
Silhouettes of Love Against the Sunset
A couple embraces on a rooftop, their silhouettes painted against a breathtaking sunset over the city. The scene evokes a sense of romance, intimacy, and hope, capturing the beauty of a shared moment.
Prompt
poses leaning: romantic, awe-inspiring ; A couple leaning on a railing, gazing out at a breathtaking cityscape; medium shot; tourism; a vibrant, bustling city; cinematic
Characteristic
Shot : A couple silhouetted against a cityscape at sunset, they are standing on a rooftop overlooking the city. The sun is setting in the distance, casting a warm glow over the scene. The buildings are tall and slender, and the couple is looking out at the view.
Aesthetic Score : 0.6
Mood : romantic, calm, nostalgic
Quality
Entropy : 6.79
Noise : 52
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are some minor artifacts in the image, particularly around the edges of the couple’s figures. There is also some slight noise in the image.
Lost in the Mountains: A Moment of Contemplation
A solitary figure stands at a mountain crossroads, lost in thought as they gaze out at the breathtaking vista. The winding road and vast backdrop evoke a sense of adventure and reflection, capturing the essence of a contemplative journey.
Prompt
poses leaning: reflective, adventurous ; A backpacker, leaning against a weathered signpost, looking out at a winding mountain road; medium shot; travel; a scenic mountain range; cinematic
Characteristic
Shot : A lone hiker with a backpack stands by a signpost on a mountain road with stunning views.
Aesthetic Score : 0.7
Mood : serene, contemplative, adventurous
Quality
Entropy : 6.81
Noise : 65
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight noise in the background, especially in the mountains.
Friendship’s Glow in a City Backstreet
Four friends, radiating joy, stroll down a narrow city street, their laughter echoing in the intimate space. The light at the end of the street adds a touch of mystery, hinting at the adventures that await them.
Prompt
poses leaning: joyful, carefree ; A group of friends, laughing and leaning on each other, as they walk down a cobblestone street; wide shot; groups; a charming, historic town; cinematic
Characteristic
Shot : Four young adults, two men and two women, are walking together in a city street. They appear to be friends, enjoying each other’s company. The street is lined with buildings on both sides.
Aesthetic Score : 0.7
Mood : happy, friendly, urban
Quality
Entropy : 6.76
Noise : 87
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image. The quality is good.
Silhouetted Against the Storm: A Moment of Solitude
A lone figure stands defiant on a windswept cliff, the crashing waves of a stormy sea creating a dramatic backdrop. The silhouette against the turbulent sky evokes a sense of loneliness and melancholic beauty.
Prompt
poses leaning: powerful, defiant ; A lone figure, standing on a cliff edge, arms outstretched, leaning into the wind; wide shot; heroism; a dramatic, stormy sea; cinematic
Characteristic
Shot : A solitary figure stands on the edge of a cliff overlooking a stormy sea. The waves are crashing against the rocks, and the sky is dark and cloudy.
Aesthetic Score : 0.7
Mood : dramatic, melancholic, contemplative
Quality
Entropy : 6.58
Noise : 70
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has a slightly washed-out look. The edges are slightly blurred.
Mystery Awaits: Friends Gather Around a Campfire in the Misty Woods
A group of four men huddle around a crackling campfire, their faces illuminated by the dancing flames. The surrounding forest is shrouded in a thick mist, adding an air of mystery and intrigue. The scene evokes a sense of adventure and camaraderie, hinting at the unknown that lies ahead.
Prompt
poses leaning: intimate, suspenseful ; A group of explorers, huddled around a campfire, sharing stories; medium shot; adventure; a dense, mysterious forest; cinematic
Characteristic
Shot : Four men are sitting around a campfire in a forest at night. The fire is burning brightly and the men are talking and laughing. The forest is dark and mysterious, with trees all around. It’s cold and there is fog in the air.
Aesthetic Score : 0.6
Mood : cozy, mysterious, adventurous
Quality
Entropy : 6.46
Noise : 90
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible errors in the image, but the fire could be more visible.
Caught in the Moment: Headphones, Surprise, and Pure Excitement
A close-up shot captures the raw emotion of a young person, headphones on, eyes wide with surprise and excitement. The intensity of the moment is palpable, leaving you wondering what sparked this reaction.
Prompt
poses leaning: intense, focused ; A gamer’s face, illuminated by the glow of a monitor, eyes wide with excitement; close-up; gaming; a dimly lit room; cinematic
Characteristic
Shot : A close-up shot of a young person wearing headphones, with their mouth open in a surprised or excited expression. The lighting is dramatic, with a dark background and a soft glow on the subject’s face.
Aesthetic Score : 0.7
Mood : intense, surprised, focused
Quality
Entropy : 6.24
Noise : 68
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image contains some noise and grain, particularly in the shadows.
Silhouettes of Love at Sunset
A romantic couple sits on the beach, their silhouettes framed against the fiery sunset. The scene evokes a sense of peace, serenity, and intimacy, capturing the essence of a perfect evening.
Prompt
poses leaning: peaceful, heartwarming ; A family, leaning on each other, watching a sunset over a vast ocean; wide shot; travel; a serene, sandy beach; cinematic
Characteristic
Shot : A couple sitting on a beach at sunset, silhouetted against the golden sky.
Aesthetic Score : 0.7
Mood : romantic, serene, peaceful
Quality
Entropy : 6.31
Noise : 55
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors.
Conclusion
The results of the image analysis show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect.
Here’s a breakdown:
- Camera Position: The model scored 0.5, which falls within the “good” range. This indicates that the model was able to capture the intended camera position fairly well, but there’s room for improvement to reach the “very good” level.
- Shot Analysis: The model scored 0.55, also within the “good” range. This suggests that the model understood the scene and its composition reasonably well, but could benefit from further refinement to achieve a more accurate representation.
- Aesthetic Analysis: The model scored 0.08, which is significantly lower than the ideal range of -0.2 to 0.1. This indicates that the generated image’s aesthetic deviated considerably from the expected aesthetic based on the prompt.
Overall, the model demonstrates a good understanding of camera position and shot composition, but needs improvement in capturing the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://fal.ai/models/fal-ai/flux/dev/api