AI's Artistic Journey: Capturing Poses, But Missing the Shot with Dall-e-3
- 10 minutes read - 1998 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual prompts is a rapidly evolving field. This blog post examines the performance of a generative AI model in capturing the essence of poses and scenes. The model demonstrates a remarkable ability to understand the aesthetic aspects of a prompt, but struggles with accurately replicating camera angles and shot composition. We will explore the model’s strengths and weaknesses, providing insights into the challenges and opportunities in AI-generated imagery.
Created with: dall-e-3
Silhouetted Against the Sunset: A Moment of Hope on the Mountaintop
A lone figure stands on a majestic mountain peak, their silhouette stark against the fiery hues of the setting sun. The vast valley below stretches out, promising endless possibilities. This epic scene evokes a sense of hope and grandeur, leaving you pondering the mysteries that lie ahead.
Prompt
poses profile: Epic, hopeful, determined ; A lone figure, silhouetted against a setting sun; wide shot; Heroism; A vast, mountainous landscape; cinematic
Characteristic
Shot : A lone figure, silhouetted against a vibrant sunset, stands on a rocky ridge overlooking a vast, sun-drenched valley carved by majestic mountains.
Aesthetic Score : 0.8
Mood : epic, majestic, serene
Quality
Entropy : 6.48
Noise : 88
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.50
Image errors : Slight color banding in the sky, particularly around the sun, and some blurring in the distant mountains.
A Moment of Solitude Amidst Nature’s Grandeur
A lone hiker finds peace and awe on a rocky cliff, overlooking a breathtaking valley bathed in the golden light of sunset. Cascading waterfalls and a winding river weave through rolling green hills, creating a scene of serene beauty and adventurous spirit.
Prompt
poses profile: Adventurous, free-spirited, awe-inspired ; A backpacker standing on a cliff edge, looking out at a breathtaking view; medium shot; Adventure; A sprawling valley with cascading waterfalls; cinematic
Characteristic
Shot : A woman with a backpack is kneeling on a cliff edge overlooking a vast valley with a river snaking through it. In the foreground, there is a majestic waterfall cascading down the cliff face. The scene is bathed in the warm glow of a setting sun.
Aesthetic Score : 0.7
Mood : serene, adventurous, contemplative
Quality
Entropy : 6.76
Noise : 103
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.60
Image errors : The waterfall appears slightly blurred and unnatural. The colors in the image are slightly desaturated, and there is a slight halo effect around the woman’s figure.
The Glow of Victory: A Gamer’s Focus
A young man’s hands grip the controller, illuminated by a vibrant glow. The intensity of his focus is palpable, as he navigates the virtual world with determination. The blurred background hints at the escape he finds in the game, a world where anything is possible.
Prompt
poses profile: Focused, intense, passionate ; A gamer’s hands, illuminated by the glow of a monitor, holding a controller; close-up; Gaming; A dimly lit room with gaming posters on the walls; cinematic
Characteristic
Shot : A young man is playing video games in his room. He is holding a controller in his hands, and his face is illuminated by the glow of the screen. There are posters on the wall behind him, and a desk lamp is shining on his desk.
Aesthetic Score : 0.7
Mood : intense, focused, energetic
Quality
Entropy : 6.79
Noise : 93
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : The lighting and composition of the image is inconsistent. The lighting on the man’s face is too bright compared to the overall lighting of the scene. The composition is a bit awkward, as the man’s head is cut off at the top of the image.
Awe-Inspiring Cathedral Captures Tourist’s Heart
A young man stands in awe before a grand cathedral, his expression reflecting the wonder and excitement of the moment. The majestic architecture and the blurred background of bustling activity create a sense of joyful discovery.
Prompt
poses profile: Curious, excited, appreciative ; A tourist gazing up at a majestic cathedral; medium shot; Tourism; A bustling city square with cobblestone streets; cinematic
Characteristic
Shot : A young man with a backpack is standing in front of a large cathedral, looking up in awe, possibly taking a selfie. The architecture of the cathedral is beautiful and the man’s expression is one of wonder.
Aesthetic Score : 0.6
Mood : joyful, adventurous, curious
Quality
Entropy : 6.80
Noise : 96
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be slightly over-sharpened and has some noticeable artifacts around the man’s face and hair. The background is a bit too blurry and the colors are not very vibrant.
A Moment of Contemplation: Hijab-Clad Woman Gazes at the Vastness
A woman, her face partially obscured by a hijab, sits on a train, her gaze fixed on a sprawling field of crops stretching out beyond the window. The soft light and the contrast between the woman and the landscape create a sense of calm contemplation and a hint of mystery. The scene evokes a feeling of wistfulness, leaving the viewer to ponder the woman’s thoughts and the journey she is on.
Prompt
poses profile: Reflective, contemplative, nostalgic ; A traveler sitting on a train, looking out the window at passing scenery; medium shot; Travel; A scenic train journey through rolling hills and fields; cinematic
Characteristic
Shot : A woman in a hijab is sitting by the window of a train looking out at the scenic view of green fields and mountains.
Aesthetic Score : 0.7
Mood : peaceful, contemplative, serene
Quality
Entropy : 6.57
Noise : 96
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no significant artifacts or errors in the image.
Friends Celebrate with Unbridled Joy
A group of friends radiate pure happiness as they laugh and celebrate together. The scene is bursting with energy and captures the spontaneous joy of a shared moment.
Prompt
poses profile: Joyful, celebratory, connected ; A group of friends laughing and celebrating together; wide shot; Groups; A lively party with colorful decorations and music; cinematic
Characteristic
Shot : A group of friends are laughing and having a good time at a party. The image is taken from a low angle, giving the viewer a close-up perspective of the friends’ faces.
Aesthetic Score : 0.7
Mood : joyful, celebratory, carefree
Quality
Entropy : 6.76
Noise : 103
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image appears to be slightly overexposed, and the background is slightly blurry. There is also some noise in the image, particularly in the shadows.
Heroic Silhouette Against the Future
A superhero stands tall on a rooftop, bathed in the golden light of a futuristic sunset. Behind him, a swirling portal promises adventure and hope. This epic scene captures the strength and determination of a hero facing an uncertain future.
Prompt
poses profile: Powerful, confident, inspiring ; A superhero standing tall, cape billowing in the wind; medium shot; Heroism; A cityscape with towering skyscrapers; cinematic
Characteristic
Shot : A futuristic superhero stands on a rooftop overlooking a city at sunset, a glowing circular portal behind him.
Aesthetic Score : 0.7
Mood : epic, hopeful, powerful
Quality
Entropy : 6.75
Noise : 117
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.90
Image errors : The city skyline is a bit blurry and lacks depth, and the superhero’s cape looks a bit stiff and unnatural.
Lost in the Jungle’s Embrace: A Temple Beckons
A mystical journey awaits as hikers venture through a dense, verdant jungle, the path leading to ancient temple ruins bathed in ethereal light. The scene evokes a sense of adventure, serenity, and mystery, with dramatic light and shadow highlighting the jungle’s beauty and the temple’s secrets.
Prompt
poses profile: Intrigued, adventurous, determined ; A group of explorers navigating a dense jungle; wide shot; Adventure; Lush greenery, ancient ruins, and dappled sunlight; cinematic
Characteristic
Shot : A group of people walk through an overgrown jungle, passing by the ruins of ancient structures. The jungle is dense and lush, with large trees and vines. The sun shines through the canopy, creating a dramatic lighting effect.
Aesthetic Score : 0.8
Mood : mysterious, adventurous, serene
Quality
Entropy : 6.70
Noise : 125
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is slightly blurry, and the colors are a bit too saturated.
Lost in the Neon Glow: A Moment of Intense Focus
A young woman, bathed in the vibrant hues of neon light, sits transfixed before her computer screen. Headphones isolate her from the world, her gaze unwavering as she navigates a digital landscape. The dimly lit room amplifies the drama, creating a sense of futuristic intensity.
Prompt
poses profile: Focused, competitive, determined ; A gamer’s face, lit by the screen, showing intense concentration; close-up; Gaming; A dimly lit room with a gaming setup and neon lights; cinematic
Characteristic
Shot : A young woman wearing headphones is sitting at a desk in front of a computer, typing on a keyboard, with neon lights illuminating the scene. The background is blurred, suggesting a night time setting or a dimly lit room.
Aesthetic Score : 0.8
Mood : intense, futuristic, focused
Quality
Entropy : 6.58
Noise : 98
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, particularly in the background, which are likely due to the use of artificial intelligence in its creation.
Sunset Romance on the Beach
A couple strolls hand-in-hand along a sandy beach as the sun dips below the horizon, casting a warm glow on the picturesque coastline. The silhouette of the couple against the vibrant sunset creates a romantic and nostalgic scene, capturing the essence of tranquility and love.
Prompt
poses profile: Romantic, peaceful, serene ; A couple holding hands, walking along a beach at sunset; medium shot; Tourism; A golden beach with turquoise waters and a vibrant sky; cinematic
Characteristic
Shot : A couple walks hand-in-hand on a beach at sunset. The man is looking down, the woman is looking ahead. There is a small town in the background.
Aesthetic Score : 0.7
Mood : romantic, dreamy, peaceful
Quality
Entropy : 6.69
Noise : 112
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some slight blurriness, especially on the faces of the couple. The sand on the beach also looks a bit too smooth.
Conclusion
The results show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered okay. This means the generated image’s camera position was somewhat different from what was requested in the prompt.
- Shot Analysis: The model scored 0.43, also considered okay. This indicates the generated image’s shot composition was somewhat different from what was requested in the prompt.
- Aesthetic Analysis: The model scored 0.01, which is considered very good. This means the generated image’s aesthetic was very close to what was expected based on the prompt.
Overall, the model seems to be better at understanding the aesthetic aspects of the prompt than the camera position and shot composition. It might be helpful to provide more specific instructions regarding camera angles and shot types in future prompts to improve the model’s performance in these areas.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://openai.com/index/dall-e-3/