AI's Artistic Struggle: Capturing the Essence of Poses with Dall-e-3
- 10 minutes read - 1934 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post delves into the results of an experiment where an AI model was tasked with creating images based on specific poses and scene descriptions. While the model demonstrated a good understanding of camera positions and scene composition, it struggled to capture the desired aesthetic, highlighting the ongoing challenges in AI’s artistic capabilities. This exploration will delve into the model’s strengths and weaknesses, providing insights into the complexities of AI-generated art and the potential for future advancements.
Created with: dall-e-3
Conquering the Peaks: A Moment of Solitude and Strength
A lone figure stands defiant against the elements, silhouetted against a breathtaking panorama of mountains and sky. The dramatic lighting and the man’s pose evoke a sense of power and adventure, while the vastness of the landscape invites contemplation and a sense of awe.
Prompt
poses ankle-cross: Determined, confident, facing the unknown ; A lone adventurer, standing atop a windswept mountain peak; wide shot; Adventure; Dramatic sky with swirling clouds; cinematic
Characteristic
Shot : A man is standing on a mountain peak with a view of a misty valley in the background. The sky is dramatic with dark clouds and a hint of sunlight.
Aesthetic Score : 0.7
Mood : dramatic, adventurous, contemplative
Quality
Entropy : 6.61
Noise : 103
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to be slightly over-processed, and the details on the mountain ranges are blurred. The lighting on the subject is not evenly spread.
Silhouette of Hope: Superhero Stands Tall Against the Sunset
A dramatic silhouette of a superhero, arms raised in victory, stands against a vibrant city skyline at sunset. The image evokes a sense of heroism, hope, and the promise of a brighter future.
Prompt
poses ankle-cross: Powerful, heroic, standing tall ; A superhero, silhouetted against a blazing sunset; medium shot; Heroism; City skyline with towering buildings; cinematic
Characteristic
Shot : A superhero stands silhouetted against a fiery sunset over a city skyline, their cape flowing dramatically.
Aesthetic Score : 0.7
Mood : epic, powerful, hopeful
Quality
Entropy : 6.68
Noise : 80
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some artifacts and errors, particularly in the sky and the city skyline. The superhero’s figure is also somewhat unrealistic and lacks detail.
Lost in the Game: A Gamer’s Intense Focus Under Neon Lights
A young man is completely absorbed in his video game, his eyes locked on the screen as he navigates the virtual world. The dark room is illuminated by vibrant blue and red lights, creating a dramatic atmosphere that emphasizes the intensity of his focus and the immersive nature of the game.
Prompt
poses ankle-cross: Immersed, concentrated, in the zone ; A gamer, intensely focused on a virtual reality headset; close-up; Gaming; Futuristic, neon-lit gaming room; cinematic
Characteristic
Shot : A young man wearing headphones is playing a video game. He is holding a controller in his hands and his eyes are focused on the screen. He is sitting in a chair with his legs crossed. The scene is lit by colorful neon lights. The monitor screen is blurred and it is not clear what game he is playing.
Aesthetic Score : 0.6
Mood : intense, focused, determined
Quality
Entropy : 6.48
Noise : 95
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor artifacts in the background, particularly around the lights. The sharpness of the image is also somewhat lacking.
Silhouetted Against Time: A Woman Contemplates Ancient Wonders
A lone figure, silhouetted against the setting sun, stands on a stone platform overlooking a sprawling complex of ancient temples. The misty air and warm glow create a tranquil and contemplative atmosphere, inviting you to imagine the stories held within these weathered stones.
Prompt
poses ankle-cross: Awe-struck, contemplative, taking in the beauty ; A tourist, gazing out at a breathtaking vista; medium shot; Tourism; Ancient ruins with a panoramic view; cinematic
Characteristic
Shot : A woman in a hat stands on a stone ledge overlooking an ancient temple complex, bathed in the soft light of sunrise. The background features a hazy mountain range.
Aesthetic Score : 0.8
Mood : serene, contemplative, adventurous
Quality
Entropy : 6.61
Noise : 96
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Sunset Adventure in the Desert
A lone hiker traverses the vast desert landscape as the sun sets, casting long shadows across the sand dunes. The vibrant orange and yellow sky creates a breathtaking backdrop, evoking a sense of adventure, peace, and inspiration.
Prompt
poses ankle-cross: Free-spirited, adventurous, embracing the unknown ; A backpacker, standing at the edge of a vast desert; wide shot; Travel; Endless sand dunes stretching into the horizon; cinematic
Characteristic
Shot : A lone hiker is walking on a sand dune, looking towards a distant sun setting over a vast desert, the sun is shining brightly.
Aesthetic Score : 0.7
Mood : adventure, hopeful, vast
Quality
Entropy : 6.75
Noise : 108
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to have some slight distortion near the horizon, likely caused by the wide-angle lens used.
Urban Night Laughter: Friends Embrace the Joy
Four young adults radiate pure happiness as they share laughter on a vibrant city street. The colorful lights illuminate their carefree spirits, capturing a moment of pure joy and connection.
Prompt
poses ankle-cross: Joyful, carefree, enjoying each other’s company ; A group of friends, laughing and celebrating; medium shot; Groups; Vibrant, bustling street scene with colorful lights; cinematic
Characteristic
Shot : Group of young adults laughing and hanging out at night in an urban setting. The lighting is warm and inviting, and the atmosphere is fun and friendly.
Aesthetic Score : 0.7
Mood : joyful, vibrant, carefree
Quality
Entropy : 6.53
Noise : 89
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The lighting is a little uneven, some areas are overexposed and others are underexposed. There’s some noise in the image, particularly in the shadows.
A Knight’s Shadow at Dusk
A lone knight stands silhouetted against the imposing gate of a grand castle at dusk. The play of light and shadow creates a sense of mystery and intrigue, hinting at the adventure that awaits within.
Prompt
poses ankle-cross: Stoic, vigilant, protecting the realm ; A lone warrior, standing guard at a castle gate; medium shot; Heroism; Majestic castle with a moat and drawbridge; cinematic
Characteristic
Shot : A lone knight stands in front of a grand, imposing castle with a drawbridge leading to the entrance. The scene is set at dusk, with a soft blue light illuminating the sky and casting long shadows. The knight seems to be contemplating the castle ahead.
Aesthetic Score : 0.75
Mood : mystical, dramatic, melancholic
Quality
Entropy : 6.75
Noise : 105
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : Slight blurring in some areas, particularly around the edges of the castle, likely due to post-processing. The knight’s hand looks a bit unnatural.
Whispers in the Woods: A Campfire Mystery
A group of friends huddle around a crackling campfire, their faces illuminated by the dancing flames. A sense of wonder and anticipation fills the air as they gaze into the shadowy depths of the forest. What secrets lie hidden in the darkness? This captivating scene evokes a mood of mystery, adventure, and suspense, leaving you eager to uncover the truth.
Prompt
poses ankle-cross: Intrigued, curious, sharing stories ; A group of explorers, huddled around a campfire; close-up; Adventure; Dense forest with flickering flames; cinematic
Characteristic
Shot : A group of friends gathered around a campfire in a forest setting, with a dramatic lighting that creates an eerie and suspenseful atmosphere.
Aesthetic Score : 0.7
Mood : suspenseful, eerie, adventurous
Quality
Entropy : 6.82
Noise : 93
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise and slight blurriness in the background.
Victory Dance on a Controller: Gamer’s Triumphant Moment Captured
A young man, bathed in a spotlight, celebrates his victory with a playful and energetic pose, standing on a gaming controller. The dimly lit room adds to the dramatic effect, highlighting his triumphant moment.
Prompt
poses ankle-cross: Excited, victorious, celebrating success ; A gamer, triumphantly raising their hands after winning a game; close-up; Gaming; Brightly lit gaming console with flashing lights; cinematic
Characteristic
Shot : A person in a blue hoodie is standing on a game controller, with their foot on the control pad and their arms raised in the air, with a bright light behind them.
Aesthetic Score : 0.6
Mood : energetic, triumphant, playful
Quality
Entropy : 6.74
Noise : 81
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is slightly blurry and there are some artifacts around the edges of the subject. The light is overexposed in the background, which makes it look artificial.
City Lights, Silhouetted Love: A Rooftop Romance at Dusk
A couple stands on a rooftop, their figures silhouetted against the twinkling city lights. The scene evokes a sense of romance, peace, and contemplation, capturing the magic of a shared moment under the stars.
Prompt
poses ankle-cross: Intimate, romantic, enjoying the view together ; A couple, standing on a balcony overlooking a bustling city; medium shot; Travel; Romantic cityscape with twinkling lights; cinematic
Characteristic
Shot : A couple stands on a rooftop overlooking a city at dusk. They are holding hands and looking out at the cityscape. The city lights are twinkling in the distance.
Aesthetic Score : 0.7
Mood : romantic, dreamy, hopeful
Quality
Entropy : 6.72
Noise : 100
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has some artifacts and errors, particularly in the cityscape. The buildings are somewhat blurry and the streetlights are not very realistic.
Conclusion
The results indicate that the generative AI model performed well in terms of understanding camera positions and scene composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored a 0.46 on camera position analysis, which falls slightly below the “good” range of 0.5 to 0.75. This suggests that while the model generally understood the camera position described in the prompt, there might be some discrepancies between the intended and generated camera angles.
- Shot Analysis: The model scored a 0.505, placing it within the “good” range for shot analysis. This indicates that the model successfully captured the overall scene composition and shot type described in the prompt.
- Aesthetic Analysis: The model scored a 0.13, which is significantly higher than the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt.
Overall, the model demonstrates a good understanding of camera positions and scene composition, but needs improvement in generating images that match the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://openai.com/index/dall-e-3/