AI's Artistic Struggle: Capturing the Essence of Poses with Stable-diffusion
- 9 minutes read - 1799 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This experiment delves into the challenges of capturing the essence of poses, exploring how an AI model navigates the complexities of camera angles, shot types, and aesthetic interpretation. The results reveal both strengths and weaknesses, highlighting the ongoing journey towards achieving truly artistic AI.
Created with: stability-ai-core
Silhouettes of Sorrow: A Sunset Walk Through Ruins
Two cloaked figures walk through a ruined city as the sun sets, casting a golden glow that illuminates their path and the crumbling structures around them. The scene evokes a sense of melancholy and mystery, with the dramatic lighting adding to the somber mood.
Prompt
poses walking-away: Melancholy, yet hopeful ; Lone figure in a tattered cloak; wide shot; Heroism; Ruins of a fallen city bathed in the golden light of a setting sun; cinematic
Characteristic
Shot : Two figures, likely men, walk through a destroyed city street. There are ruins of buildings on either side of the street. The sun is setting in the background and creates a lens flare in the image. The figures are wearing long cloaks. There is a hazy look in the air, perhaps due to dust or smoke.
Aesthetic Score : 0.7
Mood : dark, bleak, post-apocalyptic
Quality
Entropy : 6.62
Noise : 84
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some minor artifacts in the image, particularly around the edges of the figures. The sky also appears to be slightly blurry, which could be an artifact or a result of post-processing.
Lost in the Emerald Labyrinth: A Journey Through Mystery and Adventure
A solitary figure ventures through a dense jungle, the lush greenery shrouding the path in an ethereal mist. The scene evokes a sense of serene mystery and adventurous anticipation, inviting you to explore the unknown.
Prompt
poses walking-away: Excited, adventurous ; A young adventurer with a backpack; medium shot; Adventure; Lush jungle with a hidden path leading into the unknown; cinematic
Characteristic
Shot : A lone hiker walks down a path in a lush, tropical forest. The air is thick with moisture, and sunlight filters through the dense canopy.
Aesthetic Score : 0.7
Mood : mysterious, tranquil, adventurous
Quality
Entropy : 6.80
Noise : 97
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image exhibits slight noise and compression artifacts, particularly noticeable in the background and the foliage. The overall sharpness could also be improved.
Lost in the Code: A Hacker’s Focus Under Neon Lights
A young man, immersed in his work, sits before a computer bathed in blue and red neon light. The blurry background and dramatic lighting emphasize his intense concentration as he navigates the digital world.
Prompt
poses walking-away: Focused, determined ; A gamer with a headset; close-up; Gaming; Neon-lit cityscape reflected in a computer screen; cinematic
Characteristic
Shot : A young man is sitting at a computer in a dimly lit room, wearing headphones and typing on a keyboard. The room is decorated with neon lights, giving it a futuristic feel.
Aesthetic Score : 0.7
Mood : focused, intense, futuristic
Quality
Entropy : 6.24
Noise : 65
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors or artifacts, the image appears high quality
Lost in Love on a Cobblestone Street
A romantic stroll through a charming European town, captured in a moment of intimacy and wanderlust. The narrow perspective of the street creates a sense of mystery, inviting you to imagine the stories unfolding within the colorful buildings.
Prompt
poses walking-away: Romantic, carefree ; A couple holding hands; medium shot; Tourism; Picturesque European street with cobblestone paths and colorful buildings; cinematic
Characteristic
Shot : A couple walks hand-in-hand down a cobblestone street lined with colorful buildings.
Aesthetic Score : 0.7
Mood : romantic, nostalgic, cozy
Quality
Entropy : 6.79
Noise : 84
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No obvious artifacts or errors in the image.
Love Takes Flight: A Couple’s Journey Begins
A romantic couple walks hand-in-hand down an airport runway, their backs turned towards the viewer, creating a sense of mystery and anticipation. A plane takes off in the background, symbolizing the start of their adventurous journey. This heartwarming scene evokes feelings of hope and excitement for the future.
Prompt
poses walking-away: Nostalgic, bittersweet ; A lone traveler with a suitcase; long shot; Travel; Airport runway with a departing airplane in the distance; cinematic
Characteristic
Shot : A couple is walking away from the camera on a runway, an airplane is flying overhead
Aesthetic Score : 0.7
Mood : adventure, wanderlust, travel
Quality
Entropy : 6.58
Noise : 57
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no artifacts or visible errors in the image
Golden Hour Friendship
A group of friends stroll along a sandy beach, their silhouettes painted against a breathtaking sunset. The scene evokes a sense of happiness, carefree joy, and nostalgia, capturing the beauty of shared moments and the magic of nature’s spectacle.
Prompt
poses walking-away: Joyful, carefree ; A group of friends laughing; wide shot; Groups; Beach at sunset with the ocean waves crashing in the background; cinematic
Characteristic
Shot : A group of friends are walking barefoot on a sandy beach at sunset. The sun is setting in the background, creating a warm glow.
Aesthetic Score : 0.7
Mood : tranquil, happy, carefree
Quality
Entropy : 6.75
Noise : 68
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant artifacts or errors.
A Knight’s Journey into the Mist
A lone knight, shrouded in mystery, ventures through a misty forest, his sword held tight. The atmosphere is thick with intrigue, promising an adventure filled with danger and discovery.
Prompt
poses walking-away: Determined, resolute ; A lone warrior with a sword; medium shot; Heroism; Dark forest with a path leading into the shadows; cinematic
Characteristic
Shot : A lone figure, possibly a knight or warrior, walks through a misty forest, his back to the viewer, sword at his side. The forest is thick with trees, and the air is filled with a sense of mystery.
Aesthetic Score : 0.7
Mood : mysterious, dramatic, solitary
Quality
Entropy : 6.65
Noise : 77
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is somewhat blurry, particularly in the foreground, and lacks sharpness. The lighting seems inconsistent, with some areas being too bright and others too dark.
Unveiling the Secrets: Adventurers Explore Ancient Ruins
A group of intrepid explorers venture through the remnants of a forgotten temple, their backs to the camera as they delve deeper into the mysteries within. The sunlight filtering through the ruins creates an atmosphere of wonder and anticipation, beckoning viewers to join their quest.
Prompt
poses walking-away: Curious, excited ; A group of explorers with maps; wide shot; Adventure; Ancient ruins with a mysterious entrance; cinematic
Characteristic
Shot : Five figures in explorer-style attire walk through the ruins of an ancient stone temple. The scene is set in a jungle, with lush greenery surrounding the temple.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, hopeful
Quality
Entropy : 6.76
Noise : 83
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts and errors in the image, such as the slight blurring of the background. These are not significant enough to detract from the overall quality of the image.
Lost in the Neon Glow: A Man Embraces the Future of VR
A bustling city street at night becomes a canvas for virtual reality, as a man, fully immersed in his headset, experiences the wonder and excitement of a futuristic world. The neon lights and bustling crowds create a vibrant backdrop for this immersive journey.
Prompt
poses walking-away: Immersed, excited ; A gamer with a controller; close-up; Gaming; Virtual reality headset with a fantastical world displayed; cinematic
Characteristic
Shot : A man wearing VR headset and holding controllers walks down a busy street at night, with colorful advertising signs in the background.
Aesthetic Score : 0.7
Mood : futuristic, urban, immersive
Quality
Entropy : 6.71
Noise : 65
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible artifacts or errors.
A Family’s Departure: Hope and Melancholy on the Platform
A poignant scene unfolds as a family of three, silhouetted against the departing train, walks away from the camera. Their winter attire and luggage suggest a journey, while the mood evokes a mix of melancholy and hope, leaving the viewer to ponder their destination and the emotions behind their departure.
Prompt
poses walking-away: Emotional, bittersweet ; A family with luggage; long shot; Travel; Train station platform with a departing train in the distance; cinematic
Characteristic
Shot : A family of three, a man, a woman and a girl, walk away from the camera on a train platform. A train is in the background. They are pulling luggage.
Aesthetic Score : 0.5
Mood : melancholy, travel, departure
Quality
Entropy : 6.48
Noise : 66
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors. Some slight noise, but not noticeable.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.41
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera positions described in the prompt.
Shot Analysis:
- Score: 0.48
- Interpretation: This score also falls below the “good” range. It indicates that the model had some difficulty understanding and translating the scene description from the prompt into the generated image.
Aesthetic Analysis:
- Score: 0.06
- Interpretation: This score is significantly higher than the “very good” range of -0.2 to 0.1. It suggests a considerable difference between the expected aesthetic of the image and the actual aesthetic of the generated image. This could mean the model struggled to capture the desired style or mood.
Overall:
While the model showed some success in understanding camera positions and shot descriptions, it needs improvement in capturing the intended aesthetic.