AI Captures the Moment: A Look at Generative AI's Pose Prowess with Titan-g1
- 9 minutes read - 1766 wordsTable of Contents
In the realm of visual storytelling, capturing the right pose can make all the difference. It’s the subtle tilt of the head, the confident stance, or the dramatic gesture that conveys emotion, action, and character. Generative AI is now stepping into this arena, learning to create poses that match the mood and setting of a scene. From epic battle scenes to serene landscapes, these models are learning to understand camera angles, shot types, and even the desired aesthetic to bring scenes to life. This blog post explores the exciting world of AI-generated poses, examining their strengths, limitations, and the potential they hold for the future of visual storytelling.
Created with: titan-g1
Warrior’s Fury: A Dramatic Chase Through Smoke and Fire
A lone warrior, possibly female, races through a desolate landscape shrouded in fog and dust. The fiery backdrop and the character’s determined stride create a sense of urgency and intensity, hinting at a dramatic struggle unfolding.
Prompt
poses action-pose: determined, heroic ; Lone warrior; wide shot; Heroism; Epic battle scene with smoke and fire; cinematic
Characteristic
Shot : A warrior in full armor is running away from a large explosion. The explosion creates a huge cloud of smoke and fire, but the warrior seems to be unharmed.
Aesthetic Score : 0.6
Mood : dramatic, intense, action
Quality
Entropy : 6.94
Noise : 102
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some artifacts in the smoke and fire, and the edges of the warrior’s armor are slightly blurry. These errors are not particularly noticeable, but they could be improved upon with post-processing.
Lost in the Clouds: A Hiker’s Moment of Serenity
A solitary figure stands on a mountain peak, gazing out at a breathtaking landscape veiled in clouds. The vastness of the scene evokes a sense of peace and adventure, capturing the essence of contemplative exploration.
Prompt
poses action-pose: adventurous, awe-inspired ; Adventurer standing on a cliff edge; medium shot; Adventure; Majestic mountain range with clouds; cinematic
Characteristic
Shot : A lone hiker stands on a mountain peak, overlooking a vast expanse of fog-covered mountains.
Aesthetic Score : 0.7
Mood : serene, adventurous, contemplative
Quality
Entropy : 6.72
Noise : 103
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, leading to a washed-out look. Some parts of the image are slightly blurry.
Lost in the Game: A Moment of Intense Focus
A player is deeply engrossed in a video game, their hands moving with precision on the controller. The dimly lit scene, bathed in blue and purple hues, creates a sense of intensity and focus, highlighting the digital world they’ve become immersed in.
Prompt
poses action-pose: focused, intense ; Gamer holding a controller; close-up; Gaming; Neon-lit gaming room with multiple screens; cinematic
Characteristic
Shot : A person playing a video game in a dimly lit room. They are holding a controller in their hands, and the screen of their computer is showing a game in progress. There are also two gaming PCs in the background.
Aesthetic Score : 0.5
Mood : intense, focused, digital
Quality
Entropy : 6.91
Noise : 96
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are some minor artifacts present in the image, particularly in the background. These appear as small, pixelated areas.
Capturing Joy in Front of Majesty: A Selfie at the Cathedral
A man beams with happiness as he takes a selfie in front of a grand cathedral. The majestic architecture provides a stunning backdrop, while the blurred figures in the background hint at the bustling atmosphere. This photo captures a moment of pure joy and adventure, showcasing the beauty of both the man’s smile and the cathedral’s grandeur.
Prompt
poses action-pose: happy, excited ; Tourist taking a selfie in front of a famous landmark; medium shot; Tourism; Busy city square with people and street performers; cinematic
Characteristic
Shot : A young man is taking a selfie in front of the Notre Dame Cathedral in Paris.
Aesthetic Score : 0.6
Mood : happy, cheerful, carefree
Quality
Entropy : 6.84
Noise : 97
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Winding Roads and Endless Vineyards: A Romantic Motorcycle Adventure
Experience the thrill of adventure and the beauty of romance as a couple embarks on a motorcycle journey through a picturesque vineyard. With the road curving gently to the left and the sun shining bright, this classic cruiser ride is the perfect escape to freedom.
Prompt
poses action-pose: free, adventurous ; Couple riding a motorcycle on a winding road; wide shot; Travel; Scenic countryside with rolling hills and vineyards; cinematic
Characteristic
Shot : A couple riding a motorcycle on a winding road through a vineyard.
Aesthetic Score : 0.7
Mood : romantic, adventurous, serene
Quality
Entropy : 6.89
Noise : 110
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, and there is some noise in the shadows.
Friends Toast to the City Lights at Dusk
A group of four friends raise their glasses in a celebratory toast on a rooftop, bathed in the warm glow of the setting sun. The city lights twinkle in the distance, creating a magical backdrop for their joyous moment.
Prompt
poses action-pose: joyful, celebratory ; Group of friends celebrating with drinks; medium shot; Groups; Rooftop bar with city lights in the background; cinematic
Characteristic
Shot : Four friends are toasting with champagne glasses on a rooftop with city lights in the background. The scene is lit by a warm, golden light, creating a romantic and celebratory atmosphere.
Aesthetic Score : 0.7
Mood : joyful, celebratory, romantic
Quality
Entropy : 6.91
Noise : 100
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors.
City Lights, Urban Dreams
A woman in a leather jacket stands confidently on a rooftop, gazing out at the twinkling cityscape. The dramatic contrast of light and shadow creates a sense of mystery and strength, capturing the essence of urban life.
Prompt
poses action-pose: confident ; landing on a rooftop; wide shot; City skyline with skyscrapers and neon lights; cinematic
Characteristic
Shot : A young woman in a black leather jacket stands on a rooftop overlooking a city at dusk. She is looking out at the city lights, which are blurred in the distance.
Aesthetic Score : 0.7
Mood : cool, urban, confident
Quality
Entropy : 6.82
Noise : 99
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, which is causing some of the details in the city lights to be lost. There is also some noise in the image, which is most noticeable in the shadows.
Lost in the Jungle: A Hiker’s Determined Journey
A lone hiker navigates a dense jungle path, his determined gaze hinting at a hidden purpose. The lush foliage and mysterious atmosphere create a sense of adventure and intrigue.
Prompt
poses action-pose: determined, adventurous ; Explorer navigating a jungle path; medium shot; Adventure; Lush green jungle with vines and sunlight filtering through the canopy; cinematic
Characteristic
Shot : A man is hiking through a dense jungle, he is wearing a hat and a backpack. The light is dim and the foliage is very thick.
Aesthetic Score : 0.6
Mood : adventurous, mysterious, green
Quality
Entropy : 6.73
Noise : 119
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry and the colors are a little washed out.
Immersed in the Game: A Gamer’s Intense Focus
A gamer, headphones on, eyes glued to the screen, watches a competitive match unfold. The shallow depth of field draws you into the action, highlighting the intensity and focus of the moment. The blurry stadium backdrop adds a sense of scale and excitement to the scene.
Prompt
poses action-pose: intense, focused ; Gamer competing in an esports tournament; close-up; Gaming; Stadium filled with cheering fans and bright lights; cinematic
Characteristic
Shot : A person wearing headphones sits in front of a computer screen, watching a video of a man playing a game.
Aesthetic Score : 0.6
Mood : focused, intense, competitive
Quality
Entropy : 6.83
Noise : 98
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some slight graininess, particularly in the background. The subject’s face is blurred.
Golden Hour Family Portrait: Capturing Love and Joy on the Beach
A heartwarming family portrait bathed in the warm glow of a sunset. The soft lighting creates an intimate atmosphere, highlighting the love and happiness shared by this family on the beach.
Prompt
poses action-pose: happy, relaxed ; Family posing for a photo in front of a sunset; medium shot; Travel; Beach with golden sand and turquoise water; cinematic
Characteristic
Shot : A family portrait on a beach at sunset. The family consists of a couple, a young boy and a young woman. The couple is looking at the camera and the boy is smiling. The background is a blurry beach scene with a sunset.
Aesthetic Score : 0.6
Mood : happy, joyful, relaxed
Quality
Entropy : 6.76
Noise : 101
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.33, indicating a moderate ability to react to camera positions in the prompt. This is considered decent, but not excellent.
- Shot Analysis: The model scored 0.43, also indicating a moderate ability to understand the scene described in the prompt. This is considered decent, but not excellent.
- Aesthetic Analysis: The model scored 0.05, which is considered very good. This means the generated image’s aesthetic closely matched the expected aesthetic.
Overall, the model demonstrates a good understanding of the scene and camera position, but could benefit from improvements in its ability to accurately capture the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://docs.aws.amazon.com/bedrock/latest/userguide/titan-image-models.html