AI's Artistic Journey: Capturing Poses, But Missing the Vibe with Imagen-v3
- 9 minutes read - 1711 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate realistic and visually appealing images is a rapidly evolving field. One area of focus is the generation of images based on specific poses and scenes. This involves understanding not only the physical positioning of subjects but also the intended aesthetic and mood of the image. In this blog post, we examine the results of an AI model tasked with generating images based on various poses and scenes, exploring its strengths and weaknesses in capturing the desired aesthetic.
Created with: imagen-v3
Conquering the Storm: A Moment of Triumph on the Mountain Peak
A lone hiker stands defiant against a dramatic stormy sky, capturing the essence of adventure and contemplation. The scene evokes a sense of anticipation and the thrill of pushing boundaries.
Prompt
poses classic-headshot: determined, confident ; A lone adventurer, standing on a mountain peak; close-up; heroism; dramatic sky with clouds; cinematic
Characteristic
Shot : A man in hiking gear stands on a mountain peak, looking towards the camera, with a stormy sky behind him.
Aesthetic Score : 0.7
Mood : dramatic, adventurous, contemplative
Quality
Entropy : 6.79
Noise : 81
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors.
A Pirate’s Tale Begins in the Stormy Sea
A weathered pirate captain, his long black braid whipping in the wind, stands resolute before a tempestuous sea. His ship sails bravely in the distance, while he holds a compass, hinting at the mysteries and adventures that lie ahead. The image evokes a sense of darkness, adventure, and intrigue, promising a story of daring and danger.
Prompt
poses classic-headshot: bold, adventurous ; A pirate captain, holding a compass; medium shot; adventure; stormy sea with a ship in the background; cinematic
Characteristic
Shot : A pirate captain with a long black braid, holding a compass, stands before a stormy sea with his ship sailing in the background.
Aesthetic Score : 0.7
Mood : dark, adventurous, mysterious
Quality
Entropy : 6.70
Noise : 104
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some minor blurriness in the background, potentially caused by motion blur.
Lost in the Neon Glow: A Gamer’s Focus
A young man, headphones on, is completely immersed in his video game. The dimly lit room, bathed in red and blue neon, creates a dramatic and intense atmosphere, highlighting the player’s focused expression and skillful movements.
Prompt
poses classic-headshot: focused, intense ; A gamer, holding a controller; close-up; gaming; neon lights and a gaming setup in the background; cinematic
Characteristic
Shot : A young man wearing headphones is playing a video game in a dimly lit room with red and blue neon lights.
Aesthetic Score : 0.6
Mood : focused, intense, cool
Quality
Entropy : 6.25
Noise : 79
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some slight noise and compression artifacts, particularly in the shadows. The lighting is a bit uneven, with highlights and shadows that look artificial.
A Moment of Joy in the City
A young man, radiating happiness, stands before a grand archway in a bustling city. The blurred background and focused archway create a sense of depth and perspective, highlighting the man’s adventurous spirit.
Prompt
poses classic-headshot: happy, excited ; A tourist, smiling in front of a famous landmark; medium shot; tourism; bustling city street; cinematic
Characteristic
Shot : A young man is standing in front of an archway in a city. There are other people in the background. The man is smiling. The archway is in focus, and the man is blurred in the background.
Aesthetic Score : 0.6
Mood : happy, joyful, adventurous
Quality
Entropy : 6.78
Noise : 86
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a little bit of noise in the image, particularly in the background.
Lost in Thought: A Woman’s Contemplative Journey
A woman sits by a train window, her gaze lost in the passing scenery. Her reflection in the glass reveals a sense of introspection and melancholy. The muted lighting and blurry background create a sense of mystery and intrigue, leaving the viewer to ponder her thoughts and emotions.
Prompt
poses classic-headshot: reflective, contemplative ; A traveler, looking out of a train window; close-up; travel; scenic landscape passing by; cinematic
Characteristic
Shot : A woman sits by a train window, looking out at the passing scenery. Her reflection in the glass is visible, creating a sense of introspection and contemplation.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, introspective
Quality
Entropy : 6.03
Noise : 74
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry and the lighting is uneven, creating some shadows and highlights.
Friends Laughing It Up in the City
A group of five young adults share a moment of pure joy and laughter against a vibrant cityscape backdrop. Their casual attire and carefree smiles capture the essence of friendship and happiness.
Prompt
poses classic-headshot: joyful, carefree ; A group of friends, laughing together; medium shot; groups; vibrant outdoor setting; cinematic
Characteristic
Shot : A group of five young adults are laughing together outdoors. They appear to be friends, and are dressed casually. There is a cityscape in the background.
Aesthetic Score : 0.8
Mood : joyful, carefree, happy
Quality
Entropy : 6.70
Noise : 88
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is well-composed and has no visible artifacts or errors.
Hero Stands Tall Amidst Blazing Cityscape
A powerful superhero, possibly Superman, faces a burning city skyline with unwavering determination. The dramatic scene evokes a sense of urgency and heroism, as the hero prepares to confront the danger.
Prompt
poses classic-headshot: brave, heroic ; A superhero, standing in front of a burning building; close-up; heroism; city skyline with smoke and flames; cinematic
Characteristic
Shot : A superhero, possibly Superman, stands in front of a city skyline with buildings on fire in the background.
Aesthetic Score : 0.7
Mood : dramatic, heroic, intense
Quality
Entropy : 6.83
Noise : 107
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are no visible artifacts or errors in the image.
Lost in the Jungle: A Young Man’s Determined Quest
A young adventurer, clad in khaki and armed with a map, stands resolute in a dense, verdant jungle. The moody lighting casts long shadows, adding an air of mystery and suspense to his determined expression. Will he find his way out, or will the jungle claim him?
Prompt
poses classic-headshot: curious, adventurous ; An explorer, holding a map; medium shot; adventure; dense jungle with ancient ruins in the background; cinematic
Characteristic
Shot : A young man wearing a khaki shirt and a backpack is standing in a lush green jungle. He is holding a map and looks determined.
Aesthetic Score : 0.7
Mood : adventurous, mysterious, suspenseful
Quality
Entropy : 6.51
Noise : 83
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image shows some slight artifacts, particularly around the edges of the subject and the map.
Lost in the Game: The Thrill of Virtual Reality
A young man, eyes lit up with excitement, is fully immersed in a virtual reality game. His clenched fists and beaming smile reveal the intensity of his experience, showcasing the power of VR to transport us to new worlds.
Prompt
poses classic-headshot: immersed, excited ; A gamer, wearing VR headset; close-up; gaming; futuristic virtual reality environment; cinematic
Characteristic
Shot : A young man wearing a VR headset is playing a game. He is smiling and has his fists clenched.
Aesthetic Score : 0.7
Mood : excited, focused, futuristic
Quality
Entropy : 6.36
Noise : 74
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Silhouette of Serenity: A Man Bathed in Golden Sunset
A peaceful scene unfolds as a man stands on a beach, bathed in the warm glow of the setting sun. His silhouette is dramatically highlighted against the vibrant sky, capturing a moment of pure tranquility and happiness.
Prompt
poses classic-headshot: happy, relaxed ; standing in front of a sunset; medium shot; tourism; beach with golden sand and waves; cinematic
Characteristic
Shot : A man is standing on a beach, looking at the camera, with the sun setting behind him
Aesthetic Score : 0.7
Mood : happy, peaceful, relaxed
Quality
Entropy : 6.27
Noise : 79
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor imperfections in the background, particularly in the sky. Some artifacts and noise can be seen.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
- Camera Position Analysis: The score of 0.4 indicates that the model’s ability to understand and implement camera positions in the generated image is fairly good. A score between 0.5 and 0.75 would be considered good, and above 0.75 would be very good.
- Shot Analysis: The score of 0.48 suggests that the model is fairly good at understanding the scene described in the prompt and translating it into a visual shot. Again, a score between 0.5 and 0.75 would be considered good, and above 0.75 would be very good.
- Aesthetic Analysis: The score of 2.2204460492503132e-17 is essentially zero. This indicates that the model did not accurately capture the intended aesthetic of the image. A score between -0.2 and 0.1 would be considered very good, meaning the generated image closely matched the expected aesthetic.
Overall: The model demonstrates a decent understanding of camera positions and shot composition, but needs improvement in capturing the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://deepmind.google/technologies/imagen-3/