AI's Over-the-Shoulder Struggle: Capturing the Right Feel with Imagen-v3
- 9 minutes read - 1793 wordsTable of Contents
The over-the-shoulder shot, a staple in filmmaking and photography, is often used to create a sense of immersion and immediacy. It places the viewer directly behind a character, allowing them to experience the scene through their eyes. This technique is particularly effective in conveying emotions, actions, and the character’s perspective. However, replicating this technique with AI presents unique challenges, as it requires not only understanding the technical aspects of camera positioning but also the nuances of visual storytelling and aesthetic appeal.
Created with: imagen-v3
Amidst the Ruins, a Soldier’s Grim Resolve
A lone soldier, shrouded in camouflage, stands defiant against a backdrop of a war-torn cityscape consumed by smoke and flames. The image, with its stark contrasts and dramatic lighting, evokes a sense of urgency and danger, capturing the intensity of the moment.
Prompt
camera-positions Over the shoulder: intense, determined ; A lone soldier; over-the-shoulder; heroism; smoke and explosions in the background; cinematic
Characteristic
Shot : A soldier in camouflage stands against a backdrop of a war-torn cityscape, engulfed in smoke and flames.
Aesthetic Score : 0.6
Mood : grim, intense, dramatic
Quality
Entropy : 6.42
Noise : 97
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has a few minor artifacts, such as a bit of blurriness around the edges.
Into the Unknown: A Lone Explorer Ventures Deeper
A solitary figure, clad in explorer’s garb, navigates a dense jungle, his back to the viewer. Sunlight filters through the canopy, casting a mysterious glow on the path ahead. This evocative scene whispers of adventure, hope, and the allure of the unknown.
Prompt
camera-positions Over the shoulder: curious, adventurous ; An explorer; over-the-shoulder; adventure; a dense jungle with sunlight filtering through the canopy; cinematic
Characteristic
Shot : A lone man, dressed in an explorer’s garb, walks through a dense jungle, looking ahead at the path ahead. The sunlight filters through the canopy, creating a dappled effect. The man’s back is to the viewer.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, hopeful
Quality
Entropy : 6.53
Noise : 81
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has no noticeable artifacts or errors.
Lost in the Game: A Moment of Intense Focus
A young man, bathed in the glow of his computer screen, is completely absorbed in his video game. The dimly lit room and his focused gaze create a sense of suspense and intensity, highlighting the power of gaming to captivate and transport.
Prompt
camera-positions Over the shoulder: focused, intense ; A gamer; over-the-shoulder; gaming; a brightly lit computer screen displaying a complex video game; cinematic
Characteristic
Shot : A young man is playing a video game on a computer. He is wearing headphones and looking intently at the screen. The room is dimly lit, and the only light source is coming from the computer screen.
Aesthetic Score : 0.6
Mood : focused, intense, serious
Quality
Entropy : 6.43
Noise : 75
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some minor artifacts, such as some noise in the shadows and some slight compression artifacts. The image also has some chromatic aberration around the edges of the screen.
A Moment of Tranquility on the Mountaintop
A lone hiker stands silhouetted against the setting sun, gazing out at a breathtaking panorama of cloud-shrouded mountains. The scene evokes a sense of peace, contemplation, and awe at the vastness of nature.
Prompt
camera-positions Over the shoulder: awe-struck, amazed ; A tourist; over-the-shoulder; tourism; a breathtaking view of a mountain range with clouds in the distance; cinematic
Characteristic
Shot : A lone hiker stands on a mountaintop, looking out at a breathtaking view of a distant mountain range shrouded in clouds. The sky is a beautiful blue, and the sun is setting, casting a warm glow over the landscape.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, inspiring
Quality
Entropy : 6.43
Noise : 85
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No apparent errors in the image.
A Moment of Wonder in a Vibrant Marketplace
A young man, captivated by the colorful displays of a bustling marketplace, pauses to admire the decorative goods. Warm lights illuminate the scene, creating a sense of warmth and intrigue. His surprised expression invites you to wonder what has caught his eye, promising a story waiting to unfold.
Prompt
camera-positions Over the shoulder: excited, curious ; A traveler; over-the-shoulder; travel; vibrant colors and exotic goods; cinematic
Characteristic
Shot : A young man with a backpack is looking at a shop full of decorative goods, lit by warm lights. He is in a vibrant, bustling marketplace with many colorful items on display.
Aesthetic Score : 0.6
Mood : curious, vibrant, warm
Quality
Entropy : 6.53
Noise : 98
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears slightly soft and lacks sharpness, especially in the background.
Blue Light, Big Smile: The Joy of Gaming
A young man beams with happiness, bathed in the blue glow of his gaming setup. The light casts an air of mystery, highlighting his confident smile and capturing the pure joy of gaming.
Prompt
camera-positions Over the shoulder: happy, carefree ; A gamer; over-the-shoulder; all-together; virtual environment; cinematic
Characteristic
Shot : A young man is sitting in a gaming chair, smiling, with blue light reflecting on his face
Aesthetic Score : 0.7
Mood : happy, joyful, confident
Quality
Entropy : 6.26
Noise : 72
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : Slight noise and some grain are present in the image, particularly in the shadows.
Firefighter Stands Tall Against Blazing Inferno
A dramatic image captures a firefighter in full gear, silhouetted against the fiery glow of a burning building. The contrast between the darkness and the flames creates a powerful visual, highlighting the bravery and intensity of the situation.
Prompt
camera-positions Over the shoulder: brave, determined ; A firefighter; over-the-shoulder; heroism; a burning building with flames and smoke billowing out; cinematic
Characteristic
Shot : A firefighter in full gear standing in front of a burning building, the flames are visible through the windows.
Aesthetic Score : 0.7
Mood : intense, dramatic, heroic
Quality
Entropy : 6.70
Noise : 93
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some noise is present in the image, especially in the background. There are slight artifacts from the compression of the image, but they are not too noticeable.
Conquering the Summit, Embracing the Expanse
A lone climber stands atop a rocky peak, bathed in the golden light of sunset. The panoramic vista of mountains and valleys below evokes a sense of awe and adventure, highlighting the vastness and beauty of the natural world.
Prompt
camera-positions Over the shoulder: determined, focused ; A mountain climber; over-the-shoulder; adventure; a steep, rocky mountainside with a breathtaking view from above; cinematic
Characteristic
Shot : A climber with a backpack and helmet is standing on a rocky mountain peak looking out at a panoramic vista of mountains and a valley below, lit by the golden glow of the setting sun.
Aesthetic Score : 0.8
Mood : adventure, serene, expansive
Quality
Entropy : 6.74
Noise : 97
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors, the image is well-exposed and there are no noticeable artifacts.
The Intensity of the Game, But a Little Too Close for Comfort
This image captures the focused intensity of a young man engrossed in a video game, bathed in the glow of his monitor. While the mood is palpable, the composition feels a bit cramped, with the subject too close to the camera. The lighting could also be improved to enhance the overall aesthetic.
Prompt
camera-positions Over the shoulder: intense, focused ; A competitive gamer; over-the-shoulder; gaming; a dimly lit room with a computer screen displaying a fast-paced game; cinematic
Characteristic
Shot : A young man is playing a video game in a dark room, illuminated by the glow of the monitor.
Aesthetic Score : 0.4
Mood : focused, intense, gamer
Quality
Entropy : 6.29
Noise : 80
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, and there is a slight chromatic aberration around the edges of the monitor.
Silhouettes of Love at Sunset
A couple stands silhouetted against a breathtaking sunset on the beach, their love story unfolding against the backdrop of the ocean waves. The scene evokes a sense of romance, peace, and a touch of melancholy, creating a captivating and intimate moment.
Prompt
camera-positions Over the shoulder: romantic, peaceful ; A couple; over-the-shoulder; travel; a romantic sunset over a beach with the ocean waves crashing in the background; cinematic
Characteristic
Shot : A couple silhouetted against a sunset on the beach. The ocean waves and the setting sun are in the background.
Aesthetic Score : 0.6
Mood : romantic, peaceful, melancholic
Quality
Entropy : 5.18
Noise : 63
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has slight noise and some grain, especially in the darker areas. There is also a slight amount of blurring in the background, which could be caused by camera shake or a shallow depth of field.
Conclusion
The generative AI model performed well in terms of understanding camera positions and scene composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
Camera Position:
- Score: 0.45
- Interpretation: This score falls below the “good” range (0.5-0.75). It suggests the model didn’t perfectly capture the intended camera positions described in the prompt.
Shot Analysis:
- Score: 0.58
- Interpretation: This score falls within the “good” range (0.5-0.75). It indicates the model successfully understood the scene composition described in the prompt, but there might be some minor discrepancies.
Aesthetic Analysis:
- Score: 0.14
- Interpretation: This score is significantly above the “very good” range (-0.2 to 0.1). It suggests the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera positions and scene composition, but struggles to achieve the desired aesthetic. This suggests the model might need further training to better understand and translate aesthetic preferences into visual outputs.
Sources:
- https://www.studiobinder.com/blog/types-of-camera-shot-angles-in-film/
- https://www.learnaboutfilm.com/film-language/picture/camera-position/
- https://boords.com/blog/16-types-of-camera-shots-and-angles-with-gifs
- https://shorthand.com/the-craft/8-tips-for-great-visual-storytelling/
- https://deepmind.google/technologies/imagen-3/