AI's Eye: Tracking Shots - A Cinematic Journey with Flux-schnell
- 9 minutes read - 1773 wordsTable of Contents
Tracking shots, a cinematic technique where the camera follows a subject in motion, are often used to create a sense of dynamism and immersion. This blog post explores the capabilities of a generative AI model in creating tracking shots, analyzing its performance in capturing the essence of this technique across various scenes. We’ll delve into the model’s strengths and weaknesses, highlighting its success in understanding camera positions and shot composition, while also examining its limitations in achieving the desired aesthetic.
Created with: flux-schnell
Silhouetted Against the Setting Sun: A Moment of Tranquility
A lone figure, staff in hand, stands in a vast, open plain as the sun dips below the horizon. The warm glow of the setting sun casts a contemplative mood, highlighting the figure’s silhouette against the sky. This image evokes a sense of isolation and hope, inviting viewers to contemplate the beauty of the moment.
Prompt
camera-positions Tracking shot: Epic, hopeful ; A lone figure, silhouetted against the setting sun; tracking shot; Heroism; A vast, desolate landscape.; cinematic
Characteristic
Shot : A silhouette of a man standing with a staff in his hand against a sunset. The sun is almost out of the frame, creating a beautiful glow in the sky.
Aesthetic Score : 0.6
Mood : serene, contemplative, hopeful
Quality
Entropy : 5.38
Noise : 56
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors in the image.
Tranquil Trek to a Mystical Temple
A group of hikers venture through a lush forest, their path leading towards a distant temple shrouded in mystery. The serene atmosphere and sense of depth invite you to explore this tranquil adventure.
Prompt
camera-positions Tracking shot: Intriguing, adventurous ; A group of explorers navigating a dense jungle; tracking shot; Adventure; Lush greenery, ancient ruins in the distance.; cinematic
Characteristic
Shot : A group of hikers walk down a path through lush vegetation with a temple structure visible in the background.
Aesthetic Score : 0.6
Mood : tranquil, adventurous, natural
Quality
Entropy : 6.83
Noise : 126
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors in the image.
Lost in the Game: A Moment of Focused Play
A low-angle shot captures the intensity of a gamer’s focus as they grip their controller. The blurry background fades away, highlighting the player’s immersion in the virtual world. Warm lighting creates a cozy and inviting atmosphere, emphasizing the playful nature of the experience.
Prompt
camera-positions Tracking shot: Intense, focused ; A gamer’s hands furiously manipulating a controller; tracking shot; Gaming; elevated virtual world; cinematic
Characteristic
Shot : A person is playing a video game. We see their hands holding a controller, and a blurry background of a screen showing a game and some other electronics.
Aesthetic Score : 0.5
Mood : focused, relaxed, playful
Quality
Entropy : 6.65
Noise : 48
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor blurring and graininess, especially in the background. Some of the colors are washed out, particularly in the background.
A Symphony of Colors: Life and Energy in a Bustling Street Market
Immerse yourself in the vibrant energy of a bustling street market, where colorful goods and a lively crowd create a captivating scene. The weathered buildings add a touch of history, enhancing the atmosphere of this lively marketplace.
Prompt
camera-positions Tracking shot: Energetic, lively ; A bustling marketplace in a foreign city; tracking shot; Tourism; Vibrant colors, exotic goods, diverse crowds.; cinematic
Characteristic
Shot : A bustling street market in a foreign country, with colorful hanging decorations, a variety of goods on display, and people shopping and interacting.
Aesthetic Score : 0.6
Mood : vibrant, lively, exotic
Quality
Entropy : 6.82
Noise : 113
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts and blurring in the background, but these are not particularly noticeable.
Sunset Drive Through the Desert: A Journey of Freedom and Nostalgia
Two friends embark on a desert adventure, the setting sun casting a warm glow as they cruise down the open road. The carefree wave from the backseat captures the spirit of joy and nostalgia that fills their journey.
Prompt
camera-positions Tracking shot: Nostalgic, heartwarming ; A family driving down a scenic highway; tracking shot; Travel; Rolling hills, open road, sunlight streaming through the car window.; cinematic
Characteristic
Shot : Two people in a car, driving through a desert landscape. The sun is setting in the background.
Aesthetic Score : 0.6
Mood : peaceful, nostalgic, hopeful
Quality
Entropy : 6.29
Noise : 47
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight noise in the image, especially in the shadows.
Lost in the Landscape: A Boy’s Pensive Journey
A young boy gazes out of a train window, his thoughtful expression mirroring the passing scenery. The contrast between the dark train interior and the bright, blurred landscape creates a sense of depth and nostalgia, capturing a moment of quiet contemplation.
Prompt
camera-positions Tracking shot: Innocent, hopeful ; A young boy gazing out of a train window; tracking shot; Family; Passing landscapes, a sense of anticipation and wonder.; cinematic
Characteristic
Shot : A young child is looking out the window of a train, the train is moving and the landscape blurs out of focus
Aesthetic Score : 0.6
Mood : melancholy, contemplative, curious
Quality
Entropy : 6.11
Noise : 53
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors, some slight blur in the background
Firefighter Silhouetted Against Blazing Inferno
A dramatic image captures a firefighter standing bravely in front of a building consumed by flames. The intense fire and thick smoke create a powerful silhouette, highlighting the danger and intensity of the situation.
Prompt
camera-positions Tracking shot: Urgent, dramatic ; A firefighter rushing into a burning building; tracking shot; Heroism; Smoke and flames engulfing the structure.; cinematic
Characteristic
Shot : A firefighter is silhouetted against a raging fire, standing on a concrete structure, possibly a fire escape. The fire is engulfing a building, creating a dramatic and intense backdrop.
Aesthetic Score : 0.6
Mood : intense, dramatic, dangerous
Quality
Entropy : 6.84
Noise : 89
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image exhibits some slight noise, particularly in the darker areas. There is also a slight loss of detail in the firefighter’s silhouette.
Serene Mountain Hike with Breathtaking Views
Capture the tranquility of a mountain ridge hike with three adventurers enjoying the panoramic vista. The clear blue sky and lush greenery create a peaceful and adventurous atmosphere.
Prompt
camera-positions Tracking shot: Inspiring, adventurous ; A group of friends hiking through a breathtaking mountain range; tracking shot; Adventure; Majestic peaks, clear blue sky.; cinematic
Characteristic
Shot : Three people are hiking in the mountains. They are walking on a path along a ridge with a beautiful view of the mountains in the distance.
Aesthetic Score : 0.7
Mood : tranquil, adventurous, scenic
Quality
Entropy : 6.65
Noise : 72
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors are visible. The image has good clarity and sharpness.
Lost in the Neon: A Glimpse into the Future of Music
A young man, shrouded in the glow of neon lights, stands poised with a microphone, his VR headset hinting at a world beyond our own. This image captures the essence of futuristic music, blending technology and mystery in a captivating scene.
Prompt
camera-positions Tracking shot: Intriguing, futuristic ; A virtual reality headset being put on; tracking shot; Gaming; futuristic.; cinematic
Characteristic
Shot : A young man wearing a VR headset is in a room lit with neon lights. He is looking down and has a microphone in front of him. The scene seems to be a recording studio or a gaming environment.
Aesthetic Score : 0.6
Mood : futuristic, techy, focused
Quality
Entropy : 6.63
Noise : 61
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a slight blur around the edges, and some of the lights in the background appear to be overexposed.
Warm Lights, Happy Friends: A Cozy Restaurant Gathering
Capture the joy of shared meals with this intimate scene. Warm lighting and close-up shots create a sense of connection and warmth, perfect for showcasing the happiness of a casual gathering.
Prompt
camera-positions Tracking shot: Intimate, heartwarming ; A family enjoying a meal restaurant; tracking shot; Family; Warm lighting, open world.; cinematic
Characteristic
Shot : A group of friends are sitting at a table in a restaurant. They are eating and talking.
Aesthetic Score : 0.7
Mood : casual, warm, friendly
Quality
Entropy : 6.75
Noise : 91
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some minor chromatic aberration is visible around the edges of the image. Some minor noise is present in the image. The color balance is slightly off in the image.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.5
- Interpretation: This score falls within the “good” range, indicating that the model generally understood and implemented the camera positions described in the prompt.
Shot Analysis:
- Score: 0.565
- Interpretation: This score also falls within the “good” range, suggesting the model was able to grasp the scene and create shots that were generally consistent with the prompt’s description.
Aesthetic Analysis:
- Score: 0.16
- Interpretation: This score is significantly lower than the ideal range of -0.2 to 0.1. This indicates that the generated image’s aesthetic deviated from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera positions and shot composition. However, it needs improvement in generating images that match the desired aesthetic.
Sources:
- https://www.studiobinder.com/blog/types-of-camera-shot-angles-in-film/
- https://www.learnaboutfilm.com/film-language/picture/camera-position/
- https://boords.com/blog/16-types-of-camera-shots-and-angles-with-gifs
- https://shorthand.com/the-craft/8-tips-for-great-visual-storytelling/
- https://fal.ai/models/fal-ai/flux/schnell/api