AI's Camera Eye: Good at Shots, Not So Much at Mood with Flux-pro
- 10 minutes read - 1955 wordsTable of Contents
In the world of filmmaking, camera position is everything. It’s the invisible hand that guides the viewer’s eye, shaping their emotional response to the story unfolding on screen. We wanted to see if an AI model could grasp this concept, so we fed it descriptions of scenes with specific camera positions and aesthetics. The results were intriguing: the model demonstrated a good understanding of technical aspects like shot composition, but struggled to capture the intended mood and style. This suggests that while AI is making strides in understanding visual language, it still has a long way to go in replicating the artistic vision of a human director.
Created with: flux-pro
Silhouetted Against Hope
A solitary figure stands on a rocky outcrop, their silhouette stark against the vibrant hues of a setting sun. The scene evokes a sense of tranquility, contemplation, and a glimmer of hope amidst the vastness of the world.
Prompt
camera-positions Dutch angle: Epic, determined, hopeful ; A lone figure, silhouetted against the setting sun; wide shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A lone figure stands on a hilltop, silhouetted against a vibrant orange sunset.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, hopeful
Quality
Entropy : 6.48
Noise : 65
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors
Uncharted Territories: A Compass Beckons
A vintage compass rests on a weathered map, its needle pointing towards the unknown. Candles flicker in the background, casting a warm glow and adding to the sense of mystery and adventure. This image evokes a feeling of nostalgia and invites you to explore the uncharted territories of your imagination.
Prompt
camera-positions Dutch angle: Intriguing, mysterious, adventurous ; A weathered map, spread out on a table, with a compass pointing towards a distant destination; close-up; Adventure; A dimly lit room with flickering candlelight; cinematic
Characteristic
Shot : A close-up shot of an antique compass lying on a vintage map, with candles and a wooden table in the blurry background.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, nostalgic
Quality
Entropy : 6.90
Noise : 63
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible image errors.
Lost in the Code: A Hand Navigates the Digital Landscape
A solitary hand interacts with a mouse in a dimly lit room, bathed in a blueish hue. The out-of-focus lights and computer monitors in the background create a sense of mystery and isolation, highlighting the focused and serene atmosphere of the scene.
Prompt
camera-positions Dutch angle: Intense, focused, competitive ; A gamer’s hands, furiously tapping buttons on a controller; close-up; Gaming; A brightly lit room with flashing lights and screens; cinematic
Characteristic
Shot : A person’s hand is hovering over a mouse pad in a dimly lit room, with multiple computer monitors in the background, creating a sense of immersion and digital activity. The lighting is warm and inviting with vibrant colors illuminating the scene, creating a captivating ambiance.
Aesthetic Score : 0.6
Mood : cyberpunk, futuristic, digital
Quality
Entropy : 6.74
Noise : 52
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors.
Immerse Yourself in the Vibrant Chaos of a Middle Eastern Market
Experience the bustling energy of a sun-drenched market, filled with vibrant colors, interesting textures, and the lively chatter of vendors and shoppers. This captivating scene draws you in with its depth and immersion, transporting you to the heart of a bustling Middle Eastern or Mediterranean city.
Prompt
camera-positions Dutch angle: Energetic, lively, exciting ; A bustling marketplace, with vibrant colors and exotic goods; wide shot; Tourism; A sunny day with clear blue skies; cinematic
Characteristic
Shot : A bustling street market in a Mediterranean city, with colorful awnings, people shopping and vendors selling their wares.
Aesthetic Score : 0.6
Mood : vibrant, lively, bustling
Quality
Entropy : 6.92
Noise : 97
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some slight chromatic aberration around the edges of the image, noticeable in the sky and awnings.
A Serene Journey Through Majestic Mountains
Experience the thrill of a speeding train through a breathtaking valley, where vibrant green fields meet towering mountains under a clear blue sky. The motion blur captures the energy of the journey, while the dramatic lighting evokes a sense of awe and wonder. This serene and adventurous scene inspires hope and a longing for exploration.
Prompt
camera-positions Dutch angle: Dynamic, adventurous, liberating ; A train speeding through a picturesque countryside; medium shot; Travel; A rolling landscape with lush green fields and distant mountains; cinematic
Characteristic
Shot : A train traveling through a scenic mountain valley with a clear blue sky and bright sunshine. The train is in the foreground, and the mountains are in the background.
Aesthetic Score : 0.7
Mood : serene, peaceful, adventurous
Quality
Entropy : 6.87
Noise : 90
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : Some artifacts are present in the image, particularly in the sky and the grass in the background. They are not major, but they do detract from the overall quality of the image.
Laughter and Hugs: A Night of Girl Power
Three young women share a moment of joy and intimacy at a social gathering. Their laughter and embrace capture the essence of friendship and camaraderie, creating a warm and lively atmosphere.
Prompt
camera-positions Dutch angle: Joyful, celebratory, connected ; A group of friends, laughing and celebrating, with their arms around each other; medium shot; Groups; A dimly lit bar with warm lighting and a lively atmosphere; cinematic
Characteristic
Shot : Three women are in a dimly lit bar or restaurant, laughing and hugging. The atmosphere is lively and casual.
Aesthetic Score : 0.7
Mood : joyful, friendly, intimate
Quality
Entropy : 6.75
Noise : 73
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are minor artifacts in the background and some slight blurring around the edges. Some slight color bleeding on the right side
Silhouetted Against the Storm: A Lone Figure Contemplates the City
A solitary figure stands on a clifftop, their silhouette stark against a dramatic, stormy sky. The distant city lights pierce the fog, creating a sense of isolation and grandeur. This image evokes a mood of loneliness, drama, and impending change.
Prompt
camera-positions Dutch angle: Dramatic, intense, powerful ; A lone warrior, standing on a precipice, gazing out at a vast battlefield; medium shot; Heroism; A stormy sky with dark clouds and flashes of lightning; cinematic
Characteristic
Shot : A lone figure stands on a clifftop, silhouetted against a stormy sky. Lightning strikes in the distance, illuminating a city below.
Aesthetic Score : 0.7
Mood : dramatic, ominous, contemplative
Quality
Entropy : 6.53
Noise : 64
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The city lights appear a bit blurry and lack detail. The figure is also quite blocky.
Unveiling the Secrets of a Candlelit Treasure
A mysterious treasure chest, overflowing with gold coins and shimmering gems, sits bathed in the warm glow of a single candle. The dimly lit room whispers of adventure and magic, inviting you to uncover the secrets within.
Prompt
camera-positions Dutch angle: Intriguing, mysterious, alluring ; A treasure chest, overflowing with gold and jewels, with a single, flickering candle illuminating its contents; close-up; Adventure; A dark, mysterious cave with damp walls and dripping water; cinematic
Characteristic
Shot : A treasure chest overflowing with gold and jewels, lit by a single candle, suggesting a scene of pirate wealth and mystery.
Aesthetic Score : 0.7
Mood : mysterious, romantic, adventurous
Quality
Entropy : 6.58
Noise : 81
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.60
Image errors : The image appears to be slightly blurry and the shadows are somewhat artificial, which may indicate AI generation.
Finding Hope in the Golden Hour
A solitary figure stands triumphant on a mountain peak, arms outstretched towards a breathtaking sunset. The misty valley below and the silhouette of the figure evoke a sense of awe and inspire hope for the future.
Prompt
camera-positions Dutch angle: Triumphant, exhilarating, immersive ; A player’s avatar, standing triumphantly on a virtual mountain peak, with a panoramic view of the game world; medium shot; Gaming; A brightly lit room with a gamer’s headset and controller; cinematic
Characteristic
Shot : A lone figure stands on a mountain peak, arms raised in triumph, as a dramatic sunset paints the sky with vibrant hues. The mountains are shrouded in a misty haze, lending an ethereal quality to the scene.
Aesthetic Score : 0.7
Mood : inspiring, hopeful, majestic
Quality
Entropy : 6.51
Noise : 82
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.80
Image errors : Slight color banding in the sky, some edges of the figure appear a little rough
Sunset Smiles at the Arc de Triomphe
Capture the joy of a Parisian evening as a group of friends pose in front of the iconic Arc de Triomphe, bathed in the warm glow of a setting sun. The vibrant sky and happy faces create a heartwarming scene, perfect for capturing the spirit of travel and adventure.
Prompt
camera-positions Dutch angle: Romantic, nostalgic, memorable ; A group of tourists, taking photos of a famous landmark, with their faces lit by the warm glow of the setting sun; medium shot; Tourism; A bustling city with iconic architecture and vibrant street life; cinematic
Characteristic
Shot : A group of people stand in front of a large archway, likely the Arc de Triomphe in Paris, while the sun sets behind them. One person is taking a photo of the archway, another is looking at their phone.
Aesthetic Score : 0.6
Mood : peaceful, nostalgic, golden hour
Quality
Entropy : 6.64
Noise : 51
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears slightly overexposed, losing detail in the brighter areas. The composition could be strengthened by ensuring that the subject is not cut off at the edges of the frame.
Conclusion
The results show that the generative AI model performed well in understanding and implementing camera positions and shot composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
Camera Position:
- Score: 0.41
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera positions described in the prompt. However, it’s still closer to “good” than “bad,” indicating a decent level of understanding.
Shot Analysis:
- Score: 0.57
- Interpretation: This score falls within the “good” range, indicating that the model successfully translated the prompt’s scene description into a visually coherent shot.
Aesthetic Analysis:
- Score: 0.06
- Interpretation: This score is significantly lower than the ideal range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt. This could mean the model struggled to capture the desired mood, style, or visual elements.
Overall:
The model demonstrates a good understanding of camera positions and shot composition, but needs improvement in capturing the intended aesthetic. This suggests that the model might be better at understanding the technical aspects of a scene than the artistic ones.
Sources:
- https://www.studiobinder.com/blog/types-of-camera-shot-angles-in-film/
- https://www.learnaboutfilm.com/film-language/picture/camera-position/
- https://boords.com/blog/16-types-of-camera-shots-and-angles-with-gifs
- https://shorthand.com/the-craft/8-tips-for-great-visual-storytelling/
- https://fal.ai/models/fal-ai/flux-pro/api