AI's Camera Eye: Good at Shots, Not So Much at Mood with Flux-pro

Testing AI's Ability to Capture Cinematic Scenes with Flux-pro

Contents

In the world of filmmaking, camera position is everything. It’s the invisible hand that guides the viewer’s eye, shaping their emotional response to the story unfolding on screen. We wanted to see if an AI model could grasp this concept, so we fed it descriptions of scenes with specific camera positions and aesthetics. The results were intriguing: the model demonstrated a good understanding of technical aspects like shot composition, but struggled to capture the intended mood and style. This suggests that while AI is making strides in understanding visual language, it still has a long way to go in replicating the artistic vision of a human director.

Created with: flux-pro

Silhouetted Against Hope

A solitary figure stands on a rocky outcrop, their silhouette stark against the vibrant hues of a setting sun. The scene evokes a sense of tranquility, contemplation, and a glimmer of hope amidst the vastness of the world.

Silhouetted Against Hope

Prompt

camera-positions Dutch angle: Epic, determined, hopeful ; A lone figure, silhouetted against the setting sun; wide shot; Heroism; A vast, desolate landscape; cinematic

Characteristic

Shot : A lone figure stands on a hilltop, silhouetted against a vibrant orange sunset.

Aesthetic Score : 0.7

Mood : tranquil, contemplative, hopeful

Quality

Entropy : 6.48

Noise : 65

Prompt Clip Score : 0.24

AI Evaluation

Likelihood of AI : 0.20

Image errors : No noticeable artifacts or errors

Uncharted Territories: A Compass Beckons

A vintage compass rests on a weathered map, its needle pointing towards the unknown. Candles flicker in the background, casting a warm glow and adding to the sense of mystery and adventure. This image evokes a feeling of nostalgia and invites you to explore the uncharted territories of your imagination.

Uncharted Territories: A Compass Beckons

Prompt

camera-positions Dutch angle: Intriguing, mysterious, adventurous ; A weathered map, spread out on a table, with a compass pointing towards a distant destination; close-up; Adventure; A dimly lit room with flickering candlelight; cinematic

Characteristic

Shot : A close-up shot of an antique compass lying on a vintage map, with candles and a wooden table in the blurry background.

Aesthetic Score : 0.7

Mood : mysterious, adventurous, nostalgic

Quality

Entropy : 6.90

Noise : 63

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.10

Image errors : No visible image errors.

Lost in the Code: A Hand Navigates the Digital Landscape

A solitary hand interacts with a mouse in a dimly lit room, bathed in a blueish hue. The out-of-focus lights and computer monitors in the background create a sense of mystery and isolation, highlighting the focused and serene atmosphere of the scene.

Lost in the Code: A Hand Navigates the Digital Landscape

Prompt

camera-positions Dutch angle: Intense, focused, competitive ; A gamer’s hands, furiously tapping buttons on a controller; close-up; Gaming; A brightly lit room with flashing lights and screens; cinematic

Characteristic

Shot : A person’s hand is hovering over a mouse pad in a dimly lit room, with multiple computer monitors in the background, creating a sense of immersion and digital activity. The lighting is warm and inviting with vibrant colors illuminating the scene, creating a captivating ambiance.

Aesthetic Score : 0.6

Mood : cyberpunk, futuristic, digital

Quality

Entropy : 6.74

Noise : 52

Prompt Clip Score : 0.22

AI Evaluation

Likelihood of AI : 0.20

Image errors : No visible errors.

Immerse Yourself in the Vibrant Chaos of a Middle Eastern Market

Experience the bustling energy of a sun-drenched market, filled with vibrant colors, interesting textures, and the lively chatter of vendors and shoppers. This captivating scene draws you in with its depth and immersion, transporting you to the heart of a bustling Middle Eastern or Mediterranean city.

Immerse Yourself in the Vibrant Chaos of a Middle Eastern Market

Prompt

camera-positions Dutch angle: Energetic, lively, exciting ; A bustling marketplace, with vibrant colors and exotic goods; wide shot; Tourism; A sunny day with clear blue skies; cinematic

Characteristic

Shot : A bustling street market in a Mediterranean city, with colorful awnings, people shopping and vendors selling their wares.

Aesthetic Score : 0.6

Mood : vibrant, lively, bustling

Quality

Entropy : 6.92

Noise : 97

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.10

Image errors : Some slight chromatic aberration around the edges of the image, noticeable in the sky and awnings.

A Serene Journey Through Majestic Mountains

Experience the thrill of a speeding train through a breathtaking valley, where vibrant green fields meet towering mountains under a clear blue sky. The motion blur captures the energy of the journey, while the dramatic lighting evokes a sense of awe and wonder. This serene and adventurous scene inspires hope and a longing for exploration.

A Serene Journey Through Majestic Mountains

Prompt

camera-positions Dutch angle: Dynamic, adventurous, liberating ; A train speeding through a picturesque countryside; medium shot; Travel; A rolling landscape with lush green fields and distant mountains; cinematic

Characteristic

Shot : A train traveling through a scenic mountain valley with a clear blue sky and bright sunshine. The train is in the foreground, and the mountains are in the background.

Aesthetic Score : 0.7

Mood : serene, peaceful, adventurous

Quality

Entropy : 6.87

Noise : 90

Prompt Clip Score : 0.27

AI Evaluation

Likelihood of AI : 0.30

Image errors : Some artifacts are present in the image, particularly in the sky and the grass in the background. They are not major, but they do detract from the overall quality of the image.

Laughter and Hugs: A Night of Girl Power

Three young women share a moment of joy and intimacy at a social gathering. Their laughter and embrace capture the essence of friendship and camaraderie, creating a warm and lively atmosphere.

Laughter and Hugs: A Night of Girl Power

Prompt

camera-positions Dutch angle: Joyful, celebratory, connected ; A group of friends, laughing and celebrating, with their arms around each other; medium shot; Groups; A dimly lit bar with warm lighting and a lively atmosphere; cinematic

Characteristic

Shot : Three women are in a dimly lit bar or restaurant, laughing and hugging. The atmosphere is lively and casual.

Aesthetic Score : 0.7

Mood : joyful, friendly, intimate

Quality

Entropy : 6.75

Noise : 73

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.20

Image errors : There are minor artifacts in the background and some slight blurring around the edges. Some slight color bleeding on the right side

Silhouetted Against the Storm: A Lone Figure Contemplates the City

A solitary figure stands on a clifftop, their silhouette stark against a dramatic, stormy sky. The distant city lights pierce the fog, creating a sense of isolation and grandeur. This image evokes a mood of loneliness, drama, and impending change.

Silhouetted Against the Storm: A Lone Figure Contemplates the City

Prompt

camera-positions Dutch angle: Dramatic, intense, powerful ; A lone warrior, standing on a precipice, gazing out at a vast battlefield; medium shot; Heroism; A stormy sky with dark clouds and flashes of lightning; cinematic

Characteristic

Shot : A lone figure stands on a clifftop, silhouetted against a stormy sky. Lightning strikes in the distance, illuminating a city below.

Aesthetic Score : 0.7

Mood : dramatic, ominous, contemplative

Quality

Entropy : 6.53

Noise : 64

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.80

Image errors : The city lights appear a bit blurry and lack detail. The figure is also quite blocky.

Unveiling the Secrets of a Candlelit Treasure

A mysterious treasure chest, overflowing with gold coins and shimmering gems, sits bathed in the warm glow of a single candle. The dimly lit room whispers of adventure and magic, inviting you to uncover the secrets within.

Unveiling the Secrets of a Candlelit Treasure

Prompt

camera-positions Dutch angle: Intriguing, mysterious, alluring ; A treasure chest, overflowing with gold and jewels, with a single, flickering candle illuminating its contents; close-up; Adventure; A dark, mysterious cave with damp walls and dripping water; cinematic

Characteristic

Shot : A treasure chest overflowing with gold and jewels, lit by a single candle, suggesting a scene of pirate wealth and mystery.

Aesthetic Score : 0.7

Mood : mysterious, romantic, adventurous

Quality

Entropy : 6.58

Noise : 81

Prompt Clip Score : 0.23

AI Evaluation

Likelihood of AI : 0.60

Image errors : The image appears to be slightly blurry and the shadows are somewhat artificial, which may indicate AI generation.

Finding Hope in the Golden Hour

A solitary figure stands triumphant on a mountain peak, arms outstretched towards a breathtaking sunset. The misty valley below and the silhouette of the figure evoke a sense of awe and inspire hope for the future.

Finding Hope in the Golden Hour

Prompt

camera-positions Dutch angle: Triumphant, exhilarating, immersive ; A player’s avatar, standing triumphantly on a virtual mountain peak, with a panoramic view of the game world; medium shot; Gaming; A brightly lit room with a gamer’s headset and controller; cinematic

Characteristic

Shot : A lone figure stands on a mountain peak, arms raised in triumph, as a dramatic sunset paints the sky with vibrant hues. The mountains are shrouded in a misty haze, lending an ethereal quality to the scene.

Aesthetic Score : 0.7

Mood : inspiring, hopeful, majestic

Quality

Entropy : 6.51

Noise : 82

Prompt Clip Score : 0.19

AI Evaluation

Likelihood of AI : 0.80

Image errors : Slight color banding in the sky, some edges of the figure appear a little rough

Sunset Smiles at the Arc de Triomphe

Capture the joy of a Parisian evening as a group of friends pose in front of the iconic Arc de Triomphe, bathed in the warm glow of a setting sun. The vibrant sky and happy faces create a heartwarming scene, perfect for capturing the spirit of travel and adventure.

Sunset Smiles at the Arc de Triomphe

Prompt

camera-positions Dutch angle: Romantic, nostalgic, memorable ; A group of tourists, taking photos of a famous landmark, with their faces lit by the warm glow of the setting sun; medium shot; Tourism; A bustling city with iconic architecture and vibrant street life; cinematic

Characteristic

Shot : A group of people stand in front of a large archway, likely the Arc de Triomphe in Paris, while the sun sets behind them. One person is taking a photo of the archway, another is looking at their phone.

Aesthetic Score : 0.6

Mood : peaceful, nostalgic, golden hour

Quality

Entropy : 6.64

Noise : 51

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image appears slightly overexposed, losing detail in the brighter areas. The composition could be strengthened by ensuring that the subject is not cut off at the edges of the frame.

Conclusion

The results show that the generative AI model performed well in understanding and implementing camera positions and shot composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:

Camera Position:

  • Score: 0.41
  • Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera positions described in the prompt. However, it’s still closer to “good” than “bad,” indicating a decent level of understanding.

Shot Analysis:

  • Score: 0.57
  • Interpretation: This score falls within the “good” range, indicating that the model successfully translated the prompt’s scene description into a visually coherent shot.

Aesthetic Analysis:

  • Score: 0.06
  • Interpretation: This score is significantly lower than the ideal range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt. This could mean the model struggled to capture the desired mood, style, or visual elements.

Overall:

The model demonstrates a good understanding of camera positions and shot composition, but needs improvement in capturing the intended aesthetic. This suggests that the model might be better at understanding the technical aspects of a scene than the artistic ones.

Sources: