Lightning Strikes: AI's Struggle with Camera Shots with Flux-dev
- 9 minutes read - 1887 wordsTable of Contents
In the realm of artificial intelligence, image generation has become a captivating field. AI models are now capable of creating stunning visuals, often mimicking the styles of renowned photographers and artists. However, as with any emerging technology, there are areas where AI still needs to improve. One such area is the ability to accurately interpret and implement camera positions and shot types. This blog post explores the results of an experiment that sheds light on this fascinating aspect of AI image generation. We’ll delve into the data, analyze the strengths and weaknesses of the AI model, and discuss the implications for the future of this exciting technology.
Created with: flux-dev
Silhouetted in Melancholy: A Man’s Contemplative Moment
A solitary figure stands in silhouette, bathed in the warm glow of a window. The contrast between light and dark evokes a sense of melancholy and introspection, highlighting the man’s isolation and contemplative mood.
Prompt
lightning motivated-lighting: Melancholy, introspective ; A lone figure, silhouetted against a window; medium-shot; Single Person; A dimly lit room with a single window letting in the golden light of sunset.; cinematic
Characteristic
Shot : Silhouette of a man standing in a dark room, facing a window with a bright sunset outside.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, lonely
Quality
Entropy : 5.01
Noise : 27
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors.
A Solitary Figure in the Shadows
A single shaft of light illuminates a cloaked figure standing in the doorway of a derelict building. The dramatic use of light and shadow creates a sense of mystery and hope, leaving the viewer to wonder about the figure’s story.
Prompt
lightning motivated-lighting: Epic, heroic, dramatic ; A superhero standing in a beam of light emanating from a shattered window; medium-shot; Hero; A dark, smoke-filled room with debris scattered around.; cinematic
Characteristic
Shot : A lone figure in a cape stands in a shadowy room with a large window in the background. Light pours in through the window, creating a dramatic silhouette.
Aesthetic Score : 0.6
Mood : mysterious, dark, dramatic
Quality
Entropy : 6.37
Noise : 63
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor artifacts are present, especially in the shadows and around the edges of the figure. The lighting appears slightly uneven.
Intimate Moments: A Cozy Cafe Retreat
Experience the warmth and intimacy of a dimly lit cafe, where a woman finds solace in the soft glow of candlelight as she delves into her favorite book. The cozy atmosphere invites you to unwind and lose yourself in quiet contemplation.
Prompt
lightning motivated-lighting: Peaceful, intimate, cozy ; A young woman reading a book in a cozy cafe; medium-shot; Normal People; A warm, inviting cafe with soft lighting from lamps and candles.; cinematic
Characteristic
Shot : A woman is sitting at a table in a dimly lit cafe, reading a book. There is a lit candle on the table in front of her.
Aesthetic Score : 0.7
Mood : cozy, intimate, introspective
Quality
Entropy : 6.53
Noise : 69
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight noise visible in the shadows. The image appears slightly underexposed.
Lost in the Glow: A Hacker’s Haven
A solitary figure hunches over a computer screen, bathed in the crimson glow of a neon sign. The dimly lit room whispers of secrets and late-night coding, creating an atmosphere of focused intensity and enigmatic allure.
Prompt
lightning motivated-lighting: Intense, focused, mysterious ; A detective hunched over a desk, illuminated by a single desk lamp; medium-shot; Hero; A dimly lit office with stacks of files and a flickering neon sign outside the window.; cinematic
Characteristic
Shot : A person is working on a computer in a dimly lit room with a red neon sign in the background. The person is wearing a black hoodie and has their back to the camera.
Aesthetic Score : 0.6
Mood : dark, mysterious, focused
Quality
Entropy : 6.11
Noise : 51
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible errors in the image.
Sun-Kissed Joy: A Moment of Childhood Bliss
A young girl with golden hair radiates pure joy as she plays outdoors, bathed in warm sunlight. The shallow depth of field and soft lighting create a dreamy, nostalgic atmosphere, capturing the innocence and carefree spirit of childhood.
Prompt
lightning motivated-lighting: Joyful, carefree, innocent ; A child playing in a brightly lit playground; medium-shot; Normal People; A sunny afternoon with the playground bathed in warm sunlight.; cinematic
Characteristic
Shot : A young girl with blonde hair is running towards the camera in a park or outdoor area. The sun is shining brightly, creating a warm glow. There are other people in the background, but the focus is on the girl.
Aesthetic Score : 0.7
Mood : joyful, carefree, sunny
Quality
Entropy : 6.25
Noise : 66
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slight overexposure, which results in some loss of detail in the girl’s face. The background is a bit out of focus.
Lost in the Fog: A Solitary Figure Walks a Deserted Street
A lone figure walks through the mist-shrouded streets, their silhouette a stark contrast against the dim glow of streetlights. The fog adds an air of mystery and loneliness, creating a melancholic scene that evokes feelings of isolation and introspection.
Prompt
lightning motivated-lighting: Lonely, atmospheric, suspenseful ; A lone figure walking down a dark, rainy street, illuminated by a streetlamp; medium-shot; Single Person; A deserted street with puddles reflecting the dim light of the streetlamp.; cinematic
Characteristic
Shot : A lone figure walks down a fog-shrouded street lit by streetlamps
Aesthetic Score : 0.7
Mood : mysterious, lonely, atmospheric
Quality
Entropy : 6.52
Noise : 67
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight noise and grain in the image.
Unveiling the Secrets: A Focused Scientist in a Dimly Lit Lab
A man in a lab coat, bathed in the soft glow of a computer screen, works intently in a dimly lit laboratory. The low lighting and his focused expression create an atmosphere of intensity and mystery, hinting at the secrets being uncovered within the lab’s walls.
Prompt
lightning motivated-lighting: Intriguing, futuristic, focused ; A scientist working in a laboratory, illuminated by the glow of a computer screen; medium-shot; Hero; A sterile, high-tech laboratory with flashing lights and complex machinery.; cinematic
Characteristic
Shot : A man in a lab coat is working on a computer in a laboratory setting. The room is lit with blue light, creating a cool and sterile atmosphere.
Aesthetic Score : 0.6
Mood : professional, focused, serious
Quality
Entropy : 6.81
Noise : 79
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is a little too dark, and the resolution is a little low. The man’s hand in the foreground looks a little bit blurry, and the reflections on the screen are slightly distracting.
Candlelit Intimacy: A Romantic Moment Captured
In the heart of a dimly lit restaurant, a couple shares a moment of intimacy and romance. Illuminated by the soft glow of candlelight, their faces reveal a story of love and connection. The scene, with its warm hues and cozy atmosphere, exudes a sense of mystery and closeness, inviting viewers into their private world.
Prompt
lightning motivated-lighting: Romantic, intimate, sensual ; A couple sharing a romantic dinner, illuminated by candlelight; medium-shot; Normal People; A cozy restaurant with soft lighting and a warm, inviting atmosphere.; cinematic
Characteristic
Shot : A couple is sitting at a table with candles, likely on a date. They are looking at each other romantically.
Aesthetic Score : 0.7
Mood : romantic, intimate, warm
Quality
Entropy : 6.33
Noise : 56
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, particularly around the edges.
Silhouetted Musician Under Spotlight: A Moment of Dramatic Anticipation
A captivating silhouette of a musician holding a guitar on stage, bathed in dramatic spotlights. The mysterious and anticipatory mood is heightened by the use of silhouette and light, creating a powerful visual effect.
Prompt
lightning motivated-lighting: Dramatic, powerful, captivating ; A musician performing on stage, bathed in the spotlight; studio; Hero; A dark stage with a single spotlight illuminating the musician.; cinematic
Characteristic
Shot : A single spotlight shines on a lone musician holding an electric guitar, standing on a dark stage with a hazy blue atmosphere
Aesthetic Score : 0.6
Mood : mysterious, dramatic, introspective
Quality
Entropy : 6.18
Noise : 23
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors, however, the image has a slight digital feel, potentially suggesting it was slightly edited.
Silhouettes of Solitude: A Moment of Contemplation in the Night
A lone woman finds solace on a park bench under the soft glow of a street lamp. The darkness envelops her, creating a sense of isolation and mystery. The image captures a moment of quiet contemplation, bathed in the dramatic play of light and shadow.
Prompt
lightning motivated-lighting: Peaceful, contemplative, nostalgic ; A young woman sitting on a park bench, illuminated by the warm glow of a streetlamp; medium-shot; Single Person; A quiet park at night with the streetlamp casting a soft light on the bench.; cinematic
Characteristic
Shot : A lone woman sitting on a bench under a streetlight at night
Aesthetic Score : 0.6
Mood : melancholy, solitude, contemplative
Quality
Entropy : 6.29
Noise : 75
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some noise and grain in the image, especially in the shadows.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.2, which is considered poor. This indicates a significant difference between the intended camera position in the prompt and the actual camera position in the generated image.
- Shot Analysis: The model scored 0.47, which is considered okay. This suggests that the model was able to understand the scene in the prompt to some extent, but there were still discrepancies between the intended shot and the generated image.
- Aesthetic Analysis: The model scored 0.1, which is considered very good. This means the generated image closely matched the expected aesthetic, indicating the model’s ability to capture the desired visual style.
Overall, the model seems to be better at understanding the aesthetic aspects of the prompt than the camera position and shot composition. This suggests that the model might need further training to improve its ability to accurately interpret and implement camera positions and shot types.