Lightning Strikes: AI's Struggle with Scene and Camera with Flux-schnell
- 9 minutes read - 1823 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a fascinating area of exploration. This blog post examines the performance of a generative AI model in interpreting and translating scene descriptions into visual representations. We focus on the concept of ‘dramatic style lightning’ and analyze the model’s ability to capture the intended camera position, shot composition, and aesthetic style. Through this analysis, we gain insights into the strengths and limitations of AI in understanding and translating complex visual concepts.
Created with: flux-schnell
Silhouetted in the Storm
A man stands silhouetted against a window as lightning flashes outside, creating a dramatic and suspenseful scene. The blurry window and contemplative expression add to the melancholy mood.
Prompt
lightning soft-lighting: Melancholy, introspective ; A lone figure, silhouetted against a window; medium-shot; Single Person; A dimly lit room with rain falling outside; cinematic
Characteristic
Shot : A silhouette of a man standing in front of a window with a stormy view outside. The room is dimly lit.
Aesthetic Score : 0.6
Mood : dark, mysterious, contemplative
Quality
Entropy : 5.82
Noise : 66
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, such as the jagged edges of the window.
Superman Stands Tall Against the Storm
A dramatic image of Superman, bathed in lightning, stands heroically against a stormy cityscape. The lighting and pose create a powerful and heroic mood, emphasizing the superhero’s strength and resilience.
Prompt
lightning soft-lighting: Hopeful, inspiring ; A superhero, standing tall and confident, bathed in warm light; studio; Heroes; A cityscape with a dramatic sky; cinematic
Characteristic
Shot : A superhero standing in a stormy cityscape, lightning strikes in the background
Aesthetic Score : 0.6
Mood : dramatic, powerful, heroic
Quality
Entropy : 6.73
Noise : 68
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.80
Image errors : The lightning is a bit pixelated and the cityscape is blurry
Candlelit Intimacy: A Romantic Moment Caught in Warm Glow
In the soft, warm glow of candlelight, a couple shares an intimate moment. The dim lighting sets a romantic mood, creating an atmosphere of mystery and intrigue. The scene is a testament to the power of simple moments, capturing the essence of love and togetherness.
Prompt
lightning soft-lighting: Intimate, romantic ; A couple sharing a quiet moment, their faces illuminated by candlelight; medium-shot; Normal People; A cozy living room with bookshelves and a fireplace; cinematic
Characteristic
Shot : A couple is sitting in a dimly lit room, looking at each other. There is a candle between them, and the soft light creates a romantic atmosphere.
Aesthetic Score : 0.7
Mood : romantic, intimate, cozy
Quality
Entropy : 5.55
Noise : 51
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy and the lighting is uneven. There are also some minor artifacts in the background. The background is blurry, which might be intentional, but it doesn’t allow for much detail.
Lost in Thought: A Moment of Melancholy in the City Lights
A young woman finds solace in the quiet of a park at night, her gaze fixed on the distant city lights. The dramatic lighting casts an air of mystery and contemplation, capturing a moment of quiet reflection.
Prompt
lightning soft-lighting: Reflective, contemplative ; A young woman, lost in thought, sitting on a park bench; medium-shot; Single Person; A park at dusk with trees and a soft glow from streetlights; cinematic
Characteristic
Shot : A young woman is sitting on a bench in a park at night. She is looking to the right side of the image. There are lights in the background and trees.
Aesthetic Score : 0.6
Mood : melancholy, pensive, quiet
Quality
Entropy : 6.53
Noise : 71
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, especially the woman’s face.
Campfire Laughter: A Night of Friendship and Warmth
Four young women share laughter and stories around a crackling campfire, their faces illuminated by the warm glow. The scene captures a sense of intimacy, connection, and pure joy.
Prompt
lightning soft-lighting: Joyful, carefree ; A group of friends laughing together, their faces lit by the warm glow of a campfire; medium-shot; Normal People; A forest clearing with a campfire and stars in the sky; cinematic
Characteristic
Shot : Four young women are sitting around a campfire in a forest. They are all smiling and laughing, and the fire is casting a warm glow on their faces.
Aesthetic Score : 0.7
Mood : happy, joyful, warm
Quality
Entropy : 6.17
Noise : 73
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy, and there is some noise in the shadows.
Shadows and Secrets: A Man Works in the Gloom
A solitary figure hunches over paperwork in a dimly lit room, illuminated only by a lamp and neon signs. The atmosphere is heavy with mystery and intrigue, as shadows dance across the walls, hinting at secrets hidden in the darkness.
Prompt
lightning soft-lighting: Mysterious, suspenseful ; A detective, hunched over a desk, illuminated by a single desk lamp; medium-shot; Heroes; A dimly lit office with stacks of files and a flickering neon sign outside; cinematic
Characteristic
Shot : A man sitting at a desk in a dark office working late at night. There is a desk lamp on the desk, and a neon sign on the wall behind him.
Aesthetic Score : 0.3
Mood : dark, mysterious, lonely
Quality
Entropy : 4.91
Noise : 41
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is a bit noisy.
Silhouetted Dreams: A Child’s Pensive Gaze at Sunset
A young child sits by a window, their silhouette framed against a breathtaking sunset. The scene evokes a sense of melancholy and peace, with the child’s pensive gaze hinting at a world of dreams and possibilities. The soft orange and yellow hues of the sunset create a feeling of hope and beauty, leaving a lasting impression of quiet contemplation.
Prompt
lightning soft-lighting: Nostalgic, wistful ; A child, gazing out a window, their face illuminated by the soft light of the setting sun; medium-shot; Single Person; A bedroom with a window overlooking a field; cinematic
Characteristic
Shot : A child sits by a window, looking out at a sunset. The light from the sunset is visible through the window, illuminating the child’s face.
Aesthetic Score : 0.6
Mood : melancholy, pensive, hopeful
Quality
Entropy : 6.05
Noise : 56
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry and has a low resolution.
Passionate Performance Under the Spotlight
A musician pours their heart and soul into their guitar, bathed in dramatic lighting that highlights their every move. The energy is palpable, and the audience is captivated by the raw emotion on display.
Prompt
lightning soft-lighting: Passionate, energetic ; A musician, playing a guitar, bathed in the warm glow of stage lights; studio; Heroes; A concert stage with a cheering crowd; cinematic
Characteristic
Shot : A musician playing guitar on stage, there are other people in the background
Aesthetic Score : 0.7
Mood : energetic, passionate, focused
Quality
Entropy : 6.82
Noise : 61
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible errors in the image.
Intimate Gathering Under Warm Lights
A cozy scene unfolds with a family or group of friends gathered around a dining table in a dimly lit room. The warm, inviting atmosphere is enhanced by the soft lighting, creating a sense of intimacy and comfort.
Prompt
lightning soft-lighting: Warm, inviting ; A family gathered around a dinner table, their faces lit by the warm glow of overhead lights; medium-shot; Normal People; A cozy kitchen with a rustic table and a window overlooking a garden; cinematic
Characteristic
Shot : A family is gathered around a table in a dimly lit dining room. They are eating a meal and engaging in conversation. The room is decorated with a chandelier and warm light.
Aesthetic Score : 0.6
Mood : warm, inviting, cozy
Quality
Entropy : 6.70
Noise : 89
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image.
The Scientist’s Focus: A Moment of Intense Concentration
A dimly lit laboratory, filled with scientific equipment, is the setting for this image. A man in a lab coat, his face illuminated by the glow of a computer screen, is deeply engrossed in his work. The mood is serious and focused, highlighting the dedication and professionalism of scientific research.
Prompt
lightning soft-lighting: Intriguing, thought-provoking ; A scientist, working in a laboratory, illuminated by the soft glow of a monitor; medium-shot; Heroes; A laboratory with beakers, test tubes, and complex machinery; cinematic
Characteristic
Shot : A man in a lab coat is working at a computer in a laboratory setting. The room is dimly lit with the main light source coming from the computer screen.
Aesthetic Score : 0.6
Mood : focused, professional, serious
Quality
Entropy : 6.72
Noise : 80
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noticeable noise and grain in the image, particularly in the shadows. There is also some chromatic aberration visible along the edges of the subject.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.17, which is below the “good” range of 0.5 to 0.75. This indicates that the model didn’t accurately capture the intended camera position in the prompt.
- Shot Analysis: The model scored 0.485, which is also below the “good” range. This suggests that the model didn’t fully understand the scene described in the prompt and didn’t create the expected shot composition.
- Aesthetic Analysis: The model scored 0.19, which is within the “very good” range of -0.2 to 0.1. This means the generated image’s aesthetic was quite close to the expected aesthetic described in the prompt.
Overall, the model seems to be better at understanding the aesthetic style than the scene and camera position. It might need further training to improve its ability to accurately interpret and translate camera positions and scene descriptions into images.