Capturing the Essence of Dramatic Style: A Generative AI Experiment with Flux-dev
- 9 minutes read - 1862 wordsTable of Contents
The dramatic aesthetic, characterized by its use of strong contrasts, dramatic lighting, and evocative imagery, is a powerful tool in visual storytelling. It’s often used to create a sense of tension, mystery, or grandeur. However, capturing this aesthetic through AI image generation presents unique challenges. This blog post explores the results of an experiment that tested AI’s ability to understand and generate images with a dramatic style. We’ll analyze the strengths and weaknesses of the AI model, focusing on its ability to interpret camera position, shot composition, and the overall mood of a scene. By examining these results, we can gain valuable insights into the potential and limitations of AI in capturing the essence of dramatic storytelling through visuals.
Created with: flux-dev
A Suitcase Full of Memories
A vintage suitcase sits alone on a bustling train platform, its worn leather hinting at journeys taken and stories untold. The blurry background of passengers and a departing train adds to the sense of melancholy and nostalgia, leaving the viewer to wonder about the destination and the secrets held within.
Prompt
style-aesthetic Neo-realist: Nostalgic, bittersweet ; A worn suitcase, sitting on a train platform, with a single ticket protruding from it; medium shot; Travel; A bustling train station with people rushing to and fro; cinematic
Characteristic
Shot : A vintage suitcase with a ticket on top sits in a train station platform. The train is visible in the background but blurry, and other people are out of focus.
Aesthetic Score : 0.5
Mood : nostalgic, quiet, waiting
Quality
Entropy : 6.76
Noise : 71
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly blurry, especially in the background. Some minor noise is visible in the darker areas of the image.
Red Light, Ready to Play: A Gamer’s Focus Under the Glow
A hand grips a joystick, bathed in the crimson glow of a computer screen. The scene is dark, techy, and intense, hinting at a moment of focused gameplay. A can of soda in the background adds a touch of casualness to the otherwise dramatic image.
Prompt
style-aesthetic Neo-realist: Intense, focused ; A pair of hands, gripping a joystick, sweat dripping onto the buttons; close-up; Gaming; A dimly lit room with flickering monitor light, surrounded by empty pizza boxes and soda cans; cinematic
Characteristic
Shot : A person’s hands are shown, holding a gaming controller and pressing keys on a keyboard, a can of soda is visible, and there are two computer monitors in the background, glowing red.
Aesthetic Score : 0.5
Mood : intense, focused, cyberpunk
Quality
Entropy : 6.36
Noise : 53
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are no visible errors in the image
Serene Streetscape with a Touch of Mystery
A contemplative mood fills this narrow street, where people stroll past historic buildings. The towering structure in the distance adds a sense of grandeur and intrigue, inviting you to explore further.
Prompt
style-aesthetic Neo-realist: Awe-inspiring, curious ; A group of tourists, huddled together, looking up at a towering ancient monument; medium shot; Tourism; A bustling marketplace with vendors selling souvenirs and local crafts; cinematic
Characteristic
Shot : A group of people walk through a narrow street lined with old buildings. The buildings are tall and have a distinctive architectural style. In the distance, a large structure with a pointed roof can be seen. The sky is a pale blue, and the sun is shining.
Aesthetic Score : 0.6
Mood : mysterious, ancient, historical
Quality
Entropy : 6.80
Noise : 83
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no obvious image errors, however, the colors are muted and the overall image is slightly blurry.
Campfire Tales Under the Milky Way
A group of friends gather around a crackling campfire, sharing stories and laughter under a breathtaking starry sky. The warmth of the fire and the camaraderie of their bond create a cozy and adventurous atmosphere, perfect for a night under the stars.
Prompt
style-aesthetic Neo-realist: Joyful, camaraderie ; A group of friends, huddled together around a campfire, sharing stories and laughter; medium shot; Adventure; A campsite under a starry night sky with a crackling fire; cinematic
Characteristic
Shot : A group of friends enjoying a campfire under a starry night sky.
Aesthetic Score : 0.7
Mood : cozy, warm, friendly
Quality
Entropy : 6.49
Noise : 86
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some noise and grain. The lighting is uneven, making some parts of the image darker than others.
Lost in the Shadows: A Lonely Figure Walks the Deserted Street
A solitary figure, shrouded in the darkness of a deserted street, carries a suitcase, their journey shrouded in mystery. The melancholic mood is amplified by the long shadows cast by the streetlights, creating a sense of isolation and introspection.
Prompt
style-aesthetic Neo-realist: Lonely, introspective ; A lone figure, walking down a deserted street, with a suitcase in hand; long shot; Travel; A city street at night with flickering streetlights and empty sidewalks; cinematic
Characteristic
Shot : A lone figure walks down a deserted city street at night, carrying a suitcase. The street is lined with buildings, and streetlights illuminate the scene.
Aesthetic Score : 0.6
Mood : melancholy, lonely, mysterious
Quality
Entropy : 6.62
Noise : 94
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no major artifacts or errors in the image.
A Touch of Time: A Vintage Photograph Evokes Nostalgia
This image captures the essence of nostalgia, with a hand holding a faded photograph of a young woman. The blurred background of a dimly lit room adds to the sense of time and memory, creating a warm and sentimental mood.
Prompt
style-aesthetic Neo-realist: Nostalgic, sentimental ; A weathered hand, holding a worn photograph, with a faded smile on the face in the picture; close-up; Family; A dimly lit room with a single lamp casting a warm glow; cinematic
Characteristic
Shot : A hand holding a faded photograph of a young woman in a vintage setting
Aesthetic Score : 0.7
Mood : nostalgic, warm, sentimental
Quality
Entropy : 6.33
Noise : 53
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some slight artifacts around the edges of the photo
A Journey Beckons: Cozy Mystery in Candlelight
A weathered map, an ancient compass, and flickering candles create a scene of nostalgic intrigue. The warm glow casts long shadows, inviting you to unravel the secrets hidden within this cozy, mysterious setting.
Prompt
style-aesthetic Neo-realist: Intriguing, mysterious ; A weathered map, spread out on a rickety wooden table, with a worn compass resting on it; close-up; Adventure; A dimly lit room with flickering candlelight; cinematic
Characteristic
Shot : A compass lies on a vintage map, surrounded by candles and a book, creating a warm, inviting atmosphere.
Aesthetic Score : 0.7
Mood : cozy, nostalgic, adventurous
Quality
Entropy : 6.80
Noise : 67
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors
A Boy’s Longing Gaze Through the Train Window
A young boy, lost in contemplation, watches a passing train from the window of his own. The forest scenery and the boy’s wistful expression evoke a sense of nostalgia and anticipation, hinting at a journey both physical and emotional.
Prompt
style-aesthetic Neo-realist: Wonder, anticipation ; A young boy, gazing out of a train window, watching the world go by; close-up; Adventure; A train speeding through a rural landscape with fields and forests; cinematic
Characteristic
Shot : A young boy is looking out the window of a train, watching a train pass by in the distance.
Aesthetic Score : 0.7
Mood : melancholy, thoughtful, nostalgic
Quality
Entropy : 6.65
Noise : 73
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight blur on the train and the background, possibly from motion blur, but it does not affect the overall aesthetic of the image.
A Solitary Figure in a City of Smoke
A lone figure stands on a rooftop, silhouetted against a sunrise over a hazy, industrial cityscape. The scene evokes a sense of melancholy and solitude, with the figure appearing vulnerable against the backdrop of pollution and decay.
Prompt
style-aesthetic Neo-realist: Melancholy, yet hopeful ; A lone figure, silhouetted against the setting sun, standing atop a crumbling building; long shot; Heroism; A cityscape with smoke rising from factories in the distance; cinematic
Characteristic
Shot : A lone figure stands on a rooftop silhouetted against a bright orange sunrise, with smoke rising from nearby industrial buildings in the background.
Aesthetic Score : 0.6
Mood : melancholy, solitude, industrial
Quality
Entropy : 6.37
Noise : 39
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy and has some noise, particularly in the background.
A Dinner of Secrets: Intimacy and Mystery at the Table
Four figures gather around a dimly lit table, their faces shrouded in shadow. The muted colors and somber atmosphere suggest a shared history and unspoken emotions, hinting at a deeper story unfolding beneath the surface.
Prompt
style-aesthetic Neo-realist: Warm, intimate ; A family gathered around a worn dining table, sharing a simple meal; medium shot; Family; A dimly lit kitchen with faded wallpaper and chipped paint; cinematic
Characteristic
Shot : A group of four people, three adults and one teenager, are sitting at a dining table. They are eating food and talking. The room is dimly lit and there is a window in the background.
Aesthetic Score : 0.6
Mood : cozy, intimate, relaxed
Quality
Entropy : 6.35
Noise : 63
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some of the lighting and shadows are a little unnatural.
Conclusion
The generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered below average. This suggests that the model didn’t accurately translate the camera position described in the prompt into the generated image.
- Shot Analysis: The model scored 0.565, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.15, which is considered okay. This means that the generated image’s aesthetic was somewhat different from the expected aesthetic based on the prompt.
Overall, the model shows promise in understanding the scene and shot composition, but needs improvement in accurately capturing the intended camera position and aesthetic.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux/dev/api