AI's Artistic Struggle: Capturing the Essence of Dramatic Style with Flux-pro
- 10 minutes read - 1957 wordsTable of Contents
The dramatic aesthetic, characterized by its use of strong contrasts, intense emotions, and captivating visuals, is a powerful tool in storytelling and visual art. It evokes a sense of awe, suspense, and wonder, drawing viewers into the heart of the narrative. But can AI truly capture the essence of this aesthetic? In this blog post, we explore the challenges and successes of using AI to generate images with a dramatic style. We analyze a case study where an AI model was tasked with creating images based on specific scenes and aesthetics, revealing both the model’s strengths and weaknesses in capturing the desired mood and visual elements. By examining these results, we gain insights into the current capabilities of AI in artistic expression and explore potential solutions for improving its understanding of dramatic aesthetics.
Created with: flux-pro
Silhouetted Against the Sunset, a Lone Cowboy Awaits
A solitary figure, a cowboy in a hat, stands silhouetted against a vibrant orange sunset. The dramatic lighting and his poised stance with a gun hint at a tense and mysterious situation, evoking the classic tropes of the Wild West.
Prompt
French New Wave: epic, melancholic ; A lone figure, silhouetted against a setting sun; long shot; heroism; a vast, empty desert landscape; cinematic
Characteristic
Shot : A lone cowboy stands silhouetted against a setting sun in a vast desert landscape.
Aesthetic Score : 0.7
Mood : dramatic, solitary, nostalgic
Quality
Entropy : 6.04
Noise : 81
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors, but the background could be sharper.
The Red Line: A Journey into the Unknown
A hand, shrouded in mystery, points towards a map marked with a crimson line. What secrets lie hidden within this historical journey? Prepare for a suspenseful adventure as you unravel the truth behind the red line.
Prompt
French New Wave: intriguing, suspenseful ; A close-up of a weathered map, with a finger tracing a route; medium shot; adventure; a cluttered, dimly lit room; cinematic
Characteristic
Shot : A hand points at a map, the map shows a geographic area with red lines, the scene is dark and mysterious
Aesthetic Score : 0.6
Mood : mysterious, suspenseful, intriguing
Quality
Entropy : 6.79
Noise : 75
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts, especially in the dark areas.
The Hand That Holds the Past
A lone hand rests on a joystick, bathed in the soft glow of an arcade cabinet. The air hums with the nostalgic energy of a bygone era, as blurred lights and sounds of classic games paint a scene of playful anticipation. This image captures the essence of a forgotten world, where the thrill of the game was all that mattered.
Prompt
French New Wave: intense, energetic ; A hand holding a joystick, fingers moving rapidly; close-up; gaming; a neon-lit arcade with flashing screens; cinematic
Characteristic
Shot : A person’s hand is about to press a button on a game machine. The person is in the foreground and the background is blurry.
Aesthetic Score : 0.5
Mood : intense, playful, focused
Quality
Entropy : 6.80
Noise : 53
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some noise and blur, particularly in the background. The color balance also appears slightly off.
Parisian Dreams: A Moment of Hope and Romance
A woman stands before the iconic Eiffel Tower, bathed in the warm glow of the setting sun. Her gaze is directed upwards, capturing a sense of wistful longing and hopeful anticipation. The dramatic lighting and the grandeur of the Parisian landmark create a romantic and unforgettable scene.
Prompt
French New Wave: romantic, nostalgic ; A young woman, her face filled with wonder, gazing at the Eiffel Tower; medium shot; tourism; a bustling Parisian street; cinematic
Characteristic
Shot : A young woman is gazing upwards, likely at the Eiffel Tower in the background. The image is taken from a low angle and the subject appears to be in a thoughtful or dreamy mood.
Aesthetic Score : 0.8
Mood : dreamy, romantic, nostalgic
Quality
Entropy : 6.75
Noise : 71
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to have been edited to create a softer, more romantic tone. The lighting and colors are slightly unnatural, but this could be intentional.
Golden Fields and Tranquil Skies: A Journey Through Time
A peaceful train ride through a rural landscape evokes a sense of nostalgia and tranquility. The golden wheat fields, line of trees, and blue sky with white clouds create a picturesque scene, while the train’s movement adds a touch of dynamism to the image.
Prompt
French New Wave: reflective, contemplative ; A train speeding through a countryside landscape, with a lone figure looking out the window; long shot; travel; a vibrant, sun-drenched field; cinematic
Characteristic
Shot : A train is moving along a track through a field of wheat, with a view of the countryside in the background
Aesthetic Score : 0.6
Mood : calm, nostalgic, tranquil
Quality
Entropy : 6.83
Noise : 105
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some graininess and noise, especially in the sky and the background. There are some artifacts in the train window reflection.
Family Gathering: A Moment of Warmth and Togetherness
A heartwarming scene of a family sharing a meal, bathed in soft lighting that creates a sense of intimacy and cozy comfort. The relaxed atmosphere and warm colors evoke feelings of love and connection.
Prompt
French New Wave: intimate, heartwarming ; A family gathered around a table, sharing a meal, with laughter and conversation; medium shot; family; a warm, inviting kitchen; cinematic
Characteristic
Shot : A family is gathered around a dining table, enjoying a meal. There are plates of food, drinks, and a warm, inviting atmosphere.
Aesthetic Score : 0.7
Mood : cozy, intimate, happy
Quality
Entropy : 6.68
Noise : 88
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some slight noise in the image, particularly in the shadows.
Lost in the Chaos: A Man Races Through a Bustling Market
A sense of urgency fills the air as a man sprints through a vibrant, crowded street market. The scene, possibly in India or South Asia, is a whirlwind of activity, with the man’s determined stride adding to the dramatic intensity.
Prompt
French New Wave: urgent, dramatic ; A young man, his face etched with determination, running through a crowded marketplace; medium shot; heroism; a chaotic, bustling market; cinematic
Characteristic
Shot : A young man is running through a crowded market street, his shirt is open and his chest is bare, he seems to be in a hurry and a bit agitated.
Aesthetic Score : 0.7
Mood : intense, urgent, adventurous
Quality
Entropy : 6.51
Noise : 78
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some minor artifacts and noise are visible in the background, especially around the edges of the buildings.
A Compass Points North: Embracing Adventure and Nostalgia
A vintage compass, its golden needle pointing true north, rests on a weathered wooden surface. The shallow depth of field draws your eye to the compass, emphasizing its symbolic significance and evoking a sense of adventure and rustic charm.
Prompt
French New Wave: mysterious, suspenseful ; A close-up of a compass needle spinning, pointing towards an unknown destination; close-up; adventure; a dimly lit, mysterious room; cinematic
Characteristic
Shot : A close-up of an old, brass compass with a single needle pointing towards the north. The compass is sitting on a wooden surface and the background is blurred.
Aesthetic Score : 0.8
Mood : mysterious, vintage, timeless
Quality
Entropy : 6.66
Noise : 68
Prompt Clip Score : 0.20
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some slight graininess and there is a slight blur to the edges of the compass.
The Glow of Innovation: A Team United in Focus
Four young minds huddle around a computer screen, their faces illuminated by the glow of the monitor. The intensity of their focus and the collaborative energy in the room suggest a moment of breakthrough and shared excitement. This image captures the essence of innovation, where ideas take shape and dreams are realized.
Prompt
French New Wave: intense, focused ; A group of friends huddled around a computer screen, their faces illuminated by the glow; medium shot; gaming; a dimly lit, cluttered room; cinematic
Characteristic
Shot : A group of young people are gathered around a computer, likely playing a game or working on a project. The image is captured in a dimly lit room, with warm lighting emanating from the computer screen.
Aesthetic Score : 0.6
Mood : focused, concentrated, intense
Quality
Entropy : 6.74
Noise : 69
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some slight noise in the image, particularly in the shadows.
Silhouettes of Love at Sunset
A romantic stroll down a cobblestone street, bathed in the warm glow of a setting sun. The couple, silhouetted against the vibrant orange sky, exudes a sense of peace and nostalgia. This captivating scene evokes a feeling of timeless love and the beauty of shared moments.
Prompt
French New Wave: romantic, nostalgic ; A couple walking hand-in-hand along a cobblestone street, their silhouettes framed by the setting sun; long shot; tourism; a romantic, picturesque town; cinematic
Characteristic
Shot : A couple walks hand-in-hand down a cobblestone street in the evening, with a warm sunset behind them.
Aesthetic Score : 0.7
Mood : romantic, warm, nostalgic
Quality
Entropy : 6.59
Noise : 104
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant image errors.
Conclusion
The results indicate that the generative AI model performed well in understanding and executing camera positions and shot composition, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored a 0.42, which falls slightly below the “good” range of 0.5 to 0.75. This suggests that while the model generally understood the camera positions described in the prompt, there were some discrepancies between the intended and actual camera angles in the generated image.
- Shot Analysis: The model scored a 0.58, placing it within the “good” range. This indicates that the model was able to successfully translate the shot descriptions in the prompt into the generated image, demonstrating a good understanding of scene composition.
- Aesthetic Analysis: The model scored a 0.07, which is significantly lower than the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt. The model may have struggled to capture the desired mood, style, or visual elements.
Overall, the model shows promise in understanding and executing camera positions and shot composition, but needs improvement in achieving the desired aesthetic.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux-pro/api