AI Struggles to Capture Dramatic Style in Images with Imagen-v3-fast
- 9 minutes read - 1888 wordsTable of Contents
The ‘dramatic style’ is a powerful tool in visual storytelling, evoking strong emotions and immersing viewers in a world of heightened tension and intrigue. This style often utilizes dramatic lighting, contrasting colors, and dynamic camera angles to create a sense of grandeur and impact. However, replicating this style in AI-generated images presents unique challenges, as the model must not only understand the technical aspects of the scene but also capture the intended emotional impact.
Created with: imagen-v3-fast
A Knight’s Hope Amidst the Storm
A solitary knight, cloaked in darkness, stands on a windswept cliff, gazing out at a tempestuous sea. The dramatic lighting, with a sliver of light piercing the storm clouds, evokes a sense of melancholy, hope, and the overwhelming power of nature.
Prompt
dramatic-styles Color Contrast: Epic, determined ; A lone hero standing on a clifftop; wide shot; heroism; a vast, stormy sea with dark, ominous clouds; cinematic
Characteristic
Shot : A lone figure, a knight clad in a dark cloak, stands on a rocky cliff overlooking a vast, stormy sea. The sky is dark with storm clouds, but a sliver of light breaks through on the horizon.
Aesthetic Score : 0.6
Mood : melancholy, dramatic, hopeful
Quality
Entropy : 6.93
Noise : 88
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.80
Image errors : The sea looks a bit artificial, with repetitive textures. The lighting seems a bit too harsh and the clouds lack natural detail. The figure is a bit stiff and lacks realistic proportions.
Shadows Dance in the Ancient Corridor
Three figures stand bathed in a single shaft of light, their forms stark against the dark, intricately carved stone walls of an ancient corridor. The atmosphere is thick with mystery and suspense, as long shadows stretch across the floor, hinting at secrets hidden within the darkness.
Prompt
dramatic-styles Color Contrast: Mysterious, adventurous ; A group of adventurers exploring a dark, ancient temple; medium shot; adventure; glowing, mystical murals on the walls; cinematic
Characteristic
Shot : Three figures stand in a dark, ancient corridor with stone walls and intricate carvings. A shaft of light illuminates the characters from above, casting long shadows on the floor.
Aesthetic Score : 0.7
Mood : mysterious, ominous, suspenseful
Quality
Entropy : 6.51
Noise : 79
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be generated by AI and has some minor errors in texture and lighting. The figures are slightly pixelated and the shadows are not perfectly aligned.
The Rhythm of Code: Hands Fly Across a Backlit Keyboard
A close-up shot captures the focused energy of a coder, their hands dancing across a backlit keyboard. Colorful reflections from the computer screen paint the scene with a digital glow, hinting at the creative process unfolding before our eyes.
Prompt
dramatic-styles Color Contrast: Intense, focused ; A gamer’s hands on a keyboard, illuminated by the vibrant colors of a futuristic cityscape on the screen; close-up; gaming; a dark, shadowy room; cinematic
Characteristic
Shot : A person’s hands are typing on a backlit keyboard in front of a computer screen with colorful lights reflecting on the surface.
Aesthetic Score : 0.6
Mood : focused, techy, digital
Quality
Entropy : 6.33
Noise : 32
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight blur in the background, which could be due to motion blur or a shallow depth of field.
A Solitary Figure Contemplates the Vastness of Nature
A single figure stands amidst a breathtaking landscape, their gaze fixed on a majestic mountain range. The dramatic sky, filled with large, puffy clouds, adds to the sense of awe and wonder. This serene scene evokes feelings of contemplation and hope, inviting viewers to connect with the beauty and vastness of the natural world.
Prompt
dramatic-styles Color Contrast: Awe-inspiring, peaceful ; A lone traveler standing in front of a majestic mountain range; long shot; tourism; a bright, clear blue sky with fluffy white clouds; cinematic
Characteristic
Shot : A solitary figure stands in a vast landscape, gazing at a mountain range under a dramatic sky with large, puffy clouds.
Aesthetic Score : 0.8
Mood : serene, contemplative, hopeful
Quality
Entropy : 6.87
Noise : 74
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to be rendered with a slightly stylized and artificial look, likely due to being AI-generated. The edges of the clouds are very smooth and the color palette is quite uniform.
Golden Sunset Picnic: A Family’s Moment of Joy
A heartwarming scene of a family enjoying a picnic on the beach at sunset. The warm glow of the setting sun bathes the scene in a romantic and peaceful atmosphere, capturing a moment of pure happiness.
Prompt
dramatic-styles Color Contrast: Happy, nostalgic ; A family enjoying a sunset picnic on a beach; medium shot; travel; the warm, golden light of the setting sun reflecting on the water; cinematic
Characteristic
Shot : A family of three is having a picnic on the beach at sunset. The parents are sitting with their backs to the camera, and their daughter is sitting in front of them, looking at the camera.
Aesthetic Score : 0.7
Mood : happy, romantic, peaceful
Quality
Entropy : 6.53
Noise : 57
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly underexposed, and the colors are a bit muted. There is some noise in the background.
Superhero Soars Above the Storm
A powerful superhero takes flight against a backdrop of a dramatic sunset and lightning, capturing the essence of epic heroism and hope.
Prompt
dramatic-styles Color Contrast: Powerful, inspiring ; A superhero soaring through the air, silhouetted against a bright, colorful cityscape; wide shot; heroism; a dark, stormy sky with lightning strikes; cinematic
Characteristic
Shot : A superhero in flight above a city skyline, with a dramatic sunset and lightning in the background
Aesthetic Score : 0.7
Mood : epic, heroic, hopeful
Quality
Entropy : 6.81
Noise : 65
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.80
Image errors : The lightning in the background appears slightly artificial, and the city skyline lacks detail.
Into the Green Unknown: A Journey to the Waterfall
Two figures venture deep into a lush jungle, their path illuminated by the ethereal glow of a cascading waterfall. The scene evokes a sense of mystery, tranquility, and adventure, as the light plays on the verdant foliage and the figures disappear into the unknown.
Prompt
dramatic-styles Color Contrast: Dangerous, exciting ; A group of explorers navigating a dense, dark jungle; medium shot; adventure; a bright, sunlit clearing with a waterfall in the distance; cinematic
Characteristic
Shot : Two figures walk towards a waterfall in a lush, green jungle. The path is framed by large trees and vines.
Aesthetic Score : 0.8
Mood : mysterious, tranquil, adventurous
Quality
Entropy : 6.44
Noise : 86
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some slight artifacts and blurriness in the figures and the path, particularly around the edges.
Lost in the Glow: A Moment of Intense Focus
A young man, bathed in vibrant blue and orange light, is completely absorbed in the screen before him. His headphones isolate him, creating an atmosphere of intense concentration and serious purpose. The dramatic lighting highlights his unwavering focus, capturing a moment of pure dedication.
Prompt
dramatic-styles Color Contrast: Focused, determined ; A gamer’s face illuminated by the screen, showing intense concentration; close-up; gaming; a dark, dimly lit room with neon lights; cinematic
Characteristic
Shot : A young man wearing headphones is looking intensely at a screen. He is illuminated with blue and orange light.
Aesthetic Score : 0.6
Mood : focused, serious, intense
Quality
Entropy : 6.46
Noise : 49
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some blur around the man’s head. The image is slightly overexposed.
City Lights, City Dreams: A Rooftop Romance
A couple silhouetted against the twinkling cityscape, their love story unfolding under a sky full of stars. This romantic scene captures the magic of a shared moment, bathed in the warm glow of city lights.
Prompt
dramatic-styles Color Contrast: Romantic, lively ; A couple standing on a balcony overlooking a bustling city; medium shot; tourism; a vibrant, colorful cityscape with twinkling lights; cinematic
Characteristic
Shot : A couple stands on a rooftop overlooking a city at night. The city is lit up with lights, and there are many buildings in the background. The couple is looking out at the city, and they appear to be in love.
Aesthetic Score : 0.7
Mood : romantic, dreamy, magical
Quality
Entropy : 6.52
Noise : 78
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some artifacts and errors, such as the blurring of the city lights in the background. The couple’s silhouettes are also slightly distorted.
Warmth and Wonder: A Cozy Campfire Gathering
Experience the tranquility of a peaceful night in the woods as a group of four, comprising two adults and two children, huddle around a vibrant campfire. The fire’s warm glow illuminates their faces, creating an intimate and cozy atmosphere that encapsulates the essence of togetherness and peace.
Prompt
dramatic-styles Color Contrast: Warm, intimate ; A family gathered around a campfire, sharing stories and laughter; medium shot; family; a dark, starry night sky with a glowing fire; cinematic
Characteristic
Shot : A group of four people, two adults and two children, are gathered around a campfire in the woods at night. The fire is burning brightly and casting a warm glow on the faces of the people.
Aesthetic Score : 0.7
Mood : cozy, warm, peaceful
Quality
Entropy : 6.14
Noise : 55
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a bit of noise, especially in the darker areas. There is also a slight halo effect around the fire.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered okay. This means that the camera positions in the generated images were somewhat different from what was specified in the prompt.
- Shot Analysis: The model scored 0.54, which is also considered okay. This indicates that the model was able to understand the scene in the prompt to some extent, but not perfectly.
- Aesthetic Analysis: The model scored 0.31, which is considered pretty bad. This means that the generated images had a significantly different aesthetic than what was expected based on the prompt.
Overall, the model seems to be better at understanding the technical aspects of the prompt (camera position and shot type) than the artistic aspects (aesthetic).
Sources:
- https://www.swiff.org/article/crafting-the-tone-and-style-of-a-film
- https://digital-photography-school.com/backlighting-in-photography/
- https://www.studiobinder.com/blog/what-is-chiaroscuro-definition-examples/
- https://infocusfilmschool.com/4-wildly-different-movie-styles-youll-explore-filmmaking-college/
- https://cinepunked.com/2022/09/23/a-quick-guide-to-visual-style/
- https://cinematography.com/index.php?/forums/topic/184-desaturation-techniques/
- https://www.reddit.com/r/Filmmakers/comments/1452afb/colour_grading_an_underrated_factor_in_the/
- https://digital-photography-school.com/rule-of-thirds/
- https://deepmind.google/technologies/imagen-3/