AI Struggles to Capture the 'Dramatic' Aesthetic with Leonardo-ai
- 9 minutes read - 1832 wordsTable of Contents
The ‘dramatic’ aesthetic is a powerful tool in visual storytelling, often used to evoke strong emotions and create a sense of grandeur. It’s characterized by elements like dramatic lighting, contrasting colors, and dynamic compositions. This style is frequently employed in film, photography, and even video games to enhance the impact of a scene. However, replicating this aesthetic in AI-generated images presents a unique challenge. This blog post explores the results of an experiment that tested an AI model’s ability to capture the ‘dramatic’ aesthetic, highlighting its strengths and weaknesses.
Created with: leonardo-ai
Silhouetted Rider Against a Dramatic Sunset
A lone rider on horseback is silhouetted against a breathtaking sunset over a desolate landscape, evoking a sense of melancholy, solitude, and adventure. The vastness of the scene emphasizes the rider’s isolation and the mystery surrounding their journey.
Prompt
Stylized: Epic and melancholic ; A lone warrior; wide shot; Heroism; A desolate battlefield with a setting sun; cinematic
Characteristic
Shot : A lone rider on horseback rides through a desolate, dry, and rocky landscape towards the horizon, where the sun is setting behind distant mountains.
Aesthetic Score : 0.7
Mood : melancholy, epic, lonely
Quality
Entropy : 6.45
Noise : 90
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some slight artifacts and blurriness in the image, especially in the background. The rider and the horse are also slightly blurry, which could be intentional.
Unveiling the Secrets of a Hidden Treasure
A mysterious, dark cave holds a treasure chest overflowing with gold coins. The dramatic lighting casts an air of intrigue, beckoning you to uncover the secrets within.
Prompt
Stylized: Excitement and wonder ; A treasure chest overflowing with gold; close-up; Adventure; A dark and mysterious cave; cinematic
Characteristic
Shot : A treasure chest overflowing with gold coins, resting open in a dark and mysterious cave.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, treasure
Quality
Entropy : 6.30
Noise : 91
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.60
Image errors : There are some slight artifacts and blurry edges around the gold coins.
Cyberpunk Warrior on the Edge
A woman in futuristic armor stands poised on the edge of a bustling cyberpunk city, her gaze locked on the viewer. The neon-drenched cityscape and her determined stance hint at a dangerous mission about to unfold. This captivating scene blends technology and nature, creating a world of mystery and intrigue.
Prompt
Stylized: Triumphant and futuristic ; A player’s avatar, a powerful warrior, standing triumphantly; medium shot; Gaming; A vibrant and futuristic cityscape; cinematic
Characteristic
Shot : A female character in a futuristic cityscape. She is wearing a black and gold suit with cyberpunk armor. The background features brightly lit neon signs and towering skyscrapers. A street lined with cars is visible in the foreground.
Aesthetic Score : 0.8
Mood : futuristic, cyberpunk, edgy
Quality
Entropy : 6.76
Noise : 94
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are no visible artifacts or errors in the image.
Cityscape Sunset: A Serene Symphony of Light and Shadow
Experience the breathtaking beauty of a city skyline bathed in the golden hues of sunset. Tall skyscrapers pierce the sky, reflected in the shimmering river below. Streetlights illuminate the urban landscape, creating a mesmerizing interplay of light and shadow. This panoramic view captures the serene and atmospheric essence of city life at its most captivating.
Prompt
Stylized: Energetic and lively ; A panoramic view of a bustling city; long shot; Tourism; A vibrant and colorful cityscape; cinematic
Characteristic
Shot : A cityscape with skyscrapers illuminated at sunset, capturing the vibrant city life during golden hour.
Aesthetic Score : 0.8
Mood : urban, vibrant, energetic
Quality
Entropy : 6.65
Noise : 108
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has slight noise and artifacts, particularly in the shadows and highlights.
Silhouetted Against the Sunset: A Moment of Contemplation in the Vast Desert
A lone figure stands on a sand dune, bathed in the warm glow of a desert sunset. The sky explodes with vibrant hues of orange, pink, and blue, while the endless dunes stretch out in every direction. This serene scene evokes a sense of vastness, isolation, and contemplation, capturing the beauty and tranquility of the desert landscape.
Prompt
Stylized: Serene and contemplative ; A lone traveler gazing at a breathtaking sunset; medium shot; Travel; A vast desert landscape; cinematic
Characteristic
Shot : A lone figure stands on a dune, looking out at a vast desert landscape at sunset.
Aesthetic Score : 0.7
Mood : serene, contemplative, vast
Quality
Entropy : 6.70
Noise : 88
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Sun-Kissed Joy: A Family’s Walk Through Happiness
A heartwarming scene unfolds as a family strolls through a sun-drenched park. The father holds his daughter’s hand, while the mother looks back with a smile. The daughter, full of laughter, runs ahead, capturing the essence of carefree joy. The warm sunlight filtering through the leaves creates a beautiful and inviting atmosphere, reflecting the love and happiness shared by this family.
Prompt
Stylized: Joyful and heartwarming ; A family laughing and playing in a park; medium shot; Family; A sunny and idyllic park setting; cinematic
Characteristic
Shot : A family of three is walking in a park on a sunny day. The parents are smiling and looking at each other, while the little girl is running ahead of them with her arms outstretched. The trees are green and lush, and the sun is shining brightly.
Aesthetic Score : 0.7
Mood : joyful, playful, happy
Quality
Entropy : 6.75
Noise : 106
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Man vs. Nature: A Dramatic Seascape of Power and Insignificance
Two figures stand precariously on a cliff edge, dwarfed by the raw power of a stormy sea. Large waves crash against the shore, creating a dramatic contrast that evokes a sense of awe and insignificance. This powerful seascape captures the raw beauty and untamed nature of the ocean.
Prompt
Stylized: Dramatic and powerful ; A lone figure standing on a cliff overlooking a vast ocean; long shot; Heroism; A stormy sea with dramatic clouds; cinematic
Characteristic
Shot : Two people stand on a cliff overlooking a turbulent ocean. Dark, ominous clouds dominate the sky. The waves crash against the rocky shore.
Aesthetic Score : 0.8
Mood : dramatic, foreboding, powerful
Quality
Entropy : 6.67
Noise : 92
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors
Unveiling Secrets in the Shadows
A dimly lit wooden table holds the promise of adventure. An old map, a weathered wooden box, and other intriguing objects lie scattered beneath a flickering candle chandelier. The low light and faded colors of the map create an atmosphere of mystery and intrigue, beckoning you to uncover the secrets hidden within.
Prompt
Stylized: Intriguing and mysterious ; A map with pins marking locations of hidden treasures; close-up; Adventure; A dimly lit room with antique furniture; cinematic
Characteristic
Shot : A dimly lit room with a large map spread out on a wooden table, surrounded by antique furniture and decorations. The map appears to be a historical one with various pins inserted.
Aesthetic Score : 0.7
Mood : mysterious, vintage, adventurous
Quality
Entropy : 6.32
Noise : 88
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Hunter’s Focus: A Moment of Anticipation
A woman clad in leather armor stands poised in a verdant forest, her bow drawn and arrow aimed. The intensity in her gaze and the tension in her stance create a palpable sense of anticipation, hinting at the dramatic moment that is about to unfold.
Prompt
Stylized: Intense and focused ; A player’s character, a skilled archer, aiming at a target; close-up; Gaming; A dark and mysterious forest; cinematic
Characteristic
Shot : A young woman dressed in a leather tunic and bracers is aiming a bow and arrow in a forest setting. The background is blurred, suggesting a shallow depth of field, and the lighting is soft and diffused.
Aesthetic Score : 0.75
Mood : intense, focused, adventurous
Quality
Entropy : 6.35
Noise : 80
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears slightly overexposed, resulting in some loss of detail in the darker areas, particularly in the background.
A Romantic Evening Under the Warm Glow of Candlelight
Experience the perfect romantic dinner with your loved one in a cozy restaurant, illuminated by the soft light of candles and the warm glow of the restaurant’s lights. The intimate and cozy atmosphere, with a high aesthetic score of 0.8, sets the perfect mood for a memorable evening.
Prompt
Stylized: Social and celebratory ; A group of friends enjoying a meal at a restaurant with a view; medium shot; Tourism; A bustling city street with vibrant lights; cinematic
Characteristic
Shot : A couple is enjoying a romantic dinner at an outdoor restaurant in the evening. They are sitting at a table with candles and drinks, and the restaurant is lit with warm lights. The background is a bustling street with other diners.
Aesthetic Score : 0.7
Mood : romantic, cozy, warm
Quality
Entropy : 6.64
Noise : 95
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors, but some noise might be visible in the background, possibly due to low light conditions
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic style. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t quite capture the intended camera position as described in the prompt.
- Shot Analysis: The model scored 0.585, which falls within the “good” range. This indicates that the model was able to understand the scene and create a shot that was relatively close to what was described in the prompt.
- Aesthetic Analysis: The model scored 0.0, which is significantly below the “very good” range of -0.2 to 0.1. This means that the generated image didn’t match the expected aesthetic style as described in the prompt.
Overall, the model seems to be better at understanding the scene and shot composition than it is at capturing the desired aesthetic style.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://leonardo.ai