AI's Artistic Journey: Capturing Scenes, Missing the Mood with Leonardo-ai
- 9 minutes read - 1804 wordsTable of Contents
The world of AI image generation is rapidly evolving, with models capable of creating stunning visuals based on text prompts. However, achieving a perfect balance between technical accuracy and artistic expression remains a challenge. This blog post examines the results of an experiment that tested the capabilities of a generative AI model in capturing various scenes and aesthetics. While the model demonstrated proficiency in understanding camera positions and shot composition, it struggled to translate the intended aesthetic into the generated images. This discrepancy highlights the ongoing quest for AI to truly understand and replicate the nuances of human artistic expression.
Created with: leonardo-ai
Solitude and Wonder: A Hiker’s View of a Fog-Shrouded Mountain Range
A lone hiker stands on a mountain ridge, taking in the breathtaking view of a valley blanketed in fog and snow-capped peaks. The scene evokes a sense of serenity, adventure, and the vastness of nature. The dramatic effect of the fog and the lone figure emphasizes the solitude and beauty of the moment.
Prompt
poses interactive-pose: Determined, hopeful, adventurous ; A lone adventurer; wide shot; Adventure; Majestic mountain range with a winding path leading to a hidden valley; cinematic
Characteristic
Shot : A lone hiker walks on a mountain ridge overlooking a valley shrouded in fog with snow-capped mountains in the background.
Aesthetic Score : 0.8
Mood : serene, adventurous, majestic
Quality
Entropy : 6.61
Noise : 104
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No notable errors.
Friends Gather for a Night of Games and Laughter
A group of friends enjoys a lively board game session in a cozy, dimly lit room. The warm glow of the screen in the background adds to the intimate atmosphere, highlighting the joy and energy of their playful interaction.
Prompt
poses interactive-pose: Excited, focused, competitive ; A group of friends; medium shot; Gaming; A dimly lit room with a large screen displaying a video game, surrounded by controllers and snacks; cinematic
Characteristic
Shot : Three friends are playing a board game in a dimly lit room. A television is behind them, showing a video game scene. The friends are all smiling and laughing, and the atmosphere is one of fun and camaraderie.
Aesthetic Score : 0.7
Mood : playful, friendly, happy
Quality
Entropy : 6.04
Noise : 91
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, but this is not a major issue. There is a slight graininess to the image, which may be due to the lighting conditions.
Batman at Sunset: A Silhouette of Power
A brooding Batman stands silhouetted against a vibrant sunset cityscape, capturing the hero’s dramatic and isolated nature. The lighting creates a sense of mystery and power, leaving you wanting to know what’s next in this epic scene.
Prompt
poses interactive-pose: Confident, powerful, heroic ; A superhero; close-up; Heroism; A cityscape with towering buildings and a dramatic sunset in the background; cinematic
Characteristic
Shot : A superhero, possibly Batman, is standing on a rooftop at sunset, overlooking a city skyline.
Aesthetic Score : 0.7
Mood : serious, dramatic, heroic
Quality
Entropy : 6.94
Noise : 90
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is a little blurry in some areas, particularly the background.
Love and Laughter in the Heart of India
A heartwarming scene unfolds in a bustling Indian street market, where a woman and a young boy share a moment of pure joy. The woman’s loving embrace and the boy’s infectious smile radiate warmth and happiness, capturing the essence of life’s simple pleasures amidst the vibrant chaos.
Prompt
poses interactive-pose: Happy, joyful, curious ; A family; medium shot; Tourism; A bustling marketplace with colorful stalls and vibrant street performers; cinematic
Characteristic
Shot : A woman and a young boy are standing in a bustling street market in India. The woman is smiling and has her arm around the boy, who is also smiling. The background is filled with vendors and shoppers.
Aesthetic Score : 0.8
Mood : happy, heartwarming, authentic
Quality
Entropy : 6.87
Noise : 102
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant artifacts or errors are visible.
A Winding Road to Hope
A serene and open valley unfolds before you, with a winding asphalt road leading towards the horizon. The rolling hills and grassy meadows create a sense of journey and possibility, inspiring a hopeful mood.
Prompt
poses interactive-pose: Free, adventurous, contemplative ; A traveler; close-up; Travel; A scenic landscape with rolling hills, a clear blue sky, and a winding road leading to the horizon; cinematic
Characteristic
Shot : A winding asphalt road leads through a valley of rolling green hills, the road curves slightly towards the viewer.
Aesthetic Score : 0.7
Mood : serene, open, hopeful
Quality
Entropy : 6.71
Noise : 98
Prompt Clip Score : 0.18
AI Evaluation
Likelihood of AI : 0.10
Image errors : No apparent errors.
Five Women Ignite the Stage with Energetic Dance
A captivating performance unfolds as five young women command the stage, their movements illuminated by vibrant blue and white spotlights. The stark contrast between the dark stage and the bright lights amplifies their energy and confidence, creating a powerful and dramatic visual spectacle.
Prompt
poses interactive-pose: Energetic, expressive, joyful ; A group of dancers; wide shot; Groups; A brightly lit stage with a vibrant backdrop, showcasing a performance; cinematic
Characteristic
Shot : Five young women in colorful outfits are dancing on a stage. There are blue lights shining on them.
Aesthetic Score : 0.7
Mood : energetic, confident, lively
Quality
Entropy : 6.63
Noise : 99
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable artifacts or errors in the image.
Sun-Dappled Serenity: A Hiker’s Journey Through a Mystical Forest
A lone hiker finds peace and wonder amidst the sunbeams filtering through a lush, vibrant forest. The tranquil atmosphere and dramatic lighting create a magical experience, highlighting the beauty of nature and the hiker’s solitary journey.
Prompt
poses interactive-pose: Calm, peaceful, introspective ; A lone hiker; medium shot; Adventure; A dense forest with towering trees and dappled sunlight filtering through the leaves; cinematic
Characteristic
Shot : A lone hiker walks through a dense forest, sunlight streaming through the trees, creating a path of light. Moss covers the ground and tree roots.
Aesthetic Score : 0.8
Mood : serene, tranquil, adventurous
Quality
Entropy : 6.67
Noise : 112
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors
Mystery and Fun: A Night of Board Games Under Dim Lights
Four friends gather around a table in a dimly lit room, their laughter and playful banter filling the air as they engage in a spirited board game. The mysterious lighting adds an intriguing element to the scene, creating a sense of cozy intimacy and playful suspense.
Prompt
poses interactive-pose: Fun, playful, competitive ; A group of friends; close-up; Gaming; A dimly lit room with a table covered in board games and snacks; cinematic
Characteristic
Shot : A group of young adults are gathered around a table playing a board game. The room is dimly lit with a blue overhead light and the table is lit with yellow light. The table has a checkered board game on it with pieces in the middle. The group looks to be having fun and laughing.
Aesthetic Score : 0.6
Mood : fun, playful, casual
Quality
Entropy : 6.05
Noise : 93
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable errors or artifacts in this image
Silhouettes of Love at Sunset
A couple embraces on a golden-hour beach, their silhouettes painted against the fiery sunset. The scene evokes a sense of intimacy and romance, enhanced by the gentle waves and warm glow of the evening light.
Prompt
poses interactive-pose: Romantic, intimate, peaceful ; A couple; close-up; Tourism; A romantic sunset over a beach with the ocean waves crashing in the background; cinematic
Characteristic
Shot : A couple is embracing on a beach at sunset. The couple is silhouetted against the setting sun, with the waves crashing in the background.
Aesthetic Score : 0.7
Mood : romantic, serene, peaceful
Quality
Entropy : 6.95
Noise : 98
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly overexposed, causing some loss of detail in the highlights. The image is also slightly blurry, especially in the background.
Passionate Performance Under the Spotlight
A captivating live performance with a dynamic lead singer, bathed in dramatic lighting. The energy is palpable, creating a sense of excitement and drama.
Prompt
poses interactive-pose: Energetic, passionate, inspiring ; A group of musicians; wide shot; Groups; A concert stage with a large crowd cheering in the background; cinematic
Characteristic
Shot : A band performing on stage in front of a dark background with spotlights. The lead singer is in the center, spread-legged, arms outstretched. Other band members are in the background, playing instruments and singing.
Aesthetic Score : 0.6
Mood : energetic, passionate, dramatic
Quality
Entropy : 6.74
Noise : 102
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are minor artifacts visible in the background, particularly around the lights. Some image noise might be present. The image might be a little underexposed.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.46, which is slightly below the “good” range of 0.5 to 0.75. This suggests that the model’s ability to accurately interpret and reproduce camera positions in the prompt is decent, but could be improved.
- Shot Analysis: The model scored 0.53, which falls within the “good” range. This indicates that the model is generally able to understand the scene described in the prompt and create a shot that aligns with it.
- Aesthetic Analysis: The model scored 0.06, which is significantly lower than the “very good” range of -0.2 to 0.1. This suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic based on the prompt.
Overall, the model demonstrates a good understanding of camera positions and shot composition, but needs improvement in generating images that match the desired aesthetic.