AI Captures the Scene, But Misses the Mood with Leonardo-ai
- 9 minutes read - 1717 wordsTable of Contents
In the realm of artificial intelligence, image generation has emerged as a captivating field. Generative AI models, trained on vast datasets of images and text, can create stunning visuals based on textual prompts. However, the ability to capture the nuances of aesthetic style remains a challenge. This blog post delves into the results of an experiment that tested the capabilities of a generative AI model in creating images that align with specific aesthetic parameters. We explore the model’s performance in terms of camera position, shot analysis, and aesthetic analysis, highlighting its strengths and areas for improvement. By understanding the limitations and potential of these models, we can gain valuable insights into the future of AI-powered image creation.
Created with: leonardo-ai
A Handshake for the Future: Astronauts Mark a New Era of Space Exploration
Two astronauts stand on a rocky surface, their hands clasped in a gesture of hope and progress. Against a backdrop of a distant planet and a starry night sky, this iconic handshake symbolizes the dawn of a new era in space exploration. The scene evokes a sense of epic adventure, futuristic possibilities, and the boundless potential of humanity’s journey among the stars.
Prompt
poses holding-hands: Hopeful, determined, camaraderie ; Two astronauts; wide shot; heroism; the vastness of space with stars and planets in the background; cinematic
Characteristic
Shot : Two astronauts in spacesuits shaking hands on a desolate planet with a distant star field background
Aesthetic Score : 0.7
Mood : hopeful, optimistic, peaceful
Quality
Entropy : 6.73
Noise : 104
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor artifacts, such as a slight blurring around the edges of the astronauts. The planet is a bit blurry in the background and looks a bit ‘flat’ - could have more detail or a ‘glow’ to it
Lost in the Jungle’s Embrace: A Journey of Wonder and Mystery
Three adventurers trek through a vibrant jungle, bathed in sunlight that paints the scene with magic. The light beams create an ethereal atmosphere, highlighting the figures as they explore the unknown. This image captures the essence of adventure, serenity, and the mystical allure of nature.
Prompt
poses holding-hands: Excited, adventurous, trusting ; A group of explorers; medium shot; adventure; a dense jungle with sunlight filtering through the canopy; cinematic
Characteristic
Shot : Three people are hiking in a lush green tropical forest. The sunlight is filtering through the trees, creating a beautiful and serene atmosphere.
Aesthetic Score : 0.7
Mood : serene, adventurous, hopeful
Quality
Entropy : 6.72
Noise : 112
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major errors are visible in this image, it’s well-composed.
The Intensity of the Game Shines Through
Two young men are locked in a fierce video game battle, their focus unwavering in the dimly lit room. The dramatic lighting amplifies the tension and competitive spirit, creating a captivating scene of pure gaming passion.
Prompt
poses holding-hands: Focused, competitive, collaborative ; Two gamers; close-up; gaming; a brightly lit gaming setup with glowing screens and controllers; cinematic
Characteristic
Shot : Two young men are playing video games in a dimly lit room. The man in the foreground is wearing headphones and typing on a keyboard.
Aesthetic Score : 0.6
Mood : focused, intense, competitive
Quality
Entropy : 6.38
Noise : 94
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant image errors were detected.
Love Blooms on a Cobblestone Street
A couple strolls hand-in-hand down a charming cobblestone street, their smiles radiating joy and love. The warm lighting and inviting atmosphere create a picture of pure romance and happiness.
Prompt
poses holding-hands: Romantic, happy, adventurous ; A couple; medium shot; tourism; a picturesque cityscape with iconic landmarks in the background; cinematic
Characteristic
Shot : A young couple is walking hand-in-hand down a cobblestone street in a European city. The buildings on either side of the street are old and have a rustic charm.
Aesthetic Score : 0.7
Mood : romantic, happy, carefree
Quality
Entropy : 6.88
Noise : 102
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
A Father and Daughter’s Journey Towards Hope
A tranquil scene of a father and daughter walking hand-in-hand towards a snow-capped mountain range. The vastness of the mountains creates a sense of adventure and hope, while the small figures evoke a feeling of vulnerability and the beauty of shared experiences.
Prompt
poses holding-hands: Joyful, connected, adventurous ; A family; long shot; travel; a scenic mountain range with a winding road leading to the peak; cinematic
Characteristic
Shot : A father and daughter walk down a road in the mountains with backpacks on. The mountains are snow capped in the background.
Aesthetic Score : 0.8
Mood : peaceful, adventurous, hopeful
Quality
Entropy : 6.65
Noise : 104
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image appears to be well-exposed with no noticeable artifacts or errors.
Friendship in Full Bloom: Laughter and Joy at a Festive Gathering
This vibrant image captures the essence of friendship, with four friends walking together, laughing and radiating joy. The scene evokes a sense of celebration, with vibrant colors and infectious laughter painting a picture of pure happiness.
Prompt
poses holding-hands: Happy, celebratory, connected ; A group of friends; medium shot; groups; a vibrant festival with colorful decorations and music; cinematic
Characteristic
Shot : Four friends are walking together, laughing and smiling, against a colorful backdrop of a festival or market.
Aesthetic Score : 0.7
Mood : happy, playful, festive
Quality
Entropy : 6.92
Noise : 96
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors.
A Hiker’s Journey: Finding Serenity Amidst Majestic Peaks
A lone hiker traverses a mountain path, their small figure dwarfed by the towering peaks. The scene evokes a sense of serenity, adventure, and contemplation, as the hiker journeys towards a snowy summit in the distance.
Prompt
poses holding-hands: Determined, courageous, triumphant ; A lone hiker; close-up; heroism; a breathtaking mountain vista with clouds swirling below; cinematic
Characteristic
Shot : A lone hiker walks on a mountain trail, with a stunning view of snow-capped peaks and cloudy sky in the background
Aesthetic Score : 0.8
Mood : serene, adventurous, inspiring
Quality
Entropy : 6.80
Noise : 101
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors.
Childhood Joy: Two Girls Running Free on a Sunny Playground
A heartwarming scene of two young girls, hand-in-hand, running with pure joy on a vibrant playground. The sandbox, colorful play structure, and lush green trees create a backdrop of carefree fun. This image captures the essence of childhood innocence and the simple pleasures of life.
Prompt
poses holding-hands: Playful, innocent, carefree ; Two children; close-up; adventure; a playground with swings, slides, and a sandbox; cinematic
Characteristic
Shot : Two young girls are running and holding hands in a playground with colorful structures in the background
Aesthetic Score : 0.7
Mood : playful, carefree, joyful
Quality
Entropy : 6.86
Noise : 104
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No major errors, but the lighting is slightly uneven and there is some noise in the background.
Energetic Performance Under Bright Spotlights
A band takes the stage, the singer front and center, bathed in the glow of powerful spotlights. The scene is electric with energy and the promise of a lively performance.
Prompt
poses holding-hands: Passionate, connected, expressive ; A group of musicians; medium shot; groups; a dimly lit stage with spotlights shining on them; cinematic
Characteristic
Shot : A band is playing on stage with a dark background. There are spotlights on them. The musicians are dressed in casual attire and are interacting with each other.
Aesthetic Score : 0.6
Mood : energetic, lively, joyous
Quality
Entropy : 6.36
Noise : 96
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is some noise in the image, and the shadows are a bit harsh. The contrast is a bit high, which makes the image look a bit flat.
Sunset Romance in the Desert
A couple strolls hand-in-hand towards a breathtaking desert sunset, their love story unfolding against a backdrop of warm hues and endless possibilities. The scene evokes a sense of adventure, hope, and romantic bliss.
Prompt
poses holding-hands: Romantic, adventurous, hopeful ; A couple; long shot; travel; a vast desert landscape with a setting sun in the distance; cinematic
Characteristic
Shot : A couple is walking hand in hand away from the camera into the desert, backlit by the sunset.
Aesthetic Score : 0.7
Mood : romantic, adventurous, hopeful
Quality
Entropy : 6.79
Noise : 103
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.51, which falls within the “good” range (0.5 to 0.75). This indicates that the model was able to accurately capture the camera positions described in the prompt.
- Shot Analysis: The model scored 0.555, also within the “good” range. This suggests that the model understood the scene described in the prompt and was able to create an image that reflected that understanding.
- Aesthetic Analysis: The model scored 0.07, which is significantly lower than the “very good” range (-0.2 to 0.1). This indicates that the generated image did not match the expected aesthetic as closely as it did with the camera position and shot analysis.
Overall, the model demonstrates a good understanding of camera positions and scene descriptions, but needs improvement in generating images that meet the desired aesthetic.