AI Captures the Essence, But Misses the Details: A Study in Aesthetic Style with Leonardo-ai

AI's Artistic Journey: Capturing the 'style-aesthetic' but Missing the Mark on Details with Leonardo-ai

Contents

The ‘style-aesthetic’ is a powerful tool in visual storytelling, evoking specific emotions and atmospheres. It’s characterized by dramatic lighting, evocative color palettes, and carefully composed shots. This style is often used in film, photography, and even video games to create immersive and impactful experiences. In this blog post, we explore the results of an experiment testing an AI model’s ability to generate images based on the ‘style-aesthetic’ and specific scene descriptions. We’ll delve into the model’s strengths and weaknesses, highlighting its success in capturing the desired aesthetic while revealing its struggles with accurately representing camera positions and scene details. Join us as we explore the fascinating world of AI image generation and its potential to revolutionize visual storytelling.

Created with: leonardo-ai

A Solitary Journey Towards Hope

A lone figure traverses a desolate plain, their small form dwarfed by the vastness of the landscape. The setting sun casts a dramatic glow on the distant mountain range, offering a glimmer of hope amidst the solitude.

A Solitary Journey Towards Hope

Prompt

Impressionist: Epic, hopeful ; A lone figure, silhouetted against the setting sun; wide shot; Heroism; A vast, desolate landscape with a lone mountain in the distance; cinematic

Characteristic

Shot : A lone figure walks towards a mountain range as the sun sets behind it, casting a warm glow over the landscape.

Aesthetic Score : 0.8

Mood : serene, hopeful, contemplative

Quality

Entropy : 6.74

Noise : 91

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.20

Image errors : No noticeable errors, the image appears to be well-exposed and processed.

Lost in Time: A Compass Beckons Adventure

A vintage map unfolds beneath the warm glow of a lamp, revealing a weathered compass pointing towards an unknown destination. The scene evokes a sense of nostalgia, adventure, and mystery, inviting you to explore the past and dream of new horizons.

Lost in Time: A Compass Beckons Adventure

Prompt

Impressionist: Mysterious, adventurous ; A weathered map, partially obscured by shadows, with a compass needle pointing towards a distant, unknown land; close-up; Adventure; A dimly lit room with flickering candlelight; cinematic

Characteristic

Shot : An old compass lies on an antique map, with a vintage telescope in the background.

Aesthetic Score : 0.8

Mood : vintage, mysterious, adventurous

Quality

Entropy : 6.73

Noise : 87

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.20

Image errors : The lighting is slightly uneven, creating some shadows that are a bit too harsh.

In the Zone: Gamer’s Hand Grips Controller with Focused Intensity

A close-up shot captures the intensity of a gamer’s focus as their hand grips a controller with glowing red buttons. The computer screen in the background displays a vibrant video game, adding to the sense of immersion and engagement. The image evokes a mood of seriousness and determination, highlighting the player’s complete absorption in the virtual world.

In the Zone: Gamer’s Hand Grips Controller with Focused Intensity

Prompt

Impressionist: Intense, focused ; A player’s hand, gripping a joystick, with the screen reflecting the vibrant colors of a virtual world; close-up; Gaming; A dimly lit room with a computer screen glowing brightly; cinematic

Characteristic

Shot : A close-up shot of a person’s hands holding a gaming controller, with a computer screen showing a game in the background. The room is lit with vibrant, colorful lights.

Aesthetic Score : 0.6

Mood : intense, focused, playful

Quality

Entropy : 6.17

Noise : 71

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.20

Image errors : The image appears slightly blurry in the background, and the lighting is a bit artificial, making the scene feel slightly unnatural.

Lost in the Labyrinth of Color: A Street Market’s Enchanting Depth

A vibrant street market in a foreign land unfolds before you, its narrow alleyways teeming with life. A lone figure walks through the heart of the bustling scene, bathed in the play of light and shadow. Brightly colored flowers and fabrics create a tapestry of exotic beauty, inviting you to explore the hidden depths of this captivating place.

Lost in the Labyrinth of Color: A Street Market’s Enchanting Depth

Prompt

Impressionist: Exuberant, curious ; A bustling marketplace, filled with vibrant colors and exotic goods, with a lone traveler gazing in wonder; wide shot; Tourism; A bustling marketplace with vibrant colors and exotic goods; cinematic

Characteristic

Shot : A bustling street market in a foreign city, likely in South Asia. People are walking through the market, buying goods from vendors. There are colorful flowers and vegetables on display, as well as many other items.

Aesthetic Score : 0.7

Mood : busy, vibrant, exotic

Quality

Entropy : 6.77

Noise : 112

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.10

Image errors : No noticeable artifacts or errors. The image is clean and well-composed.

Nostalgic Journey Through Rolling Hills

A green train glides through picturesque countryside, evoking a sense of peace and nostalgia. The gentle motion of the train adds a touch of excitement to the serene landscape, with a charming house and rolling hills completing the idyllic scene.

Nostalgic Journey Through Rolling Hills

Prompt

Impressionist: Nostalgic, romantic ; A train speeding through a picturesque countryside, with blurred landscapes and fleeting glimpses of towns and villages; long shot; Travel; A picturesque countryside with rolling hills and lush greenery; cinematic

Characteristic

Shot : A green train travelling on a railway track through a hilly countryside. The train is moving towards the right side of the frame, while the camera follows its movement.

Aesthetic Score : 0.8

Mood : serene, peaceful, nostalgic

Quality

Entropy : 6.83

Noise : 115

Prompt Clip Score : 0.30

AI Evaluation

Likelihood of AI : 0.10

Image errors : No significant artifacts or errors are visible. The image appears to be well-processed with minimal noise.

Golden Hour Breakfast: A Grandparent’s Love Shines Through

A heartwarming scene unfolds as a grandfather and granddaughter share a cozy breakfast bathed in the soft glow of morning sunlight. The intimate setting and warm lighting create a sense of love and connection, capturing a special moment between generations.

Golden Hour Breakfast: A Grandparent’s Love Shines Through

Prompt

Impressionist: Intimate, heartwarming ; A family gathered around a table, sharing a meal, with warm, golden light illuminating their faces; medium shot; Family; A cozy kitchen with a warm, inviting atmosphere; cinematic

Characteristic

Shot : A grandfather and granddaughter are having breakfast together in a warm, inviting kitchen. The light is soft and the colors are muted, creating a peaceful atmosphere.

Aesthetic Score : 0.7

Mood : warm, cozy, intimate

Quality

Entropy : 6.79

Noise : 90

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.10

Image errors : There are some minor imperfections in the image, such as slight blurriness around the edges and some noise in the shadows. However, these are not overly distracting.

Silhouetted Against the Sunset: A Moment of Contemplation

A lone figure stands on a clifftop, dwarfed by the vast ocean and the setting sun. The warm glow of the sunset paints the sky in vibrant hues, creating a serene and contemplative mood. This image captures the power and beauty of nature, leaving the viewer with a sense of awe and wonder.

Silhouetted Against the Sunset: A Moment of Contemplation

Prompt

Impressionist: Solitary, contemplative ; A lone figure, standing on a cliff overlooking a vast ocean, with the sun setting in the distance; medium shot; Heroism; A vast ocean with a dramatic sunset; cinematic

Characteristic

Shot : A lone figure standing on a cliff overlooking the ocean at sunset, the sun is setting over the horizon, casting a golden glow over the water

Aesthetic Score : 0.8

Mood : serene, contemplative, peaceful

Quality

Entropy : 6.73

Noise : 99

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.20

Image errors : No major errors detected, slightly oversaturated

Campfire Camaraderie: A Moment of Warmth in the Wilderness

Four friends gather around a crackling campfire, their laughter echoing through the dark forest. The warm glow of the flames illuminates their faces, creating a sense of intimacy and adventure. This cozy scene captures the essence of camaraderie and the magic of shared experiences in nature.

Campfire Camaraderie: A Moment of Warmth in the Wilderness

Prompt

Impressionist: Warm, camaraderie ; A group of adventurers, silhouetted against a blazing campfire, sharing stories and laughter; medium shot; Adventure; A dark forest with a flickering campfire; cinematic

Characteristic

Shot : Four men are sitting around a campfire in a forest, they are all looking at each other and smiling, the fire is warm and inviting, and the forest is dark and mysterious.

Aesthetic Score : 0.7

Mood : cozy, friendly, adventurous

Quality

Entropy : 5.79

Noise : 93

Prompt Clip Score : 0.36

AI Evaluation

Likelihood of AI : 0.20

Image errors : No visible artifacts or errors

Lost in the Digital World: A Moment of Intense Focus

A young man sits captivated by his computer screen, his face illuminated by the glow, reflecting a mood of intense focus and seriousness. The low light adds a dramatic effect, highlighting the intensity of his engagement with the digital world.

Lost in the Digital World: A Moment of Intense Focus

Prompt

Impressionist: Engrossed, focused ; A close-up of a player’s face, illuminated by the screen, with a mix of excitement and concentration; close-up; Gaming; A dimly lit room with a computer screen glowing brightly; cinematic

Characteristic

Shot : A young man is sitting in front of a computer screen, looking focused and engrossed in his work. He is in a dimly lit room, with only the glow of the screen illuminating his face.

Aesthetic Score : 0.6

Mood : focused, intense, techy

Quality

Entropy : 5.84

Noise : 74

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.20

Image errors : There are minor errors in the image, namely, the blurred background looks almost like an over-sharpening effect is being used. The subject’s right hand looks blurred and not entirely real.

A Vibrant Stroll: Love in the Rain

Experience the romantic charm of a couple sharing an umbrella on a narrow street, surrounded by colorful buildings and shops. The recent rain has left the street glistening, while the bright blue sky with clouds adds a whimsical touch to their vibrant journey.

A Vibrant Stroll: Love in the Rain

Prompt

Impressionist: Energetic, vibrant ; A panoramic view of a bustling city, with vibrant colors and a sense of movement, with a lone traveler walking through the streets; wide shot; Tourism; A bustling city with vibrant colors and a sense of movement; cinematic

Characteristic

Shot : A couple walks under an umbrella down a narrow, cobblestone street lined with shops and colorful buildings in a European city.

Aesthetic Score : 0.8

Mood : romantic, nostalgic, urban

Quality

Entropy : 6.85

Noise : 115

Prompt Clip Score : 0.30

AI Evaluation

Likelihood of AI : 0.30

Image errors : No noticeable errors or artifacts.

Conclusion

The results show that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:

  • Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately translate the camera position described in the prompt into the generated image.
  • Shot Analysis: The model scored 0.5, which is considered average. This indicates that the model was able to understand the scene described in the prompt to a reasonable degree, but there’s room for improvement.
  • Aesthetic Analysis: The model scored 0.07, which is considered very good. This means that the generated image closely matched the expected aesthetic style, despite the other shortcomings.

Overall, the model demonstrates a mixed performance. While it excels at capturing the desired aesthetic, it struggles with accurately representing the camera position and scene details.

Sources: