AI's Artistic Journey: A Glimpse into the 'style-aesthetic' Challenge with Leonardo-ai

Exploring the 'style-aesthetic' Challenge: A Generative AI Case Study with Leonardo-ai

Contents

The ‘style-aesthetic’ is a crucial aspect of visual storytelling, influencing the emotional impact and overall message of an image. It encompasses elements like color palettes, lighting, composition, and even the choice of camera angles. This blog post explores the challenges of capturing this ‘style-aesthetic’ using a generative AI model, analyzing its performance in translating textual prompts into visually appealing and accurate images. We’ll delve into specific examples, highlighting the model’s strengths and weaknesses in understanding camera positions, shot composition, and the desired aesthetic. Join us as we explore the fascinating world of AI-generated art and the ongoing quest for achieving artistic accuracy.

Created with: leonardo-ai

Solitude and Majesty: A Hiker’s Sunrise Above the Clouds

A lone hiker stands on a rocky mountain peak, silhouetted against a breathtaking sea of clouds at sunrise. The scene evokes a sense of serenity, tranquility, and the majestic power of nature. The dramatic effect of the hiker’s small figure against the vast expanse of clouds emphasizes the beauty and solitude of this awe-inspiring moment.

Solitude and Majesty: A Hiker’s Sunrise Above the Clouds

Prompt

Abstract: Epic, triumphant ; A lone figure standing on a mountain peak; wide shot; Heroism; a vast, swirling sea of clouds; cinematic

Characteristic

Shot : A lone hiker stands on a rocky mountain peak overlooking a vast sea of clouds. The sky is a soft blue and the clouds are a beautiful white, with hints of gold from the setting sun. The hiker is silhouetted against the bright clouds, giving the image a sense of scale and solitude.

Aesthetic Score : 0.8

Mood : tranquil, serene, inspiring

Quality

Entropy : 6.75

Noise : 92

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.10

Image errors : The image is free of artifacts and errors.

A Hand Reaches Towards the Cosmic Unknown

A mysterious hand extends towards a swirling black hole, evoking a sense of wonder and the vastness of the cosmos. This surreal scene, bathed in darkness, invites contemplation of the unknown and the mysteries that lie beyond our reach.

A Hand Reaches Towards the Cosmic Unknown

Prompt

Abstract: Mysterious, exciting ; A hand reaching out to grasp a shimmering, ethereal portal; close-up; Adventure; a swirling vortex of colors; cinematic

Characteristic

Shot : A hand reaches out towards a black hole in space, with swirling clouds of gas and dust surrounding it.

Aesthetic Score : 0.6

Mood : mysterious, dramatic, awe-inspiring

Quality

Entropy : 6.34

Noise : 103

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.80

Image errors : The image appears to have some minor artifacts and blurriness, particularly around the edges.

Neon Dreams: A Cyberpunk Cityscape

Dive into a futuristic metropolis bathed in vibrant neon light. From a towering vantage point, witness the intricate web of streets and towering structures, where mystery and wonder intertwine in this cyberpunk dreamscape.

Neon Dreams: A Cyberpunk Cityscape

Prompt

Abstract: Energetic, futuristic ; A pixelated landscape with glowing, abstract figures; medium shot; Gaming; a digital, neon-lit cityscape; cinematic

Characteristic

Shot : A futuristic cityscape with tall, slender buildings illuminated by neon lights in various shades of pink and blue.

Aesthetic Score : 0.8

Mood : cyberpunk, futuristic, vibrant

Quality

Entropy : 6.41

Noise : 101

Prompt Clip Score : 0.33

AI Evaluation

Likelihood of AI : 0.90

Image errors : There are some minor artifacts in the image, particularly in the shadows. The neon lights are a bit too saturated and could be slightly toned down.

A Single Bloom in the Desert’s Embrace

A vibrant flower defies the harshness of a cracked desert landscape, offering a glimmer of hope and resilience amidst the desolate beauty. Mountains rise in the distance, adding to the sense of vastness and solitude.

A Single Bloom in the Desert’s Embrace

Prompt

Abstract: Hopeful, melancholic ; A single, vibrant flower blooming in a desolate, cracked landscape; close-up; Tourism; a surreal, otherworldly desert; cinematic

Characteristic

Shot : A single purple flower blooming in the middle of a dry cracked desert landscape. A hazy distant mountain range can be seen in the background, with a bright blue sky above.

Aesthetic Score : 0.8

Mood : solitude, resilience, hope

Quality

Entropy : 6.91

Noise : 89

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.30

Image errors : No visible artifacts or errors in the image.

City Lights Dance in a Long Exposure Symphony

A vibrant city street comes alive at night in this long exposure photograph. Streaks of car lights paint the asphalt, while the blur of pedestrians captures the energy and dynamism of urban life.

City Lights Dance in a Long Exposure Symphony

Prompt

Abstract: Dynamic, chaotic ; A blurred, kaleidoscopic image of a bustling city street; long shot; Travel; a whirlwind of colors and movement; cinematic

Characteristic

Shot : A city street with blurred lights and people, looking like a motion blur. The road is wet, with a yellow taxi in the distance.

Aesthetic Score : 0.7

Mood : dynamic, urban, mysterious

Quality

Entropy : 6.82

Noise : 109

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.50

Image errors : The image is heavily blurred, making it difficult to see details. There is some noise in the image.

Silhouettes of Hope: A Family’s Journey into the Sunset

A serene and nostalgic image captures a family of four walking into the setting sun, their silhouettes painted against the fiery sky. The scene evokes feelings of love, family, and the passage of time, leaving a sense of hope and optimism in its wake.

Silhouettes of Hope: A Family’s Journey into the Sunset

Prompt

Abstract: Hopeful, nostalgic ; A silhouette of a family holding hands, walking towards a glowing, abstract sun; medium shot; Family; a warm, golden sunset; cinematic

Characteristic

Shot : A family of four walks away from the camera, silhouetted against a setting sun.

Aesthetic Score : 0.8

Mood : romantic, hopeful, serene

Quality

Entropy : 6.14

Noise : 78

Prompt Clip Score : 0.33

AI Evaluation

Likelihood of AI : 0.10

Image errors : No visible errors, the image appears to be well-exposed and sharp

The Eye That Watches

A giant eye peers through a cracked window, reflecting a city skyline. Its gaze is both mesmerizing and unsettling, leaving you wondering what secrets it holds. This surreal and eerie image evokes a sense of mystery and intrigue, as if the eye itself is watching you.

The Eye That Watches

Prompt

Abstract: Intense, suspenseful ; A single, abstract eye peering through a cracked, distorted window; close-up; Heroism; a dark, ominous cityscape; cinematic

Characteristic

Shot : Close-up of a giant eye made of wood and glass, reflecting a city skyline.

Aesthetic Score : 0.7

Mood : mysterious, eerie, urban

Quality

Entropy : 5.85

Noise : 99

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.30

Image errors : No visible errors or artifacts.

Dreamscape of Swirling Lights

A mesmerizing scene of vibrant, swirling lights in the sky, resembling a cosmic nebula, casts a dreamy glow over a vast, futuristic landscape. The abstract, circular pattern of the lights and the dark, flat terrain create a sense of awe and wonder, transporting viewers to a psychedelic, otherworldly realm.

Dreamscape of Swirling Lights

Prompt

Abstract: Intense, exhilarating ; A swirling vortex of colors and shapes representing a chaotic, digital world; wide shot; Gaming; a vibrant, neon-lit landscape; cinematic

Characteristic

Shot : Abstract landscape with swirling light streaks in the sky, appearing as a cosmic storm.

Aesthetic Score : 0.8

Mood : psychedelic, futuristic, otherworldly

Quality

Entropy : 6.59

Noise : 115

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.90

Image errors : The light streaks in the sky exhibit slight aliasing and lack of smooth gradients, hinting at a digital creation.

Nature’s Fury Unleashed: A Dramatic Seascape of Power and Peril

Witness the raw power of nature in this dramatic seascape. Choppy waves crash against a rugged cliffside, set against a backdrop of a stormy sky. The scene evokes a sense of intensity and foreboding, capturing the beauty and danger of the wild ocean.

Nature’s Fury Unleashed: A Dramatic Seascape of Power and Peril

Prompt

Abstract: Solitary, contemplative ; A lone, abstract figure standing on a cliff overlooking a vast, swirling ocean; wide shot; Travel; a stormy, dramatic seascape; cinematic

Characteristic

Shot : A dramatic seascape with a dark, stormy sky looming over a rocky coastline and choppy waves crashing against the cliffs.

Aesthetic Score : 0.7

Mood : dramatic, moody, intense

Quality

Entropy : 6.51

Noise : 80

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.10

Image errors : No noticeable artifacts or errors.

Golden Hour Wave

A serene and peaceful scene of a wave breaking, framing the setting sun in a golden glow. The wave’s shape creates a natural frame, highlighting the beauty of the sunset.

Golden Hour Wave

Prompt

Abstract: Sentimental, reflective ; A series of overlapping, abstract shapes representing a family’s journey through life; medium shot; Family; a warm, nostalgic glow; cinematic

Characteristic

Shot : A close-up view of a wave crest, showing a sunset through the hollow of the wave.

Aesthetic Score : 0.8

Mood : tranquil, serene, majestic

Quality

Entropy : 6.46

Noise : 96

Prompt Clip Score : 0.24

AI Evaluation

Likelihood of AI : 0.20

Image errors : There are no noticeable errors in the image. The resolution is high and the colors are well-balanced.

Conclusion

The results show that the generative AI model performed okay in terms of understanding camera positions and scene composition, but needs improvement in capturing the desired aesthetic. Here’s a breakdown:

  • Camera Position: The model scored 0.4, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t always accurately translate the intended camera positions from the prompt into the generated image.
  • Shot Analysis: The model scored 0.62, which falls within the “good” range. This indicates that the model generally understood the scene described in the prompt and created a shot that was somewhat consistent with it.
  • Aesthetic Analysis: The model scored 0.005, which is significantly below the “very good” range of -0.2 to 0.1. This means that the generated image’s aesthetic deviated considerably from the expected aesthetic based on the prompt.

Overall: While the model shows some ability to understand scene composition, it struggles to accurately capture the desired camera positions and aesthetic. This suggests that the model needs further training to improve its ability to translate prompts into visually appealing and accurate images.

Sources: