AI's Artistic Journey: A Glimpse into the 'style-aesthetic' Challenge with Leonardo-ai
- 9 minutes read - 1739 wordsTable of Contents
The ‘style-aesthetic’ is a crucial aspect of visual storytelling, influencing the emotional impact and overall message of an image. It encompasses elements like color palettes, lighting, composition, and even the choice of camera angles. This blog post explores the challenges of capturing this ‘style-aesthetic’ using a generative AI model, analyzing its performance in translating textual prompts into visually appealing and accurate images. We’ll delve into specific examples, highlighting the model’s strengths and weaknesses in understanding camera positions, shot composition, and the desired aesthetic. Join us as we explore the fascinating world of AI-generated art and the ongoing quest for achieving artistic accuracy.
Created with: leonardo-ai
Solitude and Majesty: A Hiker’s Sunrise Above the Clouds
A lone hiker stands on a rocky mountain peak, silhouetted against a breathtaking sea of clouds at sunrise. The scene evokes a sense of serenity, tranquility, and the majestic power of nature. The dramatic effect of the hiker’s small figure against the vast expanse of clouds emphasizes the beauty and solitude of this awe-inspiring moment.
Prompt
Abstract: Epic, triumphant ; A lone figure standing on a mountain peak; wide shot; Heroism; a vast, swirling sea of clouds; cinematic
Characteristic
Shot : A lone hiker stands on a rocky mountain peak overlooking a vast sea of clouds. The sky is a soft blue and the clouds are a beautiful white, with hints of gold from the setting sun. The hiker is silhouetted against the bright clouds, giving the image a sense of scale and solitude.
Aesthetic Score : 0.8
Mood : tranquil, serene, inspiring
Quality
Entropy : 6.75
Noise : 92
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is free of artifacts and errors.
A Hand Reaches Towards the Cosmic Unknown
A mysterious hand extends towards a swirling black hole, evoking a sense of wonder and the vastness of the cosmos. This surreal scene, bathed in darkness, invites contemplation of the unknown and the mysteries that lie beyond our reach.
Prompt
Abstract: Mysterious, exciting ; A hand reaching out to grasp a shimmering, ethereal portal; close-up; Adventure; a swirling vortex of colors; cinematic
Characteristic
Shot : A hand reaches out towards a black hole in space, with swirling clouds of gas and dust surrounding it.
Aesthetic Score : 0.6
Mood : mysterious, dramatic, awe-inspiring
Quality
Entropy : 6.34
Noise : 103
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to have some minor artifacts and blurriness, particularly around the edges.
Neon Dreams: A Cyberpunk Cityscape
Dive into a futuristic metropolis bathed in vibrant neon light. From a towering vantage point, witness the intricate web of streets and towering structures, where mystery and wonder intertwine in this cyberpunk dreamscape.
Prompt
Abstract: Energetic, futuristic ; A pixelated landscape with glowing, abstract figures; medium shot; Gaming; a digital, neon-lit cityscape; cinematic
Characteristic
Shot : A futuristic cityscape with tall, slender buildings illuminated by neon lights in various shades of pink and blue.
Aesthetic Score : 0.8
Mood : cyberpunk, futuristic, vibrant
Quality
Entropy : 6.41
Noise : 101
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts in the image, particularly in the shadows. The neon lights are a bit too saturated and could be slightly toned down.
A Single Bloom in the Desert’s Embrace
A vibrant flower defies the harshness of a cracked desert landscape, offering a glimmer of hope and resilience amidst the desolate beauty. Mountains rise in the distance, adding to the sense of vastness and solitude.
Prompt
Abstract: Hopeful, melancholic ; A single, vibrant flower blooming in a desolate, cracked landscape; close-up; Tourism; a surreal, otherworldly desert; cinematic
Characteristic
Shot : A single purple flower blooming in the middle of a dry cracked desert landscape. A hazy distant mountain range can be seen in the background, with a bright blue sky above.
Aesthetic Score : 0.8
Mood : solitude, resilience, hope
Quality
Entropy : 6.91
Noise : 89
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible artifacts or errors in the image.
City Lights Dance in a Long Exposure Symphony
A vibrant city street comes alive at night in this long exposure photograph. Streaks of car lights paint the asphalt, while the blur of pedestrians captures the energy and dynamism of urban life.
Prompt
Abstract: Dynamic, chaotic ; A blurred, kaleidoscopic image of a bustling city street; long shot; Travel; a whirlwind of colors and movement; cinematic
Characteristic
Shot : A city street with blurred lights and people, looking like a motion blur. The road is wet, with a yellow taxi in the distance.
Aesthetic Score : 0.7
Mood : dynamic, urban, mysterious
Quality
Entropy : 6.82
Noise : 109
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.50
Image errors : The image is heavily blurred, making it difficult to see details. There is some noise in the image.
Silhouettes of Hope: A Family’s Journey into the Sunset
A serene and nostalgic image captures a family of four walking into the setting sun, their silhouettes painted against the fiery sky. The scene evokes feelings of love, family, and the passage of time, leaving a sense of hope and optimism in its wake.
Prompt
Abstract: Hopeful, nostalgic ; A silhouette of a family holding hands, walking towards a glowing, abstract sun; medium shot; Family; a warm, golden sunset; cinematic
Characteristic
Shot : A family of four walks away from the camera, silhouetted against a setting sun.
Aesthetic Score : 0.8
Mood : romantic, hopeful, serene
Quality
Entropy : 6.14
Noise : 78
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors, the image appears to be well-exposed and sharp
The Eye That Watches
A giant eye peers through a cracked window, reflecting a city skyline. Its gaze is both mesmerizing and unsettling, leaving you wondering what secrets it holds. This surreal and eerie image evokes a sense of mystery and intrigue, as if the eye itself is watching you.
Prompt
Abstract: Intense, suspenseful ; A single, abstract eye peering through a cracked, distorted window; close-up; Heroism; a dark, ominous cityscape; cinematic
Characteristic
Shot : Close-up of a giant eye made of wood and glass, reflecting a city skyline.
Aesthetic Score : 0.7
Mood : mysterious, eerie, urban
Quality
Entropy : 5.85
Noise : 99
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible errors or artifacts.
Dreamscape of Swirling Lights
A mesmerizing scene of vibrant, swirling lights in the sky, resembling a cosmic nebula, casts a dreamy glow over a vast, futuristic landscape. The abstract, circular pattern of the lights and the dark, flat terrain create a sense of awe and wonder, transporting viewers to a psychedelic, otherworldly realm.
Prompt
Abstract: Intense, exhilarating ; A swirling vortex of colors and shapes representing a chaotic, digital world; wide shot; Gaming; a vibrant, neon-lit landscape; cinematic
Characteristic
Shot : Abstract landscape with swirling light streaks in the sky, appearing as a cosmic storm.
Aesthetic Score : 0.8
Mood : psychedelic, futuristic, otherworldly
Quality
Entropy : 6.59
Noise : 115
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.90
Image errors : The light streaks in the sky exhibit slight aliasing and lack of smooth gradients, hinting at a digital creation.
Nature’s Fury Unleashed: A Dramatic Seascape of Power and Peril
Witness the raw power of nature in this dramatic seascape. Choppy waves crash against a rugged cliffside, set against a backdrop of a stormy sky. The scene evokes a sense of intensity and foreboding, capturing the beauty and danger of the wild ocean.
Prompt
Abstract: Solitary, contemplative ; A lone, abstract figure standing on a cliff overlooking a vast, swirling ocean; wide shot; Travel; a stormy, dramatic seascape; cinematic
Characteristic
Shot : A dramatic seascape with a dark, stormy sky looming over a rocky coastline and choppy waves crashing against the cliffs.
Aesthetic Score : 0.7
Mood : dramatic, moody, intense
Quality
Entropy : 6.51
Noise : 80
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Golden Hour Wave
A serene and peaceful scene of a wave breaking, framing the setting sun in a golden glow. The wave’s shape creates a natural frame, highlighting the beauty of the sunset.
Prompt
Abstract: Sentimental, reflective ; A series of overlapping, abstract shapes representing a family’s journey through life; medium shot; Family; a warm, nostalgic glow; cinematic
Characteristic
Shot : A close-up view of a wave crest, showing a sunset through the hollow of the wave.
Aesthetic Score : 0.8
Mood : tranquil, serene, majestic
Quality
Entropy : 6.46
Noise : 96
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable errors in the image. The resolution is high and the colors are well-balanced.
Conclusion
The results show that the generative AI model performed okay in terms of understanding camera positions and scene composition, but needs improvement in capturing the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is below the “good” range of 0.5 to 0.75. This suggests that the model didn’t always accurately translate the intended camera positions from the prompt into the generated image.
- Shot Analysis: The model scored 0.62, which falls within the “good” range. This indicates that the model generally understood the scene described in the prompt and created a shot that was somewhat consistent with it.
- Aesthetic Analysis: The model scored 0.005, which is significantly below the “very good” range of -0.2 to 0.1. This means that the generated image’s aesthetic deviated considerably from the expected aesthetic based on the prompt.
Overall: While the model shows some ability to understand scene composition, it struggles to accurately capture the desired camera positions and aesthetic. This suggests that the model needs further training to improve its ability to translate prompts into visually appealing and accurate images.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://leonardo.ai