AI's Artistic Journey: Capturing the Essence of Style-Aesthetic with Leonardo-ai
- 10 minutes read - 1946 wordsTable of Contents
The ‘style-aesthetic’ is a powerful tool in visual storytelling, encompassing the mood, tone, and overall visual language of a scene. It’s about more than just the subject matter; it’s about how the scene is presented, the emotions it evokes, and the message it conveys. This blog post explores how a generative AI model navigates this complex concept, analyzing its ability to capture the essence of style-aesthetic across a range of scenes.
Created with: leonardo-ai
Silhouetted Mystery at Sunset
A lone figure, shrouded in a long coat and hood, stands in a field, their silhouette stark against the fiery hues of a setting sun. The scene evokes a sense of mystery, loneliness, and contemplation, leaving the viewer to ponder the figure’s story.
Prompt
Avant-garde: Epic, melancholic ; A lone figure, silhouetted against a blazing sunset; long shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A solitary figure in a long coat and hood stands silhouetted against a vibrant sunset. The figure is facing away from the viewer and appears to be gazing at the horizon in a desolate, grassy landscape.
Aesthetic Score : 0.6
Mood : melancholy, mysterious, contemplative
Quality
Entropy : 6.23
Noise : 78
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to be slightly overexposed, with the sunset being the most prominent feature. It could be due to the JPEG compression, but the image is lacking in the details.
Reaching for the Unknown: A Hand Enters a World of Wonder
A hand stretches out towards a swirling vortex of vibrant colors, hinting at a mystical journey. The image evokes a sense of awe and anticipation, as if a threshold is being crossed into a realm of magic and possibility.
Prompt
Avant-garde: Surreal, mysterious ; A hand reaching out from a swirling vortex of light; close-up; Adventure; A kaleidoscope of colors and abstract shapes; cinematic
Characteristic
Shot : A hand reaching out towards a swirling vortex of colorful abstract shapes, possibly representing a portal or a cosmic phenomenon.
Aesthetic Score : 0.7
Mood : mysterious, ethereal, abstract
Quality
Entropy : 6.62
Noise : 100
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : The hand appears slightly blurry and lacks detail, possibly due to a lack of focus or post-processing.
Lost in the Neon Labyrinth: A Cyberpunk Silhouette
A lone figure, cloaked in a high-tech suit, stands silhouetted against a vibrant cyberpunk metropolis bathed in neon light. The scene evokes a sense of isolation and wonder, hinting at a story of mystery and intrigue in a futuristic world.
Prompt
Avant-garde: Nostalgic, futuristic ; A pixelated character, rendered in a retro 8-bit style, standing on a precipice overlooking a digital cityscape; medium shot; Gaming; A neon-lit, futuristic cityscape; cinematic
Characteristic
Shot : A lone figure in a futuristic spacesuit stands on a rooftop, overlooking a neon-lit cityscape with towering skyscrapers and glowing lines.
Aesthetic Score : 0.7
Mood : futuristic, mysterious, cyberpunk
Quality
Entropy : 6.47
Noise : 85
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to have been generated with AI and the details are slightly blurry in certain areas.
A Suitcase Full of Memories, Waiting for Departure
A vintage suitcase sits alone on a bustling train platform, its worn leather and faded labels whispering tales of journeys past. The blurry background hints at the rush of the station, while the suitcase stands as a poignant symbol of loneliness and anticipation. A melancholic mood hangs in the air, tinged with nostalgia for what has been and excitement for what lies ahead.
Prompt
Avant-garde: Lonely, evocative ; A single, weathered suitcase, abandoned on a deserted train platform; close-up; Tourism; A misty, atmospheric train station; cinematic
Characteristic
Shot : A vintage suitcase placed on the tracks of an empty train station platform, with a hazy background and a subtle sense of loneliness.
Aesthetic Score : 0.7
Mood : melancholy, solitude, nostalgic
Quality
Entropy : 6.75
Noise : 89
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Gloomy Reflections in a Broken City
A low angle shot captures the desolate beauty of a cracked cobblestone street, where a puddle reflects the towering buildings above. The mood is heavy with urban decay and a sense of loneliness.
Prompt
Avant-garde: Disorienting, dreamlike ; A pair of feet walking on a cracked, abstract pavement; low-angle shot; Travel; A distorted, surreal cityscape; cinematic
Characteristic
Shot : A city street with cracked pavement and a puddle reflecting the buildings in the background. The city is lit by the sun and the buildings are visible in the distance.
Aesthetic Score : 0.6
Mood : urban, gritty, neglected
Quality
Entropy : 6.86
Noise : 105
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
A Glimpse into the Past: A Family’s Intimate Gathering by Candlelight
Experience the intimacy and drama of a bygone era as a family of three shares a quiet moment around a candlelit table. Set in a dimly lit room adorned with antique furnishings, this scene offers a glimpse into a historical or period piece, where every face tells a story.
Prompt
Avant-garde: Intimate, mysterious ; A family gathered around a flickering candle, their faces obscured by shadows; close-up; Family; A dimly lit, antique room; cinematic
Characteristic
Shot : A family sits at a table illuminated by candles, in a dimly lit room with antique furniture and paintings. The mood is subdued and intimate.
Aesthetic Score : 0.7
Mood : intimate, somber, nostalgic
Quality
Entropy : 5.47
Noise : 81
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors in the image.
A Single Red Balloon in a White Room: A Moment of Hope and Isolation
A minimalist scene featuring a lone red balloon with a trailing ribbon floating in a stark white room. The image evokes a sense of quiet contemplation, with the balloon’s presence and isolation creating a stark contrast against the white backdrop. The trailing ribbon suggests a journey and hints at a glimmer of hope.
Prompt
Avant-garde: Hopeful, symbolic ; A single, red balloon floating against a stark, white background; close-up; Heroism; A minimalist, abstract setting; cinematic
Characteristic
Shot : A single red balloon with a ribbon is floating in the air against a white background. The balloon is slightly deflated and the ribbon is lying on the ground.
Aesthetic Score : 0.6
Mood : minimalistic, lonely, simple
Quality
Entropy : 5.85
Noise : 63
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : None.
Ready to Play: A Retro Gamer’s Dream
A close-up of a hand gripping a classic gamepad, poised before a vintage CRT television displaying vibrant, abstract visuals. The warm glow and blurred background evoke a sense of nostalgic anticipation, transporting you back to the golden age of gaming.
Prompt
Avant-garde: Nostalgic, introspective ; A hand holding a vintage game controller, the screen reflecting a distorted, pixelated world; close-up; Gaming; A dimly lit, retro-themed room; cinematic
Characteristic
Shot : A close-up shot of a hand holding a black video game controller. The controller is in focus, while the background is blurry. In the background, there is a vintage TV turned on. The scene is set in a dimly lit room. The lighting is warm and inviting, creating a sense of nostalgia and excitement.
Aesthetic Score : 0.7
Mood : nostalgic, gaming, focused
Quality
Entropy : 6.28
Noise : 76
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy and the colors are a bit muted. There is some noise visible in the shadows, and the focus is slightly soft on the background.
A Hiker’s Encounter with a Mystical Vortex in the Sky
A lone hiker stands in awe on a mountain peak, gazing up at a swirling cloud formation that resembles a gateway to another realm. The scene evokes a sense of mystery, serenity, and awe-inspiring wonder.
Prompt
Avant-garde: Sublime, awe-inspiring ; A lone figure standing on a mountain peak, their silhouette framed by a swirling vortex of clouds; long shot; Adventure; A dramatic, mountainous landscape; cinematic
Characteristic
Shot : A lone hiker stands on a mountain peak, silhouetted against a breathtaking spiral of clouds in the sky. The clouds create a sense of mystery and wonder, while the mountains and the vast expanse of clouds below add to the sense of grandeur.
Aesthetic Score : 0.8
Mood : mysterious, awe-inspiring, contemplative
Quality
Entropy : 6.84
Noise : 98
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Eiffel Tower Under Siege: Chaotic Collage Captures City’s Desperation
A disturbing and intense collage depicts a city under attack, with the iconic Eiffel Tower at its center. The chaotic scene, filled with contrasting images, evokes a sense of impending doom and leaves a lasting impression of the city’s struggle.
Prompt
Avant-garde: Energetic, disorienting ; A series of fragmented, overlapping images, depicting different aspects of travel and tourism; montage; Tourism; A chaotic, abstract collage; cinematic
Characteristic
Shot : A collage of various images, with the Eiffel Tower prominently displayed in the center, surrounded by other images of buildings, cityscapes, and nature scenes.
Aesthetic Score : 0.6
Mood : chaotic, vibrant, surreal
Quality
Entropy : 6.80
Noise : 110
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some slight blurring artifacts and inconsistencies in the edges of the collage, especially where the different images meet.
Conclusion
The results show that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.53, which is considered good. This indicates that the model was able to understand the scene described in the prompt and create a shot that was relatively close to the intended one.
- Aesthetic Analysis: The model scored 0.14, which is considered very good. This means that the generated image’s aesthetic was very close to the expected aesthetic described in the prompt.
Overall, the model seems to be better at understanding the scene and achieving the desired aesthetic than accurately capturing the camera position.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://leonardo.ai