AI's Artistic Struggle: Capturing the 'Style-Aesthetic' with Leonardo-ai
- 9 minutes read - 1800 wordsTable of Contents
The ‘style-aesthetic’ is a powerful tool in visual storytelling, allowing artists to evoke specific emotions and atmospheres. It encompasses everything from the lighting and color palette to the composition and overall mood. But can AI truly understand and replicate these nuances? This blog post explores the challenges of AI in capturing the ‘style-aesthetic’ through a series of generated images, highlighting its strengths and weaknesses in understanding and replicating specific artistic styles.
Created with: leonardo-ai
Silhouettes of Solitude: A Figure Walks into the Sunset
A lone figure, shrouded in mystery, traverses a desolate desert landscape as the sun dips below the horizon. The warm glow of the setting sun casts long shadows, creating a sense of melancholy and intrigue. The vastness of the desert and the silhouette of the figure walking into the sunset amplify the feeling of solitude and mystery.
Prompt
French New Wave: epic, melancholic ; A lone figure, silhouetted against a setting sun; long shot; heroism; a vast, empty desert landscape; cinematic
Characteristic
Shot : A lone figure in a long coat and hat walks away from the camera across a vast desert landscape at sunset.
Aesthetic Score : 0.7
Mood : mysterious, desolate, contemplative
Quality
Entropy : 6.31
Noise : 86
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some slight noise in the image, particularly in the sky. This could be addressed with a little post-processing.
A Journey Through Time: Unraveling the Secrets of a Faded Coastline
This weathered map, likely of the Mediterranean, whispers tales of forgotten voyages and bygone eras. Its faded colors and worn edges evoke a sense of nostalgia, inviting you to explore the mysteries hidden within its lines. The stark contrast between the map’s faded hues and the dark wood surface creates an intriguing atmosphere, beckoning you to delve into its historical depths.
Prompt
French New Wave: intriguing, suspenseful ; A close-up of a weathered map, with a finger tracing a route; medium shot; adventure; a cluttered, dimly lit room; cinematic
Characteristic
Shot : A close-up of a vintage map, likely depicting the Mediterranean Sea and surrounding regions. The map has a worn and aged appearance with faded colors and a textured surface.
Aesthetic Score : 0.7
Mood : nostalgic, historical, adventurous
Quality
Entropy : 6.93
Noise : 92
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors or artifacts observed.
Neon Dreams: A Gamer’s Focus
A nostalgic scene of a young man immersed in a retro arcade game. Vibrant neon lights illuminate the player, creating a sense of excitement and anticipation. The blurred background emphasizes the player’s focused hands, drawing you into the moment.
Prompt
French New Wave: intense, energetic ; A hand holding a joystick, fingers moving rapidly; close-up; gaming; a neon-lit arcade with flashing screens; cinematic
Characteristic
Shot : A person playing an arcade game in a dimly lit arcade setting. The focus is on the person’s hands on the control panel, with the arcade cabinet and other people in the background out of focus.
Aesthetic Score : 0.7
Mood : retro, nostalgic, focused
Quality
Entropy : 6.26
Noise : 83
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly underexposed and the colors are a bit muted. There is some noise in the shadows.
Lost in Parisian Dreams: A Moment of Melancholy at the Eiffel Tower
A young woman stands before the iconic Eiffel Tower, her gaze turned back over her shoulder, hinting at a bittersweet longing. The low angle and shallow depth of field create a sense of mystery, drawing the viewer into her melancholic reverie. This Parisian scene captures the essence of romance and wistful contemplation.
Prompt
French New Wave: romantic, nostalgic ; A young woman, her face filled with wonder, gazing at the Eiffel Tower; medium shot; tourism; a bustling Parisian street; cinematic
Characteristic
Shot : A young woman is standing in front of the Eiffel Tower, looking at the camera with a stoic expression. The sun is setting behind her, casting a warm glow over the scene.
Aesthetic Score : 0.8
Mood : mysterious, alluring, pensive
Quality
Entropy : 6.81
Noise : 87
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible artifacts or errors
Tranquil Journey Through Golden Fields
A peaceful scene captured from a train window, showcasing a vast field of golden wheat, a line of trees, and a blue sky with clouds. The composition creates a sense of depth and perspective, inviting you to escape into the tranquil beauty of the moment.
Prompt
French New Wave: reflective, contemplative ; A train speeding through a countryside landscape, with a lone figure looking out the window; long shot; travel; a vibrant, sun-drenched field; cinematic
Characteristic
Shot : A view out of a train window on a sunny day, looking out over a field of golden wheat and a line of trees in the distance
Aesthetic Score : 0.6
Mood : tranquil, calm, nostalgic
Quality
Entropy : 6.48
Noise : 94
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant errors in the image.
Family Joy: A Moment of Shared Laughter and Love
This heartwarming scene captures a family of three sharing a meal and laughter. The warm lighting and genuine smiles create a sense of intimacy and happiness, making it a perfect representation of family bonding.
Prompt
French New Wave: intimate, heartwarming ; A family gathered around a table, sharing a meal, with laughter and conversation; medium shot; family; a warm, inviting kitchen; cinematic
Characteristic
Shot : Three people are sitting at a table, eating and talking. The scene is warm and inviting, with soft lighting and a relaxed atmosphere.
Aesthetic Score : 0.7
Mood : cozy, warm, intimate
Quality
Entropy : 6.86
Noise : 97
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Lost in the City’s Labyrinth
A young man, shrouded in shadows, navigates a bustling marketplace with a worried expression. The narrow alleyway and low lighting create a sense of tension and mystery, suggesting he’s trying to blend in while hiding something.
Prompt
French New Wave: urgent, dramatic ; A young man, his face etched with determination, running through a crowded marketplace; medium shot; heroism; a chaotic, bustling market; cinematic
Characteristic
Shot : A young man with a worried expression walks through a crowded marketplace in a European city. The scene is set in the early 20th century, with the man dressed in period clothing and the architecture and marketplace being reminiscent of that time period. The image is shot from a low angle, giving the man a sense of importance and power.
Aesthetic Score : 0.7
Mood : dramatic, intense, suspenseful
Quality
Entropy : 6.76
Noise : 94
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image is slightly grainy and the lighting is a bit flat, giving the image a bit of a vintage look.
Lost in the Stars: A Golden Compass Beckons
A close-up of a golden compass with a starburst design, set against a dark background, evokes a sense of mystery and intrigue. The shallow depth of field draws the viewer’s eye to the intricate details of the compass, hinting at a hidden world waiting to be explored.
Prompt
French New Wave: mysterious, suspenseful ; A close-up of a compass needle spinning, pointing towards an unknown destination; close-up; adventure; a dimly lit, mysterious room; cinematic
Characteristic
Shot : Close-up of an ornate compass with a golden needle pointing north.
Aesthetic Score : 0.8
Mood : mysterious, antique, timeless
Quality
Entropy : 5.85
Noise : 73
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors
The Sound of Focus: A Glimpse into a Music Production Session
Three young men, bathed in the blue and green glow of a dimly lit room, huddle around a mixing board, their intense focus hinting at a creative process brimming with suspense and intrigue. This scene captures the raw energy and dedication of music production, leaving viewers eager to know what sonic masterpiece is being crafted.
Prompt
French New Wave: intense, focused ; A group of friends huddled around a computer screen, their faces illuminated by the glow; medium shot; gaming; a dimly lit, cluttered room; cinematic
Characteristic
Shot : Three young men are working on a soundboard in a dimly lit room. The room appears to be a home studio, with a cluttered desk and a framed picture on the wall.
Aesthetic Score : 0.7
Mood : intense, focused, mysterious
Quality
Entropy : 6.01
Noise : 81
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
Silhouettes of Love: A Romantic Sunset Stroll
A couple walks hand-in-hand down a cobblestone street in a European city, their silhouettes bathed in the golden glow of the setting sun. The scene evokes a sense of romance, nostalgia, and intimacy, capturing the magic of a shared moment.
Prompt
French New Wave: romantic, nostalgic ; A couple walking hand-in-hand along a cobblestone street, their silhouettes framed by the setting sun; long shot; tourism; a romantic, picturesque town; cinematic
Characteristic
Shot : A couple walks down a cobblestone street in a European city, silhouetted by the setting sun.
Aesthetic Score : 0.7
Mood : romantic, nostalgic, peaceful
Quality
Entropy : 6.78
Noise : 106
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable image artifacts or errors
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic style. Here’s a breakdown:
- Camera Position: The model scored 0.48, which is considered good. This means the generated image’s camera position closely matched the prompt’s instructions.
- Shot Analysis: The model scored 0.57, also considered good. This indicates the model successfully captured the intended shot type and composition described in the prompt.
- Aesthetic Analysis: The model scored 0.05, which is not very good. This suggests the generated image’s aesthetic style deviated significantly from the desired style specified in the prompt.
Overall, the model demonstrates a good understanding of scene and camera position, but needs improvement in capturing the intended aesthetic style.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://leonardo.ai