AI Captures the Essence of 'Style-Aesthetic' but Struggles with Scene Composition with Scenario
- 10 minutes read - 1931 wordsTable of Contents
The ‘style-aesthetic’ is a captivating concept that encompasses the visual language of a particular style, evoking specific emotions and associations. It’s often used in film, photography, and art to create a distinct atmosphere and immerse viewers in a particular world. This blog post explores the fascinating world of ‘style-aesthetic’ and how generative AI models are attempting to capture its essence. We’ll delve into the results of a recent experiment where an AI model was tasked with generating images based on various ‘style-aesthetic’ prompts, analyzing its strengths and weaknesses in capturing the desired visual language.
Created with: scenario
A Lone Figure in the Desert: A Journey Begins
A woman stands silhouetted against the warm glow of the desert sunset, her gaze fixed on the horizon. The scene evokes a sense of adventure, mystery, and hope, hinting at a journey of self-discovery and the challenges that lie ahead.
Prompt
Vintage: Epic, adventurous, hopeful ; A lone, weathered explorer; medium shot; Adventure; a vast, sun-drenched desert landscape with ancient ruins in the distance; cinematic
Characteristic
Shot : A woman with long brown hair stands in a desert landscape, wearing a beige shirt, and a brown leather strap across her chest. There are sand dunes and rock formations in the background. The sky is a pale blue and the sun is setting.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, hopeful
Quality
Entropy : 6.60
Noise : 81
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors.
Candlelight and Suspense: A Board Game Night Unfolds
Four children huddle around a table, their faces illuminated by flickering candlelight as they engage in a board game. The cozy atmosphere is punctuated by a sense of anticipation and suspense, hinting at the intensity of the game and the secrets it might hold.
Prompt
Vintage: Nostalgic, intimate, playful ; A group of children playing a board game; close-up; Gaming; a dimly lit room with a worn wooden table and flickering candlelight; cinematic
Characteristic
Shot : A group of four children are playing a board game around a table in a dimly lit room. The room has a cozy and vintage atmosphere with warm lighting, wooden furniture, and a framed picture on the wall.
Aesthetic Score : 0.7
Mood : cozy, nostalgic, playful
Quality
Entropy : 6.71
Noise : 90
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
A Moment of Vintage Elegance on the Platform
A young woman, clad in a stylish blue coat and hat, stands patiently on a train platform, her gaze fixed on the approaching train. The steam rising from the tracks adds a touch of nostalgia and drama to this elegant scene, transporting us to a bygone era.
Prompt
Vintage: Romantic, adventurous, hopeful ; A young woman in a vintage dress standing on a train platform; long shot; Travel; a bustling train station with steam locomotives and vintage luggage; cinematic
Characteristic
Shot : A woman in a blue coat and hat stands on a train platform, the train is in the background. There is a small amount of smoke on the ground.
Aesthetic Score : 0.7
Mood : mysterious, elegant, vintage
Quality
Entropy : 6.77
Noise : 95
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.40
Image errors : The image is slightly blurry, and the smoke looks a bit artificial. The lighting is uneven.
Heroic Rescue: Firefighter Saves Child from Burning Building
A dramatic image captures the bravery of a firefighter, clad in full gear, as they carry a child to safety from a blazing inferno. The scene is both harrowing and hopeful, highlighting the selflessness of those who risk their lives to protect others.
Prompt
Vintage: Dramatic, heroic, suspenseful ; A firefighter carrying a child through a burning building; close-up; Heroism; a smoky, chaotic scene with flames and debris; cinematic
Characteristic
Shot : A firefighter, wearing a helmet and a fireproof jacket, is rescuing a child from a burning building. The firefighter is holding the child in her arms, and they are both looking at the flames.
Aesthetic Score : 0.7
Mood : dramatic, intense, hopeful
Quality
Entropy : 6.86
Noise : 96
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Cozy Campfire Night Under the Moonlit Sky
A group of friends gather around a crackling campfire, bathed in the warm glow of the flames and the ethereal light of the moon. The scene evokes a sense of peace, nostalgia, and the magic of a night spent in the wilderness.
Prompt
Vintage: Warm, nostalgic, peaceful ; A family gathered around a campfire; wide shot; Family; a serene forest setting with stars twinkling in the night sky; cinematic
Characteristic
Shot : A group of people are gathered around a campfire in front of a cabin in the woods. It is night and the stars are out. The scene is lit by the campfire and the glow from the cabin.
Aesthetic Score : 0.8
Mood : cozy, nostalgic, peaceful
Quality
Entropy : 6.56
Noise : 109
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to be digitally painted and exhibits a certain amount of artificiality in the shading and textures, particularly in the cabin and the trees.
Classic Car Adventure: A Scenic Mountain Drive
Experience the thrill of a classic car journey through a breathtaking mountain pass. The car speeds towards you, capturing the essence of adventure and nostalgia against a backdrop of serene beauty.
Prompt
Vintage: Romantic, adventurous, nostalgic ; A vintage car driving down a winding mountain road; long shot; Tourism; a scenic mountain landscape with lush forests and snow-capped peaks; cinematic
Characteristic
Shot : A vintage car drives down a mountain road towards a snowy peak in the distance. The road is flanked by lush green trees and grass. The sky is a clear, bright blue, with a few wispy clouds.
Aesthetic Score : 0.7
Mood : serene, nostalgic, adventurous
Quality
Entropy : 6.78
Noise : 108
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some slight blurriness in the foreground and background, possibly from over-sharpening, the sky appears unnatural and slightly overexposed
Taking Flight: A Retro Adventure in the Clouds
A confident woman, sporting aviator sunglasses and a white shirt, gazes out from the cockpit of an airplane. The high-contrast lighting and dynamic composition create a sense of excitement and intrigue, capturing the spirit of retro adventure.
Prompt
Vintage: Exhilarating, adventurous, free ; A pilot in a vintage biplane soaring through the clouds; close-up; Adventure; a breathtaking view of a vast, blue sky with fluffy white clouds; cinematic
Characteristic
Shot : A woman wearing aviator sunglasses looks out of an airplane window. The sun is shining and the clouds are fluffy. The woman is wearing a white collared shirt and a brown leather strap.
Aesthetic Score : 0.8
Mood : dreamy, nostalgic, adventurous
Quality
Entropy : 6.72
Noise : 94
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors
Soldiers March Through a City’s Ashes
A haunting image of soldiers silhouetted against smoke, walking through a war-torn city. The scene evokes a sense of grim determination and the weight of destruction.
Prompt
Vintage: Grim, heroic, determined ; A group of soldiers marching through a war-torn city; medium shot; Heroism; a desolate cityscape with rubble and smoke; cinematic
Characteristic
Shot : A group of soldiers in military uniform walk down a war-torn street, the buildings on either side are heavily damaged and smoke rises from the distance.
Aesthetic Score : 0.75
Mood : war-torn, somber, dramatic
Quality
Entropy : 6.79
Noise : 105
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant image errors, however, the image seems slightly overexposed in the background.
Elegant Romance: A Timeless Dance in the Opulent Ballroom
Experience the epitome of elegance and romance as a couple dances gracefully in a lavish ballroom. Adorned with chandeliers and exquisite decor, the scene is set for a timeless moment. The couple, dressed in formal attire, exudes passion and intimacy, their expressions reflecting a deep connection. The warm colors and shallow depth of field enhance the mood, creating a captivating image that evokes a sense of timeless romance.
Prompt
Vintage: Romantic, elegant, nostalgic ; A couple dancing in a vintage ballroom; close-up; Tourism; a grand ballroom with chandeliers and elegant guests; cinematic
Characteristic
Shot : A couple is dancing in a grand ballroom. The man is wearing a black tuxedo and the woman is wearing a light brown gown. The room is decorated with chandeliers and there are other guests in the background.
Aesthetic Score : 0.8
Mood : romantic, elegant, timeless
Quality
Entropy : 6.71
Noise : 96
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurred in the background, likely due to the use of a wide aperture. There are also some minor artifacts in the background.
Lost in a World of Maps and Memories
A young boy, bathed in the warm glow of candlelight, pores over a map in a rustic setting. The scene evokes a sense of nostalgia, thoughtfulness, and curiosity, as the boy’s face is illuminated by the flickering flames, highlighting his intense concentration.
Prompt
Vintage: Curious, adventurous, hopeful ; A young boy gazing at a vintage map; close-up; Adventure; a dimly lit room with a worn wooden table and a flickering candlelight; cinematic
Characteristic
Shot : A young boy is sitting at a table, studying a map. The scene is lit by candles and has a warm, inviting atmosphere.
Aesthetic Score : 0.7
Mood : nostalgic, curious, contemplative
Quality
Entropy : 6.89
Noise : 98
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image. The lighting is well balanced and there is no grain or noise. The resolution is sufficient for high-quality reproduction.
Conclusion
The results show that the generative AI model performed okay in terms of understanding camera positions and scene composition, but excelled in capturing the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.45, which falls below the “good” range of 0.5 to 0.75. This suggests that the model didn’t perfectly translate the intended camera positions from the prompt into the generated image.
- Shot Analysis: The model scored 0.4, also below the “good” range. This indicates that the model had some difficulty understanding the scene described in the prompt and translating it into a visually coherent shot.
- Aesthetic Analysis: The model scored a remarkable 0.01, which is well within the “very good” range of -0.2 to 0.1. This means the generated image closely matched the desired aesthetic style specified in the prompt.
Overall, while the model struggled with accurately capturing camera positions and scene composition, it excelled in achieving the desired aesthetic. This suggests that the model might be better at understanding and implementing stylistic elements than it is at interpreting spatial relationships.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://www.scenario.com