AI's Artistic Eye: Capturing the 'style-aesthetic' with Mixed Results with Imagen-v3
- 9 minutes read - 1837 wordsTable of Contents
The ‘style-aesthetic’ is a captivating artistic approach that emphasizes the visual mood and atmosphere of an image over literal representation. It’s often used in film, photography, and digital art to evoke specific emotions and create a unique visual experience. This style relies heavily on color palettes, lighting, composition, and abstract elements to convey a desired feeling or theme. Think of the iconic opening scenes of a film like ‘Blade Runner’ or the surreal landscapes in a Salvador Dali painting. These are examples of how ‘style-aesthetic’ can be used to create powerful and memorable visual experiences.
Created with: imagen-v3
A Lone Rider’s Silhouette Against the Setting Sun
A solitary rider on horseback traverses a vast desert landscape, silhouetted against a vibrant orange sunset. The scene evokes a sense of solitude, adventure, and hope, with the dramatic effect of the silhouette highlighting the rider’s determination in the face of the unknown.
Prompt
style-aesthetic Avant-garde: Epic, melancholic ; A lone figure, silhouetted against a blazing sunset; long shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A lone rider on horseback silhouetted against a vibrant orange sunset, riding across a vast desert landscape.
Aesthetic Score : 0.6
Mood : solitude, adventure, hope
Quality
Entropy : 6.68
Noise : 68
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image is somewhat blurry, particularly in the foreground. The silhouette of the rider and horse are not very detailed.
Hope Emerges from the Abstract
A hand reaches out from a swirling vortex of vibrant colors, symbolizing hope and possibility amidst the unknown. The mysterious and surreal scene evokes a sense of wonder and anticipation.
Prompt
style-aesthetic Avant-garde: Surreal, mysterious ; A hand reaching out from a swirling vortex of light; close-up; Adventure; A kaleidoscope of colors and abstract shapes; cinematic
Characteristic
Shot : A hand reaching out of a swirling, abstract background of vibrant colors.
Aesthetic Score : 0.7
Mood : mysterious, surreal, hopeful
Quality
Entropy : 6.13
Noise : 87
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The background appears slightly blurry and the hand has some minor aliasing artifacts
A Lone Figure in the Neon Labyrinth
A solitary figure stands on a rocky precipice, gazing out over a futuristic city bathed in neon lights and purple haze. The city, rendered in a striking pixel art style, features towering structures and floating platforms connected by bridges. This cyberpunk scene evokes a sense of mystery and isolation, leaving the viewer to ponder the figure’s story and the secrets hidden within the city’s neon glow.
Prompt
style-aesthetic Avant-garde: Nostalgic, futuristic ; A pixelated character, rendered in a retro 8-bit style, standing on a precipice overlooking a digital cityscape; medium shot; Gaming; A neon-lit, futuristic cityscape; cinematic
Characteristic
Shot : A lone figure stands on a rocky outcropping overlooking a futuristic city, bathed in neon lights and purple haze. The city is heavily stylized and built in a pixel art style, featuring towering structures and floating platforms connected by bridges.
Aesthetic Score : 0.6
Mood : cyberpunk, mysterious, futuristic
Quality
Entropy : 6.25
Noise : 64
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has a very noticeable pixel art style, which might not be appealing to all viewers. Some of the pixelated elements, such as the figure’s hair and the city’s buildings, appear jagged and could benefit from smoothing.
Lost in the Fog: A Suitcase Whispers of Journeys Past
An old suitcase sits forlornly on a train platform, shrouded in a thick fog. The scene evokes a sense of loneliness, nostalgia, and abandonment, leaving viewers to ponder the stories hidden within the weathered leather.
Prompt
style-aesthetic Avant-garde: Lonely, evocative ; A single, weathered suitcase, abandoned on a deserted train platform; close-up; Tourism; A misty, atmospheric train station; cinematic
Characteristic
Shot : An old suitcase left on a train platform, with fog filling the background.
Aesthetic Score : 0.6
Mood : lonely, nostalgic, abandoned
Quality
Entropy : 6.62
Noise : 96
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No major image errors are visible.
A Walk Through the Ruins: Desolation and Ominous Silence
A solitary figure walks down a cracked, deserted street, the buildings lining the path standing as silent witnesses to an unknown catastrophe. The scene evokes a sense of desolation and foreboding, hinting at a dystopian future or the aftermath of a devastating event.
Prompt
style-aesthetic Avant-garde: Disorienting, dreamlike ; A pair of feet walking on a cracked, abstract pavement; low-angle shot; Travel; A distorted, surreal cityscape; cinematic
Characteristic
Shot : A person walking down a deserted street with cracked pavement. The street is lined with buildings on both sides. The scene has an ominous atmosphere, perhaps suggesting the aftermath of a disaster or a dystopian future.
Aesthetic Score : 0.5
Mood : ominous, deserted, apocalyptic
Quality
Entropy : 6.50
Noise : 94
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some artifacts, particularly in the buildings and the pavement. The lighting is also somewhat unrealistic. The details in the city buildings look somewhat unrealistic and the blurring effect used to create the depth is not well executed. The cracked pavement has a repetitive pattern and the texture is not convincingly rendered.
Whispers in the Shadows: A Candlelit Moment of Mystery
Three women gather in a dimly lit room, their faces illuminated by the flickering glow of a candle. The atmosphere is thick with suspense, inviting you to unravel the secrets hidden in their gazes.
Prompt
style-aesthetic Avant-garde: Intimate, mysterious ; A family gathered around a flickering candle, their faces obscured by shadows; close-up; Family; A dimly lit, antique room; cinematic
Characteristic
Shot : Three women in a dimly lit room, the youngest holding a candle in her hand, the light illuminating their faces.
Aesthetic Score : 0.7
Mood : mysterious, suspenseful, intimate
Quality
Entropy : 4.82
Noise : 58
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is no noticeable image error, the quality is good.
Red Balloon, White Space: A Minimalist Moment
A single red balloon floats against a stark white backdrop, creating a simple yet powerful image. The balloon’s vibrant color and solitary presence evoke a sense of playfulness and contemplation, inviting viewers to pause and appreciate the beauty of minimalism.
Prompt
style-aesthetic Avant-garde: Hopeful, symbolic ; A single, red balloon floating against a stark, white background; close-up; Heroism; A minimalist, abstract setting; cinematic
Characteristic
Shot : A single red balloon against a white background.
Aesthetic Score : 0.6
Mood : simple, playful, minimalist
Quality
Entropy : 5.84
Noise : 28
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.70
Image errors : Slight blurriness around the edges of the balloon, suggesting it could be a digitally generated image.
Lost in the Pixels: A Nostalgic Journey Back to Retro Gaming
A close-up shot captures the essence of retro gaming, with warm lighting illuminating the hands of a player immersed in a classic CRT television game. The scene evokes a sense of nostalgia and cozy comfort, transporting viewers back to a simpler time of pixelated adventures.
Prompt
style-aesthetic Avant-garde: Nostalgic, introspective ; A hand holding a vintage game controller, the screen reflecting a distorted, pixelated world; close-up; Gaming; A dimly lit, retro-themed room; cinematic
Characteristic
Shot : A close-up of a person playing a retro video game on a CRT television.
Aesthetic Score : 0.8
Mood : nostalgic, retro, cozy
Quality
Entropy : 6.02
Noise : 67
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are no significant artifacts or errors in the image.
Lost in the Clouds: A Hiker’s Moment of Awe
A solitary figure stands on a mountain peak, dwarfed by swirling clouds above. The misty valley below adds to the dramatic and mysterious atmosphere, creating a sense of awe and wonder.
Prompt
style-aesthetic Avant-garde: Sublime, awe-inspiring ; A lone figure standing on a mountain peak, their silhouette framed by a swirling vortex of clouds; long shot; Adventure; A dramatic, mountainous landscape; cinematic
Characteristic
Shot : A lone hiker stands on a mountain peak, gazing at a swirling vortex of clouds above. The valley below is filled with mist, creating a dramatic and mysterious atmosphere.
Aesthetic Score : 0.7
Mood : mysterious, dramatic, awe-inspiring
Quality
Entropy : 6.91
Noise : 83
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.60
Image errors : Slight artifacts and blurriness in some areas, particularly in the clouds and the distant mountains.
Lost in the Fragments: A Chaotic Collage of Disjointed Reality
This abstract collage, composed of fragmented images of buildings, landscapes, and a bridge, evokes a sense of disorientation and unease. The chaotic arrangement creates a feeling of being lost or overwhelmed, leaving the viewer questioning the nature of reality itself.
Prompt
style-aesthetic Avant-garde: Energetic, disorienting ; A series of fragmented, overlapping images, depicting different aspects of travel and tourism; montage; Tourism; A chaotic, abstract collage; cinematic
Characteristic
Shot : A collage of various images, some of which depict buildings, landscapes, and a bridge. The images are arranged in a chaotic fashion and are fragmented.
Aesthetic Score : 0.3
Mood : chaotic, disorienting, abstract
Quality
Entropy : 6.62
Noise : 102
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are visible artifacts and errors in the image. The seams between the different images are not well-defined, and there are some areas where the image appears to be pixelated. There is also color bleed from the different images.
Conclusion
The results show that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.25, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.445, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflects it.
- Aesthetic Analysis: The model scored 0.20, which is considered very good. This means that the generated image closely matched the expected aesthetic style described in the prompt.
Overall, the model seems to be better at capturing the desired aesthetic style than understanding the scene and camera position. This suggests that the model might need further training to improve its ability to interpret and translate complex scene descriptions into visual representations.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://deepmind.google/technologies/imagen-3/