AI's Artistic Struggle: Capturing the 'style-aesthetic' with Dall-e-3
- 9 minutes read - 1831 wordsTable of Contents
The ‘style-aesthetic’ is a captivating visual style that blends abstract elements, vibrant colors, and dramatic compositions to evoke a sense of wonder and intrigue. It’s often used in fantasy, science fiction, and surreal art, where the focus is on creating visually striking and emotionally impactful imagery. This style is characterized by its use of bold colors, dynamic shapes, and a sense of depth and mystery. It’s a style that’s difficult to capture with traditional photography or illustration, making it an ideal challenge for generative AI models.
Created with: dall-e-3
Solitude and Wonder: A Lone Figure Contemplates the Vastness of the Clouds
A breathtaking scene of a solitary figure standing on a mountain peak, gazing out at a sea of clouds with a distant sun breaking through. The vastness of the clouds evokes a sense of awe and wonder, while the lone figure emphasizes the feeling of solitude and contemplation. This serene and inspiring image captures the beauty of nature and the power of introspection.
Prompt
Abstract: Epic, triumphant ; A lone figure standing on a mountain peak; wide shot; Heroism; a vast, swirling sea of clouds; cinematic
Characteristic
Shot : A lone figure stands on the peak of a mountain, looking out over a vast sea of clouds. The sun is rising in the distance, casting a warm glow over the scene.
Aesthetic Score : 0.8
Mood : serene, inspirational, hopeful
Quality
Entropy : 6.52
Noise : 107
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.60
Image errors : The clouds have a slightly artificial texture. There is some noise in the image, especially in the shadows.
A Hand Reaches Towards the Cosmic Abyss
A mesmerizing scene unfolds as a hand stretches out towards a swirling black hole, enveloped by a vibrant nebula. The image evokes a sense of awe and wonder, highlighting the vastness of space and the irresistible power of the black hole. This mystical and otherworldly composition invites contemplation of the universe’s mysteries.
Prompt
Abstract: Mysterious, exciting ; A hand reaching out to grasp a shimmering, ethereal portal; close-up; Adventure; a swirling vortex of colors; cinematic
Characteristic
Shot : A hand reaches out towards a swirling black hole, surrounded by a nebula of colorful gas and stars.
Aesthetic Score : 0.7
Mood : mysterious, cosmic, awe-inspiring
Quality
Entropy : 6.72
Noise : 113
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The hand appears slightly blurred and the colors are a bit oversaturated, giving the image a somewhat artificial feel.
A Glimpse into the Future: Ethereal Figure Walks the Path of Progress
A futuristic cityscape bathed in vibrant light, a glowing figure walks on a circuit board-like ground. The scene evokes a sense of wonder, mystery, and hope for the future, with the distant city symbolizing progress and possibility.
Prompt
Abstract: Energetic, futuristic ; A pixelated landscape with glowing, abstract figures; medium shot; Gaming; a digital, neon-lit cityscape; cinematic
Characteristic
Shot : A futuristic city, possibly a cityscape, with glowing lights and streaks of light coming down from the sky. It is an abstract, neon-lit scene that appears to be a computer-generated imagery (CGI) or concept art. The image features two figures in the foreground that appear to be ethereal or spectral in nature.
Aesthetic Score : 0.7
Mood : dreamy, futuristic, surreal
Quality
Entropy : 6.95
Noise : 117
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to be generated by AI and has a slight blurriness in certain areas, particularly in the background cityscape. This blurriness could be due to the AI’s rendering process.
A Solitary Bloom in the Desert’s Embrace
A single, vibrant flower defies the harshness of a cracked desert landscape, bathed in the golden light of a setting sun. Towering mountains loom in the distance, creating a surreal and hopeful scene that speaks to resilience and survival.
Prompt
Abstract: Hopeful, melancholic ; A single, vibrant flower blooming in a desolate, cracked landscape; close-up; Tourism; a surreal, otherworldly desert; cinematic
Characteristic
Shot : A single vibrant flower grows in a barren desert landscape, with towering cliffs in the background. The ground is cracked and dry, and the air is hazy with dust.
Aesthetic Score : 0.7
Mood : solitude, resilience, hope
Quality
Entropy : 6.72
Noise : 106
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image appears to be slightly over-saturated and the textures could be more realistic.
Urban Pulse: Capturing the Energy of City Life
A vibrant city street comes alive with motion blur, showcasing the fast-paced energy of urban life. Tall buildings line the sidewalks as a crowd of people rush by, creating a dynamic and captivating scene.
Prompt
Abstract: Dynamic, chaotic ; A blurred, kaleidoscopic image of a bustling city street; long shot; Travel; a whirlwind of colors and movement; cinematic
Characteristic
Shot : A busy city street with a crowd of people walking towards the camera. The street is lined with tall buildings, and the image is blurred to create a sense of motion.
Aesthetic Score : 0.6
Mood : busy, energetic, urban
Quality
Entropy : 6.77
Noise : 115
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is slightly overexposed, and the blurring effect is a bit too strong.
A Family’s Journey Towards Hope
Silhouetted against a warm, beige backdrop, a family of four walks towards a radiant, glowing circle. The scene evokes a sense of optimism and hope, suggesting a brighter future awaits them. The mysterious glow of the circle adds an element of wonder, leaving viewers to ponder the destination and the promise it holds.
Prompt
Abstract: Hopeful, nostalgic ; A silhouette of a family holding hands, walking towards a glowing, abstract sun; medium shot; Family; a warm, golden sunset; cinematic
Characteristic
Shot : A family of five walking towards a bright, circular light source.
Aesthetic Score : 0.7
Mood : hopeful, optimistic, bright
Quality
Entropy : 5.83
Noise : 52
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has a slight grainy texture, which may be intentional for a vintage aesthetic.
The Watcher in the Ruins
A haunting image of a giant eye peering through a shattered window, overlooking a sprawling cityscape. The scene evokes a sense of darkness, mystery, and unsettling anticipation, leaving the viewer questioning what lies beyond the broken glass.
Prompt
Abstract: Intense, suspenseful ; A single, abstract eye peering through a cracked, distorted window; close-up; Heroism; a dark, ominous cityscape; cinematic
Characteristic
Shot : A giant eye looking through a broken window at a cityscape, possibly New York, with a bridge in the foreground and the sun setting behind the skyline.
Aesthetic Score : 0.7
Mood : mysterious, eerie, suspenseful
Quality
Entropy : 6.24
Noise : 102
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The glass shards are slightly jagged and unnatural looking. The cityscape and eye are somewhat blurry, which could be due to over-sharpening or post-processing.
A Dreamlike Portal to the Future
A digital artist captures the essence of a futuristic city with a swirling portal in the background, creating a dreamlike and magical scene. The portal’s depth and mystery draw the viewer’s eye to the center of the image, leaving them wondering what lies beyond.
Prompt
Abstract: Intense, exhilarating ; A swirling vortex of colors and shapes representing a chaotic, digital world; wide shot; Gaming; a vibrant, neon-lit landscape; cinematic
Characteristic
Shot : A futuristic city with a swirling vortex of light in the background, rendered on a digital drawing tablet
Aesthetic Score : 0.7
Mood : cyberpunk, futuristic, mystical
Quality
Entropy : 6.85
Noise : 121
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.90
Image errors : Some blurriness and artifacts are visible
Lost in the Vortex: A Moment of Contemplation
A solitary figure stands on a rocky precipice, gazing out at a turbulent sea. Above, a swirling vortex of clouds casts an ethereal glow, creating a scene of awe and mystery. This image evokes a sense of profound contemplation and a journey into the unknown.
Prompt
Abstract: Solitary, contemplative ; A lone, abstract figure standing on a cliff overlooking a vast, swirling ocean; wide shot; Travel; a stormy, dramatic seascape; cinematic
Characteristic
Shot : A lone figure stands on a rocky outcrop in a surreal landscape. The sky is filled with swirling clouds that form a vortex above him, with a bright light shining through the center.
Aesthetic Score : 0.7
Mood : mystical, dramatic, introspective
Quality
Entropy : 6.66
Noise : 104
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.90
Image errors : No visible errors, smooth transitions
Silhouettes of Hope: A Family’s Journey into the Digital Unknown
A family of eight walks in silhouette against a backdrop of swirling, vibrant patterns, hinting at a future filled with both mystery and promise. This evocative image captures the essence of technological advancement and the enduring power of family bonds.
Prompt
Abstract: Sentimental, reflective ; A series of overlapping, abstract shapes representing a family’s journey through life; medium shot; Family; a warm, nostalgic glow; cinematic
Characteristic
Shot : A family of 8 silhouetted in front of a colorful abstract background made of squares and swirls.
Aesthetic Score : 0.7
Mood : mystical, hopeful, futuristic
Quality
Entropy : 6.85
Noise : 84
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has a slight blurriness and some pixelation, particularly in the background.
Conclusion
The results indicate that the generative AI model performed well in understanding the scene and camera position, but struggled with achieving the desired aesthetic. Here’s a breakdown:
- Camera Position: The model scored 0.25, which is below the “good” range of 0.5 to 0.75. This suggests the model didn’t accurately translate the camera position described in the prompt into the generated image.
- Shot Analysis: The model scored 0.63, falling within the “good” range. This indicates the model was able to understand the scene described in the prompt and create a shot that aligns with it to a decent degree.
- Aesthetic Analysis: The model scored 0.08, which is far from the “very good” range of -0.2 to 0.1. This suggests a significant difference between the expected aesthetic and the actual aesthetic of the generated image. The model likely struggled to capture the desired visual style.
Overall, the model shows promise in understanding scene descriptions and camera positions, but needs improvement in achieving the desired aesthetic.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://openai.com/index/dall-e-3/