AI's Artistic Struggle: Capturing the Essence of 'Dramatic' with Midjourney
- 9 minutes read - 1819 wordsTable of Contents
The ‘dramatic’ aesthetic is often characterized by strong contrasts, heightened emotions, and a focus on visual impact. It’s a style commonly used in film, photography, and even literature to evoke powerful feelings and create memorable scenes. But how well can AI understand and translate this aesthetic into visual outputs? This blog post explores the results of a test that aimed to assess AI’s ability to capture the ‘dramatic’ style, revealing both its strengths and limitations.
Created with: midjourney
A Moment of Solitude at Sunset
A lone figure stands in a vast field, bathed in the warm glow of the setting sun. The scene evokes a sense of peace and contemplation, with the small figure against the expansive landscape highlighting themes of solitude and longing.
Prompt
Dogme 95: Epic, hopeful ; A lone figure, silhouetted against a setting sun; long shot; Heroism; A vast, desolate landscape; cinematic
Characteristic
Shot : A single person is standing in a large field facing the sun as it sets.
Aesthetic Score : 0.6
Mood : melancholy, solitary, contemplative
Quality
Entropy : 6.33
Noise : 97
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, and the person’s silhouette is not very distinct.
The Grip of Determination
A close-up shot captures the intensity of a hand gripping a rope, the rocky cliff face blurred in the background. The shallow depth of field and tight framing create a sense of suspense and focus, highlighting the climber’s unwavering resolve.
Prompt
Dogme 95: Suspenseful, thrilling ; A hand reaching out to grasp a rope ladder dangling from a cliff face; close-up; Adventure; A rocky, treacherous mountainside; cinematic
Characteristic
Shot : A close-up of a hand gripping a rope against a rocky background, possibly on a cliff.
Aesthetic Score : 0.6
Mood : intense, suspenseful, determined
Quality
Entropy : 6.70
Noise : 100
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, and the lighting is a bit uneven.
Silhouetted in the Glow: A Gamer’s Focus in the Dark
A shadowy figure hunches over a brightly lit computer screen, their face obscured by the glow. The dim room adds to the sense of mystery and tension, highlighting the intense focus of the gamer.
Prompt
Dogme 95: Intense, focused ; A player’s hands frantically manipulating a joystick, their face illuminated by the screen; medium shot; Gaming; A dimly lit room with a computer monitor glowing brightly; cinematic
Characteristic
Shot : A person is playing a video game in a dimly lit room, with the monitor in the background, the subject is in silhouette, only hands are visible and a portion of the head.
Aesthetic Score : 0.4
Mood : dark, mysterious, intense
Quality
Entropy : 5.43
Noise : 75
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some grain in the image. The image is slightly blurry.
A Bird’s Eye View of Chaos: Life in an Indian Street Market
Experience the vibrant energy of a bustling Indian street market from a unique perspective. This high-angle shot captures the narrow, crowded streets, overflowing with colorful stalls selling spices, sweets, and other goods. The scene is alive with activity, as people navigate the market, browsing, chatting, and enjoying the lively atmosphere.
Prompt
Dogme 95: Energetic, lively ; A bustling marketplace, filled with vibrant colors and exotic goods; wide shot; Tourism; A crowded street in a foreign city; cinematic
Characteristic
Shot : A bustling market in India. The image shows a narrow street lined with stalls selling a variety of goods, including spices, jewelry, and clothing. The street is crowded with people, many of whom are vendors or customers. There are also a number of brightly colored decorations and lights hanging from the stalls.
Aesthetic Score : 0.7
Mood : busy, vibrant, exotic
Quality
Entropy : 6.67
Noise : 123
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, especially in the shadows and darker areas. The image is also slightly grainy, but this is likely due to the age of the photograph.
Tranquil Journey Through Rolling Hills
A serene view of a train track winding through a grassy countryside, with rolling hills in the background. The motion blur captures the feeling of speed and adventure as you journey through this tranquil landscape.
Prompt
Dogme 95: Nostalgic, contemplative ; A train speeding through a countryside landscape, blurring the scenery; long shot; Travel; Rolling hills and fields passing by; cinematic
Characteristic
Shot : A view of a train track winding through a green rolling countryside, the camera is moving forward, the motion blur suggests a fast speed, the light is soft and warm as if the sun is rising or setting
Aesthetic Score : 0.6
Mood : serene, peaceful, nostalgic
Quality
Entropy : 6.70
Noise : 109
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some motion blur, which is intentional but could be slightly less pronounced, it also has some graininess
Intimate Dinner Gathering Bathed in Warm Light
A cozy and inviting home setting is captured in this dimly lit dinner scene. Four individuals gather around a table, their conversation shrouded in mystery and intimacy. The low lighting creates a sense of warmth and closeness, highlighting the familial bond shared by the group.
Prompt
Dogme 95: Warm, intimate ; A family gathered around a dinner table, sharing a meal and laughter; medium shot; Family; A cozy, well-worn kitchen; cinematic
Characteristic
Shot : A group of four people are seated around a dining table, illuminated by the warm glow of a hanging lamp and a candle. They appear to be engaged in conversation, enjoying a meal and each other’s company.
Aesthetic Score : 0.7
Mood : warm, intimate, cozy
Quality
Entropy : 5.68
Noise : 91
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor noise is visible in the shadows.
A Child’s Silent Longing
A close-up portrait captures the melancholy in a child’s eyes, their cheek bathed in soft light, creating a sense of wistful vulnerability. The out-of-focus hair adds to the dreamy, pensive mood.
Prompt
Dogme 95: Sad, poignant ; A single tear rolling down a child’s cheek as they watch their parents argue; close-up; Family; A dimly lit living room; cinematic
Characteristic
Shot : Close-up of a young girl’s face with light shining on her cheek, she appears to be looking down and her eyes are closed, she is wearing a neutral expression.
Aesthetic Score : 0.7
Mood : melancholy, introspective, wistful
Quality
Entropy : 6.21
Noise : 88
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable image errors
Mystical Firelight in the Deep Woods
A group of men huddle around a crackling campfire, their faces illuminated by the dancing flames. The surrounding forest is shrouded in darkness, creating a sense of mystery and adventure. The contrast between the warm light and the cool shadows adds a dramatic touch to this cozy scene.
Prompt
Dogme 95: Joyful, communal ; A group of friends huddled together around a campfire, sharing stories and laughter; medium shot; Adventure; A dark forest with flickering flames; cinematic
Characteristic
Shot : A group of men are gathered around a campfire in a dark forest.
Aesthetic Score : 0.7
Mood : mysterious, atmospheric, tranquil
Quality
Entropy : 5.67
Noise : 90
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight noise visible in the background, not distracting.
Solitude and Storm: A Figure Contemplates the Unrelenting Sea
A lone figure sits perched on a cliff, their silhouette stark against the tumultuous backdrop of a stormy sea. The crashing waves and dramatic sky evoke a sense of melancholy and contemplation, highlighting the power of nature and the fragility of human existence.
Prompt
Dogme 95: Awe-inspiring, contemplative ; A lone traveler gazing out at a vast ocean, their face filled with wonder; long shot; Travel; A dramatic coastline with crashing waves; cinematic
Characteristic
Shot : A lone figure sits on a rocky cliff overlooking a vast, stormy ocean. The sky is overcast and the waves are crashing against the shore.
Aesthetic Score : 0.7
Mood : melancholy, contemplative, dramatic
Quality
Entropy : 6.60
Noise : 98
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors.
A Hand Holds Time’s Faded Memories
A photograph, worn and faded, depicts a doorway in a crumbling building. Held in a hand, it evokes a sense of melancholy and nostalgia, the blurred background hinting at a life lived and lost. The image speaks of time’s passage and the enduring power of memory.
Prompt
Dogme 95: Melancholy, nostalgic ; A hand holding a worn photograph, the image blurred and faded; close-up; Family; A cluttered attic filled with old memories; cinematic
Characteristic
Shot : A hand holds up a black and white photograph in front of a messy cluttered room that looks like it may have been damaged by fire. The photo is of a doorway leading to a garden with debris in front of it.
Aesthetic Score : 0.6
Mood : melancholy, somber, nostalgic
Quality
Entropy : 6.69
Noise : 88
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors.
Conclusion
The analysis shows that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.4
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
Shot Analysis:
- Score: 0.56
- Interpretation: This score falls within the “good” range. It indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it to a decent degree.
Aesthetic Analysis:
- Score: 0.14
- Interpretation: This score is significantly higher than the “very good” range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera position and shot composition, but struggles to accurately capture the desired aesthetic. This suggests that the model might need further training to better understand and translate aesthetic concepts into visual outputs.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://midjourney.com