AI's Artistic Struggle: Capturing the Perfect Shot with Midjourney
- 10 minutes read - 1958 wordsTable of Contents
In the realm of artistic expression, capturing the perfect shot is paramount. This involves not only the composition of the scene but also the perspective from which it is viewed. Dramatic facial expressions, often used in film and photography, rely heavily on the camera’s position to convey emotion and intensity. This analysis delves into the capabilities of a generative AI model in understanding and implementing these crucial elements, showcasing its strengths and areas for improvement.
Created with: midjourney
Lost in the City Rain
A single eye, caught in the downpour, reflects the blurred glow of urban life. The rain washes away clarity, leaving behind a sense of mystery and introspection.
Prompt
Anger Anger, frustration, and a hint of tears: Despair and rage ; close-up; eye-level; Single Person; Rain pouring down; streetlights; cinematic
Characteristic
Shot : Close-up of a person’s eye in the rain, with blurry city lights in the background.
Aesthetic Score : 0.8
Mood : mysterious, atmospheric, contemplative
Quality
Entropy : 6.55
Noise : 90
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.70
Image errors : The raindrops appear slightly artificial, and the bokeh effect is somewhat overdone.
Unleashing Fury: A Comic Book Masterpiece
This heavily stylized image captures a monstrous, muscular figure with a red cape, roaring against a fiery backdrop. The intense expression, dynamic pose, and chaotic energy create a powerful and dramatic scene, reminiscent of classic comic book art.
Prompt
Anger Gritted teeth, narrowed eyes, and a fierce expression: Fury and determination ; A superhero, fists clenched, facing down a horde of villains; eye-level; Hero; A crumbling cityscape, smoke and debris filling the air; cinematic
Characteristic
Shot : A large, muscular, red-clad figure with a grotesque face, likely a monster or demon, is charging through a city during a fiery apocalypse.
Aesthetic Score : 0.7
Mood : dark, apocalyptic, chaotic
Quality
Entropy : 6.61
Noise : 97
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts and blurriness, particularly around the edges.
Chaos in the Office: Man Unleashes Fury in Explosive Outburst
A tense moment captured in a chaotic office setting. A man in a white shirt and tie screams in rage, his fist clenched, as papers fly around him. The dynamic composition, with the man’s outstretched arm and the flying papers, creates a sense of immediacy and chaos, highlighting the intensity of the moment.
Prompt
Anger Red face, veins bulging, and a look of pure anger: Frustration and rage ; A man, slamming his fist on a table, surrounded by scattered papers; eye-level; Normal Person; A cluttered office, with a window showing a stormy sky; cinematic
Characteristic
Shot : A man in a white shirt is furiously yelling and punching through a window. Papers are flying everywhere around him.
Aesthetic Score : 0.7
Mood : intense, violent, chaotic
Quality
Entropy : 6.45
Noise : 102
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.60
Image errors : Some minor artifacts and blurriness in the background. The edges of the image are slightly jagged.
The Frustration is Palpable: Gamer’s Rage Captured in a Single Shot
This dimly lit scene captures the raw emotion of a gamer’s frustration. The man’s intense expression, directed straight at the viewer, creates a powerful sense of drama. Empty soda cans scattered around him tell a story of hours spent battling virtual foes, culminating in this explosive moment of defeat.
Prompt
Anger Scowling, teeth gritted, and a look of intense frustration: Frustration and rage ; A gamer, throwing his headset on the floor, surrounded by empty energy drink cans; eye-level; Gamer; A dimly lit room, with a computer screen displaying a game in progress; cinematic
Characteristic
Shot : A man is yelling at a computer screen. He’s wearing headphones and has his hands on the keyboard. He’s surrounded by empty cans of soda. The scene is set in a dark room.
Aesthetic Score : 0.6
Mood : intense, angry, chaotic
Quality
Entropy : 6.32
Noise : 96
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts, particularly around the edges of the man’s head and the computer screen. There’s some aliasing visible in the image.
Terror in the Green Light
A woman’s face, bathed in an eerie green glow, contorts in a silent scream. Sweat beads on her forehead, amplifying the raw terror etched on her features. The close-up shot captures the intensity of her fear, leaving the viewer breathless.
Prompt
Anger Tears streaming down her face, eyes wide with anger: Despair and rage ; A woman, screaming into the void, her face contorted in anger; close-up; Single Person; A dark, empty room, with only a single flickering light; cinematic
Characteristic
Shot : A close-up shot of a woman’s face with a dramatic expression, her mouth open in a scream and her eyes wide with fear. The lighting is dark and moody, with shadows obscuring much of her face. There is a sense of urgency and tension in the image.
Aesthetic Score : 0.3
Mood : intense, dark, dramatic
Quality
Entropy : 5.83
Noise : 109
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is blurry and noisy, particularly around the edges. This suggests low-quality compression or a low-resolution source image.
A City in Flames: One Figure Stands Alone
A solitary figure silhouetted against a backdrop of burning buildings, capturing the raw despair and destruction of an apocalyptic cityscape. The image evokes a powerful sense of loss and the fragility of civilization.
Prompt
Anger A determined look, with a hint of sadness in their eyes: Anger and determination ; A hero, standing on a rooftop, overlooking a city in flames; eye-level; Hero; A fiery inferno engulfing the city, with smoke billowing into the sky; cinematic
Characteristic
Shot : A lone figure stands silhouetted against a backdrop of a city engulfed in flames. Smoke billows from the inferno, casting a dramatic orange glow over the scene.
Aesthetic Score : 0.6
Mood : apocalyptic, somber, ominous
Quality
Entropy : 6.85
Noise : 102
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.90
Image errors : The flames and smoke appear somewhat repetitive and lack natural variation. The figure’s silhouette is somewhat generic and lacks detail.
Red Hot Fury: Couple’s Explosive Restaurant Argument
A tense and dramatic scene unfolds as a couple erupts in a heated argument at a restaurant. Their faces contorted in anger, fists clenched, and contrasting colors of red and blue heighten the intensity of the moment. The close proximity and facing bodies of the couple add to the dramatic effect, leaving the viewer on the edge of their seat.
Prompt
Anger Red faces, clenched fists, and angry words being exchanged: Frustration and rage ; A couple, arguing in a crowded restaurant, their voices raised in anger; eye-level; Normal People; A bustling restaurant, with other diners looking on; cinematic
Characteristic
Shot : A man and woman are engaged in an intense argument, likely in a restaurant setting. They are both visibly upset and angry.
Aesthetic Score : 0.7
Mood : dramatic, tense, confrontational
Quality
Entropy : 6.63
Noise : 124
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has some minor artifacts in the background, especially in the area of the ceiling and some blurring around the woman’s hair.
Screaming in the Digital Age: A Portrait of Raw Emotion
A man’s face contorted in a scream, bathed in dramatic blue and red light, captures the intensity of a digital age struggle. The close-up perspective draws the viewer into the raw emotion of the moment, leaving a lasting impression.
Prompt
Anger A look of pure rage, with veins bulging in his forehead: Frustration and rage ; A gamer, smashing his keyboard in a fit of rage; close-up; Gamer; A dimly lit room, with a computer screen displaying a game over screen; cinematic
Characteristic
Shot : A man is sitting at a computer in a dimly lit room. He is yelling in frustration, looking directly at the camera with his mouth open and teeth bared. He has blood dripping on his forehead. The scene has an ominous feel.
Aesthetic Score : 0.3
Mood : intense, aggressive, chaotic
Quality
Entropy : 6.21
Noise : 53
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.40
Image errors : There are no notable artifacts or errors in the image.
Rain-Soaked Fury: A Man’s Face in Black and White
A close-up shot captures the raw emotion of a man’s face, bathed in the downpour of rain. The stark black and white cinematography intensifies the dramatic mood, highlighting his angry expression. The low-key lighting adds to the sense of intensity, creating a powerful and unforgettable image.
Prompt
Anger A look of intense anger, with a hint of sadness in his eyes: Despair and rage ; A man, standing in the rain, his face obscured by the downpour; eye-level; Single Person; A dark, deserted street, with only the sound of rain and thunder; cinematic
Characteristic
Shot : A close up shot of a man’s face in the rain, with a focus on his intense eyes.
Aesthetic Score : 0.7
Mood : dramatic, intense, mysterious
Quality
Entropy : 5.33
Noise : 90
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no noticeable image artifacts or errors.
Blood and Fury: A Close-Up of a Warrior’s Rage
A gritty, intense portrait captures the raw emotion of a bearded warrior amidst a chaotic battle. The close-up framing and exaggerated facial expressions convey a fierce determination, while the smoky background adds to the sense of urgency and chaos.
Prompt
Anger A fierce expression, with a look of determination in their eyes: Anger and determination ; A hero, standing on a battlefield, surrounded by fallen enemies; eye-level; Hero; A battlefield littered with bodies, with smoke and dust filling the air; cinematic
Characteristic
Shot : A close-up of a warrior in the middle of a battlefield, covered in blood, and screaming. It seems to be a still from a movie.
Aesthetic Score : 0.7
Mood : intense, dramatic, violent
Quality
Entropy : 5.70
Noise : 97
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.30
Image errors : The background seems overly blurred, almost unrealistic.
Conclusion
The results show that the generative AI model performed well in understanding the scene and creating an image with the desired aesthetic, but struggled with accurately capturing the camera position. Here’s a breakdown:
- Camera Position: The model scored 0.2, indicating a significant difference between the intended camera position in the prompt and the actual camera position in the generated image. This suggests the model needs improvement in understanding and implementing camera positioning instructions.
- Shot Analysis: The model scored 0.57, which falls within the “good” range. This means the model was able to understand the scene described in the prompt and create an image that reflects it reasonably well.
- Aesthetic Analysis: The model scored 0.2, which is considered “very good”. This indicates that the generated image closely matched the desired aesthetic described in the prompt.
Overall, the model demonstrates a strong ability to understand the scene and create aesthetically pleasing images. However, it needs improvement in accurately capturing the intended camera position.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://midjourney.com