AI Captures the Essence of Dramatic Poses, But Struggles with Camera Work with Midjourney
- 9 minutes read - 1845 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, conveying emotions and narratives through body language and composition. These poses often involve strong silhouettes, dynamic angles, and a sense of heightened emotion. In this experiment, we tasked an AI model with generating images based on descriptions of dramatic poses and scenes. The results reveal a fascinating interplay between the model’s understanding of aesthetics and its ability to accurately capture camera angles and scene details.
Created with: midjourney
Silhouettes of Hope: A Lone Figure Walks into the Sunset
A melancholic yet hopeful scene unfolds as a solitary figure walks into the setting sun, their silhouette casting a mysterious and longing presence against the fiery sky. The tall grass whispers secrets, adding to the enigmatic atmosphere.
Prompt
close-up close-up: epic, determined ; A lone figure, silhouetted against a blazing sunset; close-up; heroism; a vast, desolate landscape; cinematic
Characteristic
Shot : A lone figure walks into the setting sun, silhouetted against a fiery sky. The scene is set in a grassy field, with the sun sinking below the horizon in the distance.
Aesthetic Score : 0.6
Mood : melancholic, contemplative, hopeful
Quality
Entropy : 6.32
Noise : 48
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, particularly around the edges.
Unveiling Secrets: A Finger Points the Way on an Antique Map
A close-up shot captures the essence of exploration and discovery. An old, worn book reveals a faded map, with a finger pointing at a specific location. The muted lighting and vintage charm evoke a sense of mystery and adventure, leaving you wondering what secrets lie hidden within.
Prompt
close-up close-up: intrigued, adventurous ; A weathered map, its edges frayed, with a finger tracing a route; close-up; adventure; a dimly lit room filled with antique maps and globes; cinematic
Characteristic
Shot : A close-up shot of an open book with a hand pointing at a map inside. There is another book or globe out of focus in the background.
Aesthetic Score : 0.7
Mood : mysterious, nostalgic, adventurous
Quality
Entropy : 6.91
Noise : 86
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant errors in the image. The lighting is slightly uneven, which could be addressed with post-processing.
Cyberpunk Typing: A Close-Up on the Edge of Tomorrow
A glowing green keyboard, neon lights blurring in the background, and a pair of hands furiously typing - this image captures the intensity and focus of a cyberpunk world. The close-up shot emphasizes the act of typing, highlighting the importance of information and technology in this futuristic setting.
Prompt
close-up close-up: intense, focused ; A gamer’s hands, fingers flying across a keyboard, eyes glued to the screen; close-up; gaming; a dimly lit room with neon lights reflecting on the screen; cinematic
Characteristic
Shot : Close up shot of a person’s hands typing on a keyboard in a dimly lit room with neon lights.
Aesthetic Score : 0.6
Mood : dark, intense, futuristic
Quality
Entropy : 6.36
Noise : 61
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some blurring and graininess, particularly around the edges.
Capturing the Majesty: A Moment of Serene Adventure
A lone figure stands amidst the clouds, camera in hand, capturing the breathtaking panorama of a majestic mountain range. The lens draws you into the scene, inviting you to experience the serenity and adventure of this breathtaking landscape.
Prompt
close-up close-up: awe-inspiring, wonder ; A hand holding a camera, capturing a breathtaking vista; close-up; tourism; a panoramic view of a mountain range with clouds swirling below; cinematic
Characteristic
Shot : A person is holding a camera in front of a mountain landscape with clouds. The camera is in focus and the mountains are blurred in the background.
Aesthetic Score : 0.7
Mood : serene, adventurous, contemplative
Quality
Entropy : 6.46
Noise : 78
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor artifacts and compression artifacts, particularly around the camera.
Whispers of Adventure: A Worn Backpack Holds Memories
A passport and notebook rest within a well-loved brown backpack, hinting at journeys taken and stories yet to be written. The soft focus and muted colors evoke a sense of nostalgia and the promise of exciting adventures to come.
Prompt
close-up close-up: nostalgic, adventurous ; A passport, open to a page with a stamp from a foreign country; close-up; travel; a cluttered backpack overflowing with travel essentials; cinematic
Characteristic
Shot : A passport and a notebook are resting in a brown backpack. The lighting is soft and warm, casting shadows around the objects.
Aesthetic Score : 0.7
Mood : nostalgic, travel, anticipation
Quality
Entropy : 6.32
Noise : 103
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors in the image. The image appears to be well-processed and sharp.
The Warmth of Connection
A group of friends gather around a crackling campfire, their hands reaching out to the flames. The scene evokes a sense of warmth, coziness, and shared community, with the fire’s glow highlighting the intimacy and connection between them.
Prompt
close-up close-up: warm, connected ; A group of hands, clasped together in a circle, symbolizing unity; close-up; groups; a campfire burning brightly in the background; cinematic
Characteristic
Shot : A group of people are gathered around a campfire, with their hands outstretched towards the flames. The warm glow of the fire illuminates their faces and creates a sense of unity and togetherness.
Aesthetic Score : 0.8
Mood : warm, cozy, communal
Quality
Entropy : 6.13
Noise : 96
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors, but the image is a bit dark overall. The fire might be slightly overexposed
A Tear Tells a Story: The Weight of Emotion
A close-up shot captures the raw vulnerability of a young man, his face etched with sadness as a single tear traces a path down his cheek. The image evokes a sense of profound emotion, leaving the viewer to ponder the story behind his distress.
Prompt
close-up close-up: tragic, poignant ; A single tear rolling down a hero’s cheek, reflecting the weight of their sacrifice; close-up; heroism; a battlefield littered with fallen comrades; cinematic
Characteristic
Shot : A close-up of a young man’s face, with his eyes closed and a single tear rolling down his cheek. He is likely experiencing great sadness or emotional distress.
Aesthetic Score : 0.6
Mood : sad, melancholic, somber
Quality
Entropy : 6.72
Noise : 109
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.70
Image errors : The tear appears somewhat artificially rendered, lacking the natural blurring and reflection one would expect. The lighting on the face also seems overly contrasted.
Lost in the Wilderness: A Compass Beckons
A vintage compass, resting on a bed of moss and leaves, points the way through a sunlit forest. The shallow depth of field creates a sense of mystery and adventure, inviting you to explore the unknown.
Prompt
close-up close-up: uncertain, suspenseful ; A compass needle spinning wildly, pointing in all directions; close-up; adventure; a dense jungle with sunlight filtering through the canopy; cinematic
Characteristic
Shot : A vintage compass lies on the forest floor, bathed in warm sunlight streaming through the trees. The background is a soft blur of green foliage and light.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, calm
Quality
Entropy : 6.72
Noise : 61
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some slight blurring on the edges of the compass and the leaves around it. It seems like an artificial depth of field effect rather than real depth.
Nostalgic Arcade Moment: A Hand Presses the Red Button
A close-up shot captures the essence of retro gaming with a hand pressing a red button on an arcade game. The shallow depth of field emphasizes the action, drawing you into the moment. Bright lights blur in the background, adding to the nostalgic and playful mood.
Prompt
close-up close-up: exhilarated, competitive ; A joystick, gripped tightly in a gamer’s hand, as they navigate a virtual world; close-up; gaming; a brightly lit arcade with flashing lights and sounds; cinematic
Characteristic
Shot : A hand pressing a red button on a video game arcade machine, the background is blurry and out of focus, with neon lights.
Aesthetic Score : 0.7
Mood : nostalgic, retro, playful
Quality
Entropy : 6.81
Noise : 111
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry and has some noise. There are some artifacts around the edges of the screen.
A Message in the Luggage Tag
A close-up shot captures a luggage tag, its message shrouded in mystery, attached to a black suitcase. The blurred background hints at a bustling airport or train station, fueling anticipation for the journey ahead. The shallow depth of field draws you in, inviting you to decipher the secrets held within the tag.
Prompt
close-up close-up: hopeful, anticipatory ; A luggage tag, with a handwritten note attached, signifying a journey to a new destination; close-up; travel; a bustling airport terminal with people rushing around; cinematic
Characteristic
Shot : A close-up shot of a luggage tag attached to a suitcase, with a blurry background of lights and people in a public space.
Aesthetic Score : 0.5
Mood : minimalistic, travel, anticipation
Quality
Entropy : 6.76
Noise : 67
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : Slight chromatic aberration visible around the edges, the overall image is a bit blurry.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.45, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.56, which is considered average. This indicates that the model was able to understand the scene described in the prompt, but not exceptionally well.
- Aesthetic Analysis: The model scored 0.08, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model seems to be better at understanding the aesthetic style than the camera position and scene. It might be helpful to provide more specific instructions regarding camera angles and shot types in future prompts to improve the model’s performance in these areas.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://midjourney.com