AI's Artistic Struggle: Capturing the Scene, Not the Pose with Midjourney
- 9 minutes read - 1873 wordsTable of Contents
In the realm of artificial intelligence, generative models are pushing the boundaries of creativity. These models can generate images, text, and even music based on user prompts. However, achieving accuracy and capturing the nuances of human intention remains a challenge. This blog post examines the results of a generative AI model tasked with creating images based on specific scenes and poses, highlighting the model’s strengths and weaknesses in translating prompts into visual representations. We’ll explore the concept of ‘dramatic style poses’ and how they are used in various contexts, providing examples to illustrate the complexities involved.
Created with: midjourney
A Hiker’s Moment of Majesty
A lone hiker stands triumphantly on a mountain peak, arms outstretched, taking in the breathtaking view of a misty valley and towering mountains. The scene evokes a sense of tranquility, adventure, and awe, highlighting the smallness of humanity against the vastness of nature.
Prompt
standing-tall Standing tall, arms outstretched, facing the horizon: Determined, hopeful, awe-inspiring ; Lone adventurer; wide shot; Adventure; Majestic mountain range with a vast, clear sky; cinematic
Characteristic
Shot : A lone hiker stands with arms outstretched atop a rocky ridge, overlooking a vast mountain valley bathed in morning light.
Aesthetic Score : 0.8
Mood : serene, inspiring, contemplative
Quality
Entropy : 6.65
Noise : 103
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Silhouette of Courage: Soldier Walks into the Heart of the Fire
A lone soldier, cloaked in camouflage, walks away from the viewer towards a raging fire and billowing smoke. Debris flies through the air, creating a chaotic backdrop to this dramatic scene. The soldier’s silhouette against the light evokes a sense of isolation and vulnerability, highlighting the tension and somber mood of the war zone.
Prompt
standing-tall Standing tall, holding a weapon, looking towards the enemy: Brave, defiant, resolute ; Soldier standing on a battlefield; medium shot; Heroism; Smoke and debris from a recent explosion; cinematic
Characteristic
Shot : A lone soldier, silhouetted against a smoky, apocalyptic backdrop, walks towards the viewer with a weapon in hand. In the background, fires blaze, hinting at the devastation around him.
Aesthetic Score : 0.7
Mood : dramatic, somber, desolate
Quality
Entropy : 6.59
Noise : 105
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.80
Image errors : Some minor artifacts are present in the smoke and flames, particularly in the background.
Victory Dance in the Neon Glow
Three friends celebrate a hard-earned victory in a video game, their joy radiating in a dimly lit room bathed in colorful light. The scene captures the raw excitement and triumph of their achievement, with shadows adding a touch of mystery to the moment.
Prompt
standing-tall Standing tall, arms raised in victory, cheering: Joyful, triumphant, celebratory ; Group of friends celebrating a victory in a video game; close-up; Gaming; Neon lights and glowing screens of a gaming setup; cinematic
Characteristic
Shot : Three people are standing in front of computer screens, their backs to the camera, arms raised in victory. The background is a blur of neon lights.
Aesthetic Score : 0.7
Mood : excited, vibrant, energetic
Quality
Entropy : 6.33
Noise : 97
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image has some minor artifacts, such as slight blurring in the background.
Silhouetted Serenity: A Moment of Tranquility on the Cliffside
A man stands in quiet contemplation on a cliff overlooking a vast expanse of blue water. The silhouette against the sky and water creates a sense of peace and solitude, capturing a moment of serene beauty.
Prompt
standing-tall Standing tall, arms crossed, gazing at the view: Awe-struck, contemplative, peaceful ; Tourist standing on a cliff overlooking a breathtaking view; long shot; Tourism; Scenic landscape with rolling hills and a sparkling ocean; cinematic
Characteristic
Shot : A solitary figure stands on a cliff overlooking a vast, blue ocean. The coastline curves gently in the distance, and a few wispy clouds drift across the pale sky.
Aesthetic Score : 0.7
Mood : tranquil, contemplative, serene
Quality
Entropy : 6.54
Noise : 119
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has a noticeable vintage filter applied, which may not be desirable for all viewers. The colors are slightly washed out and the texture is grainy.
Silhouettes of Love Against a Fiery Sunset
A couple, hand in hand, stands silhouetted against a breathtaking sunset on the deck of a ship. Their gaze is fixed on a distant island, symbolizing hope and a shared future. The romantic and serene mood is heightened by the dramatic effect of their silhouettes against the fiery sky, creating a powerful image of love and connection.
Prompt
standing-tall Standing tall, holding hands, looking at the sunset: Romantic, adventurous, hopeful ; Couple standing on a ship’s deck; medium shot; Travel; Sunset over the ocean with a silhouette of a distant island; cinematic
Characteristic
Shot : A couple silhouetted against a sunset on a cruise ship deck. The sea stretches out before them, and a small island is visible in the distance.
Aesthetic Score : 0.7
Mood : romantic, dreamy, hopeful
Quality
Entropy : 6.75
Noise : 95
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are no major artifacts or errors visible in the image.
Energetic Dance Routine Under the Spotlight
A group of young women exude confidence and youthful energy as they perform a dynamic dance routine on stage, bathed in spotlights and a smoky atmosphere. The image captures the excitement and vibrancy of their performance.
Prompt
standing-tall Standing tall, synchronized movements, radiating confidence: Energetic, passionate, expressive ; Group of dancers performing on a stage; wide shot; Groups; Bright stage lights and a cheering audience; cinematic
Characteristic
Shot : A group of female dancers performing on a stage with red lights. The main dancer is front and center, while the rest of the dancers are arranged in a line behind her.
Aesthetic Score : 0.7
Mood : energetic, playful, confident
Quality
Entropy : 6.64
Noise : 106
Prompt Clip Score : 0.19
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some minor image artifacts are present, particularly in the background, but the overall quality is good.
A Giant Leap: Astronaut Plants American Flag on the Moon
A poignant image captures the moment an astronaut stands on the lunar surface, holding the American flag, with Earth a distant blue orb in the background. The scene evokes feelings of nostalgia, patriotism, and hope, highlighting the scale of human achievement and the enduring spirit of exploration.
Prompt
standing-tall Standing tall, holding a flag, gazing at Earth: Awe-inspiring, futuristic, surreal ; Astronaut standing on the surface of the moon; long shot; Adventure; Cratered lunar landscape with Earth in the distance; cinematic
Characteristic
Shot : An astronaut in a spacesuit stands on a lunar surface, holding an American flag. A large Earth is visible in the background.
Aesthetic Score : 0.7
Mood : patriotic, hopeful, contemplative
Quality
Entropy : 5.63
Noise : 113
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is slightly blurry and the colors are a bit muted.
Firefighter Bravely Battles Blaze in Dramatic Scene
A powerful image captures the intensity of a fire as a firefighter, silhouetted against the flames, stands ready to battle the blaze. The scene is both dramatic and heroic, highlighting the courage of those who risk their lives to protect others.
Prompt
standing-tall Standing tall, holding a hose, facing the fire: Brave, determined, selfless ; Firefighter standing in front of a burning building; medium shot; Heroism; Flames and smoke billowing from the building; cinematic
Characteristic
Shot : A firefighter stands in front of a burning building with hoses and equipment. The fire is intense and the building is engulfed in flames. The firefighter is silhouetted against the fire.
Aesthetic Score : 0.7
Mood : dramatic, intense, heroic
Quality
Entropy : 6.83
Noise : 101
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : There is some noise in the image and some artifacts in the smoke. The fire is a little too uniform and doesn’t have the chaotic nature of a real fire.
Champion’s Smile: Gamer Celebrates Victory in a Blaze of Lights
A young, bearded gamer beams with joy as he holds aloft his trophy, bathed in the colorful glow of a celebratory backdrop. The selective focus and dramatic lighting capture the excitement and triumph of his victory.
Prompt
standing-tall Standing tall, holding the trophy aloft, smiling: Triumphant, proud, accomplished ; Gamer holding a trophy after winning a tournament; close-up; Gaming; Crowd cheering and flashing cameras; cinematic
Characteristic
Shot : A young man, a gamer, holding a trophy in a dimly lit room with a crowd in the background. The lighting is soft and warm.
Aesthetic Score : 0.7
Mood : joyful, celebratory, triumphant
Quality
Entropy : 6.33
Noise : 102
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, particularly in the background.
Awe-Inspiring Mountaintop View: A Moment of Serenity
Three figures stand silhouetted against a breathtaking panorama of snow-capped peaks, capturing the essence of peace and wonder amidst the vastness of nature.
Prompt
standing-tall Standing tall, arms around each other, looking at the view: Joyful, united, adventurous ; Family standing on a mountain peak; wide shot; Travel; Panoramic view of snow-capped mountains and a clear blue sky; cinematic
Characteristic
Shot : Three people, two men and a woman, stand on a rocky mountain peak overlooking a snow-capped mountain range in the distance, with a cloudy sky above.
Aesthetic Score : 0.8
Mood : serene, adventurous, contemplative
Quality
Entropy : 6.19
Noise : 89
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No notable errors in the image.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.46, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflects it.
- Aesthetic Analysis: The model scored 0.05, which is considered very good. This means that the generated image closely matched the expected aesthetic style.
Overall, the model seems to be better at capturing the desired aesthetic than understanding the scene and camera position. This suggests that the model might need further training to improve its ability to interpret and translate prompts into accurate visual representations.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://midjourney.com