AI Captures the Scene, But Misses the Feeling with Scenario
- 10 minutes read - 1991 wordsTable of Contents
Dramatic poses are a powerful tool in visual storytelling, used to convey emotion, action, and character. They often involve dynamic angles, strong silhouettes, and a sense of movement. This experiment aimed to test an AI model’s ability to generate images that capture the essence of these dramatic poses. The results reveal both strengths and weaknesses in the model’s capabilities.
Created with: scenario
Silhouetted Serenity: A Moment of Contemplation at Sunset
A lone figure stands on a rocky peak, bathed in the warm glow of a setting sun. The vast mountain range stretches out before them, creating a scene of peaceful solitude and hopeful contemplation. The silhouette of the figure against the sunset evokes a sense of quiet reflection and connection with the natural world.
Prompt
poses low-angle: inspiring, triumphant ; A lone figure standing atop a mountain peak, silhouetted against the rising sun; wide shot; heroism; majestic mountain range with clouds swirling below; cinematic
Characteristic
Shot : A lone figure stands on a mountaintop, silhouetted against a breathtaking sunset. The sky is a vibrant orange, and the sun is a large, glowing orb. The figure’s presence emphasizes the vastness of the landscape.
Aesthetic Score : 0.7
Mood : tranquil, serene, contemplative
Quality
Entropy : 6.25
Noise : 82
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.90
Image errors : The rendering of the sky appears slightly flat, lacking depth and texture. Some areas of the mountains exhibit a lack of detail.
Uncharted Territory: A Journey into the Unknown
A group of intrepid explorers, clad in rugged attire, venture deep into a lush jungle, their path leading towards a colossal, moss-covered stone structure. The air hums with anticipation and mystery, promising adventure and untold discoveries.
Prompt
poses low-angle: mysterious, adventurous ; A group of explorers navigating a dense jungle, their faces illuminated by the light of their headlamps; medium shot; adventure; lush green foliage and ancient ruins in the background; cinematic
Characteristic
Shot : A group of explorers dressed in jungle attire are standing in a dense, lush forest. There is a large stone structure in the background, partially overgrown with vegetation. The explorers are looking up at the structure, perhaps in awe or anticipation.
Aesthetic Score : 0.7
Mood : adventurous, mysterious, hopeful
Quality
Entropy : 6.67
Noise : 98
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts, particularly around the edges of the characters, but they are not particularly noticeable.
Lost in Thought, Embracing the Future
A young woman, lost in contemplation, holds a device in her hand. The blurred background and dramatic lighting create a sense of mystery and intrigue, hinting at a world of technological possibilities.
Prompt
poses low-angle: intense, focused ; A gamer’s hands intensely manipulating a controller, their face illuminated by the glow of the monitor; close-up; gaming; a vibrant, futuristic cityscape projected on the screen; cinematic
Characteristic
Shot : A young woman with blue eyes and pink hair is wearing a black headset. The focus is on her face as she looks off to the side, holding a futuristic-looking device in her other hand. The background is blurred and features a colorful neon glow.
Aesthetic Score : 0.8
Mood : futuristic, mysterious, introspective
Quality
Entropy : 6.86
Noise : 90
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image is slightly oversharpened, resulting in a halo effect around the subject’s hair and edges of the device. There are also some minor artifacts around the background lights.
A Timeless Monument in a Grand European Square
A statue of a man in a formal pose stands proudly atop a pedestal in a European city square, surrounded by other sculptures and a stately building. The scene evokes a sense of grandeur, history, and a touch of somber reflection.
Prompt
poses low-angle: awe-inspiring, historical ; A towering statue of a historical figure, viewed from the perspective of a tourist looking up in awe; wide shot; tourism; a bustling city square with other tourists and vendors; cinematic
Characteristic
Shot : A statue of a man in a suit and hat stands atop a monument in a city square. The statue is surrounded by other figures and is in front of a large building. The sky is blue and there are clouds in the background.
Aesthetic Score : 0.7
Mood : grand, historical, urban
Quality
Entropy : 6.70
Noise : 92
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable errors.
Lost in the Vastness: A Woman Finds Solitude in the Desert
A solitary figure in a long brown dress and black hat stands amidst endless sand dunes, her gaze fixed on the horizon. The vastness of the desert creates a sense of isolation and peace, highlighting the smallness of humanity against the backdrop of nature’s grandeur.
Prompt
poses low-angle: solitude, contemplative ; A lone traveler gazing out at a vast desert landscape, their back to the camera; medium shot; travel; endless sand dunes stretching out to the horizon; cinematic
Characteristic
Shot : A lone woman in a long brown dress and black hat stands on a sand dune in the desert, looking out at the horizon. The sky is a pale blue with some clouds.
Aesthetic Score : 0.7
Mood : lonely, contemplative, serene
Quality
Entropy : 6.18
Noise : 73
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Confetti Showers and Smiles: Capturing the Joy of Celebration
A vibrant scene of young women celebrating, with confetti raining down and smiles all around. The image captures the infectious energy and joy of a festive gathering, whether it’s a party or a concert. The bright colors and dynamic composition create a visually appealing and uplifting experience.
Prompt
poses low-angle: joyful, celebratory ; A group of friends celebrating a victory, their arms raised in the air, viewed from the perspective of someone standing below; wide shot; groups; a brightly lit party scene with confetti and balloons; cinematic
Characteristic
Shot : A group of young women are celebrating and having fun, surrounded by confetti. The main subject is looking up and laughing, with her arms raised in the air. The lighting is warm and inviting.
Aesthetic Score : 0.7
Mood : joyful, celebratory, vibrant
Quality
Entropy : 6.80
Noise : 97
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable image errors.
One Man Against the Flames: Firefighter Battles Massive Blaze
A lone firefighter stands defiant amidst a raging inferno, the towering flames and billowing smoke creating a dramatic backdrop of danger and intensity. The scene captures the raw power of the fire and the courage of the firefighter facing it head-on.
Prompt
poses low-angle: intense, heroic ; A lone firefighter battling a raging inferno, their silhouette framed against the flames; medium shot; heroism; a burning building with smoke billowing into the sky; cinematic
Characteristic
Shot : A firefighter standing in the middle of a street engulfed in flames. The flames are billowing up in the background, and the buildings on either side of the street are on fire. The firefighter is wearing a helmet and protective gear, and he is looking towards the flames.
Aesthetic Score : 0.6
Mood : dramatic, intense, danger
Quality
Entropy : 6.85
Noise : 115
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image is slightly blurry, but this is likely an artistic choice. Some of the details in the buildings are not fully resolved. The fire looks a bit unnatural and cartoony. There are also some artifacts from the image processing in the fire.
Precariously Balanced: A Climber’s View of Majestic Grandeur
A climber hangs precariously from a sheer cliff face, gazing out at a breathtaking canyon. The winding river below and the vast expanse of the landscape create a dramatic and adventurous scene, evoking a sense of awe and danger.
Prompt
poses low-angle: thrilling, adventurous ; A group of adventurers rappelling down a sheer cliff face, their ropes dangling below; medium shot; adventure; a breathtaking view of a mountain range and a valley below; cinematic
Characteristic
Shot : A rock climber hangs from a cliff face with a stunning valley and river below.
Aesthetic Score : 0.8
Mood : adventurous, daring, awe-inspiring
Quality
Entropy : 6.69
Noise : 109
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no major artifacts or errors visible in the image.
Lost in the Neon Glow: A Gamer’s Focus
A young woman, bathed in the vibrant hues of neon, is completely engrossed in her gaming session. The dimly lit room, filled with monitors and equipment, creates an atmosphere of intense focus and futuristic immersion. The dramatic lighting highlights her determination, leaving the viewer with a sense of mystery and intrigue.
Prompt
poses low-angle: immersive, fantastical ; A gamer’s hands deftly navigating a virtual world, their fingers flying across the keyboard; close-up; gaming; a vibrant, fantasy world displayed on the monitor; cinematic
Characteristic
Shot : A young woman wearing a headset is gaming in a futuristic-looking room with bright lights. She is focused on the screen in front of her and is typing on a keyboard.
Aesthetic Score : 0.7
Mood : intense, focused, futuristic
Quality
Entropy : 6.78
Noise : 87
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some slight artifacts, particularly in the woman’s hair. The lighting on the woman’s face is a little flat and unrealistic.
Mystical Sunset at the Ancient Temple
A group of women in flowing dresses stand before a majestic sandstone temple, bathed in the warm glow of the setting sun. The intricate carvings and dramatic lighting create a sense of mystery and wonder, inviting you to explore this serene and adventurous scene.
Prompt
poses low-angle: awe-inspiring, historical ; A group of tourists standing in awe before a magnificent ancient temple, their faces illuminated by the setting sun; wide shot; tourism; a sprawling temple complex with intricate carvings and statues; cinematic
Characteristic
Shot : Five women stand in front of an ancient stone temple with elaborate architecture, likely located in Southeast Asia. The setting sun casts a warm glow over the scene.
Aesthetic Score : 0.7
Mood : mysterious, contemplative, serene
Quality
Entropy : 6.79
Noise : 98
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight noise is visible in the sky, but it is not overly distracting.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis.
Here’s a breakdown:
- Camera Position: The model scored 0.53, which falls within the “good” range (0.5 to 0.75). This indicates that the model was able to accurately capture the camera positions described in the prompt.
- Shot Analysis: The model scored 0.59, also within the “good” range. This suggests that the model understood the scene described in the prompt and was able to create an image that reflected that understanding.
- Aesthetic Analysis: The model scored 0.27, which is significantly lower than the “very good” range (-0.2 to 0.1). This indicates that the generated image did not match the expected aesthetic as closely as it did with the camera position and shot analysis.
Overall, the model demonstrates a good understanding of camera positions and scene composition, but needs improvement in capturing the desired aesthetic.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.scenario.com