AI's Artistic Journey: Capturing Poses, But Missing the Shot with Leonardo-ai

AI's Artistic Journey: Capturing Poses, But Missing the Shot with Leonardo-ai

Contents

In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post delves into the results of an AI model tasked with generating images based on scene descriptions and poses. The model demonstrates a remarkable ability to capture the aesthetic style of the desired poses, but it struggles with accurately capturing camera positions and shot composition. This suggests that while the model has made significant progress in understanding the visual elements of a scene, it still requires further training to fully grasp the nuances of camera angles and shot types. We will explore the model’s performance, analyzing its strengths and weaknesses, and discuss potential areas for improvement.

Created with: leonardo-ai

A Lone Figure Walks Towards the Storm

A hooded figure strides along a dusty desert road, their destination a looming, ominous storm. The dramatic contrast of light and dark, the eerie silence, and the figure’s mysterious purpose create a sense of foreboding and intrigue.

A Lone Figure Walks Towards the Storm

Prompt

poses running: determined, hopeful ; A lone figure in a tattered cloak; wide shot; Heroism; a desolate wasteland with a storm brewing in the distance; cinematic

Characteristic

Shot : A lone figure in a hooded cloak walks down a dirt road towards a distant figure. A dark, ominous storm cloud looms over the vast, flat landscape in the background.

Aesthetic Score : 0.8

Mood : dramatic, foreboding, mysterious

Quality

Entropy : 6.60

Noise : 97

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.10

Image errors : No noticeable artifacts or errors.

Adventure Awaits: Young Man Explores Ancient Ruins

A young man, brimming with excitement, races towards the camera on a dirt path through a lush jungle. Behind him, an ancient stone building with weathered steps beckons, promising secrets and untold stories. The vibrant colors and adventurous mood capture the spirit of exploration and discovery.

Adventure Awaits: Young Man Explores Ancient Ruins

Prompt

poses running: excited, curious ; A young adventurer with a backpack; medium shot; Adventure; a lush jungle with ancient ruins in the background; cinematic

Characteristic

Shot : A young man is running on a dirt path in a jungle setting, with a large stone structure behind him. He is carrying a backpack. The path is lined with lush greenery on both sides.

Aesthetic Score : 0.7

Mood : adventure, energetic, excited

Quality

Entropy : 6.89

Noise : 109

Prompt Clip Score : 0.33

AI Evaluation

Likelihood of AI : 0.10

Image errors : No visible artifacts or errors in the image.

In the Zone: A Gamer’s Intense Focus Under Neon Lights

A young man, headphones on, is completely immersed in a video game. The dimly lit room, punctuated by the bright glow of multiple monitors, creates a dramatic atmosphere that captures the intensity of his focus. This image embodies the thrill and dedication of competitive gaming.

In the Zone: A Gamer’s Intense Focus Under Neon Lights

Prompt

poses running: intense, focused ; A gamer’s hands on a keyboard and mouse; close-up; Gaming; a brightly lit gaming room with a monitor displaying a virtual world; cinematic

Characteristic

Shot : A young man wearing headphones is sitting in a dimly lit room, focused on his computer with three monitors displaying a video game.

Aesthetic Score : 0.6

Mood : intense, focused, digital

Quality

Entropy : 6.56

Noise : 94

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.20

Image errors : There is some noise in the shadows, especially around the edges of the monitors and the keyboard. The lighting seems slightly artificial and uneven.

Vibrant Life in an Indian Marketplace

A bustling marketplace in India comes alive with color and energy. The narrow street is lined with colorful buildings, and the warm light creates a sense of depth and dimension. The image captures the lively atmosphere of the market, with people walking and shopping, and flags and banners hanging overhead.

Vibrant Life in an Indian Marketplace

Prompt

poses running: energetic, joyful ; A group of tourists running through a bustling marketplace; long shot; Tourism; a vibrant marketplace with colorful stalls and vendors; cinematic

Characteristic

Shot : A bustling marketplace in India, with people walking and shopping. The scene is bright and colorful, with a sense of energy and excitement.

Aesthetic Score : 0.6

Mood : energetic, vibrant, lively

Quality

Entropy : 6.92

Noise : 111

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.10

Image errors : No visible artifacts

Chasing Horizons: A Romantic Beach Run

Experience the joy and freedom of a couple running towards the turquoise ocean on a white sandy beach, with a blue sky filled with white clouds above. The scene exudes romance, carefree happiness, and a sense of hope as they chase the horizon together.

Chasing Horizons: A Romantic Beach Run

Prompt

poses running: romantic, carefree ; A couple running hand-in-hand along a beach; medium shot; Travel; a beautiful beach with turquoise water and white sand; cinematic

Characteristic

Shot : A couple is running on a beach towards the ocean. The woman is wearing a blue dress and the man is wearing blue swim trunks. The sun is setting in the background and the sky is a beautiful blue.

Aesthetic Score : 0.7

Mood : romantic, happy, carefree

Quality

Entropy : 6.10

Noise : 98

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.20

Image errors : No apparent artifacts

Friends Running Through Golden Sunlight

Capture the joy of friendship and fitness with this image of four friends running through a grassy park, bathed in golden light. Their laughter and energy create a sense of movement and happiness, making it a perfect representation of a healthy and energetic lifestyle.

Friends Running Through Golden Sunlight

Prompt

poses running: happy, playful ; A group of friends running through a park; wide shot; Groups; a sunny park with green grass and trees; cinematic

Characteristic

Shot : A group of four young adults running together in a park. It’s a sunny day and the sky is blue with fluffy clouds.

Aesthetic Score : 0.7

Mood : joyful, energetic, healthy

Quality

Entropy : 6.87

Noise : 104

Prompt Clip Score : 0.30

AI Evaluation

Likelihood of AI : 0.10

Image errors : No noticeable artifacts or errors.

Superhero in Motion: Blurred Speed and Heroic Power

A dynamic scene captures a superhero running through a city at dusk, the blur of motion and dramatic lighting emphasizing their speed and heroic power.

Superhero in Motion: Blurred Speed and Heroic Power

Prompt

poses running: powerful, confident ; A superhero in a bright costume; close-up; Heroism; a city skyline with skyscrapers and flashing lights; cinematic

Characteristic

Shot : A superhero in a red and blue costume runs through a city at night.

Aesthetic Score : 0.6

Mood : dramatic, action, heroic

Quality

Entropy : 6.66

Noise : 98

Prompt Clip Score : 0.33

AI Evaluation

Likelihood of AI : 0.30

Image errors : The image is slightly blurry and the colors are a bit washed out.

A Lone Hiker Embraces the Majesty of a Snowy Mountain

This serene and inspiring image captures the essence of adventure as a lone hiker traverses a snowy path towards a majestic snow-capped mountain. The vastness of the landscape, with its mix of snow and brown grasses, creates a sense of scale and isolation, leaving the viewer feeling both humbled and inspired.

A Lone Hiker Embraces the Majesty of a Snowy Mountain

Prompt

poses running: determined, adventurous ; A lone explorer running through a snow-covered mountain pass; long shot; Adventure; a majestic mountain range with snow-capped peaks; cinematic

Characteristic

Shot : A lone hiker walks up a snowy path towards a majestic, snow-capped mountain peak. The scenery is vast and expansive with rolling hills and valleys.

Aesthetic Score : 0.8

Mood : serene, adventurous, inspiring

Quality

Entropy : 6.74

Noise : 104

Prompt Clip Score : 0.25

AI Evaluation

Likelihood of AI : 0.20

Image errors : No significant image errors

Stone-Clad Terror: Monster Rampages Through Futuristic City

A chilling scene unfolds as a monstrous, stone-like creature tears through a futuristic cityscape. The creature’s menacing pose and the terrified crowd in the background create a palpable sense of urgency and danger. This dark, futuristic image evokes a chilling mood, leaving viewers on the edge of their seats.

Stone-Clad Terror: Monster Rampages Through Futuristic City

Prompt

poses running: immersive, exciting ; A gamer’s avatar running through a virtual world; close-up; Gaming; a vibrant and detailed virtual world with fantastical creatures; cinematic

Characteristic

Shot : A reptilian humanoid creature runs through a futuristic, neon-lit city, with a crowd of blurred figures in the background.

Aesthetic Score : 0.7

Mood : dark, futuristic, suspenseful

Quality

Entropy : 6.65

Noise : 98

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 1.00

Image errors : There are some minor artifacts and inconsistencies in the creature’s anatomy and the background, particularly in the blur and the lighting.

Family Fun on a Country Road: Joy and Determination in Every Step

A heartwarming scene of a family running down a country road, radiating happiness and energy. The parents, running side-by-side, beam with joy as they watch their smiling child in the stroller. This image captures the essence of family bonding and the simple pleasures of life.

Family Fun on a Country Road: Joy and Determination in Every Step

Prompt

poses running: happy, carefree ; A family running along a scenic road; medium shot; Travel; a winding road with rolling hills and a picturesque countryside; cinematic

Characteristic

Shot : A family of four is running down a country road. The father and mother are running on either side of the road. A young girl is sitting in a jogging stroller, being pushed by the mother.

Aesthetic Score : 0.7

Mood : happy, active, family

Quality

Entropy : 6.87

Noise : 103

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.20

Image errors : No significant errors detected. The colors appear natural, and there are no distracting artifacts.

Conclusion

The results show that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:

  • Camera Position: The model scored 0.46, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
  • Shot Analysis: The model scored 0.58, which is considered average. This indicates that the model was able to understand the scene in the prompt reasonably well, but there might be some discrepancies between the intended shot and the generated image.
  • Aesthetic Analysis: The model scored 0.08, which is considered very good. This means that the generated image closely matched the expected aesthetic style, despite the other shortcomings.

Overall, the model seems to be better at understanding the aesthetic style than the camera position and shot composition. This suggests that the model might need further training to improve its ability to accurately interpret and implement camera positions and shot types.

Sources: