AI's Artistic Struggle: Capturing the Essence of Poses with Freepik
- 9 minutes read - 1915 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post delves into the results of an experiment where an AI model was tasked with creating images based on specific poses and scenes. While the model demonstrates a good understanding of camera positions and shot composition, it struggles to match the desired aesthetic, highlighting the ongoing challenges in AI’s artistic development. This exploration will delve into the model’s strengths and weaknesses, analyzing its performance in capturing the essence of poses and scenes, and discussing the implications for the future of AI-generated art.
Created with: freepik
Knight of the Storm
A lone knight in full armor stands on a rocky precipice, silhouetted against a stormy sky. Lightning illuminates the distant city below, creating a dramatic and melancholic scene. The knight’s pose suggests a moment of tension or anticipation, hinting at an epic tale unfolding.
Prompt
poses dutch-angle: determined, heroic, hopeful ; A lone knight, standing tall on a hilltop overlooking a besieged city; wide shot; heroism; a dramatic, stormy sky with flashes of lightning; cinematic
Characteristic
Shot : A lone knight stands on a rocky hilltop, overlooking a city illuminated by night. A storm rages overhead, with lightning bolts illuminating the sky.
Aesthetic Score : 0.7
Mood : epic, dramatic, heroic
Quality
Entropy : 6.64
Noise : 60
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The knight’s armor appears somewhat blurry, as do the city lights. The lightning bolts are somewhat artificial and lacking in naturalism. Some of the textures, particularly those of the grass, are very noticeable and feel digital.
Silhouettes in the Forest: A Glimpse of Hope
Five figures stand silhouetted against the backdrop of a dense forest, their gazes fixed on a distant, radiant light source. Rays of light pierce the darkness, creating a sense of mystery and adventure. This image evokes a feeling of hope, suggesting a journey towards a brighter future.
Prompt
poses dutch-angle: adventurous, mysterious, awe-inspiring ; A group of explorers, silhouetted against the setting sun, standing at the edge of a vast, unexplored jungle; medium shot; adventure; lush green foliage and towering trees; cinematic
Characteristic
Shot : A group of five people stand in silhouette in a dense forest, looking towards a hazy sun breaking through the trees.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, hopeful
Quality
Entropy : 6.72
Noise : 78
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image is slightly overexposed and there is some noise in the shadows.
Lost in the Code: A Moment of Intense Focus
A young man, illuminated by the glow of his computer screens, sits engrossed in his work. The dimly lit room adds a dramatic touch, highlighting his concentration and the immersive nature of his task.
Prompt
poses dutch-angle: intense, focused, competitive ; A gamer, intensely focused on a screen, fingers flying across a keyboard; close-up; gaming; a brightly lit room with gaming peripherals and posters; cinematic
Characteristic
Shot : A young man wearing headphones is seated in front of his computer, typing on a keyboard. The room is dimly lit, with only the glow of the computer screens illuminating the scene.
Aesthetic Score : 0.6
Mood : focused, intense, concentrated
Quality
Entropy : 6.79
Noise : 56
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, particularly around the edges of the screens. The lighting is a bit uneven, with some areas appearing too dark.
A Parisian Love Story Unfolds at Dusk
In the heart of Paris, a couple shares an intimate moment at a quaint cafe, with the iconic Eiffel Tower standing tall in the background. As the sun sets, their love story unfolds amidst the romantic Parisian atmosphere.
Prompt
poses dutch-angle: romantic, nostalgic, joyful ; A couple, hand-in-hand, gazing out at the Eiffel Tower from a Parisian cafe; medium shot; tourism; bustling Parisian streets with charming cafes and shops; cinematic
Characteristic
Shot : A couple is sitting at a cafe table in Paris, with the Eiffel Tower in the background.
Aesthetic Score : 0.7
Mood : romantic, cozy, Parisian
Quality
Entropy : 6.84
Noise : 71
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the image, particularly around the edges of the Eiffel Tower.
A Hiker’s Journey Through a Snowy Mountain Valley
Experience the tranquility and adventure of a lone hiker traversing a narrow path through a snowy mountain valley. The vastness of the mountains and the clear blue sky create a sense of peace and inspiration, while the hiker’s perspective emphasizes the solitude of the journey.
Prompt
poses dutch-angle: free-spirited, adventurous, inspiring ; A backpacker, walking along a winding mountain path, with breathtaking views of snow-capped peaks; medium shot; travel; a rugged mountain landscape with clear blue skies; cinematic
Characteristic
Shot : A lone hiker walks along a path through a mountain valley, with snow-capped peaks in the distance.
Aesthetic Score : 0.8
Mood : tranquil, adventurous, inspiring
Quality
Entropy : 6.72
Noise : 73
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Cheers to Friendship and Good Times!
A group of friends raise their glasses in a dimly lit restaurant, bathed in warm candlelight. The close-up shot captures the joy and camaraderie of the moment, creating a feeling of warmth and connection.
Prompt
poses dutch-angle: joyful, celebratory, connected ; A group of friends, laughing and celebrating, raising their glasses in a toast; medium shot; groups; a lively bar or restaurant with warm lighting and festive decorations; cinematic
Characteristic
Shot : A group of friends are toasting with drinks at a dimly lit restaurant. There are candles on the table, and the warm lighting creates a cozy and festive atmosphere.
Aesthetic Score : 0.7
Mood : joyful, celebratory, friendly
Quality
Entropy : 6.67
Noise : 55
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : The lighting is slightly uneven, and there are some minor artifacts around the edges of the image.
A Moment of Wonder: Astronaut Gazes at Earth from Space
A lone astronaut, lost in contemplation, looks out of a spacecraft window at the breathtaking sight of Earth. The blue planet hangs in the distance, bathed in sunlight, with swirling clouds and shimmering oceans. This image evokes a sense of awe, hope, and the vastness of the universe.
Prompt
poses dutch-angle: awe-inspiring, contemplative, hopeful ; A lone astronaut, gazing out at the Earth from a space station window; close-up; heroism; the vastness of space with stars and planets in the background; cinematic
Characteristic
Shot : An astronaut looks out the window of a spacecraft at Earth from space.
Aesthetic Score : 0.7
Mood : solitude, wonder, awe
Quality
Entropy : 6.70
Noise : 67
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to have been generated by AI, with some artificial features such as the astronaut’s helmet and the Earth.
Conquering the Cliffside: A Breathtaking View of Nature’s Majesty
Experience the thrill of adventure as climbers descend a towering cliff face, overlooking a breathtaking valley with a winding river and a majestic waterfall. This epic scene captures the raw power and beauty of nature, leaving you in awe of its grandeur.
Prompt
poses dutch-angle: exciting, daring, adventurous ; A group of adventurers, rappelling down a steep cliff face, with a breathtaking view of a valley below; wide shot; adventure; a dramatic mountain landscape with waterfalls and lush vegetation; cinematic
Characteristic
Shot : A group of climbers are rappelling down a cliff face, overlooking a valley with a river and waterfall
Aesthetic Score : 0.8
Mood : adventurous, awe-inspiring, dramatic
Quality
Entropy : 6.66
Noise : 93
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Triumphant Moment Captured in a Burst of Light
A young man basks in the glow of victory, holding aloft a trophy as a cheering crowd blurs into a sea of joyous faces. Dramatic lighting and a sense of motion capture the energy and excitement of this celebratory moment.
Prompt
poses dutch-angle: triumphant, celebratory, exciting ; A gamer, celebrating a victory, holding up a trophy; close-up; gaming; a brightly lit stage with cheering crowds and flashing lights; cinematic
Characteristic
Shot : A young man in a blue jersey is holding a golden trophy high above his head. He is smiling broadly and looks genuinely excited. Behind him, a crowd of people is cheering and celebrating with him. The scene is lit by spotlights and the atmosphere is electric.
Aesthetic Score : 0.7
Mood : joyful, celebratory, triumphant
Quality
Entropy : 6.77
Noise : 56
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image does not have any visible artifacts or errors.
Sunset Stroll: A Family’s Tranquil Moment on the Beach
A heartwarming scene unfolds as a family of three walks along a sandy beach towards the ocean during a breathtaking sunset. The father leads the way, followed by his wife and daughter, their silhouettes bathed in the warm glow of the setting sun. The peaceful atmosphere and loving bond between them create a truly tranquil moment, captured in this beautiful image.
Prompt
poses dutch-angle: peaceful, heartwarming, nostalgic ; A family, standing on a beach, watching the sunset over the ocean; medium shot; travel; a serene beach with golden sand and turquoise waters; cinematic
Characteristic
Shot : A family of three is walking on a sandy beach at sunset, they are walking away from the camera towards the setting sun.
Aesthetic Score : 0.7
Mood : happy, peaceful, hopeful
Quality
Entropy : 6.56
Noise : 56
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : No noticeable errors or artifacts in the image.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.45
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
Shot Analysis:
- Score: 0.52
- Interpretation: This score falls within the “good” range of 0.5 to 0.75. It indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it to a decent degree.
Aesthetic Analysis:
- Score: 0.06
- Interpretation: This score is significantly higher than the “very good” range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera positions and shot composition, but struggles to match the desired aesthetic. This suggests that the model might need further training to better understand and translate aesthetic preferences into visual outputs.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.freepik.com