AI's Artistic Struggle: Capturing the Essence of Poses with Freepik
- 9 minutes read - 1823 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post delves into an experiment where an AI model was tasked with creating images based on specific poses and scenes. While the model demonstrated proficiency in understanding camera positions and shot types, it fell short in capturing the intended aesthetic, highlighting the ongoing challenges in AI’s artistic capabilities. This exploration sheds light on the complexities of translating human artistic vision into the digital realm, showcasing both the strengths and limitations of current AI technology.
Created with: freepik
Silhouetted Against the Sunset: A Moment of Contemplation on the Mountain Peak
A lone hiker stands on a mountain peak, bathed in the warm glow of the setting sun. The vast valley below stretches out before them, a river winding its way through the landscape. The scene evokes a sense of serenity, contemplation, and the inspiring vastness of nature.
Prompt
poses over-the-shoulder: epic, hopeful ; A lone adventurer, silhouetted against a setting sun; wide shot; Adventure; a vast, rugged mountain range; cinematic
Characteristic
Shot : A lone hiker stands on a mountain peak, looking out over a vast valley at sunset.
Aesthetic Score : 0.8
Mood : serene, contemplative, adventurous
Quality
Entropy : 6.71
Noise : 34
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable artifacts or errors in the image.
Firefighter Bravely Faces Blazing Inferno
A dramatic scene unfolds as a firefighter, clad in full gear, stands defiant against a backdrop of raging flames and billowing smoke. The contrast between the fire’s intensity and the firefighter’s resolute expression highlights the danger and their unwavering courage.
Prompt
poses over-the-shoulder: intense, dramatic ; A firefighter, helmet gleaming, facing a raging inferno; medium shot; Heroism; a burning building with smoke billowing; cinematic
Characteristic
Shot : A firefighter in full gear, standing in front of a burning building, looking at the flames.
Aesthetic Score : 0.7
Mood : serious, dramatic, heroic
Quality
Entropy : 6.90
Noise : 53
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : no significant errors
Lost in the Code: A Portrait of Focus and Intensity
A young man, eyes locked on the camera, is immersed in his work. The blurred background and dramatic lighting create a sense of intensity and focus, highlighting the power of concentration in the digital age.
Prompt
poses over-the-shoulder: focused, intense ; A gamer, eyes glued to the screen, fingers flying across the keyboard; close-up; Gaming; a brightly lit gaming setup with flashing lights; cinematic
Characteristic
Shot : A young man is sitting in front of a computer, wearing headphones and looking at the camera. He is typing on a keyboard.
Aesthetic Score : 0.6
Mood : intense, focused, serious
Quality
Entropy : 6.70
Noise : 49
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly blurry, and there are some artifacts in the background.
Parisian Elegance: A Timeless Moment at the Eiffel Tower
Experience the enchanting allure of Paris as a young woman in a beige trench coat gracefully poses in front of the iconic Eiffel Tower. With a captivating glance over her shoulder, she invites you to join her in this romantic and elegant scene, surrounded by lush trees and the vibrant energy of the city.
Prompt
poses over-the-shoulder: joyful, awe-inspired ; A tourist, camera in hand, gazing at the Eiffel Tower; medium shot; Tourism; a bustling Parisian street with the Eiffel Tower in the background; cinematic
Characteristic
Shot : A young woman is standing in front of the Eiffel Tower in Paris, France. She is wearing a tan trench coat and is looking over her shoulder at the camera. The Eiffel Tower is in the background and there are trees on either side of the woman. There are also many people walking around in the background.
Aesthetic Score : 0.7
Mood : romantic, dreamy, Parisian
Quality
Entropy : 6.76
Noise : 53
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : There is a slight blur in the image. It might be due to camera movement or motion blur.
Sunset Serenity on the Beach
A woman finds peace and tranquility as she gazes out at the ocean at sunset, with swaying palm trees framing the scene. The dramatic hues of the sky enhance the sense of calm and contemplation.
Prompt
poses over-the-shoulder: peaceful, contemplative ; A backpacker, gazing out at a breathtaking sunset over the ocean; wide shot; Travel; a serene beach with palm trees and turquoise water; cinematic
Characteristic
Shot : A woman stands on a beach, silhouetted against a sunset. Palm trees and a body of water are visible.
Aesthetic Score : 0.75
Mood : serene, contemplative, peaceful
Quality
Entropy : 6.48
Noise : 54
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no obvious errors in the image. There may be minor noise in some areas, but it is not distracting.
Campfire Tales Under a Starry Sky
A group of friends gather around a crackling campfire, their laughter echoing under a vast, star-filled sky. The warm glow of the flames creates a cozy atmosphere, fostering a sense of intimacy and connection as they share stories and enjoy the beauty of the night.
Prompt
poses over-the-shoulder: warm, nostalgic ; A group of friends, laughing and sharing stories, around a campfire; medium shot; Groups; a campsite under a starry night sky; cinematic
Characteristic
Shot : A group of friends are gathered around a campfire at night, enjoying each other’s company. The scene is set in a forest, with a tent in the background and a starry sky above. The warm glow of the fire creates a cozy and inviting atmosphere.
Aesthetic Score : 0.75
Mood : relaxed, cozy, happy
Quality
Entropy : 6.41
Noise : 53
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are some minor artifacts in the sky, particularly around the stars. The image also appears to be slightly overexposed.
The Intensity of Scientific Discovery
Two women in lab coats, their faces etched with focus, peer into a microscope. The image captures the seriousness and dedication of scientific research, highlighting the intensity and importance of their work.
Prompt
poses over-the-shoulder: focused, determined ; A scientist, peering through a microscope, engrossed in her research; close-up; Heroism; a laboratory filled with scientific equipment; cinematic
Characteristic
Shot : Two women in lab coats are looking through a microscope in a laboratory. It appears to be a modern laboratory with lots of white and silver.
Aesthetic Score : 0.7
Mood : serious, focused, professional
Quality
Entropy : 6.93
Noise : 59
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible artifacts or errors.
Soaring High: A Pilot’s Moment of Serenity
A female pilot, her gaze fixed on the endless expanse of clouds below, embodies focus, serenity, and the thrill of adventure. The vastness of the sky and her determined expression evoke a sense of awe and the freedom of flight.
Prompt
poses over-the-shoulder: exhilarating, adventurous ; A pilot, gripping the controls, soaring through the clouds; wide shot; Adventure; a cockpit with a view of the vast, blue sky; cinematic
Characteristic
Shot : A young woman wearing a headset and sunglasses sits in the cockpit of a small plane, looking out the window at the clouds below.
Aesthetic Score : 0.8
Mood : serene, focused, determined
Quality
Entropy : 6.63
Noise : 73
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some noise is visible in the image, especially in the shadows.
The Art of Plating: A Chef’s Focused Precision
Witness the meticulous artistry of a chef in a pristine kitchen as they plate a dish with focused precision. Warm lighting and a close-up shot create an intimate atmosphere, building anticipation for the culinary masterpiece to come.
Prompt
poses over-the-shoulder: passionate, artistic ; A chef, meticulously plating a dish, surrounded by the aromas of fresh ingredients; close-up; Tourism; a bustling kitchen in a gourmet restaurant; cinematic
Characteristic
Shot : A chef is plating a dish in a professional kitchen. The chef is wearing a white chef’s coat and is carefully arranging food on a white plate. The scene is lit by warm overhead lighting.
Aesthetic Score : 0.7
Mood : professional, focused, culinary
Quality
Entropy : 6.84
Noise : 51
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible artifacts or errors in the image.
Silhouetted Hikers Witness a Breathtaking Sunset Panorama
A group of hikers stand on a mountain ridge, their figures silhouetted against a vibrant sunset. The vast panorama of mountains stretches before them, creating a sense of awe and adventure. The warm glow of the setting sun casts a tranquil mood over the scene.
Prompt
poses over-the-shoulder: triumphant, inspiring ; A group of hikers, silhouetted against a mountain peak, reaching the summit; wide shot; Groups; a majestic mountain range with a breathtaking view; cinematic
Characteristic
Shot : A group of six hikers silhouetted against a dramatic mountain range at sunset.
Aesthetic Score : 0.8
Mood : serene, adventurous, hopeful
Quality
Entropy : 6.34
Noise : 42
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors or artifacts
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.45
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
Shot Analysis:
- Score: 0.52
- Interpretation: This score falls within the “good” range of 0.5 to 0.75. It indicates that the model was able to understand the scene described in the prompt and create a shot that aligns with it to a decent degree.
Aesthetic Analysis:
- Score: 0.03
- Interpretation: This score is significantly below the “very good” range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt. This could mean the model struggled to capture the desired style, mood, or visual elements.
Overall:
The model demonstrates a decent ability to understand and implement camera positions and shot descriptions. However, it needs improvement in capturing the intended aesthetic of the image.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://www.freepik.com