AI's Artistic Struggle: Capturing the Essence of Poses with Dall-e-3
- 9 minutes read - 1912 wordsTable of Contents
In the realm of artificial intelligence, the ability to generate images based on textual descriptions is a rapidly evolving field. This blog post delves into an experiment where an AI model was tasked with creating images based on specific poses and scenes. While the model demonstrated proficiency in capturing camera position and shot analysis, it struggled to accurately convey the intended aesthetic, highlighting the ongoing challenges in AI’s artistic capabilities. This exploration sheds light on the complexities of AI’s artistic journey and the need for further development in understanding and implementing aesthetic preferences.
Created with: dall-e-3
Silhouetted Against the Sunset: A Moment of Solitude in the Desert
A lone figure stands on a rocky cliff, bathed in the warm glow of the setting sun. The vast desert valley stretches out below, with distant mountains silhouetted against the horizon. The scene evokes a sense of tranquility, awe, and contemplation, highlighting the figure’s isolation and the vastness of the landscape.
Prompt
poses over-the-shoulder: epic, hopeful ; A lone adventurer, silhouetted against a setting sun; wide shot; Adventure; a vast, rugged mountain range; cinematic
Characteristic
Shot : A solitary figure stands on a rocky outcrop overlooking a vast desert valley. The sun is setting in the distance, casting long shadows across the landscape.
Aesthetic Score : 0.7
Mood : tranquil, majestic, awe-inspiring
Quality
Entropy : 6.36
Noise : 76
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears slightly overexposed, which makes the highlights in the sky too bright and lacks detail. Some textures appear blurry. The resolution is also not the highest.
Heroic Silhouette: Firefighter Faces the Blaze
A firefighter, silhouetted against the fiery inferno, stands resolute in the face of danger. The dramatic scene captures the intensity and heroism of those who battle blazes.
Prompt
poses over-the-shoulder: intense, dramatic ; A firefighter, helmet gleaming, facing a raging inferno; medium shot; Heroism; a burning building with smoke billowing; cinematic
Characteristic
Shot : A firefighter in full gear standing in front of a burning building, looking at the flames.
Aesthetic Score : 0.6
Mood : intense, dramatic, heroic
Quality
Entropy : 6.58
Noise : 86
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to be slightly blurred, possibly due to motion.
In the Zone: Gamer’s Intensity Under Neon Lights
A young woman, bathed in blue and purple light, is locked in a battle of wits with her game. The close-up shot captures her intense focus, creating a palpable sense of tension and anticipation. The blurred background emphasizes her singular dedication to the moment.
Prompt
poses over-the-shoulder: focused, intense ; A gamer, eyes glued to the screen, fingers flying across the keyboard; close-up; Gaming; a brightly lit gaming setup with flashing lights; cinematic
Characteristic
Shot : A young woman wearing a headset is concentrating on gaming, typing on a keyboard with colorful lights. The background is blurred with some colorful lights, suggesting she is in a gaming room or similar environment.
Aesthetic Score : 0.7
Mood : intense, focused, determined
Quality
Entropy : 6.66
Noise : 91
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some artifacts around the woman’s hair and on the keyboard. The colors are a bit over-saturated.
Capturing Parisian Joy: A Selfie at the Eiffel Tower
A man with a backpack beams with excitement as he takes a selfie in front of the iconic Eiffel Tower. The setting evokes a sense of adventure and joy, capturing the spirit of travel and exploration.
Prompt
poses over-the-shoulder: joyful, awe-inspired ; A tourist, camera in hand, gazing at the Eiffel Tower; medium shot; Tourism; a bustling Parisian street with the Eiffel Tower in the background; cinematic
Characteristic
Shot : A man is standing in the street in Paris, looking up at the Eiffel Tower. He is holding a camera and is laughing. There are buildings and trees in the background.
Aesthetic Score : 0.6
Mood : joyful, playful, excited
Quality
Entropy : 6.75
Noise : 106
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.70
Image errors : Some of the textures look slightly artificial, such as the man’s hair and the Eiffel Tower.
Sunset Serenade: A Moment of Wanderlust on the Beach
A young woman, bathed in the golden hues of sunset, stands on a pristine beach, her backpack hinting at adventures to come. Palm trees sway gently in the background, creating a serene and romantic atmosphere. The warm glow of the setting sun casts a dramatic and alluring light, capturing the essence of wanderlust and the promise of new horizons.
Prompt
poses over-the-shoulder: peaceful, contemplative ; A backpacker, gazing out at a breathtaking sunset over the ocean; wide shot; Travel; a serene beach with palm trees and turquoise water; cinematic
Characteristic
Shot : A woman with long brown hair and a backpack stands on a beach at sunset, looking over her shoulder at the camera.
Aesthetic Score : 0.7
Mood : serene, hopeful, adventurous
Quality
Entropy : 6.74
Noise : 86
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable image artifacts or errors.
Campfire Laughter Under a Starry Sky
A group of friends gather around a crackling campfire, their laughter echoing under a breathtaking starry sky. The warmth of the fire and the joy in their faces create a scene of pure contentment and connection.
Prompt
poses over-the-shoulder: warm, nostalgic ; A group of friends, laughing and sharing stories, around a campfire; medium shot; Groups; a campsite under a starry night sky; cinematic
Characteristic
Shot : A group of friends gathered around a campfire under a starry night sky. They are laughing and enjoying each other’s company. There is a tent in the background.
Aesthetic Score : 0.75
Mood : joyful, relaxed, warm
Quality
Entropy : 6.62
Noise : 111
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has some minor artifacts in the background and some noise in the shadow areas.
Unveiling the Secrets: A Scientist’s Focused Gaze
A woman in a lab coat, her expression intense, peers through a microscope. The blurred background of medical equipment adds a layer of mystery to this dramatic scene, hinting at the secrets being uncovered within the lab.
Prompt
poses over-the-shoulder: focused, determined ; A scientist, peering through a microscope, engrossed in her research; close-up; Heroism; a laboratory filled with scientific equipment; cinematic
Characteristic
Shot : A woman in a lab coat is looking through a microscope. She is surrounded by test tubes and other lab equipment. The image is framed by a monitor screen.
Aesthetic Score : 0.7
Mood : focused, serious, scientific
Quality
Entropy : 6.79
Noise : 96
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image has some artifacts, particularly around the edges of the monitor screen. The lighting is also uneven.
Soaring Through Serenity: A Pilot’s Journey Above the Clouds
Experience the thrill and tranquility of flight as a young woman navigates her small plane through a breathtaking field of white puffy clouds. The sun shines brightly, the sky is a vibrant blue, and the perspective from the cockpit creates a sense of awe and wonder. This image captures the adventurous spirit, serene beauty, and empowering feeling of soaring above the world.
Prompt
poses over-the-shoulder: exhilarating, adventurous ; A pilot, gripping the controls, soaring through the clouds; wide shot; Adventure; a cockpit with a view of the vast, blue sky; cinematic
Characteristic
Shot : A woman is flying a small plane through the clouds. She is wearing a headset and is focused on her task. The clouds are bright and fluffy, and the sky is a beautiful blue.
Aesthetic Score : 0.7
Mood : adventurous, determined, free
Quality
Entropy : 6.74
Noise : 111
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The clouds and sky appear to be somewhat blurry.
Mastering the Art of Dessert: A Chef’s Focused Elegance
A female chef in a hijab meticulously plates a dessert in a professional kitchen, bathed in dramatic lighting that highlights her focused expression and the intricate details of her creation. The scene exudes a sense of professionalism and elegance, capturing the artistry of culinary mastery.
Prompt
poses over-the-shoulder: passionate, artistic ; A chef, meticulously plating a dish, surrounded by the aromas of fresh ingredients; close-up; Tourism; a bustling kitchen in a gourmet restaurant; cinematic
Characteristic
Shot : A female chef, wearing a hijab, is meticulously decorating a plate of food in a professional kitchen. The kitchen is lit by warm, yellow lights, creating a sense of intimacy and focus. The scene is punctuated by steam and a sense of controlled chaos, emphasizing the chef’s concentration and artistry.
Aesthetic Score : 0.8
Mood : focused, intimate, artistic
Quality
Entropy : 6.67
Noise : 99
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.30
Image errors : The image suffers from some minor artifacts, particularly visible in the steam and the chef’s clothing. The steam appears a little pixelated and the texture of the chef’s hijab is slightly unnatural.
Silhouettes of Adventure: Hikers Conquer the Sunset
A breathtaking scene of hikers silhouetted against a majestic mountain range at sunset. The dramatic composition emphasizes the vastness of nature and the sense of accomplishment felt by the hikers as they reach the summit.
Prompt
poses over-the-shoulder: triumphant, inspiring ; A group of hikers, silhouetted against a mountain peak, reaching the summit; wide shot; Groups; a majestic mountain range with a breathtaking view; cinematic
Characteristic
Shot : A group of hikers stand on a mountaintop, silhouetted against a bright sun, with a majestic mountain range in the background.
Aesthetic Score : 0.7
Mood : inspirational, adventurous, hopeful
Quality
Entropy : 6.21
Noise : 106
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some minor artifacts in the sky and the mountains, but they are not particularly noticeable.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.45
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
Shot Analysis:
- Score: 0.51
- Interpretation: This score is within the “good” range, indicating the model successfully understood and implemented the scene described in the prompt.
Aesthetic Analysis:
- Score: 0.06
- Interpretation: This score is significantly higher than the “very good” range of -0.2 to 0.1. It suggests a significant difference between the expected aesthetic and the actual aesthetic of the generated image. This could mean the model struggled to capture the desired style or mood.
Overall:
While the model demonstrated good understanding of camera position and shot composition, it fell short in capturing the intended aesthetic. This suggests that the model might need further training to better understand and implement aesthetic preferences.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://openai.com/index/dall-e-3/