AI's Artistic Eye: Capturing Emotion, Not the Scene with Dall-e-3
- 10 minutes read - 1978 wordsTable of Contents
The ability to generate realistic and evocative images from text prompts is a powerful tool with vast potential applications. However, recent research suggests that current generative AI models still face challenges in accurately translating complex visual information, particularly when it comes to scene descriptions and camera positions. This blog post explores these challenges and examines the potential for future advancements in AI’s ability to understand and translate visual information.
Created with: dall-e-3
A Lone Figure Braces Against the Storm
A solitary man stands defiant on a windswept cliff, facing a tempestuous sea. Lightning cracks across the horizon, casting an ominous glow on the scene. A flock of birds takes flight, adding to the sense of impending drama and unease.
Prompt
facial-expressions Hope: Determined, resilient, facing adversity ; A lone figure standing on a clifftop overlooking a vast, stormy sea; eye-level; Single Person; Dramatic, stormy sky with crashing waves; cinematic
Characteristic
Shot : A lone figure stands on a cliff overlooking a stormy sea, with a lightning strike in the distance and birds flying overhead.
Aesthetic Score : 0.7
Mood : dramatic, melancholic, ominous
Quality
Entropy : 6.35
Noise : 101
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.80
Image errors : The lightning strike looks slightly artificial and the birds appear to be awkwardly placed.
Firefighter’s Heroic Rescue Amidst Blazing Inferno
A dramatic scene unfolds as a firefighter bravely carries a child through a burning building, their determined faces a beacon of hope amidst the flames and smoke. The image captures the urgency and danger of the situation, while also highlighting the resilience and heroism of those on the front lines.
Prompt
facial-expressions Hope: Brave, selfless, courageous ; A firefighter carrying a child through a burning building; eye-level; Hero; Smoke and flames engulfing the background; cinematic
Characteristic
Shot : A firefighter is carrying a young boy through a burning building. The scene is chaotic, with flames and smoke billowing in the background. The firefighter is focused on her task, and the boy looks scared but safe.
Aesthetic Score : 0.6
Mood : intense, heroic, hopeful
Quality
Entropy : 6.79
Noise : 105
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.70
Image errors : The fire and smoke look a bit artificial, and the lighting is a bit harsh. The image is a little over-saturated. The boy’s face is a bit unnatural.
A Moment of Hope in the Desert
A young woman, her face veiled, plants a sapling in a vast expanse of sand. The low angle and soft light create a sense of intimacy, inviting viewers to share in her quiet act of hope and renewal.
Prompt
facial-expressions Hope: Optimistic, hopeful, believing in a better future ; A young woman planting a tree in a barren wasteland; eye-level; Normal Person; Dusty, desolate landscape with a single, hopeful green sprout; cinematic
Characteristic
Shot : A young woman in a headscarf plants a sapling in the ground. The background is a field of similar saplings, implying the woman is part of a larger reforestation effort. The scene is lit with a soft, golden light.
Aesthetic Score : 0.7
Mood : hopeful, determined, calm
Quality
Entropy : 6.90
Noise : 99
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is well-composed, but there are some minor inconsistencies in the lighting and shadows, particularly on the woman’s right hand.
Victory Dance! Friends Celebrate Epic Win with Joyful Energy
Capture the pure joy of victory as a group of friends celebrate their win with infectious smiles and energetic cheers. Dramatic lighting highlights their faces, amplifying the excitement of the moment.
Prompt
facial-expressions Hope: Excited, triumphant, feeling a sense of accomplishment ; A gamer celebrating a victory with their team, their faces illuminated by the glow of the monitor; eye-level; Gamer; A dimly lit room with gaming peripherals and posters on the walls; cinematic
Characteristic
Shot : A group of friends are playing video games in a dimly lit room with a blue light shining on them. They are all cheering and looking up at the screen. The room is decorated with video game posters and artwork.
Aesthetic Score : 0.6
Mood : excited, energetic, celebratory
Quality
Entropy : 6.85
Noise : 97
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image has some slight blurring, especially on the people’s faces. The lighting is a little uneven, and the colors are not very saturated.
Shadows and Secrets: A Candlelit Mystery
A single candle casts a warm glow on a wooden table, illuminating a scene of quiet mystery. The figure of a person, shrouded in darkness, looms in the background, hinting at a story waiting to be told. This image evokes a sense of somber intrigue, leaving the viewer to wonder what secrets lie hidden in the shadows.
Prompt
facial-expressions Hope: Hopeful, comforting, a beacon of light in the darkness ; A single candle burning brightly in a dark room; eye-level; Single Person; Shadows and darkness surrounding the candle; cinematic
Characteristic
Shot : A single burning candle on a wooden table. The background is dark and blurry, suggesting a figure in the distance. Scattered lights create a soft, ethereal glow.
Aesthetic Score : 0.7
Mood : mysterious, atmospheric, intriguing
Quality
Entropy : 6.03
Noise : 71
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has a slight digital blur and a bit of noise, particularly in the background. The lights in the background have an unnatural, almost digital, effect.
A Moment of Joy: Doctor Smiles Down at Newborn Baby
A heartwarming scene unfolds in a hospital room as a doctor, bathed in a halo of light, cradles a newborn baby. The doctor’s smile and the medical equipment in the background create a sense of hope and joy, capturing the essence of new life and the miracle of birth.
Prompt
facial-expressions Hope: Joyful, hopeful, a symbol of new beginnings ; A doctor holding a newborn baby in their arms; eye-level; Hero; A sterile hospital room with medical equipment in the background; cinematic
Characteristic
Shot : A female doctor is holding a newborn baby in a hospital room, illuminated by a bright light source above, with medical equipment and other medical personnel in the background.
Aesthetic Score : 0.8
Mood : joyful, hopeful, heartwarming
Quality
Entropy : 6.72
Noise : 92
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are some minor artifacts in the background, particularly around the medical equipment. The lighting is slightly overexposed in areas, leading to a loss of detail in the doctor’s hair.
Hope Blooms at the Dinner Table
A heartwarming scene of togetherness unfolds in a sunlit kitchen, where a group of people gather around a table, sharing a meal and a sense of hope. The warm light and vibrant greenery create a feeling of optimism and connection, reminding us of the power of shared moments.
Prompt
facial-expressions Hope: Warm, comforting, a sense of belonging ; A group of friends sharing a meal together in a cozy kitchen; eye-level; Normal People; Warm, inviting kitchen with sunlight streaming through the window; cinematic
Characteristic
Shot : A group of people are gathered around a table in a rustic kitchen, enjoying a meal. The sun is setting outside the window, casting a warm glow over the scene.
Aesthetic Score : 0.7
Mood : warm, hopeful, togetherness
Quality
Entropy : 6.61
Noise : 99
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears a bit over-processed. There are some artifacts visible in the shadows and highlights.
In the Zone: Gamer’s Intense Focus Under Dark Lighting
A young man, his face etched with concentration, grips his controller as he battles through a virtual world. The dimly lit room adds to the intensity of the moment, creating a sense of high stakes and thrilling gameplay.
Prompt
facial-expressions Hope: Determined, focused, persevering ; A gamer overcoming a difficult challenge in a video game, their face showing determination and focus; eye-level; Gamer; A brightly lit room with a large monitor displaying the game; cinematic
Characteristic
Shot : A young man is playing video games, focusing intently on the screen. The room is dimly lit with a few lamps, creating a moody atmosphere.
Aesthetic Score : 0.7
Mood : intense, focused, dramatic
Quality
Entropy : 6.06
Noise : 81
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.30
Image errors : There are no visible artifacts or errors in the image.
Reaching for the Sun: A Moment of Hope and Inspiration
A solitary figure gazes upwards, arms outstretched, towards a vibrant blue sky dotted with fluffy clouds. The sun shines brightly, casting golden rays of light that illuminate the scene. This image evokes a sense of hope, inspiration, and the boundless possibilities that lie ahead.
Prompt
facial-expressions Hope: Free, hopeful, a symbol of liberation ; Soaring through blue sky; eye-level; Single Person; Vast, open sky with fluffy white clouds; cinematic
Characteristic
Shot : A person with dark skin and a white tank top is looking up into a bright blue sky with fluffy white clouds, a single sunbeam shines from the top of the image
Aesthetic Score : 0.6
Mood : hopeful, uplifting, peaceful
Quality
Entropy : 6.39
Noise : 91
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.80
Image errors : The image appears to be generated by AI, the person’s face appears distorted. There are no visible artefacts.
Silhouetted Against Hope: A Gathering at Sunset
A large group stands united, their faces turned towards a radiant sunset. The blinding light, a symbol of hope and optimism, casts long shadows, creating a powerful image of unity and shared purpose.
Prompt
facial-expressions Hope: United, hopeful, facing the future together ; A group of people standing together, arms linked, facing a bright sunrise; eye-level; Heroes; A vast, open field with a golden sunrise in the background; cinematic
Characteristic
Shot : A large group of people standing back to back, looking at a bright, golden sunset over a field. They are holding hands.
Aesthetic Score : 0.7
Mood : hopeful, spiritual, uplifting
Quality
Entropy : 6.78
Noise : 96
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has some minor blurriness in the back. The field in the distance is a bit flat.
Conclusion
The results show that the generative AI model performed well in understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.25, which is below the “good” range of 0.5 to 0.75. This indicates that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.485, which is also below the “good” range. This suggests that the model didn’t fully understand the scene described in the prompt and didn’t create an image that accurately reflects it.
- Aesthetic Analysis: The model scored 0.11, which is within the “very good” range of -0.2 to 0.1. This means the generated image’s aesthetic closely matched the expected aesthetic described in the prompt.
Overall, the model seems to be better at understanding the desired aesthetic than the scene and camera position. This suggests that the model might need further training to improve its ability to interpret and translate camera positions and scene descriptions into visual representations.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://openai.com/index/dall-e-3/