AI's Artistic Eye: Capturing Emotion, But Missing the Scene with Stable-diffusion
- 10 minutes read - 1961 wordsTable of Contents
In the realm of artificial intelligence, generative models are rapidly pushing the boundaries of creativity. These models, trained on vast datasets of images and text, can generate stunning visuals based on user prompts. However, while they excel in capturing aesthetics and camera angles, they often struggle with understanding the nuances of scene composition. This blog post delves into the performance of a new generative AI model, analyzing its strengths and weaknesses in capturing facial expressions and scene understanding. We’ll explore how the model excels in creating visually appealing images while highlighting its limitations in grasping the complexities of scene descriptions. Through this analysis, we aim to shed light on the potential of this technology and its future development.
Created with: stability-ai-core
A Solitary Figure Contemplates the Fury of the Sea
A lone figure stands defiant against the raw power of nature, silhouetted against a stormy sky as massive waves crash against the rocky shore. The scene evokes a sense of awe and vulnerability, highlighting the dramatic contrast between human fragility and the untamed forces of the ocean.
Prompt
facial-expressions Hope: Determined, resilient, facing adversity ; A lone figure standing on a clifftop overlooking a vast, stormy sea; eye-level; Single Person; Dramatic, stormy sky with crashing waves; cinematic
Characteristic
Shot : A man standing on a cliff overlooking a stormy sea with large waves crashing against the shore. The sky is dark and overcast, and there is a sense of danger and drama in the air.
Aesthetic Score : 0.8
Mood : dramatic, powerful, ominous
Quality
Entropy : 6.63
Noise : 81
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors
Heroic Rescue Amidst the Flames
A firefighter, clad in full gear, bravely carries a small child through a burning building, the flames and smoke creating a dramatic backdrop. The image captures the heroism and urgency of the situation, highlighting the vulnerability of the child and the firefighter’s unwavering commitment to saving lives.
Prompt
facial-expressions Hope: Brave, selfless, courageous ; A firefighter carrying a child through a burning building; eye-level; Hero; Smoke and flames engulfing the background; cinematic
Characteristic
Shot : A firefighter is walking through a burning building, carrying a child in his arms. There are flames and smoke in the background.
Aesthetic Score : 0.7
Mood : heroic, dramatic, tense
Quality
Entropy : 6.78
Noise : 76
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.30
Image errors : The smoke in the background looks slightly unnatural and the fire flames appear slightly over-exposed
A Single Seed of Hope in a Barren Land
A young woman plants a sapling amidst a desolate landscape, a poignant reminder of the fragility of life and the urgent need for environmental protection. The image evokes a sense of melancholy, hope, and contemplation, highlighting the stark contrast between life and death.
Prompt
facial-expressions Hope: Optimistic, hopeful, believing in a better future ; A young woman planting a tree in a barren wasteland; eye-level; Normal Person; Dusty, desolate landscape with a single, hopeful green sprout; cinematic
Characteristic
Shot : A young woman is planting a sapling in a barren landscape, surrounded by dead trees. The scene is likely intended to convey a message about environmental destruction and the need for hope and action.
Aesthetic Score : 0.6
Mood : melancholic, hopeful, environmental
Quality
Entropy : 6.82
Noise : 73
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant artifacts or errors in the image.
The Glow of Competition: Friends United in the Digital Arena
A dimly lit room pulsates with the energy of a gaming session. Friends, faces illuminated by the screen’s glow, are locked in intense competition, their focus unwavering. The low light and screen reflections create a palpable sense of excitement and camaraderie.
Prompt
facial-expressions Hope: Excited, triumphant, feeling a sense of accomplishment ; A gamer celebrating a victory with their team, their faces illuminated by the glow of the monitor; eye-level; Gamer; A dimly lit room with gaming peripherals and posters on the walls; cinematic
Characteristic
Shot : A collage of images depicting a group of friends playing video games in a dimly lit room. The images are arranged in a grid-like pattern, and each image features a different aspect of the gaming experience, such as the players themselves, their controllers, and the game screens. The overall mood of the collage is one of excitement and camaraderie.
Aesthetic Score : 0.6
Mood : excitement, fun, camaraderie
Quality
Entropy : 6.16
Noise : 72
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some of the images are slightly blurry, and the colors are a bit muted. There are also some artifacts and noise present in the images.
A Candle in the Darkness: Hope Amidst Mystery
A young woman, her face illuminated by a single flickering candle, stands in a shadowy room. The darkness amplifies the mystery surrounding her, while the candle’s light suggests a glimmer of hope. This evocative image captures a moment of somber reflection, leaving the viewer to ponder the story behind her gaze.
Prompt
facial-expressions Hope: Hopeful, comforting, a beacon of light in the darkness ; A single candle burning brightly in a dark room; eye-level; Single Person; Shadows and darkness surrounding the candle; cinematic
Characteristic
Shot : A young woman is holding a lit candle in a dark room, looking at the camera with a serious expression.
Aesthetic Score : 0.7
Mood : mysterious, somber, introspective
Quality
Entropy : 4.13
Noise : 51
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors
A Moment of Joy in the Delivery Room
A heartwarming image captures a smiling nurse cradling a newborn baby in a hospital room. The scene radiates joy and hope, though the lack of background detail leaves room for a more impactful composition.
Prompt
facial-expressions Hope: Joyful, hopeful, a symbol of new beginnings ; A doctor holding a newborn baby in their arms; eye-level; Hero; A sterile hospital room with medical equipment in the background; cinematic
Characteristic
Shot : A nurse in a hospital room is holding a newborn baby. The nurse is smiling and looking at the camera. There are two other medical personnel in the background. One is a woman in a blue scrubs. The other is a man in a blue hat.
Aesthetic Score : 0.7
Mood : joyful, caring, hopeful
Quality
Entropy : 6.80
Noise : 64
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.10
Image errors : The baby’s face is slightly blurred. The image is also slightly overexposed.
Warm Kitchen Gathering: Friends Share a Meal in the Golden Light
A heartwarming scene unfolds in a kitchen bathed in natural light. Four friends gather around a table, enjoying a meal together. The atmosphere is happy, warm, and inviting, capturing the essence of shared joy and connection.
Prompt
facial-expressions Hope: Warm, comforting, a sense of belonging ; A group of friends sharing a meal together in a cozy kitchen; eye-level; Normal People; Warm, inviting kitchen with sunlight streaming through the window; cinematic
Characteristic
Shot : Four friends are sitting at a kitchen table, having a meal. The light is warm and inviting, and the people are all laughing and smiling. It looks like a very pleasant and enjoyable occasion.
Aesthetic Score : 0.7
Mood : warm, joyful, friendly
Quality
Entropy : 6.72
Noise : 77
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors detected.
The Focused Gamer
A young man, headphones on and eyes fixed on the screen, embodies the intensity of focus required for gaming. The dark background and his serious expression create a sense of determination, highlighting the immersive world he’s engrossed in.
Prompt
facial-expressions Hope: Determined, focused, persevering ; A gamer overcoming a difficult challenge in a video game, their face showing determination and focus; eye-level; Gamer; A brightly lit room with a large monitor displaying the game; cinematic
Characteristic
Shot : A young man wearing a headset sits in front of a computer screen, likely playing a video game. The room is dimly lit, and the only other elements in the frame are computer monitors and a desk.
Aesthetic Score : 0.6
Mood : focused, serious, determined
Quality
Entropy : 6.55
Noise : 64
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight noise and graininess in the image, particularly in the darker areas.
Smiling at the Sky: A Moment of Joy and Hope
A man basks in the sunshine, his smile reaching for the fluffy white clouds above. The scene evokes a sense of happiness, carefree abandon, and hopeful optimism.
Prompt
facial-expressions Hope: Free, hopeful, a symbol of liberation ; Soaring through blue sky; eye-level; Single Person; Vast, open sky with fluffy white clouds; cinematic
Characteristic
Shot : A man is looking up at the sky with a big smile on his face. The sky is blue with white clouds.
Aesthetic Score : 0.7
Mood : happy, hopeful, carefree
Quality
Entropy : 6.43
Noise : 61
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant errors, the image is sharp and well-exposed.
Silhouettes of Hope: A Sunset Moment of Determination
Four figures stand resolute against the backdrop of a golden sunset, their arms crossed and gazes fixed on the horizon. The warm glow of the setting sun casts long shadows across the field, emphasizing their silhouettes and creating a sense of hope and optimism. This image captures a moment of shared purpose and unwavering resolve.
Prompt
facial-expressions Hope: United, hopeful, facing the future together ; A group of people standing together, arms linked, facing a bright sunrise; eye-level; Heroes; A vast, open field with a golden sunrise in the background; cinematic
Characteristic
Shot : Five people, four men and one woman, standing in a field with their arms crossed, looking toward the sunset. The field is a golden yellow.
Aesthetic Score : 0.6
Mood : serious, hopeful, determined
Quality
Entropy : 6.62
Noise : 68
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.10
Image errors : None
Conclusion
The results show that the generative AI model performed well in terms of aesthetics and camera position, but struggled with understanding the scene in the prompt.
Here’s a breakdown:
- Aesthetic Analysis: The model achieved a score of 0.1, which falls within the “very good” range of -0.2 to 0.1. This means the generated image closely matched the expected aesthetic described in the prompt.
- Camera Position Analysis: The model scored 0.1, which is considered “good” as it falls within the range of 0.5 to 0.75. This indicates that the model was able to accurately capture the camera position described in the prompt.
- Shot Analysis: The model scored 0.43, which is considered “good” as it falls within the range of 0.5 to 0.75. However, this score is closer to the lower end of the range, suggesting that the model may have had some difficulty understanding the scene described in the prompt.
Overall, the model demonstrated a strong ability to create images that align with the desired aesthetic and camera position. However, it could benefit from further development to improve its understanding of scene composition and shot types.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://stability.ai