AI's Artistic Eye: Capturing Emotion, Not Camera Angles with Freepik
- 10 minutes read - 1923 wordsTable of Contents
In the realm of artificial intelligence, generative models are rapidly pushing the boundaries of creativity. These models can generate images, text, and even music, often mimicking human artistic expression. However, while these models excel in capturing certain aspects of artistic creation, they still struggle with others. This blog post delves into the fascinating world of generative AI and its ability to create images with specific camera angles and shot compositions, using facial expressions as a case study. We’ll explore how these models can capture the essence of human emotion while grappling with the technicalities of visual storytelling.
Created with: freepik
Lost in the Neon Rain
A solitary figure in a black raincoat stands amidst the shimmering reflections of a wet, neon-lit city street. The dramatic silhouette evokes a sense of isolation and mystery, capturing a melancholic mood in the heart of urban life.
Prompt
facial-expressions Realization: Melancholy, introspective ; A lone figure; eye-level; Single Person; a bustling city street at night, with neon signs and rain reflecting on the wet pavement; cinematic
Characteristic
Shot : A woman in a black coat stands in the middle of a rain-soaked street in a neon-lit city. The reflections of the lights on the wet pavement create a mesmerizing and colorful effect. The mood is contemplative and lonely.
Aesthetic Score : 0.8
Mood : lonely, contemplative, mysterious
Quality
Entropy : 6.60
Noise : 81
Prompt Clip Score : 0.28
AI Evaluation
Likelihood of AI : 0.70
Image errors : Some of the reflections in the water look a bit too perfect and artificial.
Superman’s Golden Hour: A Hero Stands Tall
Witness the Man of Steel in all his glory, bathed in the golden light of sunset as he surveys the city from atop a towering skyscraper. This dramatic scene captures the power and heroism of Superman, leaving a lasting impression.
Prompt
facial-expressions Realization: Triumphant, awe-inspiring ; A superhero, standing atop a skyscraper; wide shot; Hero; a sprawling cityscape bathed in the golden light of sunset; cinematic
Characteristic
Shot : A superhero, clad in golden armor, stands on the edge of a skyscraper, overlooking a city skyline. The sky is a vibrant orange, suggesting a sunrise or sunset.
Aesthetic Score : 0.7
Mood : heroic, powerful, dramatic
Quality
Entropy : 6.77
Noise : 60
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.90
Image errors : The image has a somewhat artificial look, especially in the superhero’s armor. The cityscape is also a bit generic and lacks detail.
A Moment of Solitude and Sorrow
A young woman sits alone at a kitchen table, her face etched with sadness. Surrounded by untouched bowls of food, she seems lost in her thoughts, creating a poignant image of loneliness and melancholy.
Prompt
facial-expressions Realization: Disillusioned, resigned ; A young woman, sitting at a kitchen table; close-up; Normal People; a cluttered kitchen, with dishes piled in the sink and a half-eaten meal on the table; cinematic
Characteristic
Shot : A young woman sits at a kitchen table in a dimly lit room, looking despondent. The table is set with bowls of food, and there are several empty bowls stacked next to her.
Aesthetic Score : 0.6
Mood : melancholy, contemplative, lonely
Quality
Entropy : 6.78
Noise : 53
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors
Lost in the Code, Fueled by Pizza
A young man sits at his desk, bathed in the soft glow of his computer screen. His focused expression and the low-light setting create a sense of intensity, while the pizza in front of him suggests a casual, late-night coding session. The image captures the essence of dedication and immersion in the digital world.
Prompt
facial-expressions Realization: Intense, focused ; A gamer, hunched over a computer screen; close-up; Gamer; a dimly lit room, with flashing lights from the monitor and empty pizza boxes scattered around; cinematic
Characteristic
Shot : A young man is sitting at a desk, looking thoughtfully at a computer monitor. There is pizza on the desk, suggesting he’s enjoying a meal while gaming or working. The room is dimly lit, creating a cozy and somewhat intimate atmosphere.
Aesthetic Score : 0.6
Mood : relaxed, contemplative, focused
Quality
Entropy : 6.71
Noise : 51
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly grainy, which is likely due to the low-light conditions. The pizza appears slightly out of focus.
Lost in Thought: A Moment of Contemplation on the Subway
A man stands amidst the bustling chaos of a subway train, his expression lost in thought. The cool, bluish lighting and shallow depth of field create a moody and atmospheric scene, drawing attention to his isolation and contemplation.
Prompt
facial-expressions Realization: Lost, alienated ; A man, walking through a crowded train station; eye-level; Single Person; a sea of faces, all rushing in different directions; cinematic
Characteristic
Shot : A young man in a brown coat stands in a crowded subway car, looking ahead with a contemplative expression.
Aesthetic Score : 0.7
Mood : pensive, moody, introspective
Quality
Entropy : 6.85
Noise : 60
Prompt Clip Score : 0.26
AI Evaluation
Likelihood of AI : 0.10
Image errors : No noticeable artifacts or errors.
Superman Stands Tall Amidst Chaos
A powerful image captures Superman amidst the ruins of a destroyed city, his determined gaze fixed on the horizon. The massive explosion in the background underscores the danger and urgency of the situation, while Superman’s unwavering stance suggests he’s ready to face any challenge.
Prompt
facial-expressions Realization: Determined, resolute ; A superhero, standing in the middle of a battle; wide shot; Hero; a chaotic scene of destruction and explosions, with enemies closing in; cinematic
Characteristic
Shot : A superhero, presumably Superman, stands in a war-torn cityscape with explosions in the background. He is focused and determined.
Aesthetic Score : 0.7
Mood : epic, dramatic, heroic
Quality
Entropy : 6.84
Noise : 63
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : The background looks a bit artificial. The explosions and the cityscape feel slightly unrealistic.
Candlelit Dinner with Friends: A Night of Laughter and Joy
Capture the warmth and intimacy of a shared meal with friends. This scene features a dimly lit room, a table laden with delicious food, and happy faces illuminated by candlelight. The mood is cozy and inviting, perfect for evoking feelings of togetherness and joy.
Prompt
facial-expressions Realization: Nostalgic, heartwarming ; A family, gathered around a dinner table; medium shot; Normal People; a warm and inviting kitchen, with the aroma of home-cooked food filling the air; cinematic
Characteristic
Shot : A group of friends enjoying a meal together in a warm and inviting dining room. The table is set with plates of food and drinks, and there is a candle in the center of the table. The friends are all smiling and laughing, and the atmosphere is relaxed and enjoyable.
Aesthetic Score : 0.7
Mood : warm, cozy, joyful
Quality
Entropy : 6.84
Noise : 66
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors
Lost in the Digital World: A Young Man’s Pensive Focus
A dimly lit room, cluttered with gaming gear, sets the stage for a young man’s intense concentration as he stares at his computer screen. The lighting and composition create an air of mystery and intrigue, hinting at a story waiting to unfold.
Prompt
facial-expressions Realization: Defeated, frustrated ; A gamer, staring at a blank screen; close-up; Gamer; a dimly lit room, with the only light coming from the monitor, which is now displaying a game over message; cinematic
Characteristic
Shot : A young man is sitting at a desk in front of two computer monitors, he is looking at the screen to his right, looking somewhat concerned. The image is lit by soft warm light coming from the desk lamp behind him, and the computer monitors are glowing blue and orange. The image has a somewhat melancholic feel.
Aesthetic Score : 0.7
Mood : melancholic, contemplative, introspective
Quality
Entropy : 6.47
Noise : 40
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No notable errors or artifacts
Silhouette of Serenity: A Woman Contemplates the Sunset
A solitary figure in a blue dress stands on a cliff, silhouetted against the vibrant hues of a setting sun. The scene evokes a sense of peace and contemplation, capturing the beauty of a moment of quiet reflection.
Prompt
facial-expressions Realization: Reflective, contemplative ; A woman, standing on a cliff overlooking the ocean; eye-level; Single Person; a vast expanse of blue water stretching out to the horizon, with the sun setting in the distance; cinematic
Characteristic
Shot : A woman in a blue dress is standing on a cliff overlooking the ocean at sunset.
Aesthetic Score : 0.75
Mood : serene, peaceful, contemplative
Quality
Entropy : 6.66
Noise : 55
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image has a slightly blurry background, possibly due to the camera settings. The colors are a bit muted, and the overall image could be sharper.
Hope Amidst the Ruins: Superman Stands Tall in a City’s Ashes
A dramatic scene unfolds as a superhero, likely Superman, surveys the devastation of a destroyed city. The setting sun casts a melancholic glow on the scene, while a plume of smoke rises in the background. Despite the destruction, the superhero’s presence offers a glimmer of hope, creating a powerful and emotional image.
Prompt
facial-expressions Realization: Hopeful, determined ; A superhero, standing in the ruins of a city; wide shot; Hero; a desolate landscape, with smoke rising from the rubble and the sun breaking through the clouds; cinematic
Characteristic
Shot : Superman standing in a destroyed city, looking at a large plume of smoke in the distance. There is a golden light from the sun shining in the distance.
Aesthetic Score : 0.6
Mood : dramatic, heroic, hopeful
Quality
Entropy : 6.82
Noise : 71
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The rubble in the foreground looks slightly unrealistic, it might be too perfectly placed, and the smoke could be less uniform and more textured. The composition could be more interesting.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
- Camera Position: The model scored 0.3, which is considered below average. This suggests that the model didn’t accurately capture the intended camera positions described in the prompt.
- Shot Analysis: The model scored 0.43, which is also below average. This indicates that the model didn’t fully understand the scene described in the prompt and didn’t create the expected shot composition.
- Aesthetic Analysis: The model scored 0.13, which is considered very good. This means that the generated image’s aesthetic closely matched the expected aesthetic described in the prompt.
Overall, the model seems to be better at understanding the desired aesthetic than it is at accurately capturing camera positions and shot composition.
Sources:
- https://dramaresource.com/storytelling/
- https://seedsoftellers.eu/resources/the-body-language-for-young-tellers/
- https://digitalcollections.sit.edu/cgi/viewcontent.cgi?article=1288&context=sandanona&filename=1&type=additional
- https://citeseerx.ist.psu.edu/document?doi=7f842882e9bb1fa2c0e96939bc8d2c37e34e17c0&repid=rep1&type=pdf
- https://www.twinkl.co.uk/search?q=drama+facial+expression
- https://www.freepik.com