AI Captures the Essence, But Misses the Shot: A Study in Aesthetic Style with Freepik
- 9 minutes read - 1832 wordsTable of Contents
The world of AI-generated imagery is constantly evolving, with models becoming increasingly adept at understanding and replicating complex artistic styles. This experiment focused on the ‘style-aesthetic’ category, exploring the model’s ability to capture the essence of a scene while adhering to a specific aesthetic. The results reveal a fascinating dichotomy: while the model excels at achieving the desired aesthetic, it struggles with accurately capturing the intended camera position. This highlights the ongoing challenge of bridging the gap between human intention and AI execution in the realm of visual art.
Created with: freepik
Silhouetted in Solitude: A Figure Contemplates the Vast Desert
A lone figure, shrouded in a long coat, stands silhouetted against the fiery hues of a desert sunset. Their gaze is fixed on the horizon, lost in contemplation amidst the vastness of the landscape. The scene evokes a sense of solitude, mystery, and the profound beauty of isolation.
Prompt
French New Wave: epic, melancholic ; A lone figure, silhouetted against a setting sun; long shot; heroism; a vast, empty desert landscape; cinematic
Characteristic
Shot : A lone figure in a long coat stands in a desert at sunset, looking out at the vast, rolling sand dunes. The sun is setting in the distance, casting a warm glow over the landscape.
Aesthetic Score : 0.7
Mood : solitude, contemplation, vastness
Quality
Entropy : 6.21
Noise : 69
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image appears to be slightly overexposed, particularly in the sky.
Unveiling the Secrets of the Past: A Hand Points to an Intriguing Destination
A weathered hand, reaching out from the foreground, points to a specific location on an old, faded map. The map, spread out on a table, whispers tales of forgotten adventures and mysterious places. The scene evokes a sense of mystery, adventure, and nostalgia, leaving you wondering what secrets lie hidden within the map’s faded lines.
Prompt
French New Wave: intriguing, suspenseful ; A close-up of a weathered map, with a finger tracing a route; medium shot; adventure; a cluttered, dimly lit room; cinematic
Characteristic
Shot : A hand is pointing at a map with a pen or pencil. The map is old and worn.
Aesthetic Score : 0.5
Mood : mysterious, vintage, suspenseful
Quality
Entropy : 6.82
Noise : 73
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noise and grain in the image. The lighting could be improved.
The Thrill of the Arcade: A Close-Up on Nostalgia
A nostalgic glimpse into the world of arcade gaming, captured in a close-up shot of a hand maneuvering a joystick. The vibrant lights and graphics of the surrounding games create a sense of excitement and playful energy, transporting us back to a time of pure, unadulterated fun.
Prompt
French New Wave: intense, energetic ; A hand holding a joystick, fingers moving rapidly; close-up; gaming; a neon-lit arcade with flashing screens; cinematic
Characteristic
Shot : A close-up shot of a person’s hand holding a joystick on an arcade game, with the neon glow of the arcade cabinets in the background.
Aesthetic Score : 0.8
Mood : nostalgic, playful, retro
Quality
Entropy : 6.54
Noise : 66
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors.
Lost in the City of Lights: A Moment of Melancholy at the Eiffel Tower
A young woman stands before the iconic Eiffel Tower, her gaze fixed on the viewer. The warm glow of the city lights creates a dreamy atmosphere, while the bokeh effect adds a sense of mystery and intrigue. This image captures a moment of quiet contemplation, tinged with both romance and melancholy.
Prompt
French New Wave: romantic, nostalgic ; A young woman, her face filled with wonder, gazing at the Eiffel Tower; medium shot; tourism; a bustling Parisian street; cinematic
Characteristic
Shot : A young woman stands in front of the Eiffel Tower at dusk. She is looking directly at the camera with a serious expression.
Aesthetic Score : 0.8
Mood : romantic, melancholic, wistful
Quality
Entropy : 6.81
Noise : 64
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible artifacts or errors
Tranquil Journey: A Glimpse of Rural Life from a Train Window
A serene view from a train window captures the essence of a tranquil journey. Rolling hills and a field stretch out before you, while a passing train on a rural track adds a sense of movement and immediacy to the scene. The perspective from the window evokes a feeling of peaceful travel and connection to the passing landscape.
Prompt
French New Wave: reflective, contemplative ; A train speeding through a countryside landscape, with a lone figure looking out the window; long shot; travel; a vibrant, sun-drenched field; cinematic
Characteristic
Shot : View from a train window, looking out at a passing train and a rolling countryside landscape
Aesthetic Score : 0.7
Mood : tranquil, nostalgic, journey
Quality
Entropy : 6.69
Noise : 86
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors.
Family Dinner: A Moment of Warmth and Laughter
A heartwarming scene of a family gathered around a table, enjoying a delicious meal. The warm lighting and genuine smiles create a cozy and inviting atmosphere, capturing the joy of shared moments.
Prompt
French New Wave: intimate, heartwarming ; A family gathered around a table, sharing a meal, with laughter and conversation; medium shot; family; a warm, inviting kitchen; cinematic
Characteristic
Shot : A family is gathered around a table eating a meal. It is likely dinner as the lighting is warm and the table is set with a meal.
Aesthetic Score : 0.7
Mood : cozy, happy, warm
Quality
Entropy : 6.83
Noise : 63
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable image errors. The image is sharp, well-exposed, and the colors are vibrant.
Fear and Determination in the Parisian Streets
A young man races through a bustling Parisian street, his face etched with a mix of fear and determination. The image captures a moment of intense suspense, with the blur of the crowd and the man’s urgent expression creating a palpable sense of chaos and urgency.
Prompt
French New Wave: urgent, dramatic ; A young man, his face etched with determination, running through a crowded marketplace; medium shot; heroism; a chaotic, bustling market; cinematic
Characteristic
Shot : A young man is running through a crowded street in Paris. The street is narrow and lined with buildings. The man is wearing a blue shirt and brown pants. He has a backpack on his shoulders.
Aesthetic Score : 0.7
Mood : dramatic, tense, suspenseful
Quality
Entropy : 6.90
Noise : 79
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : Some of the people in the background are slightly blurred, indicating that the camera was moving during the shot, but the effect is deliberate and does not detract from the image.
A Compass to Adventure: Nostalgic Charm in Every Detail
This vintage compass, captured in a close-up with soft, warm lighting, evokes a sense of nostalgia and adventure. The depth of field draws your eye to the intricate details of the compass, highlighting its significance as a symbol of exploration and discovery.
Prompt
French New Wave: mysterious, suspenseful ; A close-up of a compass needle spinning, pointing towards an unknown destination; close-up; adventure; a dimly lit, mysterious room; cinematic
Characteristic
Shot : A close-up shot of an antique compass with a golden needle, resting on a wooden surface. The background is blurred and features warm, golden lights.
Aesthetic Score : 0.8
Mood : classic, mysterious, vintage
Quality
Entropy : 6.40
Noise : 62
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors.
Intense Focus: A Group Hangs on the Edge of Their Seats
A group of young adults huddle around a computer, their faces illuminated by the dim screen. The atmosphere is thick with tension and anticipation as they watch something unfold, their eyes glued to the display. What are they witnessing? What secrets lie hidden in the digital world?
Prompt
French New Wave: intense, focused ; A group of friends huddled around a computer screen, their faces illuminated by the glow; medium shot; gaming; a dimly lit, cluttered room; cinematic
Characteristic
Shot : A group of young people are gathered around a computer screen, watching something with intensity.
Aesthetic Score : 0.7
Mood : suspenseful, mysterious, focused
Quality
Entropy : 6.54
Noise : 65
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some slight blurring around the edges of the image, particularly noticeable on the computer screen.
Sunset Stroll: A Romantic Evening in a European City
A couple walks hand-in-hand down a charming cobblestone street, bathed in the warm glow of a setting sun. The silhouette effect created by the backlighting adds to the romantic and nostalgic atmosphere of this picturesque scene.
Prompt
French New Wave: romantic, nostalgic ; A couple walking hand-in-hand along a cobblestone street, their silhouettes framed by the setting sun; long shot; tourism; a romantic, picturesque town; cinematic
Characteristic
Shot : A couple walks hand-in-hand down a cobblestone street in a European city. The sun is setting in the background, casting long shadows.
Aesthetic Score : 0.7
Mood : romantic, serene, nostalgic
Quality
Entropy : 6.85
Noise : 90
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed, and the shadows are a bit too harsh. The background is a bit distracting.
Conclusion
The results indicate that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.51, which is considered average. This indicates that the model was able to understand the scene in the prompt reasonably well, but there might be some discrepancies between the intended shot and the generated image.
- Aesthetic Analysis: The model scored 0.07, which is considered very good. This means that the generated image closely matched the expected aesthetic style, despite the other shortcomings.
Overall, the model shows promise in understanding the scene and achieving the desired aesthetic, but needs improvement in accurately capturing the intended camera position.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://www.freepik.com