AI Captures the Essence, But Misses the Angle: A Look at Generative AI's Aesthetic Prowess with Freepik
- 9 minutes read - 1747 wordsTable of Contents
The world of generative AI is rapidly evolving, with models capable of creating stunning and realistic images from text prompts. However, achieving a perfect match between the intended scene and the generated image remains a challenge. This article explores the strengths and weaknesses of generative AI in capturing specific aesthetics, camera angles, and scene composition. We’ll analyze the results of a test using various scene descriptions, focusing on the model’s ability to understand and translate the desired aesthetic style, camera position, and shot analysis. Through this analysis, we’ll gain insights into the current capabilities and limitations of generative AI in creating visually compelling and accurate images.
Created with: freepik
Solitude Amidst Majesty: A Lone Figure Contemplates the Vastness of Nature
A breathtaking scene unfolds as a solitary figure stands on a mountain peak, gazing out at a sea of clouds and a majestic mountain range. The serene sky and fluffy clouds create a sense of awe and wonder, while the lone figure adds a touch of contemplation and solitude, highlighting the power and beauty of the natural world.
Prompt
Minimalist: Epic, triumphant ; Lone figure standing on a mountain peak; wide shot; Heroism; Dramatic sky with clouds; cinematic
Characteristic
Shot : A lone figure stands on a mountain peak overlooking a vast landscape of clouds and mountains.
Aesthetic Score : 0.8
Mood : serene, majestic, contemplative
Quality
Entropy : 6.69
Noise : 61
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.70
Image errors : The clouds in the foreground appear slightly artificial and lack texture.
A Compass Points to Adventure
A rustic still life featuring a compass, leather pouch, and bag, capturing the essence of vintage exploration. The shallow depth of field draws your eye to the intricate details of the compass, hinting at journeys yet to be taken.
Prompt
Minimalist: Intriguing, mysterious ; A single, weathered compass; close-up; Adventure; Dusty, worn leather bag; cinematic
Characteristic
Shot : A close-up of a compass, a leather bag, and a small leather pouch on a wooden table.
Aesthetic Score : 0.7
Mood : rustic, adventurous, vintage
Quality
Entropy : 6.86
Noise : 73
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.10
Image errors : No visible errors in the image.
Lost in the Game: A Moment of Focused Intensity
A dimly lit room, the glow of the TV screen illuminating the player’s hands gripping the controller. This image captures the focused intensity of a gamer lost in the world of their favorite game, with a playful undercurrent of excitement.
Prompt
Minimalist: Focused, intense ; A pair of hands holding a joystick; close-up; Gaming; Blurred background of a vibrant video game screen; cinematic
Characteristic
Shot : A person is holding a video game controller, with a TV screen in the background. The TV screen is blurry, but the lights of the room are in focus.
Aesthetic Score : 0.5
Mood : focused, intense, playful
Quality
Entropy : 6.66
Noise : 50
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors
A Suitcase Full of Memories on a Cobblestone Street
A vintage suitcase rests on a cobblestone street in a charming European town, evoking a sense of nostalgia and loneliness. The camera points towards the end of the street, hinting at a journey taken and memories left behind. The suitcase, a symbol of forgotten dreams, adds a touch of melancholy to the scene.
Prompt
Minimalist: Nostalgic, hopeful ; A lone suitcase on a cobblestone street; medium shot; Tourism; A quaint, European town in the background; cinematic
Characteristic
Shot : A vintage suitcase sitting in the middle of a cobblestone street in a European town, the buildings on either side of the street are out of focus. The street leads to a vanishing point in the distance.
Aesthetic Score : 0.7
Mood : lonely, nostalgic, melancholy
Quality
Entropy : 6.73
Noise : 85
Prompt Clip Score : 0.37
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Finding Peace on the Sandy Shore
A serene image of footprints leading towards the ocean, capturing the essence of tranquility and relaxation. The camera’s perspective from above emphasizes the calm and peaceful mood of the scene.
Prompt
Minimalist: Serene, liberating ; A pair of feet walking on a sandy beach; low-angle shot; Travel; Vast ocean and horizon in the background; cinematic
Characteristic
Shot : A person is lying on the beach with their feet in the sand, looking at a man walking away from them on the beach. The sand is white and the water is blue.
Aesthetic Score : 0.6
Mood : peaceful, calm, contemplative
Quality
Entropy : 6.26
Noise : 67
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image is slightly overexposed. The shadows are a little too dark and the light is not as evenly distributed as it could be.
A Handshake of Hope in a Peaceful Park
Two hands meet in a gesture of connection, captured in a moment of peace and hope. The shallow depth of field draws the viewer’s attention to the handshake, emphasizing the intimacy and shared feeling between the individuals.
Prompt
Minimalist: Warm, loving ; A hand holding a child’s hand; close-up; Family; A blurred background of a park or playground; cinematic
Characteristic
Shot : Two hands shaking in a park with a blurred background of trees and a path.
Aesthetic Score : 0.7
Mood : warm, friendly, hopeful
Quality
Entropy : 6.49
Noise : 48
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable errors
A Single Red Rose, A Whisper of Romance
A solitary red rose rests delicately upon a pair of worn leather gloves, evoking a sense of vintage romance and quiet intimacy. The composition draws the eye to the rose, creating a sense of mystery and longing.
Prompt
Minimalist: Romantic, symbolic ; A single, red rose; close-up; Heroism; A weathered, worn leather glove; cinematic
Characteristic
Shot : A single red rose is held in a pair of brown leather gloves, which are resting on a wooden surface. The composition is simple and elegant, with the rose being the main focal point.
Aesthetic Score : 0.7
Mood : romantic, elegant, classic
Quality
Entropy : 6.76
Noise : 69
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no visible errors in the image. The quality is good and the colors are well-balanced.
Ready for Adventure: A Vintage Camera Awaits
A nostalgic scene of a vintage camera resting on an open map, hinting at a journey yet to be taken. The camera’s presence evokes a sense of anticipation and the thrill of exploring uncharted territories. The accompanying books in the background add to the feeling of adventure and travel.
Prompt
Minimalist: Intriguing, adventurous ; A map with a single pin marking a destination; close-up; Adventure; A worn, leather-bound journal; cinematic
Characteristic
Shot : A vintage camera resting on an open map book, with a brown leather bound book in the background.
Aesthetic Score : 0.7
Mood : nostalgic, adventurous, vintage
Quality
Entropy : 6.86
Noise : 84
Prompt Clip Score : 0.32
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible errors or artifacts.
Urban Beats: Where Style Meets Sound
Sleek black headphones with vibrant blue LEDs illuminate a dark table, reflecting the cityscape’s nocturnal glow. A modern, futuristic aesthetic is amplified by the dramatic lighting and intriguing reflections, creating a captivating visual experience.
Prompt
Minimalist: Immersive, futuristic ; A pair of headphones with a cityscape reflected in the earcups; close-up; Gaming; A dimly lit room with a computer screen in the background; cinematic
Characteristic
Shot : A pair of black headphones with blue lights are resting on a sleek black table with a blurred city skyline in the background.
Aesthetic Score : 0.6
Mood : sleek, futuristic, modern
Quality
Entropy : 6.89
Noise : 56
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.70
Image errors : The reflections in the table appear to be slightly distorted, giving away the possibility of being generated with AI
Capturing Tranquility: A Vintage Camera Awaits the Perfect Shot
A vintage camera rests on a rock, overlooking a serene river valley. The shallow depth of field draws your eye to the camera, hinting at the anticipation of capturing the breathtaking scenery. This image evokes a sense of tranquility, nostalgia, and adventure, inviting you to imagine the stories waiting to be told.
Prompt
Minimalist: Nostalgic, adventurous ; A vintage camera with a viewfinder showing a breathtaking landscape; close-up; Tourism; A vibrant, colorful landscape in the background; cinematic
Characteristic
Shot : A vintage camera is placed on a rock in a scenic valley setting. The valley is characterized by rolling hills, lush vegetation, and a winding river flowing through it. The camera is positioned in the foreground, with the valley extending into the background.
Aesthetic Score : 0.7
Mood : tranquil, serene, nostalgic
Quality
Entropy : 6.65
Noise : 66
Prompt Clip Score : 0.36
AI Evaluation
Likelihood of AI : 0.20
Image errors : The image has slight artifacts around the edges and some noise in the darker areas.
Conclusion
The results indicate that the generative AI model performed well in terms of understanding the scene and camera position, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.4, which is considered below average. This suggests that the model didn’t accurately translate the camera position described in the prompt into the generated image.
- Shot Analysis: The model scored 0.54, which is considered average. This indicates that the model was able to understand the scene described in the prompt to a reasonable degree, but there might be some discrepancies between the intended and generated shot.
- Aesthetic Analysis: The model scored 0.08, which is considered very good. This means that the generated image closely matched the expected aesthetic style, despite the other shortcomings.
Overall, the model shows promise in understanding the scene and achieving the desired aesthetic, but needs improvement in accurately translating camera positions.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://www.freepik.com