AI's Artistic Struggle: Capturing the Essence of Style with Freepik
- 9 minutes read - 1869 wordsTable of Contents
The world of AI image generation is rapidly evolving, with models capable of creating stunning visuals. However, capturing the essence of a specific aesthetic style remains a challenge. This blog post delves into the complexities of AI’s artistic journey, exploring its strengths and weaknesses in replicating desired aesthetics. We’ll analyze a case study where an AI model successfully captured camera positions and shot types but struggled to accurately portray the intended aesthetic. This analysis highlights the need for further development in AI’s understanding of artistic elements, paving the way for more nuanced and expressive image generation.
Created with: freepik
Superman’s Silhouette Against a Blazing Sunset
A powerful image captures Superman standing tall on a rooftop, his silhouette stark against the vibrant orange and pink sunset. The city lights twinkle below, adding to the heroic and hopeful mood of the scene.
Prompt
Pop art: Epic, hopeful ; A lone superhero, silhouetted against a blazing sunset; wide shot; Heroism; cityscape with towering skyscrapers; cinematic
Characteristic
Shot : Superman stands on a rocky outcrop overlooking a city skyline at sunset. He is silhouetted against the orange sky.
Aesthetic Score : 0.7
Mood : epic, powerful, hopeful
Quality
Entropy : 6.83
Noise : 67
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.90
Image errors : There are some minor artifacts in the image, such as the clouds and the cityscape, that make it look a little bit blurry.
Jungle Adventure Awaits: Friends Explore Ancient Ruins
A group of six friends, radiating smiles and excitement, stand before a majestic jungle temple. Their khaki shorts and t-shirts suggest a spirit of adventure, ready to uncover the secrets hidden within the lush greenery. This image captures the essence of exploration and camaraderie, promising an unforgettable journey.
Prompt
Pop art: Excited, adventurous ; A group of adventurers, their faces painted with determination, standing on the edge of a jungle; medium shot; Adventure; lush green foliage and ancient ruins; cinematic
Characteristic
Shot : A group of six young people are standing in front of a large stone temple, surrounded by lush tropical greenery. They are all dressed in casual clothing and appear to be on an adventure.
Aesthetic Score : 0.6
Mood : adventurous, youthful, explorative
Quality
Entropy : 6.82
Noise : 108
Prompt Clip Score : 0.35
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no noticeable errors in the image.
Lost in the Code: A Young Programmer’s Intense Focus
A dimly lit room, two glowing monitors, and a young man with headphones and glasses, completely absorbed in the world of code. The dramatic lighting highlights his intense focus and the futuristic glow of his headset, capturing the essence of a programmer’s dedication.
Prompt
Pop art: Intense, focused ; A gamer, eyes glued to the screen, fingers flying across the keyboard; close-up; Gaming; neon-lit gaming room with flashing lights; cinematic
Characteristic
Shot : A young man is sitting at his computer, wearing headphones and typing on a keyboard. The room is dimly lit with blue and pink lights.
Aesthetic Score : 0.7
Mood : focused, intense, digital
Quality
Entropy : 6.52
Noise : 72
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.20
Image errors : No visible artifacts or errors.
Parisian Romance: A Silhouette Against the Eiffel Tower
A couple strolls hand-in-hand towards the iconic Eiffel Tower at twilight, their silhouettes framed against the illuminated landmark. The scene evokes a sense of romance, nostalgia, and grandeur, capturing the magic of Paris.
Prompt
Pop art: Romantic, nostalgic ; A couple, hand in hand, gazing at the Eiffel Tower; medium shot; Tourism; bustling Parisian street with vibrant colors; cinematic
Characteristic
Shot : A couple walks hand-in-hand away from the viewer towards the Eiffel Tower at dusk. The image is captured at a slight distance, allowing for a clear view of the iconic landmark.
Aesthetic Score : 0.8
Mood : romantic, dreamy, Parisian
Quality
Entropy : 6.83
Noise : 91
Prompt Clip Score : 0.34
AI Evaluation
Likelihood of AI : 0.10
Image errors : The image contains some slight blurriness around the edges, particularly in the areas surrounding the couple and the Eiffel Tower. This may be due to camera shake or post-processing.
A Hiker’s Dream: Finding Serenity Amidst Majestic Peaks
A lone hiker stands on a mountaintop, map in hand, gazing out at a breathtaking panorama of snow-capped peaks and rolling hills. The vastness of the landscape and the small figure of the hiker create a sense of awe and perspective, capturing the essence of adventure and hope.
Prompt
Pop art: Free, adventurous ; A backpacker, with a map in hand, standing on a mountain peak; wide shot; Travel; breathtaking mountain range with clouds swirling below; cinematic
Characteristic
Shot : A lone hiker stands on a mountain peak, looking at a map with a vast mountain range in the background. The sky is clear and bright with fluffy clouds.
Aesthetic Score : 0.7
Mood : serene, adventurous, contemplative
Quality
Entropy : 6.66
Noise : 77
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.30
Image errors : No visible errors.
Sun-Kissed Laughter: A Moment of Pure Joy
Three friends bask in the warmth of a summer day, their laughter echoing through the park. The vibrant green grass and bright sunshine create a scene of pure happiness and carefree joy.
Prompt
Pop art: Happy, heartwarming ; A family, laughing and playing in a park; medium shot; Family; bright green grass, blooming flowers, and a sunny sky; cinematic
Characteristic
Shot : Three women, two adults and a child, are sitting on a grassy lawn, laughing and enjoying the sunny day. Green trees surround them, creating a picturesque setting.
Aesthetic Score : 0.8
Mood : joyful, carefree, sunny
Quality
Entropy : 6.58
Noise : 94
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant errors detected.
Superman Soars Through the City in a Blaze of Glory
A dynamic and powerful image of Superman flying over a city skyline, leaving colorful smoke trails in his wake. The scene evokes a sense of heroism and excitement, capturing the essence of the iconic superhero in action.
Prompt
Pop art: Dynamic, powerful ; A superhero, leaping through the air, leaving a trail of colorful smoke; dynamic shot; Heroism; cityscape with iconic landmarks; cinematic
Characteristic
Shot : Superman flying through the air over a city with pink and blue smoke trails behind him
Aesthetic Score : 0.7
Mood : heroic, dynamic, hopeful
Quality
Entropy : 6.79
Noise : 68
Prompt Clip Score : 0.30
AI Evaluation
Likelihood of AI : 0.80
Image errors : The smoke trails look slightly artificial and the city skyline is a bit blurry. The costume fabric looks a little plastic.
Lost in the Ethereal Glow: Explorers Venture into a Mysterious Cave
Three explorers brave the darkness of a cavern, their path illuminated by a single light source. The ethereal glow reveals intricate rock formations, creating an atmosphere of mystery and adventure. The play of light and shadow adds a touch of eeriness, hinting at the unknown that lies ahead.
Prompt
Pop art: Suspenseful, thrilling ; A group of adventurers, navigating a treacherous cave; close-up; Adventure; dark and mysterious cave with glowing crystals; cinematic
Characteristic
Shot : Three explorers, wearing helmets and backpacks, are walking through a cave lit by a blue light. The cave is filled with stalactites and stalagmites.
Aesthetic Score : 0.7
Mood : mysterious, adventurous, eerie
Quality
Entropy : 6.67
Noise : 80
Prompt Clip Score : 0.33
AI Evaluation
Likelihood of AI : 0.60
Image errors : The shadows seem a bit unnatural and the texture of the cave walls appears slightly artificial. The explorers’ faces are not clear enough and some details in the background look blurry.
Victory Dance! Gamer Celebrates Triumph with Energetic Cheer
This image captures the pure joy of victory as a young gamer, clad in a vibrant pink and blue t-shirt, throws his fist in the air and beams with excitement. The blurred background of colorful lights suggests a lively gaming competition, adding to the celebratory atmosphere.
Prompt
Pop art: Exuberant, joyful ; A gamer, celebrating a victory with a triumphant fist pump; close-up; Gaming; brightly colored video game interface with flashing lights; cinematic
Characteristic
Shot : A young man in a pink and blue shirt is celebrating a victory, his arm is raised in the air, and he is looking at the camera with a wide smile. He is standing in front of a wall with colorful lights and screens.
Aesthetic Score : 0.6
Mood : joyful, energetic, excited
Quality
Entropy : 6.78
Noise : 68
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.30
Image errors : Slight blurriness on the background
Laughter, Food, and Friendship: Capturing the Joy of a Bustling Market
A vibrant scene unfolds at a bustling outdoor market, where four friends gather around a table laden with colorful dishes. Their laughter and animated conversation paint a picture of shared joy and connection, while the vibrant background of food stalls and bustling crowds adds to the lively atmosphere. This image captures the essence of a moment filled with happiness and the simple pleasures of good company and delicious food.
Prompt
Pop art: Joyful, authentic ; A family, enjoying a delicious meal at a street food stall; medium shot; Travel; vibrant street market with colorful food stalls; cinematic
Characteristic
Shot : A group of friends enjoying a meal at a street food market. They are all smiling and laughing, and the food looks delicious. The image is taken from a slightly elevated angle, looking down on the group. There are vibrant colors, and the image is well composed.
Aesthetic Score : 0.7
Mood : happy, lively, celebratory
Quality
Entropy : 6.80
Noise : 92
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : None
Conclusion
This analysis shows that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.4
- Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera position described in the prompt.
Shot Analysis:
- Score: 0.47
- Interpretation: This score is within the “good” range, indicating the model successfully understood and implemented the desired shot type from the prompt.
Aesthetic Analysis:
- Score: 0.32
- Interpretation: This score is significantly higher than the “very good” range of -0.2 to 0.1. This means the generated image’s aesthetic deviated considerably from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera positions and shot types, but struggles to accurately capture the intended aesthetic. This suggests that the model might need further training to better understand and implement aesthetic elements in its generated images.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://www.freepik.com