AI's Artistic Eye: Capturing the Essence, Not the Details with Midjourney
- 9 minutes read - 1814 wordsTable of Contents
The world of generative AI is rapidly evolving, with models capable of creating stunningly realistic images from text prompts. However, the ability to translate complex visual instructions remains a challenge. This blog post examines the results of a recent experiment where a generative AI model was tasked with creating images based on detailed descriptions, highlighting its strengths and weaknesses in capturing the essence of a scene.
Created with: midjourney
Sunrise Triumph: A Hiker’s Silhouette Against a Breathtaking Mountain Panorama
Capture the essence of adventure and serenity with this stunning image. A lone hiker stands triumphantly on a rocky peak, arms outstretched, as the sun rises behind them, painting the sky with vibrant hues. The backlit silhouette against the majestic mountain range evokes a powerful sense of accomplishment and awe, inspiring a sense of wonder and the desire to explore.
Prompt
leaning-back leaning back, arms outstretched: epic, contemplative ; A lone adventurer, silhouetted against a setting sun; wide shot; adventure; vast, rugged mountain range; cinematic
Characteristic
Shot : A lone hiker stands on a mountaintop with arms outstretched, taking in the breathtaking view of a sunlit mountain range with a clear sky and a dramatic sunrise.
Aesthetic Score : 0.8
Mood : inspirational, adventurous, peaceful
Quality
Entropy : 6.63
Noise : 114
Prompt Clip Score : 0.21
AI Evaluation
Likelihood of AI : 0.20
Image errors : No significant artifacts or errors.
Soaring Above the City: A Superhero’s Moment of Triumph
A powerful image captures a superhero standing atop a skyscraper, their cape billowing in the wind as they gaze out over the city. The sun shines brightly, symbolizing hope and the hero’s unwavering determination. This epic scene evokes a sense of power and heroism, leaving viewers inspired by the superhero’s unwavering commitment to justice.
Prompt
leaning-back leaning back, arms crossed: triumphant, powerful ; A superhero, cape billowing in the wind, looking down at a city skyline; medium shot; heroism; bustling cityscape; cinematic
Characteristic
Shot : A superhero, with red cape billowing in the wind, stands on the edge of a tall building overlooking a cityscape. The sun is setting and casting a warm glow over the city.
Aesthetic Score : 0.6
Mood : heroic, hopeful, dramatic
Quality
Entropy : 6.74
Noise : 122
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.80
Image errors : There are some minor artifacts and errors in the image, particularly in the rendering of the cape and the city buildings.
Silhouettes of Love at Sunset
A tranquil scene of four figures silhouetted against the setting sun on a beach, creating a sense of mystery and romance under a swaying palm tree. The warm glow of the sunset bathes the scene in a serene beauty.
Prompt
leaning-back leaning back, relaxed, arms around each other: joyful, carefree ; A group of friends, laughing and relaxing on a beach, watching the sunset; wide shot; tourism; tropical beach with palm trees; cinematic
Characteristic
Shot : Silhouettes of four people sitting on a beach under a palm tree, facing the sunset over the ocean.
Aesthetic Score : 0.7
Mood : tranquil, peaceful, romantic
Quality
Entropy : 5.84
Noise : 108
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible errors in the image. The resolution is good.
Neon Glow, Focused Play: A Gamer’s Sanctuary
A young man relaxes in a dimly lit room, bathed in the vibrant glow of neon lights. He’s engrossed in a video game, his concentration evident in his posture. The scene evokes a sense of calm focus and the allure of a dedicated gaming space.
Prompt
leaning-back leaning back, hands on controller: intense, focused ; A gamer, eyes glued to a screen, leaning back in a gaming chair, surrounded by controllers and snacks; medium shot; gaming; dimly lit room with neon lights; cinematic
Characteristic
Shot : A young man is sitting in a dark room lit by pink and blue lights, holding a video game controller and looking straight at the camera. There are snacks and a TV screen in the background.
Aesthetic Score : 0.6
Mood : intense, focused, serious
Quality
Entropy : 6.70
Noise : 88
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is some noise in the image, particularly in the darker areas.
Golden Hour Reflections: A Moment of Tranquility on the Train
A solitary figure gazes out the window of a moving train, lost in contemplation as the sun sets over a rolling landscape. The soft golden light casts a warm glow, creating a sense of peace and isolation. This image captures a fleeting moment of quiet reflection, leaving the viewer to ponder the man’s thoughts and the beauty of the passing scenery.
Prompt
leaning-back leaning back, head resting on hand: reflective, nostalgic ; A traveler, gazing out of a train window, watching the scenery pass by; medium shot; travel; rolling hills and fields; cinematic
Characteristic
Shot : A man is looking out the window of a train. He is sitting in a seat and his head is resting on his hand.
Aesthetic Score : 0.8
Mood : tranquil, contemplative, melancholic
Quality
Entropy : 5.67
Noise : 102
Prompt Clip Score : 0.24
AI Evaluation
Likelihood of AI : 0.20
Image errors : Minor dust and scratches are visible on the window glass.
Cellists Under the Spotlight: A Dramatic Performance
A low-angle shot captures a group of cellists on stage, bathed in bright spotlights. The intense focus of the musicians and the dramatic lighting create a powerful and captivating scene.
Prompt
leaning-back leaning back, instruments in hand: energetic, passionate ; A group of musicians, performing on stage, bathed in spotlights; wide shot; groups; concert stage with cheering audience; cinematic
Characteristic
Shot : A group of musicians playing instruments on stage in a concert hall. The stage is lit with spotlights.
Aesthetic Score : 0.6
Mood : dramatic, mysterious, intense
Quality
Entropy : 6.17
Noise : 109
Prompt Clip Score : 0.23
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible image errors.
Solitude by the Sea: A Moment of Contemplation
A lone figure finds peace and reflection on a dramatic cliff overlooking the crashing waves. The vastness of the ocean amplifies the sense of solitude and contemplation, creating a powerful and evocative scene.
Prompt
leaning-back leaning back, legs dangling over the edge: solitary, contemplative ; A lone figure, sitting on a cliff edge, looking out at a vast ocean; medium shot; adventure; dramatic coastline with crashing waves; cinematic
Characteristic
Shot : A lone figure sits on a cliff overlooking a vast, stormy ocean.
Aesthetic Score : 0.7
Mood : lonely, dramatic, contemplative
Quality
Entropy : 5.97
Noise : 106
Prompt Clip Score : 0.22
AI Evaluation
Likelihood of AI : 0.20
Image errors : No noticeable artifacts or errors.
Awe-Inspiring View: Astronauts Float Amidst the Vastness of Space
This breathtaking image captures three astronauts adrift in the cosmic void, with Earth serving as a stunning backdrop. The astronauts’ weightless state and the sheer scale of space evoke a profound sense of awe and wonder, reminding us of the incredible beauty and vastness of our universe.
Prompt
leaning-back leaning back, arms outstretched: awe-inspiring, majestic ; A group of astronauts, floating weightlessly in space, looking out at Earth; wide shot; heroism; Earth from space with stars in the background; cinematic
Characteristic
Shot : Three astronauts floating in space, with Earth in the background. The astronauts are in a weightless state, and the Earth looks small and distant.
Aesthetic Score : 0.7
Mood : awe, wonder, isolation
Quality
Entropy : 6.09
Noise : 93
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.80
Image errors : The astronauts are slightly blurred, possibly due to motion blur.
Cozy Campfire Glow in the Dark Forest
A family of four gathers around a crackling campfire, their faces illuminated by the warm flames. The dark forest surrounding them adds a sense of mystery and intimacy to this cozy scene.
Prompt
leaning-back leaning back, relaxed, arms around each other: warm, intimate ; A family, gathered around a campfire, sharing stories and laughter; medium shot; groups; forest clearing with a crackling fire; cinematic
Characteristic
Shot : A family of four is gathered around a campfire in a forest. The parents are sitting with their two children, who are looking at the fire. The image is lit by the firelight and the natural light from the forest.
Aesthetic Score : 0.7
Mood : cozy, warm, family
Quality
Entropy : 6.00
Noise : 97
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no visible errors in the image.
Soaring Above the Clouds: A Pilot’s Breathtaking View
Experience the awe-inspiring beauty of snowy mountains and endless clouds from the cockpit of an airplane. This dramatic scene captures the pilot’s focused gaze as they navigate through the vastness of nature, leaving a sense of peace and wonder.
Prompt
leaning-back leaning back, hands on controls: exhilarating, adventurous ; A pilot, looking out of the cockpit window, flying over a breathtaking landscape; medium shot; travel; mountains and valleys covered in clouds; cinematic
Characteristic
Shot : A pilot in a cockpit is looking out the window at a view of the Himalayas, the mountains are covered in clouds with a soft light coming from the setting sun
Aesthetic Score : 0.8
Mood : serene, majestic, adventurous
Quality
Entropy : 6.66
Noise : 108
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.70
Image errors : The mountains and the clouds appear to be slightly blurry, and the pilot’s face lacks detail
Conclusion
The results show that the generative AI model performed well in understanding the camera position and shot composition, but struggled with the aesthetic aspect. Here’s a breakdown:
- Camera Position: The model scored 0.35, which is considered below average. This suggests that the model didn’t accurately capture the intended camera position described in the prompt.
- Shot Analysis: The model scored 0.44, which is also below average. This indicates that the model didn’t fully understand the scene and its elements as described in the prompt.
- Aesthetic Analysis: The model scored 0.07, which is considered very good. This means the generated image closely matched the expected aesthetic style.
Overall, the model seems to be better at capturing the desired aesthetic than understanding the camera position and shot composition. This suggests that the model might need further training to improve its ability to interpret and translate complex visual instructions.
Sources:
- https://www.writerswrite.co.za/cheat-sheets-for-writing-body-language/
- https://mads3df.wordpress.com/2013/09/04/storytelling-poses/
- https://www.pinterest.com/pegasister890/character-poses/
- https://www.youtube.com/watch?v=udky6ANxWws
- https://maven.com/articles/storytelling-techniques
- https://midjourney.com