AI's Camera Skills: A Long Way to Go with Ideogram-v2

AI's Camera Skills: A Long Way to Go with Ideogram-v2

Contents

Dramatic camera positions are a powerful tool in filmmaking and photography, used to evoke specific emotions and create a sense of grandeur or intimacy. They can be used to emphasize the scale of a scene, highlight the vulnerability of a character, or create a sense of awe and wonder. Examples of dramatic camera positions include the low-angle shot, which makes the subject appear powerful and imposing, and the high-angle shot, which can make the subject appear small and vulnerable. This experiment aimed to explore how well an AI model could understand and implement these dramatic camera positions in its image generation.

Created with: ideogram-v2

Silhouetted Against Ruin: A Lone Figure in a Post-Apocalyptic Sunset

A solitary figure stands on the precipice of a crumbling building, their silhouette stark against the fiery hues of a post-apocalyptic sunset. The ruins of the city stretch out behind them, a testament to a lost world. This dramatic scene evokes feelings of isolation, loss, and a haunting sense of the unknown.

Silhouetted Against Ruin: A Lone Figure in a Post-Apocalyptic Sunset

Prompt

camera-positions Long Shot: Epic, hopeful, determined ; A lone figure, silhouetted against the setting sun, stands atop a crumbling skyscraper; Long shot; Heroism; A cityscape with smoke and fire in the distance; cinematic

Characteristic

Shot : A lone figure stands on the edge of a crumbling building, silhouetted against a sunset in a post-apocalyptic city. The ruins of the city extend behind him.

Aesthetic Score : 0.6

Mood : dramatic, suspenseful, melancholic

Quality

Entropy : 6.59

Noise : 78

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.90

Image errors : The image has some blurring and grain, which might be a stylistic choice but could be refined.

Battling the Storm: A Boat Braves the Fury of the Sea

A lone boat cuts through towering waves, lightning illuminating the dramatic scene. A distant lighthouse offers a beacon of hope amidst the chaos, highlighting the power of nature and the resilience of the vessel.

Battling the Storm: A Boat Braves the Fury of the Sea

Prompt

camera-positions Long Shot: Thrilling, suspenseful, awe-inspiring ; A small boat, dwarfed by towering waves, navigates a raging storm; Long shot; Adventure; A vast, stormy ocean with lightning flashing in the distance; cinematic

Characteristic

Shot : A boat navigates through a stormy sea with massive waves and lightning in the background. A lighthouse is visible in the distance.

Aesthetic Score : 0.7

Mood : dramatic, powerful, intense

Quality

Entropy : 6.74

Noise : 116

Prompt Clip Score : 0.30

AI Evaluation

Likelihood of AI : 0.30

Image errors : Some slight blurring in the background, which could be due to the stormy weather or the camera settings

Lost in the Neon Labyrinth: A Surreal Journey into the Digital City

This captivating image transports you to a futuristic cityscape, where glowing neon lines and shapes create a mesmerizing digital landscape. The person wearing a VR headset stands at the heart of this surreal world, inviting you to experience the immersive power of virtual reality.

Lost in the Neon Labyrinth: A Surreal Journey into the Digital City

Prompt

camera-positions Long Shot: Energetic, immersive, futuristic ; A player, surrounded by glowing screens and flashing lights, navigates a complex virtual world; Long shot; Gaming; A futuristic, virtual world; cinematic

Characteristic

Shot : A person wearing a VR headset is standing in a futuristic cityscape made of glowing neon lines and shapes.

Aesthetic Score : 0.7

Mood : futuristic, digital, surreal

Quality

Entropy : 5.50

Noise : 94

Prompt Clip Score : 0.28

AI Evaluation

Likelihood of AI : 0.80

Image errors : The image appears to be slightly blurry and the lighting is uneven. There are some artifacts in the image, such as the lines being slightly jagged.

Awe-Inspiring Ancient Architecture: Tourists Marvel at Historic Stone Building

A group of tourists stand in awe before a majestic ancient stone building, its intricate facade a testament to a bygone era. The composition, with the tourists framed in the foreground and the building dominating the background, evokes a sense of wonder and curiosity about the history and craftsmanship of this remarkable structure.

Awe-Inspiring Ancient Architecture: Tourists Marvel at Historic Stone Building

Prompt

camera-positions Long Shot: Awe-inspiring, curious, nostalgic ; A group of tourists, their faces filled with wonder, stand before a majestic ancient monument; Long shot; Tourism; A sprawling, historical site with intricate carvings and towering structures; cinematic

Characteristic

Shot : A group of tourists standing in front of a large ancient stone building with an intricate facade. The building is set into a hillside and is surrounded by other ancient structures. The tourists are looking up at the building with awe.

Aesthetic Score : 0.6

Mood : awe, wonder, curiosity

Quality

Entropy : 6.93

Noise : 111

Prompt Clip Score : 0.26

AI Evaluation

Likelihood of AI : 0.10

Image errors : There are some minor artifacts and errors in the image, particularly around the edges of the building and the tourists. The lighting is also somewhat uneven.

A Family’s Journey Through a Bustling Asian Market

A family of four walks through a vibrant market, their backs to the camera, bathed in warm backlighting. The scene captures the energy and adventure of their journey, leaving the viewer to imagine their destination and the stories they’ll create along the way.

A Family’s Journey Through a Bustling Asian Market

Prompt

camera-positions Long Shot: Adventurous, lively, hopeful ; A family, their luggage in tow, walks down a bustling street in a foreign city; Long shot; Travel; A vibrant, crowded street market with colorful stalls and exotic goods; cinematic

Characteristic

Shot : A family of four walking through a bustling market, likely in Asia. The camera is behind them, capturing their backs as they walk away from the viewer.

Aesthetic Score : 0.6

Mood : tranquil, busy, adventurous

Quality

Entropy : 6.92

Noise : 105

Prompt Clip Score : 0.29

AI Evaluation

Likelihood of AI : 0.10

Image errors : The image is slightly overexposed and some colors are blown out. There are no significant artifacts or errors.

Lost in the Milky Way: A Child’s Wonder Under the Stars

A young girl stands in awe, gazing up at a breathtaking night sky filled with stars and the Milky Way. The scene evokes a sense of wonder and tranquility, inviting viewers to contemplate the vastness of the universe and their place within it.

Lost in the Milky Way: A Child’s Wonder Under the Stars

Prompt

camera-positions Long Shot: Peaceful, hopeful, nostalgic ; A young girl, her eyes filled with wonder, gazes up at a starry night sky; Long shot; Family; A vast, open field with a starry sky above; cinematic

Characteristic

Shot : A young girl is gazing up at the night sky, filled with stars and the Milky Way. The scene is framed by a line of trees in the distance.

Aesthetic Score : 0.7

Mood : wonder, awe, tranquility

Quality

Entropy : 6.90

Noise : 82

Prompt Clip Score : 0.31

AI Evaluation

Likelihood of AI : 0.20

Image errors : No visible artifacts or errors

A Solitary Figure Against the Majesty of Mountains

A lone hiker stands on a rocky peak, dwarfed by the vast, snow-capped mountain range. The serene scene evokes a sense of awe and wonder, highlighting the hiker’s solitude and the immense scale of nature.

A Solitary Figure Against the Majesty of Mountains

Prompt

camera-positions Long Shot: Inspiring, contemplative, triumphant ; A lone figure, standing on a mountain peak, surveys a breathtaking landscape; Long shot; Heroism; A majestic mountain range with snow-capped peaks and valleys below; cinematic

Characteristic

Shot : A lone hiker stands on a rocky peak overlooking a vast, snow-capped mountain range. The sky is a clear blue with a few fluffy clouds.

Aesthetic Score : 0.8

Mood : serene, contemplative, majestic

Quality

Entropy : 6.76

Noise : 109

Prompt Clip Score : 0.30

AI Evaluation

Likelihood of AI : 0.20

Image errors : No significant artifacts or errors are noticeable. The image appears to be clean and well-exposed.

Lost in the Jungle: A Glimpse of Ancient Secrets

Explore a dense, mysterious jungle where ancient ruins peek through the foliage. This tranquil scene evokes a sense of adventure and wonder, inviting you to uncover the secrets hidden within.

Lost in the Jungle: A Glimpse of Ancient Secrets

Prompt

camera-positions Long Shot: Intriguing, suspenseful, adventurous ; A group of explorers, their faces etched with determination, navigate a dense jungle; Long shot; Adventure; A lush, overgrown jungle with ancient ruins hidden within; cinematic

Characteristic

Shot : A group of people are walking through a dense jungle, a glimpse of ancient ruins can be seen through the foliage in the background.

Aesthetic Score : 0.7

Mood : mysterious, adventurous, tranquil

Quality

Entropy : 6.63

Noise : 123

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.10

Image errors : No visible artifacts or errors

Immersed in the Fight: A Gamer Faces Down a Virtual Monster

A young man, eyes locked on a towering virtual monster, sits in a futuristic gaming chair. Neon lights illuminate the scene, creating a sense of suspense and excitement as he prepares for battle. This image captures the immersive experience of virtual reality gaming, where the line between reality and fantasy blurs.

Immersed in the Fight: A Gamer Faces Down a Virtual Monster

Prompt

camera-positions Long Shot: Exciting, immersive, thrilling ; A gamer, immersed in a virtual reality game, battles a giant monster; Long shot; Gaming; A futuristic, neon-lit cityscape with holographic projections of the monster; cinematic

Characteristic

Shot : A young man wearing VR headset is sitting in a gaming chair, facing a large virtual monster behind him. The scene takes place in a futuristic, neon-lit environment, with a cityscape in the background.

Aesthetic Score : 0.6

Mood : futuristic, suspenseful, immersive

Quality

Entropy : 6.54

Noise : 98

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.70

Image errors : Some areas of the image appear slightly blurry and pixelated, particularly the virtual monster and the cityscape. The lighting is also a bit uneven, with some areas being too bright and others too dark.

Beach Bliss: A Family’s Perfect Summer Moment

Capture the joy of a family vacation with this heartwarming image. Five smiling faces stand against the backdrop of a serene ocean, radiating happiness and relaxation. The casual summer attire and the calming atmosphere create a perfect picture of family bonding and carefree fun.

Beach Bliss: A Family’s Perfect Summer Moment

Prompt

camera-positions Long Shot: Relaxing, joyful, nostalgic ; A family, their faces filled with joy, stands on a beach overlooking a turquoise ocean; Long shot; Family; A pristine beach with white sand and crystal-clear water; cinematic

Characteristic

Shot : A family of five is standing on a beach, smiling at the camera, with the ocean behind them. They are dressed in casual summer clothes.

Aesthetic Score : 0.7

Mood : happy, relaxed, family-oriented

Quality

Entropy : 6.19

Noise : 91

Prompt Clip Score : 0.32

AI Evaluation

Likelihood of AI : 0.20

Image errors : no obvious errors

Conclusion

The results show that the generative AI model performed well in understanding and implementing camera positions and shot types, but struggled with achieving the desired aesthetic. Here’s a breakdown:

Camera Position:

  • Score: 0.4
  • Interpretation: This score falls below the “good” range of 0.5 to 0.75. It suggests that the model didn’t perfectly capture the intended camera positions described in the prompt.

Shot Analysis:

  • Score: 0.48
  • Interpretation: This score is also below the “good” range, indicating that the model didn’t fully understand the shot types described in the prompt.

Aesthetic Analysis:

  • Score: 0.02
  • Interpretation: This score is very close to the “very good” range of -0.2 to 0.1. It suggests that the generated image’s aesthetic deviated significantly from the expected aesthetic described in the prompt.

Overall:

The model demonstrates a decent ability to understand and implement camera positions and shot types, but it struggles to achieve the desired aesthetic. This suggests that the model might need further training to better understand and translate aesthetic descriptions into visual outputs.

Sources: