This blog post delves into the results of an experiment testing an AI model's ability to understand and implement camera positions in image generation. While the model demonstrates a strong grasp of scene composition and aesthetic, it falls short in accurately translating camera positions from text prompts. We analyze the model's performance, highlighting its strengths and weaknesses, and discuss the implications for the future of AI in visual storytelling.