This blog post explores the capabilities of a generative AI model in creating images based on text prompts. While the model demonstrates impressive aesthetic understanding, it falls short in accurately interpreting camera positions and shot descriptions. We delve into the model's performance, analyzing its strengths and weaknesses, and discuss the implications for future development.