This blog post explores the capabilities of a generative AI model in creating images based on text prompts. While the model demonstrates proficiency in understanding camera positions and scene descriptions, it struggles to capture the intended aesthetic style, particularly in the portrayal of poses. We delve into the model's performance, analyzing its strengths and weaknesses, and discuss the implications for future development.