This blog post explores the results of an experiment where a generative AI model was tasked with creating images based on scene descriptions. The model demonstrated a good understanding of scene and shot composition, but struggled with accurately capturing the intended camera position and aesthetic, particularly in conveying the essence of poses. We delve into the model's performance, analyzing its strengths and weaknesses, and discuss the challenges of AI in capturing the nuances of human expression and artistic intent.