Steampunk Dreams: A Generative AI's Journey Through a Clockwork World with Flux-dev
- 9 minutes read - 1867 wordsTable of Contents
Steampunk, a subgenre of science fiction, is known for its distinctive aesthetic that blends Victorian-era technology with fantastical elements. This unique style often features intricate clockwork mechanisms, steam-powered vehicles, and a sense of wonder and adventure. In this blog post, we explore the challenges of using generative AI to capture the essence of steampunk, analyzing its ability to create images that embody the style’s key elements.
Created with: flux-dev
Into the Unknown: A Journey Through Ancient Woods
A group of hikers venture through a lush forest, drawn towards a mysterious stone archway. The depth of field and lighting create a sense of wonder and intrigue, inviting you to explore the secrets hidden within this peaceful, adventurous landscape.
Prompt
style-aesthetic Steampunk: Intriguing, adventurous ; A group of adventurers navigating a treacherous jungle; wide shot; Adventure; Lush, overgrown jungle with ancient ruins and steam-powered contraptions.; cinematic
Characteristic
Shot : A group of hikers are walking towards a large, imposing archway surrounded by lush greenery. The scene has a mysterious and ethereal quality due to the hazy atmosphere and the obscured details of the surrounding forest.
Aesthetic Score : 0.6
Mood : mysterious, ethereal, tranquil
Quality
Entropy : 6.82
Noise : 116
Prompt Clip Score : 0.25
AI Evaluation
Likelihood of AI : 0.20
Image errors : There is a slight blurriness in the background and some noise in the shadows. The colors are slightly washed out and lack vibrancy.
Lost in the Fog: A Nostalgic Journey Through Time
A charming street scene bathed in a soft, ethereal fog, leading towards a majestic clock tower in the distance. The atmosphere is thick with nostalgia, inviting you to explore the hidden stories within this captivating image.
Prompt
style-aesthetic Steampunk: Energetic, bustling ; A bustling marketplace filled with exotic goods and steam-powered vehicles; wide shot; Tourism; A vibrant, colorful marketplace with ornate clockwork contraptions and bustling crowds.; cinematic
Characteristic
Shot : A narrow street in a European city with old buildings, a clock tower and many shops. The street is filled with people and the scene is somewhat busy. The lighting is soft and muted with some fog in the background, giving the scene a romantic feel.
Aesthetic Score : 0.7
Mood : nostalgic, romantic, mysterious
Quality
Entropy : 6.80
Noise : 111
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are no obvious artifacts or errors in the image.
Into the Darkness: A Journey Through a Mysterious Cave
A group of adventurers venture deep into a shadowy cave, their path illuminated by flickering lanterns. The contrast between the darkness and the artificial light creates a sense of mystery and intrigue, promising an exciting journey ahead.
Prompt
style-aesthetic Steampunk: Mysterious, adventurous ; A group of explorers navigating a labyrinthine underground city; wide shot; Adventure; A vast, dimly lit underground city with intricate tunnels and glowing crystals.; cinematic
Characteristic
Shot : A group of people walk through a dark and mysterious tunnel, lit only by a few lamps and the faint glow of an unknown light source at the end.
Aesthetic Score : 0.7
Mood : mysterious, eerie, suspenseful
Quality
Entropy : 6.70
Noise : 108
Prompt Clip Score : 0.27
AI Evaluation
Likelihood of AI : 0.30
Image errors : No major errors or artifacts.
A Gathering of Secrets in a Cozy, Mysterious Setting
Three figures huddle around a game board in a dimly lit living room, bathed in the warm glow of lamps. The fireplace crackles in the background, adding to the sense of intimacy and intrigue. The lighting casts long shadows, creating a mysterious atmosphere that draws the viewer into the scene.
Prompt
style-aesthetic Steampunk: Warm, nostalgic ; A family gathered around a crackling fireplace, sharing stories and playing a board game; medium shot; Family; A cozy living room with plush furniture, antique clocks, and warm lighting.; cinematic
Characteristic
Shot : Three people are sitting around a table in a dimly lit room with a fireplace. They are playing a game of chess.
Aesthetic Score : 0.7
Mood : cozy, intimate, mysterious
Quality
Entropy : 6.42
Noise : 90
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : There are some minor artifacts in the image, such as the slight blurriness of the figures and the background, but nothing major.
Lost in the Digital Frontier: A Glimpse into a Futuristic World
A person, enveloped in a VR headset, sits before a computer monitor displaying a mesmerizing futuristic scene. The image evokes a sense of suspense and wonder, as the user is transported to a digital realm brimming with possibilities. The dark, intense mood and dramatic lighting create an otherworldly atmosphere, leaving viewers captivated by the mystery unfolding before them.
Prompt
style-aesthetic Steampunk: Exciting, immersive ; A player’s avatar battling a mechanical beast in a virtual reality game; close-up; Gaming; A futuristic gaming room with holographic displays and advanced controls.; cinematic
Characteristic
Shot : A person wearing a VR headset sits in a chair in front of a computer monitor. Behind them, a metallic robot stands, adding a futuristic element to the scene.
Aesthetic Score : 0.6
Mood : futuristic, mysterious, focused
Quality
Entropy : 6.68
Noise : 78
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : Some of the light sources in the image have a noticeable halo effect, which could be improved.
Nostalgia on Rails: A Steam Locomotive Arrives at a Grand Station
A timeless scene unfolds as a steam locomotive, billowing smoke, pulls into a magnificent, aged train station. The arched window and ornate details create a romantic atmosphere, while the contrast between the dark train and the bright station adds a dramatic touch. This image evokes a sense of nostalgia and the excitement of a bygone era.
Prompt
style-aesthetic Steampunk: Nostalgic, bustling ; A vintage steam train pulling into a bustling station; wide shot; Travel; A grand train station with ornate architecture, steam billowing from the train, and crowds of passengers.; cinematic
Characteristic
Shot : A vintage steam locomotive is pulling into a large train station. There are people waiting on the platform and some are walking away from the locomotive. The steam from the locomotive is billowing up into the air.
Aesthetic Score : 0.7
Mood : nostalgic, romantic, industrial
Quality
Entropy : 6.95
Noise : 100
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.20
Image errors : There are no significant image errors.
Time’s Ticking: A Hand Reaches for a Vintage Clock
A mysterious hand emerges from the shadows, reaching towards a large, intricate clock face. The scene evokes a vintage, industrial atmosphere, leaving viewers with a sense of anticipation and intrigue. What secrets does this clock hold, and what will happen when the hand finally touches it?
Prompt
style-aesthetic Steampunk: Intriguing, focused ; A player’s hand manipulating gears and levers on a complex automaton; close-up; Gaming; A dimly lit workshop filled with intricate machinery and glowing dials.; cinematic
Characteristic
Shot : A hand reaching towards an antique clock face with intricate details and a golden dial. The scene is set in a dimly lit room, with warm lighting.
Aesthetic Score : 0.7
Mood : mysterious, vintage, antique
Quality
Entropy : 6.56
Noise : 81
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.20
Image errors : Slight blurring around the hand and the edges of the clock face, some slight noise present in the shadows.
Lost in the Gears: A Boy’s Fascination with Clockwork
A young boy, captivated by the intricate workings of a golden clockwork device, sits in a workshop bathed in soft light. The image captures a moment of quiet concentration, highlighting the boy’s curiosity and the wonder of mechanical artistry.
Prompt
style-aesthetic Steampunk: Curious, inventive ; A young inventor tinkering with a complex clockwork device; close-up; Heroism; A cluttered workshop filled with tools, gears, and blueprints.; cinematic
Characteristic
Shot : A young boy is sitting at a table, focused intently on a detailed clockwork mechanism he is examining.
Aesthetic Score : 0.7
Mood : curious, focused, playful
Quality
Entropy : 6.77
Noise : 83
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.10
Image errors : No significant errors are noticeable in the image.
Airship Adventure: Soaring Above Snowy Peaks
A vintage airship glides gracefully over a majestic snow-capped mountain range, evoking a sense of dreamy adventure and nostalgia. The clear blue sky and the airship’s presence create a captivating scene of wonder and exploration.
Prompt
style-aesthetic Steampunk: Awe-inspiring, majestic ; A luxurious airship soaring over a breathtaking mountain range; wide shot; Travel; Majestic mountains with snow-capped peaks and a vast, cloudy sky.; cinematic
Characteristic
Shot : A vintage airship is flying over snow-capped mountains in a dreamy landscape.
Aesthetic Score : 0.7
Mood : nostalgic, adventurous, serene
Quality
Entropy : 6.35
Noise : 93
Prompt Clip Score : 0.31
AI Evaluation
Likelihood of AI : 0.80
Image errors : Slight blurring around the edges of the airship, especially in the back. The texture of the airship’s wood is too uniform and doesn’t appear very realistic. The shadows cast by the airship seem slightly off.
Airship Mystery in the Foggy City
A vintage airship hangs suspended above a misty cityscape, its silhouette a stark contrast against the swirling fog. A towering clock tower looms in the background, adding to the sense of mystery and intrigue. This evocative scene captures a vintage aesthetic with a touch of the dramatic.
Prompt
style-aesthetic Steampunk: Epic, determined ; A lone, determined airship pilot; close-up; Heroism; A sprawling cityscape with towering clockwork structures and smoke billowing from chimneys.; cinematic
Characteristic
Shot : An airship flies over a foggy city with a large clock tower in the background.
Aesthetic Score : 0.7
Mood : mysterious, vintage, ethereal
Quality
Entropy : 6.70
Noise : 81
Prompt Clip Score : 0.29
AI Evaluation
Likelihood of AI : 0.70
Image errors : The image appears to have a slight blur, especially on the airship.
Conclusion
The results show that the generative AI model performed well in terms of camera position and shot analysis, but struggled with aesthetic analysis. Here’s a breakdown:
Camera Position:
- Score: 0.5
- Interpretation: This score falls within the “good” range, indicating that the model generally understood and implemented the camera positions described in the prompt.
Shot Analysis:
- Score: 0.58
- Interpretation: This score also falls within the “good” range, suggesting the model was able to grasp the scene and create shots that were generally consistent with the prompt’s description.
Aesthetic Analysis:
- Score: 0.3
- Interpretation: This score is significantly lower than the ideal range of -0.2 to 0.1. This indicates that the generated image’s aesthetic deviated from the expected aesthetic described in the prompt.
Overall:
The model demonstrates a good understanding of camera positions and shot composition. However, it needs improvement in capturing the desired aesthetic style.
Sources:
- https://heartofnoir.com/knowing-noir/aesthetic-of-noir/
- https://www.yellowbrick.co/blog/film/maximizing-the-visual-impact-unveiling-the-art-of-film-aesthetics
- https://www.questjournals.org/jrhss/papers/vol10-issue8/1008255260.pdf
- https://www.jstor.org/stable/3331672
- https://www.cinepoetics.fu-berlin.de/activities/workshops/2020-12-ws/index.html
- https://resource.download.wjec.co.uk/vtc/2016-17/16-17_1-22/eng/Part%201%20What%20is%20Aesthetics.pdf
- https://fal.ai/models/fal-ai/flux/dev/api