Automation and scale are key to AI image generation

Generative image models offer limitless possibilities for content creation, creative expression, and automation of costly and time-consuming tasks. Models such as Stable Diffusion and Midjourney are already transforming content creation in many industries including movies, video games and marketing. They will permeate many more industries over the coming year.

However, the current process of interactively generating a handful of images* is inefficient and limits the value of these models as it limits the chance you will see the best images. For every good image posted on Reddit or Twitter, thousands have been generated and discarded. Who knows how good the images were in the unexplored parts of the model, the unseen other ten thousand images?

We believe that to fully realize the potential of these generative models, they need to be combined with a platform that enables fast iteration on thousands of images by scaling human feedback with further AI tools and traditional automation. This includes automated training, inference parameter space search, ML-enabled image search, filtering, and comparison tools, and integrated assistive models such as resolution-upscaling and detail correction (e.g. “face-fix”).

We are a stealth-mode startup building a platform to solve these challenges. If you are interested in trying a private-alpha version of this platform, please contact us at scaled-diffusion@proton.me

We have been generating tens of thousands of images using our platform to serve a particular vertical. This experience, and the requirements it has exposed, has driven our development. It has also demonstrated a number of aspects that significantly affect the quality of generated images and the occurrence-rate of very high quality images.

Key learnings

As you would expect, we have built our platform to address all of these points and continue to improve it daily.

Bring automation and scale to the problem. You get better results.

scaled-diffusion@proton.me

* In more technical terms, manually exploring the parameter space is inefficient.