it's geared toward a photographic style with an emphasis on balancing realism with creativity, but also has some gems with illustrated or artistic styles if you prompt for them.
This model works great as a plug-and-play model out of the box, but it shines with some workflow optimizations. I've made some suggestions at the end of this post, and you can try them out with my workflows here.
Recommended Settings
In the sample images, second pass is a 1.5x latent upscale, 0.3 to 0.4 denoise, 40 steps. Everything was generated in Comfy.
Sampler: DPM++ 3M SDE
Scheduler: AlignYourSteps
CFG: 3-4 (or use Automatic CFG)
Steps: 30-40
Clip Skip: -2 or -3
Aspect Ratio: 1:1, 2:3, 3:4, 16:9, 21:9, vertical or horizontal
This model works best with natural language style prompting. I've gotten the very best results by separating CLIP-G and CLIP-L, using natural language in CLIP-G and SD 1.5-style keyword based prompting in CLIP-L.
I've created a custom GPT to help with this. By default, it will generate CLIP-G style prompts, but you can optionally ask it for CLIP-L and/or T5 style prompts. The GPT follows my Prompt Pyramid style of prompting, which may not be the best, but it's how I do things.