This is pretty experimental, but can be a lot of fun if you persist past the duds. The idea was to create scenes that have elements that are obviously, yet skillfully photoshopped or inpainted in. Flux wants to fix everything during training, so it fought the concept pretty hard.
The trigger photoshopped was trained in, but you can also just describe two scenes. In a lot of my examples I've described a scene at the top of the image and a separate scene at the bottom. You can also use a phrase like morphing into. Broad items like umbrellas and hats are convenient transition areas between scenes.
Limb disfigurement is fairly prominent. Sorry 'bout that.