Photanima is an experimental finetune of Anima Base v1.0 to see whether it is a viable architecture for photography. Spoiler alert: it totally is.
Turbo LoRA baked in. If you're on a 30-series GPU, I recommend using this with my INT8 Toolkit + INT8 Lazy Torch Compile node for wicked fast gen times. All demo images generated with that combo.
❤️ If you enjoy Photanima, you can help offset the cost of training:
🤓 Technical details
Trained on ~1500 images for 27,500 steps. This is my dataset with around 100 new images and some caption cleanup. Training took approximately 24 hours on a Geforce 3090.
Pros:
Extremely fast.
Extremely good prompt adherence.
Anatomy is pretty stable. If it screws something up, changing your steps by +1/-1 usually fixes it.
Supports up to nearly 2MP with little-to-no distortions.
At first, I noticed that Photanima's style was inconsistent - it had a tendency to regress toward a cartoony/CGI look as my prompts became more complex. I was able to mostly overcome this by splitting Photanima into constituent content and style blocks, then boosting the style strength to around ~4.2 in ComfyUI.
Style is pretty consistent now, but there are some notable drawbacks.
Cons:
There are significant biases from my limited dataset. For example, you have to push your prompts pretty hard to steer the model away from its default facial features/racial biases. Yes, I have a type. I suspect this won't be a big issue for LoRA training.
It struggles with certain artistic terms like silhouette.
Microdetail quality is somewhere between SDXL and ZIT. Honestly, it's really good for a 2B model. Two-step upscaling with Anima doesn't help much, but I'm sure the results would be amazing if you sent a Photanima image to a different model for refinement. Or if that's too much work: just add a little film grain. It does wonders and requires no extra VRAM.
🛠️ Recommended Settings
8-10 steps with v1.1 Turbo, or ~12 steps with v1.0 Turbo.
Euler sampler.
Simple scheduler.
CFG 1.
Preferred resolution: 832x1216 or 1040x1520.
For maximum realism, begin your prompt with real life photo of...
Base model settings:
30-50 steps.
Euler sampler.
Simpler scheduler.
CFG 4-6.
Use a bunch of fluff tags like masterpiece, score_9, absurdres, best quality, highres, photo \(medium\), real life. Note: do not do this with Turbo.
🗺️ Roadmap
I'm pretty excited about the potential of Anima, but let's be clear: I'm not claiming that this checkpoint is a "ZIT killer." The correct model to compare this against is SDXL/IL - and I'm confident that Anima can dethrone it with enough community attention.
Directions I'd like to explore next:
There are a handful of Anima "detailer" LoRAs on Civitai. These are not intended for photography, but with enough block pruning, you never know. The right mix could go a long way.
I suspect doubling my dataset to ~3k images would make a big difference, especially if I can collect a wider range of faces, body types, and textures.
I'm eagerly awaiting the release of Anima Turbo 1.0. The current Turbo solution is based on Preview3 and I think it's holding back this model's potential a little.
I'm also looking forward to Anima support in OneTrainer. It will make trying experimental configs a lot less of a hassle compared to kohya-ss. For this v1 run, I stuck with safe values (prodigy, 1.0 LR, no fancy flags.)
Thank you. As always, I look forward to your feedback. Please share the model and upload some images to help it gain traction.
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/53c24124-ab19-840e-6cf2-8392bbd6c9a5.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/f554f08b-99cb-3b50-a94d-10439fef021b.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/edd14609-de0a-489c-43ca-36895ffcadea.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/72bc77e0-7e85-90bd-7cb5-4fadd587627d.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/e171b0b7-777d-9ae4-36e2-51af8b574cb7.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/aadc5a10-a19e-6538-1d17-58fd198ed6e4.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/448d21dd-1f61-b93c-7c8e-432fdccbd737.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/78368c10-e0b7-0bdd-83d6-7ecf5ec1cc49.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/5a8dcc87-eee0-e1f0-413a-22a420df6d84.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/64b53955-ae93-4daf-560c-8f31a9c5c787.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/df709ec8-45d6-ccc5-0595-c3db95aed7d3.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/4163627b-214a-bc19-7b19-a8eece199953.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/fa959dcf-c499-be12-4e54-1f62f5c51ea2.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/e962d174-4e54-88d9-a9a9-5b0fe3efa25f.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/4a86cc5a-138d-c2da-454b-6997a3fb2e08.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/200428b7-60c4-c714-eda8-a6e02f375b2a.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/66b38815-cc35-dda1-4a51-42e18ff459b5.png)
![Photanima [by liftweights] by MacrossManiac on Tensor.Art](https://image.tensorartassets.com/cdn-cgi/image/anim=true,plain=false,w=500,q=85/model_showcase/1002097324480019597/152daba8-b8ff-e985-ce2d-a0df6a988c52.png)