Hunyuan-DiT can perform multi-round multi-modal dialogue with users, generating and refining images according to the context. Hunyuan-DiT sets a new state-of-the-art in Chinese-to-image generation compared with other open-source models.
Languages supported: 🇨🇳 and 🇬🇧 Bilingual generation capabilities and has advantages in Chinese elements understanding, A Hunyuan-DiT can analyze and understand the information in long texts, generating corresponding arts.
Know more of "Hunyuan Dit" model: https://dit.hunyuan.tencent.com/
@Made from own ai generated set, LoRA created for the TensorArt's HunYua event.
@misc{li2024hunyuandit,
title={Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding},
author={Zhimin Li and Jianwei Zhang and Qin Lin and Jiangfeng Xiong and Yanxin Long and Xinchi **** and Yingfang Zhang and Xingchao Liu and Minbin Huang and ****** Xiao and Dayou Chen and Jiajun He and Jiahao Li and Wenyue Li and Chen Zhang and Rongwei Quan and Jianxiang Lu and Jiabin Huang and Xiaoyan Yuan and Xiaoxiao Zheng and Yixuan Li and Jihong Zhang and Chao Zhang and Meng Chen and Jie Liu and Zheng Fang and Weiyan Wang and Jinbao Xue and Yangyu Tao and Jianchen Zhu and Kai Liu and Sihuan Lin and Yifu Sun and Yun Li and Dongdong Wang and Mingtao Chen and Zhichao Hu and Xiao Xiao and Yan Chen and Yuhong Liu and Wei Liu and Di Wang and Yong Yang and Jie Jiang and Qinglin Lu},
year={2024},
eprint={2405.08748},
archivePrefix={arXiv},
primaryClass={cs.CV}
}