Wenxin Yige: Baidu's AI generated content platform
The ERNIE-ViLG samples seem worse than DALL-E 2/Imagen/Parti/Stable Diffusion. Wonder what's going on there - undertrained possibly, n=140m isn't a lot these days, especially for 10b-parameter models.
The ERNIE-ViLG samples seem worse than DALL-E 2/Imagen/Parti/Stable Diffusion. Wonder what's going on there - undertrained possibly, n=140m isn't a lot these days, especially for 10b-parameter models.