论文标题

看起来像魔术:在gan中转移学习以生成新的卡插图

Looks Like Magic: Transfer Learning in GANs to Generate New Card Illustrations

论文作者

Venturelli, Matheus K., Gomes, Pedro H., Wehrmann, Jônatas

论文摘要

在本文中,我们提出了MagicStylegan和Magicstylegan-ADA-最先进的模型stylegan2和stylegan2 ada的化身 - 试验他们将学习到一个相当不同的领域的转移能力:为游戏“魔术:魔术”卡创建新的插图。这是一项具有挑战性的任务,尤其是由于这些插图中存在的各种要素,例如人类,生物,文物和景观 - 更不用说多年来各种艺术家制作的许多艺术风格。为了解决手头的任务,我们介绍了一个名为MTG的新型数据集,其中成千上万的插图来自不同的卡类型,并具有丰富的元数据。最终的集合是一个由无数现实和类似幻想的插图组成的数据集。尽管为了研究多样性的影响,我们还引入了包含特定类型概念的子集,例如森林,岛屿,面部和人类。我们表明,更简单的模型(例如DCGANS)无法在任何环境中学习生成适当的插图。另一方面,我们使用所有建议的子集训练Magicstylegan实例,能够产生高质量的插图。我们执行实验,以了解如何将stylegan2的预训练特征转移到目标域。我们表明,在训练有素的模型中,我们可以找到特定的噪声向量实例,这些实例实际上代表数据集中的真实图像。此外,我们提供定量和定性研究来支持我们的主张,这表明MagicStylegan是生成魔术插图的最新方法。最后,本文重点介绍了有关GAN中转移学习的一些新兴属性,这在生成学习研究中仍然以某种方式探索的领域。

In this paper, we propose MAGICSTYLEGAN and MAGICSTYLEGAN-ADA - both incarnations of the state-of-the-art models StyleGan2 and StyleGan2 ADA - to experiment with their capacity of transfer learning into a rather different domain: creating new illustrations for the vast universe of the game "Magic: The Gathering" cards. This is a challenging task especially due to the variety of elements present in these illustrations, such as humans, creatures, artifacts, and landscapes - not to mention the plethora of art styles of the images made by various artists throughout the years. To solve the task at hand, we introduced a novel dataset, named MTG, with thousands of illustration from diverse card types and rich in metadata. The resulting set is a dataset composed by a myriad of both realistic and fantasy-like illustrations. Although, to investigate effects of diversity we also introduced subsets that contain specific types of concepts, such as forests, islands, faces, and humans. We show that simpler models, such as DCGANs, are not able to learn to generate proper illustrations in any setting. On the other side, we train instances of MAGICSTYLEGAN using all proposed subsets, being able to generate high quality illustrations. We perform experiments to understand how well pre-trained features from StyleGan2 can be transferred towards the target domain. We show that in well trained models we can find particular instances of noise vector that realistically represent real images from the dataset. Moreover, we provide both quantitative and qualitative studies to support our claims, and that demonstrate that MAGICSTYLEGAN is the state-of-the-art approach for generating Magic illustrations. Finally, this paper highlights some emerging properties regarding transfer learning in GANs, which is still a somehow under-explored field in generative learning research.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源