Style Adaptation Based on Image Processing Methods Using Cyclegan

Izdanje: Sinteza 2023 - International Scientific Conference on Information Technology and Data Related Research

DOI: 10.15308/Sinteza-2023-9-16

Oblast: Computer Science and Artificial Intelligence

Stranice: 9-16

Link: https://portal.sinteza.singidunum.ac.rs/paper/901

Apstrakt:
Cycle-Consistent Generative Adversarial Networks (CycleGANs) are able to provide a highly under-constrained mapping between input and output data samples, i.e., source and target data domain, in cases when the aligned dataset is unavailable, in an unsupervised training fashion, using cycle-consistency loss mechanisms. On the other hand, most image-to-image and speech-to-speech translation tasks use the aligned, i.e., paired input-output training datasets. A large amount of data is necessary to train such architectures, while one of the domains could be scarce. Several possible improvements to the original CycleGAN architecture are analysed in this paper for the cases when only a small percentage of training samples are aligned among source and target data domains. A semi-supervised approach is proposed to achieve better translation accuracy and prevent overfitting of the scarce data domain discriminator during initial training iterations. The training database is augmented by adding samples generated by inverse CycleGAN mappings after several training epochs (when the network is sufficiently trained) into the training pool of the discriminator of scarce, i.e., reduced data domain. An additional optimization constraint is also proposed, aligning probability distributions of feature maps belonging to the same-depth neural network layers of direct GAN encoder and inverse GAN decoder, to reinforce resemblance among object representations in various data domains. Significantly better performances are obtained using proposed improvements in both image-to-image and speech-to-speech translation tasks, by observing standard qualitative and quantitative measures, in comparison to the baseline CycleGAN training approach.
Ključne reči: Style Adaptation, Generative Adversarial Networks, Cycle-Consistency, Semi- Supervised Learning, Bootstrapping
Priložene datoteke:

Preuzimanje citata:

BibTeX format
@article{article,
  author  = {B. Popović}, 
  title   = {Style Adaptation Based on Image Processing Methods Using Cyclegan},
  journal = {Sinteza 2023 - International Scientific Conference on Information Technology and Data Related Research},
  year    = 2023,
  pages   = {9-16},
  doi     = {10.15308/Sinteza-2023-9-16}
}
RefWorks Tagged format
RT Conference Proceedings
A1 Branislav Popović
T1 Style Adaptation Based on Image Processing Methods Using Cyclegan
AD Univerzitet Singidunum, Beograd
YR 2023
NO doi: 10.15308/Sinteza-2023-9-16
Unapred formatirani prikaz citata
B. Popović, Style Adaptation Based on Image Processing Methods Using Cyclegan, Univerzitet Singidunum, Beograd, 2023, doi:10.15308/Sinteza-2023-9-16