Style Adaptation Based on Image Processing Methods Using Cyclegan
Style Adaptation Based on Image Processing Methods Using Cyclegan
Autori:
Izdanje: Sinteza 2023 - International Scientific Conference on Information Technology and Data Related Research
DOI: 10.15308/Sinteza-2023-9-16
Oblast: Computer Science and Artificial Intelligence
Stranice: 9-16
Apstrakt:
Cycle-Consistent Generative Adversarial Networks (CycleGANs) are able to provide a highly under-constrained mapping between input and output data samples, i.e., source and target data domain, in cases when the aligned dataset is unavailable, in an unsupervised training fashion, using cycle-consistency loss mechanisms. On the other hand, most image-to-image and speech-to-speech translation tasks use the aligned, i.e., paired input-output training datasets. A large amount of data is necessary to train such architectures, while one of the domains could be scarce. Several possible improvements to the original CycleGAN architecture are analysed in this paper for the cases when only a small percentage of training samples are aligned among source and target data domains. A semi-supervised approach is proposed to achieve better translation accuracy and prevent overfitting of the scarce data domain discriminator during initial training iterations. The training database is augmented by adding samples generated by inverse CycleGAN mappings after several training epochs (when the network is sufficiently trained) into the training pool of the discriminator of scarce, i.e., reduced data domain. An additional optimization constraint is also proposed, aligning probability distributions of feature maps belonging to the same-depth neural network layers of direct GAN encoder and inverse GAN decoder, to reinforce resemblance among object representations in various data domains. Significantly better performances are obtained using proposed improvements in both image-to-image and speech-to-speech translation tasks, by observing standard qualitative and quantitative measures, in comparison to the baseline CycleGAN training approach.
Ključne reči: Style Adaptation, Generative Adversarial Networks, Cycle-Consistency, Semi- Supervised Learning, Bootstrapping
Priložene datoteke:
- US - SINTEZA - 2023 - RAD 2- 9-16 ( veličina: 935,91 KB, broj pregleda: 137 )
Kategorije objave:
Radovi na konferenciji Sinteza 2023, Beograd, Srbija
Zahvaljujemo se što ste preuzeli publikaciju sa portala Singipedia.
Ukoliko želite da se prijavite za obaveštenja o sadržajima iz oblasti ove publikacije, možete nam ostaviti adresu svoje elektronske pošte.
Preuzimanje citata:
BibTeX format
RefWorks Tagged format
Unapred formatirani prikaz citata
BibTeX format
@article{article, author = {B. Popović}, title = {Style Adaptation Based on Image Processing Methods Using Cyclegan}, journal = {Sinteza 2023 - International Scientific Conference on Information Technology and Data Related Research}, year = 2023, pages = {9-16}, doi = {10.15308/Sinteza-2023-9-16} }
RT Conference Proceedings A1 Branislav Popović T1 Style Adaptation Based on Image Processing Methods Using Cyclegan AD Univerzitet Singidunum, Beograd YR 2023 NO doi: 10.15308/Sinteza-2023-9-16
B. Popović, Style Adaptation Based on Image Processing Methods Using Cyclegan, Univerzitet Singidunum, Beograd, 2023, doi:10.15308/Sinteza-2023-9-16