Semantic Segmentation Using Enhanced Super-Resolution Generative Adversarial Network-Synthesized Images

Rajashekar Redd Booreddy, Li Zhang, Kamlesh Mistry

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this research, we conduct image segmentation using DeepLabv3+ and SUIM-Net models based on super-resolution images generated using an Enhanced Super-Resolution Generative Adversarial Network (ESRGAN). Specifically, ESRGAN is used to upscale the resolution of an image up to four times of the original resolution, where a perceptual loss function instead of a regular mean squared error loss is used for image generation. We subsequently compare the effect of these generated super-resolution images against the original low-resolution ones in solving image segmentation tasks. Evaluated using UFO-120 and SUIM datasets, the empirical results indicate the superiority of DeepLabv3+ and SUIM-Net using the ESRGAN-generated super-resolution images in comparison with the models using the original low-resolution ones for tackling semantic segmentation and visual saliency prediction.
Original languageEnglish
Title of host publicationProceedings of 2024 International Conference on Machine Learning and Cybernetics (ICMLC)
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages499-504
Number of pages6
ISBN (Electronic)9798331528041, 9798331528058
DOIs
Publication statusPublished - 23 Sept 2024
Event23rd International Conference on Machine Learning and Cybernetics, ICMLC 2024 - Miyazaki, Japan
Duration: 20 Sept 202423 Sept 2024
Conference number: 23rd
https://www.icmlc.com/ICMLC/welcome.html

Conference

Conference23rd International Conference on Machine Learning and Cybernetics, ICMLC 2024
Abbreviated titleICMLC 2024
Country/TerritoryJapan
Period20/09/2423/09/24
Internet address

Keywords

  • Deep Neural Network
  • Saliency Detection
  • Segmentation
  • Super-resolution

Cite this