Abstract
In this research, we conduct image segmentation using DeepLabv3+ and SUIM-Net models based on super-resolution images generated using an Enhanced Super-Resolution Generative Adversarial Network (ESRGAN). Specifically, ESRGAN is used to upscale the resolution of an image up to four times of the original resolution, where a perceptual loss function instead of a regular mean squared error loss is used for image generation. We subsequently compare the effect of these generated super-resolution images against the original low-resolution ones in solving image segmentation tasks. Evaluated using UFO-120 and SUIM datasets, the empirical results indicate the superiority of DeepLabv3+ and SUIM-Net using the ESRGAN-generated super-resolution images in comparison with the models using the original low-resolution ones for tackling semantic segmentation and visual saliency prediction.
Original language | English |
---|---|
Title of host publication | Proceedings of 2024 International Conference on Machine Learning and Cybernetics (ICMLC) |
Place of Publication | Piscataway, NJ |
Publisher | IEEE |
Pages | 499-504 |
Number of pages | 6 |
ISBN (Electronic) | 9798331528041, 9798331528058 |
DOIs | |
Publication status | Published - 23 Sept 2024 |
Event | 23rd International Conference on Machine Learning and Cybernetics, ICMLC 2024 - Miyazaki, Japan Duration: 20 Sept 2024 → 23 Sept 2024 Conference number: 23rd https://www.icmlc.com/ICMLC/welcome.html |
Conference
Conference | 23rd International Conference on Machine Learning and Cybernetics, ICMLC 2024 |
---|---|
Abbreviated title | ICMLC 2024 |
Country/Territory | Japan |
Period | 20/09/24 → 23/09/24 |
Internet address |
Keywords
- Deep Neural Network
- Saliency Detection
- Segmentation
- Super-resolution