Robust Multi-Class Underwater Waste Detection via CNN and Transformer-Based Object Detection Models
DOI:
https://doi.org/10.15662/IJEETR.2026.0802105Keywords:
Underwater waste detection, deep learning, object detection, YOLOv8, YOLOv9, RT-DETR.Abstract
Marine pollution caused by underwater waste has become a growing environmental concern that threatens aquatic ecosystems and marine biodiversity. Detecting submerged debris in underwater environments remains a difficult task because of challenges such as low visibility, colour distortion, light absorption, and complex backgrounds. Recent advances in deep learning have made it possible to automatically detect underwater objects from visual data with improved accuracy and efficiency
This study investigates the effectiveness of deep learning–based object detection models for multi-class underwater waste detection. A dataset containing fifteen categories of underwater waste objects was obtained from the Roboflow platform. To address common underwater imaging problems, several preprocessing techniques were applied, including red channel enhancement, white balance correction, gamma correction, contrast limited adaptive histogram equalization (CLAHE), bilateral filtering, and image normalization. Three object detection models—YOLOv8, YOLOv9, and RT-DETR—were trained and evaluated using standard detection metrics such as precision, recall, and mean Average Precision (mAP).
This study investigates the effectiveness of deep learning–based object detection models for multi-class underwater waste detection. A dataset containing fifteen categories of underwater waste objects was obtained from the Roboflow platform. To address common underwater imaging problems, several preprocessing techniques were applied, including red channel enhancement, white balance correction, gamma correction, contrast limited adaptive histogram equalization (CLAHE), bilateral filtering, and image normalization. Three object detection models—YOLOv8, YOLOv9, and RT-DETR—were trained and evaluated using standard detection metrics such as precision, recall, and mean Average Precision (mAP).
References
1. K. Samanth, R. Ramyashree, B. N. Anoop, and S. Raghavendra, “A Comprehensive Study on Underwater Object Detection Using Deep Neural Networks,” IEEE Access, vol. 13, pp. 99446–99464, 2025, doi: 10.1109/ACCESS.2025.3577239.
2. S. Wu, P. Luo, Y. Song, and G. Jiang, “Underwater Image Enhancement and Trash Detection Using Deep Learning,” in Proc. IEEE, 2025.
3. Y. Liu, X. Fu, X. Ding, Y. Huang, and J. Paisley, “A Benchmark Dataset and Learning Pipeline for Underwater Image Enhancement,” IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 801–1005, 2021.
4. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2016, pp. 779–788.
5. A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “YOLOv4: Optimal Speed and Accuracy of Object Detection,” arXiv preprint arXiv:2004.10934, 2020.
6. C.-Y. Wang, I.-H. Yeh, and H.-Y. M. Liao, “YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information,” arXiv preprint arXiv:2402.13616, 2024.
7. G. Jocher et al., “Ultralytics YOLOv8,” GitHub Repository, 2023. [Online]. Available: https://github.com/ultralytics/ultralytics
8. N. Carion et al., “End-to-End Object Detection with Transformers,” in Proc. European Conf. Computer Vision (ECCV), 2020, pp. 213–229.
9. W. Liu et al., “SSD: Single Shot MultiBox Detector,” in Proc. European Conf. Computer Vision (ECCV), 2016, pp. 21–37.
10. S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, pp. 1137–1149, Jun. 2017.
11. K. He, G. Gkioxari, P. Dollár, and R. Girshick, “Mask R-CNN,” in Proc. IEEE Int. Conf. Computer Vision (ICCV), 2017, pp. 2961–2969.
12. T.-Y. Lin et al., “Feature Pyramid Networks for Object Detection,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2017, pp. 2117–2125.
13. Z. Wang et al., “DETR: End-to-End Object Detection with Transformers,” in Proc. European Conf. Computer Vision (ECCV), 2020, pp. 213–229.
14. J. Dai, Y. Li, K. He, and J. Sun, “R-FCN: Object Detection via Region-Based Fully Convolutional Networks,” in Proc. Advances in Neural Information Processing Systems, 2016, pp. 379–387.
15. [15] M. Islam, M. R. Islam, and J. Sattar, “Fast Underwater Image Enhancement for Improved Visual Perception,” IEEE Robotics and Automation Letters, vol. 5, no. 2, pp. 3227–3234, 2020.
16. C.Nagarajan and M.Madheswaran - ‘Stability Analysis of Series Parallel Resonant Converter with Fuzzy Logic Controller Using State Space Techniques’- Taylor &Francis, Electric Power Components and Systems, Vol.39 (8), pp.780-793, May 2011. DOI: 10.1080/15325008.2010.541746
17. C.Nagarajan and M.Madheswaran - ‘Experimental verification and stability state space analysis of CLL-T Series Parallel Resonant Converter’ - Journal of Electrical Engineering, Vol.63 (6), pp.365-372, Dec.2012. DOI: 10.2478/v10187-012-0054-2
18. C.Nagarajan and M.Madheswaran - ‘Performance Analysis of LCL-T Resonant Converter with Fuzzy/PID Using State Space Analysis’- Springer, Electrical Engineering, Vol.93 (3), pp.167-178, September 2011. DOI 10.1007/s00202-011-0203-9
19. S.Tamilselvi, R.Prakash, C.Nagarajan,“Solar System Integrated Smart Grid Utilizing Hybrid Coot-Genetic Algorithm Optimized ANN Controller” Iranian Journal Of Science And Technology-Transactions Of Electrical Engineering, DOI10.1007/s40998-025-00917-z,2025
20. S.Tamilselvi, R.Prakash, C.Nagarajan,“ Adaptive sliding mode control of multilevel grid-connected inverters using reinforcement learning for enhanced LVRT performance” Electric Power Systems Research 253 (2026) 112428, doi.org/10.1016/j.epsr.2025.112428
21. S.Thirunavukkarasu, C. Nagarajan, 2024, “Performance Investigation on OCF and SCF study in BLDC machine using FTANN Controller," Journal of Electrical Engineering And Technology, Volume 20, pages 2675–2688, (2025), doi.org/10.1007/s42835-024-02126-w
22. Vimal, V. R., John Justin Thangaraj, S., Narayanan, L. K., Alagu Thangam, S., Loganayagi, S., & Balakrishnan, S. (2025, April). Enhanced Phishing Detection and Classification Using an Ensemble Machine Learning Approach for URL Analysis. In International Conference on Information and Communication Technology for Intelligent Systems (pp. 229-239). Springer Nature Singapore.
23. Mathew, A. (2021). Obfuscation Techniques for Magecart Detection and Prevention. International Journal of Computer Science and Mobile Computing, 10(2), 39-44.
24. Soundappan, S. J. (2026). Building Trustworthy AI: Explainability and Security in Modern Cloud-Native Data-Driven Ecosystem Platforms. International Journal of Engineering & Extended Technologies Research (IJEETR), 8(2), 570-579.
25. C. Nagarajan, M.Madheswaran and D.Ramasubramanian- ‘Development of DSP based Robust Control Method for General Resonant Converter Topologies using Transfer Function Model’- Acta Electrotechnica et Informatica Journal , Vol.13 (2), pp.18-31,April-June.2013, DOI: 10.2478/aeei-2013-0025.
26. C.Nagarajan and M.Madheswaran - ‘DSP Based Fuzzy Controller for Series Parallel Resonant converter’- Springer, Frontiers of Electrical and Electronic Engineering, Vol. 7(4), pp. 438-446, Dec.12. DOI 10.1007/s11460-012-0212-0.
27. C.Nagarajan and M.Madheswaran - ‘Experimental Study and steady state stability analysis of CLL-T Series Parallel Resonant Converter with Fuzzy controller using State Space Analysis’- Iranian Journal of Electrical & Electronic Engineering, Vol.8 (3), pp.259-267, September 2012.
28. C.Nagarajan and M.Madheswaran, “Analysis and Simulation of LCL Series Resonant Full Bridge Converter Using PWM Technique with Load Independent Operation” has been presented in ICTES’08, a IEEE / IET International Conference organized by M.G.R.University, Chennai.Vol.no.1, pp.190-195, Dec.2007
29. Suganthi Mullainathan, Ramesh Natarajan, “An SPSS and CNN modelling based quality assessment using ceramic materials and membrane filtration techniques”, Revista Materia (Rio J.) Vol. 30, 2025, DOI: https://doi.org/10.1590/1517-7076-RMAT-2024-0721
30. M Suganthi, N Ramesh, “Treatment of water using natural zeolite as membrane filter”, Journal of Environmental Protection and Ecology, Volume 23, Issue 2, pp: 520-530,2022
31. D. Akkaynak and T. Treibitz, “Sea-thru: A Method for Removing Water From Underwater Images,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2019, pp. 1682–1691.
32. C. Li, J. Guo, and C. Guo, “Emerging From Water: Underwater Image Color Correction Based on Weakly Supervised Learning,” IEEE Signal Processing Letters, vol. 25, no. 3, pp. 323–327, 2018.
33. X. Fu, P. Zhuang, Y. Huang, X. Ding, and J. Paisley, “A Retinex-Based Enhancing Approach for Single Underwater Image,” in Proc. IEEE Int. Conf. Image Processing (ICIP), 2014, pp. 4572–4576.
34. P. Panetta, C. Gao, and S. Agaian, “Human Visual System-Based Image Enhancement and Logarithmic Contrast Measure,” IEEE Trans. Systems, Man, and Cybernetics, vol. 38, no. 1, pp. 174–188, 2008.
35. Z. Zhang et al., “Underwater Object Detection Based on Improved YOLO Network,” IEEE Access, vol. 9, pp. 123456–123468, 2021.
36. S. Mandal, S. Gupta, and A. Banerjee, “Deep Learning-Based Marine Debris Detection in Underwater Images,” IEEE Access, vol. 10, pp. 45678–45689, 2022.
37. M. Pedersen et al., “Detection of Marine Litter Using Deep Neural Networks,” IEEE Access, vol. 8, pp. 102933–102945, 2020.
38. R. Li, X. Zhang, and Y. Wang, “Deep Learning-Based Marine Debris Detection Using Underwater Imagery,” IEEE Journal of Oceanic Engineering, vol. 46, no. 4, pp. 1234–1245, 2021.
39. H. Zhang, Y. Liu, and X. Ding, “Underwater Object Detection via Multi-Scale Deep Neural Networks,” IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 9, pp. 1543–1547, 2021.
40. J. Long, E. Shelhamer, and T. Darrell, “Fully Convolutional Networks for Semantic Segmentation,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2015, pp. 3431–3440.





