Optimized VGG Network with Dilated Residual Convolution and Path Enhancement for Crack Image Segmentation

Authors

  • Xiaofang WANG Geely University of China, Chengdu, Sichuan Province, China
  • JiaLing WU Chengdu College of Electronic Science and Technology of China, Chengdu, Sichuan Province, China
  • Xin CHEN Chengdu College of Electronic Science and Technology of China, Chengdu, Sichuan Province, China
  • JunNiang HOU Geely University of China, Chengdu, Sichuan Province, China
  • Peichun CHEN Research Servicesr, Coventry University, Coventry, UK

DOI:

https://doi.org/10.15837/ijccc.2025.2.6872

Keywords:

Crack Segmentation, Dilated Residual Convolution, Path Enhancement Module, Neural Network

Abstract

Image crack segmentation is a critical task in infrastructure maintenance, as accurate crack detection is essential for structural health monitoring and preventing potential risks. Although traditional Convolutional Neural Networks (CNN) have achieved certain successes in image crack detection, they still exhibit limitations in handling noisy images and detecting fine cracks. This study proposes a VGG network based on dilated residual convolution and path-enhanced optimization to improve the accuracy and efficiency of image crack segmentation. The study utilizes fuzzy morphological filtering for preprocessing, introduces dilated convolution to expand the receptive field, employs residual structures to enhance feature transmission, and incorporates path enhancement modules to boost network performance. The experimental evaluation was conducted using the SDNET2018, METU Dataset, and CFD datasets. The results show that the proposed method achieves a mean intersection over union (MIoU) of 94.60% and a mean accuracy of 96.18%. The improved algorithm demonstrates significant advantages in noise interference handling and detail processing.

References

Black & Veatch. The risks of aging infrastructure and the value of asset management, 2021.

J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. arXiv preprint arXiv:1411.4038, 2015. https://doi.org/10.1109/CVPR.2015.7298965

H. Li, W. Wang, M. Wang, L. Li, and V. Vimlund. A review of deep learning methods for pixel-level crack detection. Journal of Traffic and Transportation Engineering (English Edition), 9(6):945-968, 2022. https://doi.org/10.1016/j.jtte.2022.11.003

Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backpropagation applied to handwritten zip code recognition. Neural Computation, 1(4):541-551, 1989. https://doi.org/10.1162/neco.1989.1.4.541

K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. arxiv preprint arxiv:1409.1556. arXiv preprint arXiv:1409.1556, 2014.

J. Wu and X. Zhang. Tunnel crack detection method and crack image processing algorithm based on improved retinex and deep learning. Sensors, 23(22):9140, 2023. https://doi.org/10.3390/s23229140

S. Hao, L. Shao, and S. Wang. A faster rcnn airport pavement crack detection method based on attention mechanism. Academic Journal of Science and Technology, 2023. https://doi.org/10.54097/ajst.v4i2.4122

V. Mandal, L. Uong, and Y. Adu-Gyamfi. Automated road crack detection using deep convolutional neural networks. Journal of the American Society for Information Science and Technology, 69(10):2145-2157, 2018. https://doi.org/10.1109/BigData.2018.8622327

S. Anand, S. Gupta, V. Darbari, and S. Kohli. Crack-pot: Autonomous road crack and pothole detection. In Digital Image Computing: Techniques and Applications (DICTA), pages 1-6, Canberra, ACT, Australia, 2018. https://doi.org/10.1109/DICTA.2018.8615819

M. Zeeshan, M. Adnan, W. Ahmad, and F. Z. Khan. Structural crack detection and classification using deep convolutional neural network. Pakistan Journal of Engineering and Technology, 2021. https://doi.org/10.51846/vol4iss4pp50-56

O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, pages 234-241. Springer, 2015. https://doi.org/10.1007/978-3-319-24574-4_28

V. Polovnikov, D. Alekseev, I. Vinogradov, and V. G. Lashkia. Daunet: Deep augmented neural network for pavement crack segmentation. IEEE Access, 2021. https://doi.org/10.1109/ACCESS.2021.3111223

Runze Fan, Yuhong Liu, Rongfen Zhang, and Jingyu Li. A road scene semantic segmentation model based on multi-scale attention mechanism. Computer Engineering, 49(2):288-295, 2023.

Zhiqiang Zhou, Muhammad Rafiqul Siddiquee, Nasir Tajbakhsh, and Jun Liang. Unet++: A nested u-net architecture for medical image segmentation. IEEE Transactions on Medical Imaging, 2018.

Hao Luo, Jun Li, Li Cai, and Ming Wu. Strans-yolox: Fusing swin transformer and yolox for automatic pavement crack detection. MDPI, 2023. https://doi.org/10.3390/app13031999

Zhihua Zhang, Yanan Wen, Haowei Mu, and Xiaoping Du. Road crack detection combined with dual attention mechanism. Chinese Journal of Image and Graphics, 07:2240-2250, 2022.

Arash Saberironaghi and Jun Ren. Depthcracknet: A deep learning model for automatic pavement crack detection. Journal of imaging, 10(5):100, 2024. https://doi.org/10.3390/jimaging10050100

Ioannis Katsamenis, Evangelos Protopapadakis, Nikos Bakalos, Anastasios Varvarigos, and Athanasios Doulamis. A few-shot attention recurrent residual u-net for crack segmentation. arXiv, 2023. https://doi.org/10.1007/978-3-031-47969-4_16

Yi Fan, Tian Wu, Guangyu Cao, Yue Zhao, Yifeng Yang, and Yukun Li. Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Transactions on Intelligent Transportation Systems, 2019.

Sa Xie and Zhuowen Tu. Holistically-nested edge detection. arXiv preprint arXiv:1504.06375, 2015. https://doi.org/10.1109/ICCV.2015.164

Fisher Yu and Vlad Koltun. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122, 2016.

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (pp. 770-778). IEEE, 2016.

Yuan Chen, Guo-Qing Zhou, Zhi-Hua Wang, and Hong Zhang. Fuzzy mathematical morphology and its application to image processing. Pattern Recognition Letters, 62:92-102, 2015.

Additional Files

Published

2025-03-01

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.