Optimized VGG Network with Dilated Residual Convolution and Path Enhancement for Crack Image Segmentation
DOI:
https://doi.org/10.15837/ijccc.2025.2.6872Keywords:
Crack Segmentation, Dilated Residual Convolution, Path Enhancement Module, Neural NetworkAbstract
Image crack segmentation is a critical task in infrastructure maintenance, as accurate crack detection is essential for structural health monitoring and preventing potential risks. Although traditional Convolutional Neural Networks (CNN) have achieved certain successes in image crack detection, they still exhibit limitations in handling noisy images and detecting fine cracks. This study proposes a VGG network based on dilated residual convolution and path-enhanced optimization to improve the accuracy and efficiency of image crack segmentation. The study utilizes fuzzy morphological filtering for preprocessing, introduces dilated convolution to expand the receptive field, employs residual structures to enhance feature transmission, and incorporates path enhancement modules to boost network performance. The experimental evaluation was conducted using the SDNET2018, METU Dataset, and CFD datasets. The results show that the proposed method achieves a mean intersection over union (MIoU) of 94.60% and a mean accuracy of 96.18%. The improved algorithm demonstrates significant advantages in noise interference handling and detail processing.
References
Black & Veatch. The risks of aging infrastructure and the value of asset management, 2021.
J. Long, E. Shelhamer, and T. Darrell. Fully convolutional networks for semantic segmentation. arXiv preprint arXiv:1411.4038, 2015. https://doi.org/10.1109/CVPR.2015.7298965
H. Li, W. Wang, M. Wang, L. Li, and V. Vimlund. A review of deep learning methods for pixel-level crack detection. Journal of Traffic and Transportation Engineering (English Edition), 9(6):945-968, 2022. https://doi.org/10.1016/j.jtte.2022.11.003
Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backpropagation applied to handwritten zip code recognition. Neural Computation, 1(4):541-551, 1989. https://doi.org/10.1162/neco.1989.1.4.541
K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. arxiv preprint arxiv:1409.1556. arXiv preprint arXiv:1409.1556, 2014.
J. Wu and X. Zhang. Tunnel crack detection method and crack image processing algorithm based on improved retinex and deep learning. Sensors, 23(22):9140, 2023. https://doi.org/10.3390/s23229140
S. Hao, L. Shao, and S. Wang. A faster rcnn airport pavement crack detection method based on attention mechanism. Academic Journal of Science and Technology, 2023. https://doi.org/10.54097/ajst.v4i2.4122
V. Mandal, L. Uong, and Y. Adu-Gyamfi. Automated road crack detection using deep convolutional neural networks. Journal of the American Society for Information Science and Technology, 69(10):2145-2157, 2018. https://doi.org/10.1109/BigData.2018.8622327
S. Anand, S. Gupta, V. Darbari, and S. Kohli. Crack-pot: Autonomous road crack and pothole detection. In Digital Image Computing: Techniques and Applications (DICTA), pages 1-6, Canberra, ACT, Australia, 2018. https://doi.org/10.1109/DICTA.2018.8615819
M. Zeeshan, M. Adnan, W. Ahmad, and F. Z. Khan. Structural crack detection and classification using deep convolutional neural network. Pakistan Journal of Engineering and Technology, 2021. https://doi.org/10.51846/vol4iss4pp50-56
O. Ronneberger, P. Fischer, and T. Brox. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015, pages 234-241. Springer, 2015. https://doi.org/10.1007/978-3-319-24574-4_28
V. Polovnikov, D. Alekseev, I. Vinogradov, and V. G. Lashkia. Daunet: Deep augmented neural network for pavement crack segmentation. IEEE Access, 2021. https://doi.org/10.1109/ACCESS.2021.3111223
Runze Fan, Yuhong Liu, Rongfen Zhang, and Jingyu Li. A road scene semantic segmentation model based on multi-scale attention mechanism. Computer Engineering, 49(2):288-295, 2023.
Zhiqiang Zhou, Muhammad Rafiqul Siddiquee, Nasir Tajbakhsh, and Jun Liang. Unet++: A nested u-net architecture for medical image segmentation. IEEE Transactions on Medical Imaging, 2018.
Hao Luo, Jun Li, Li Cai, and Ming Wu. Strans-yolox: Fusing swin transformer and yolox for automatic pavement crack detection. MDPI, 2023. https://doi.org/10.3390/app13031999
Zhihua Zhang, Yanan Wen, Haowei Mu, and Xiaoping Du. Road crack detection combined with dual attention mechanism. Chinese Journal of Image and Graphics, 07:2240-2250, 2022.
Arash Saberironaghi and Jun Ren. Depthcracknet: A deep learning model for automatic pavement crack detection. Journal of imaging, 10(5):100, 2024. https://doi.org/10.3390/jimaging10050100
Ioannis Katsamenis, Evangelos Protopapadakis, Nikos Bakalos, Anastasios Varvarigos, and Athanasios Doulamis. A few-shot attention recurrent residual u-net for crack segmentation. arXiv, 2023. https://doi.org/10.1007/978-3-031-47969-4_16
Yi Fan, Tian Wu, Guangyu Cao, Yue Zhao, Yifeng Yang, and Yukun Li. Feature pyramid and hierarchical boosting network for pavement crack detection. IEEE Transactions on Intelligent Transportation Systems, 2019.
Sa Xie and Zhuowen Tu. Holistically-nested edge detection. arXiv preprint arXiv:1504.06375, 2015. https://doi.org/10.1109/ICCV.2015.164
Fisher Yu and Vlad Koltun. Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122, 2016.
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (pp. 770-778). IEEE, 2016.
Yuan Chen, Guo-Qing Zhou, Zhi-Hua Wang, and Hong Zhang. Fuzzy mathematical morphology and its application to image processing. Pattern Recognition Letters, 62:92-102, 2015.
Additional Files
Published
Issue
Section
License
Copyright (c) 2025 Xiaofang WANG, JiaLing WU, Xin CHEN, JunNiang HOU, Peichun CHEN

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
ONLINE OPEN ACCES: Acces to full text of each article and each issue are allowed for free in respect of Attribution-NonCommercial 4.0 International (CC BY-NC 4.0.
You are free to:
-Share: copy and redistribute the material in any medium or format;
-Adapt: remix, transform, and build upon the material.
The licensor cannot revoke these freedoms as long as you follow the license terms.
DISCLAIMER: The author(s) of each article appearing in International Journal of Computers Communications & Control is/are solely responsible for the content thereof; the publication of an article shall not constitute or be deemed to constitute any representation by the Editors or Agora University Press that the data presented therein are original, correct or sufficient to support the conclusions reached or that the experiment design or methodology is adequate.