PGA-Net: pyramid feature fusion and global context attention network for automated surface defect detection Hongwen Dong Kechen Song Yu He Jing Xu Yunhui Yan Qinggang Meng 2134/12249758.v1 https://repository.lboro.ac.uk/articles/journal_contribution/PGA-Net_pyramid_feature_fusion_and_global_context_attention_network_for_automated_surface_defect_detection/12249758 Surface defect detection is a critical task in industrial production process. Nowadays, there are lots of detection methods based on computer vision and have been successfully applied in industry, they also achieved good results. However, achieving full automation of surface defect detection remains a challenge, due to the complexity of surface defect, in intra-class, while the defects between inter-class contain similar parts, there are large differences in appearance of the defects. To address these issues, this paper proposes a pyramid feature fusion and global context attention network for pixel-wise detection of surface defect, called PGA-Net. In the framework, the multi-scale features are extracted at first from backbone network. Then the pyramid feature fusion module is used to fuse these features into five resolutions through some efficient dense skip connections. Finally, the global context attention module is applied to the fusion feature maps of adjacent resolution, which allows effective information propagate from low-resolution fusion feature maps to high-resolution fusion ones. In addition, the boundary refinement block is added to the framework to refine the boundary of defect and improve the result of predict. The final prediction is the fusion of the five resolutions fusion feature maps. The results of evaluation on four real-world defect datasets demonstrate that the proposed method outperforms the state-of-the-art methods on mean Intersection of Union and mean Pixel Accuracy (NEU-Seg: 82.15%, DAGM 2007: 74.78%, MT_defect: 71.31%, Road_defect: 79.54%). 2020-05-06 10:34:09 Surface defect detection Deep learning Deeply-supervised Pyramid feature fusion Global context attention Boundary refinement