ViTALnet: Anomaly on Industrial Textured Surfaces With Hybrid Transformer

Publisher:
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Publication Type:
Journal Article
Citation:
IEEE Transactions on Instrumentation and Measurement, 2023, 72
Issue Date:
2023-01-01
Filename Description Size
ViTALnet_Anomaly_on_Industrial_Textured_Surfaces_With_Hybrid_Transformer.pdfPublished version19.99 MB
Adobe PDF
Full metadata record
The coexistence of subtle and long-range anomalies in real-world industrial applications brings significant challenges for anomaly localization. Existing methods typically train deep models by utilizing the multilevel patches or layers-fusion approaches for learning the global-local distribution; however, these methods do not consider learning local and global features simultaneously, which suffer from inaccurate localization results. To this end, a hybrid transformer model, ViTALnet, is proposed here, which is established based on fine-grained feature reconstruction. Our ViTALnet first adopts the vision transformer (ViT) to extract local discriminatory features as feature representation, which leverages the global semantic capturing capability. Then, an anomaly estimation module is proposed by integrating global attention and a pyramidal architecture to enhance contextual information for fine-grain anomaly localization. The experiments were extensively conducted on industrial anomaly localization datasets MVTec Anomaly Detection (MVTec AD)-Texture, NanoTWICE, and general textured datasets KolektorSDD2, MT Defect, and Dot-patterned Fabric, where our proposed ViTALnet outperformed major state-of-the-art (SOTA) methods.
Please use this identifier to cite or link to this item: