A Contrastive Learning Approach for Anomaly Detection in Multi-View Scenarios
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Anibal Bregon
Carlos J. Alonso-Gonzalez
Miguel A. Martinez-Prieto
Belarmino Pulido
Abstract
Quality control is a key task in smart manufacturing, since it ensures that processes consistently meet rigorous performance standards. The effective implementation of these mechanisms is crucial to ensuring both reliability and efficiency in modern manufacturing environments, where automation is increasingly integrated. Traditional anomaly detection algorithms typically rely on single-view data for each manufacturing product, overlooking relevant and complementary information available from multiple perspectives. Furthermore, cross-entropy-based loss functions are frequently adopted in the literature to train detection models; however, these approaches often struggle with imbalanced datasets or when detecting rare and subtle anomalies. In this work, a contrastive learning architecture for multi-view anomaly detection in industrial settings is proposed. The method performs a mid-level fusion to generate a structured representation of the input instances, thereby enhancing detection capabilities. The architecture was evaluated on the Real-IAD dataset, where it demonstrated better performance than traditional techniques. These findings highlight the potential of contrastive learning to improve anomaly detection performance, thus contributing to the construction of more reliable quality control systems in smart manufacturing environments.
How to Cite
##plugins.themes.bootstrap3.article.details##
Deep Learning, Contrastive Learning, Multi-view, Quality Control
Chen, T., Kornblith, S., Norouzi, M., & Hinton, G. (2020). A simple framework for contrastive learning of visual representations. In International conference on machine learning (pp. 1597–1607).
Defard, T., Setkov, A., Loesch, A., & Audigier, R. (2021). Padim: a patch distribution modeling framework for anomaly detection and localization. In International conference on pattern recognition (pp. 475–489).
Hadsell, R., Chopra, S., & LeCun, Y. (2006). Dimensionality reduction by learning an invariant mapping. In 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR’06) (Vol. 2, pp. 1735–1742).
He, H., Zhang, J., Tian, G., Wang, C., & Xie, L. (2024). Learning multi-view anomaly detection. arXiv preprint arXiv:2407.11935.
Jakob, P., Madan, M., Schmid-Schirling, T., & Valada, A. (2021). Multi-perspective anomaly detection. Sensors, 21(16), 5311.
Khosla, P., Teterwak, P., Wang, C., Sarna, A., Tian, Y., Isola, P., . . . Krishnan, D. (2020). Supervised contrastive learning. Advances in neural information processing systems, 33, 18661–18673.
Kopuklu, O., Zheng, J., Xu, H., & Rigoll, G. (2021). Driver anomaly detection: A dataset and contrastive learning approach. In Proceedings of the IEEE/CVF winter conference on applications of computer vision (pp. 91–100).
Maaten, L. v. d., & Hinton, G. (2008). Visualizing data using t-sne. Journal of machine learning research, 9(Nov), 2579–2605.
Park, S., Lee, K. H., Ko, B., & Kim, N. (2023). Unsupervised anomaly detection with generative adversarial networks in mammography. Scientific Reports, 13(1), 2925.
Schroff, F., Kalenichenko, D., & Philbin, J. (2015). Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 815–823).
Su, H., Maji, S., Kalogerakis, E., & Learned-Miller, E. (2015). Multi-view convolutional neural networks for 3D shape recognition. In Proceedings of the IEEE international conference on computer vision (pp. 945–953).
Wang, C., Zhu, W., Gao, B.-B., Gan, Z., Zhang, J., Gu, Z., . . . Ma, L. (2024). Real-iad: A real-world multi-view dataset for benchmarking versatile industrial anomaly detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp.
22883–22892).
Wang, S., Liu, J., Yu, G., Liu, X., Zhou, S., Zhu, E., . . . Yang, W. (2022). Multiview deep anomaly detection: A systematic exploration. IEEE Transactions on Neural Networks and Learning Systems, 35(2), 1651–1665.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
https://orcid.org/0009-0000-5908-4112