A Novel Approach for Evaluating Datasets Similarities Based on Analytical Hierarchy Process in the Industrial PHM Context

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Published Jun 27, 2024
Mohamed Aziz Zaghdoudi Christophe Varnier Sonia Hajri-Gabouj Noureddine Zerhouni

Abstract

In prognostics and health management (PHM), data-driven approaches are crucial for performing prognostics based on historical data, relying on the analysis of extensive datasets to identify patterns and relationships that contribute to predicting or optimizing variables. However, their efficiency is contingent upon the availability of large, high-quality datasets tailored to the specific task at hand.
Yet, real-world applications frequently face challenges as data may not always be readily available due to limitations in data acquisition systems or confidentiality concerns. Paradoxically, the contemporary era witnesses an unprecedented surge in the availability of online databases across various fields. These databases offer a plethora of data that can be harnessed to develop, prototype, and test PHM solutions.
This study endeavors to introduce an innovative approach for assessing the similarity between datasets, specifically tailored for prognostic and health management applications. The objective is to empower the development of PHM solutions for predefined systems without relying on data generated from the system itself, but rather by leveraging analogous datasets.
To quantify the similarity between different datasets, we propose a set of criteria and sub-criteria based on the characteristics of datasets. Subsequently, the analytic hierarchy process (AHP), a well-established multi-criteria decision-making approach, is employed to systematically compare the importance of criteria and sub-criteria for each elementary process within the PHM cycle. This dynamic process considers the varying importance of criteria across different phases, acknowledging that a criterion may not be uniformly significant for all elementary processes. The evaluation of dataset similarity incorporates the proposed criteria and sub-criteria, utilizing a fundamental scale of importance intensity and weights assigned through AHP. This holistic approach yields a comprehensive similarity score, enabling a nuanced understanding of dataset compatibility.
To exemplify the efficiency of our proposed approach, we applied it to a practical case study. The study involves assessing the similarity between a run-to-stop database of mechanical bearings and a set of online databases dedicated to the same application. Our solution facilitated the identification of criteria pertinent to the case study, the determination of criterion weights, and ultimately, the calculation of a similarity score for each database. This process proved instrumental in selecting the most similar database, showcasing the practical utility of our proposed approach in real-world PHM scenarios.

How to Cite

Zaghdoudi, M. A., Varnier, C. ., Hajri-Gabouj, S., & Zerhouni, N. (2024). A Novel Approach for Evaluating Datasets Similarities Based on Analytical Hierarchy Process in the Industrial PHM Context. PHM Society European Conference, 8(1), 10. https://doi.org/10.36001/phme.2024.v8i1.4036
Abstract 243 | PDF Downloads 139

##plugins.themes.bootstrap3.article.details##

Keywords

Data Similarity, AHP, PHM, Data-driven, Data Characteristics

References
Ahmadi, A., Arasteh Khouy, I., Kumar, U., & Schunnesson, H. (2009). Selection of maintenance strategy, using analytical hierarchy process. Communications in Dependability and Quality Management, 12(1), 121–132. Alelyani, S., Liu, H., & Wang, L. (2011). The effect of the characteristics of the dataset on the selection stability. In 2011 ieee 23rd international conference on tools with artificial intelligence (pp. 970–977). Benjelloun, O., Chen, S., & Noy, N. (2020). Google dataset search by the numbers. In International semantic web conference (pp. 667–682). Bhatt, N., Thakkar, A., & Ganatra, A. (2012). A survey & current research challenges in meta learning

approaches based on dataset characteristics. International Journal of soft computing and Engineering, 2(10), 234–247.

Bougacha, O., Varnier, C., & Zerhouni, N. (2022). Impact of decision horizon on post-prognostics maintenance and missions scheduling: a railways case study. International Journal of Rail Transportation, 10(4), 516–546. Brunelli, M. (2014). Introduction to the analytic hierarchy process. Springer.

Cabrita, M. D. R., & Frade, R. (2016). Supplier selection approach: integrating analytic hierarchy process and supplier risk analysis. International Journal of Business and Systems Research, 10(2-4), 238–261.

CWRU. (.). Case western reserve university bearing data center dataset. https://engineering.case .edu/bearingdatacenter. (Accessed on March 12, 2024) Franek, J., & Kresta, A. (2014). Judgment scales and consistency measure in ahp. Procedia economics and finance, 12, 164–173.

Guo, L., Lei, Y., Xing, S., Yan, T., & Li, N. (2018). Deep convolutional transfer learning network: A new method for intelligent fault diagnosis of machines with unlabeled data. IEEE Transactions on Industrial Electronics, 66(9), 7316–7325.

Ishizaka, A. (2004). The advantages of clusters in ahp. In The 15th mini-euro conference, mudsm.

Ishizaka, A., & Labib, A. (2011). Review of the main developments in the analytic hierarchy process. Expert systems with applications, 38(11), 14336–14345. Kaggle. (2023). Bearing classification dataset.

www.kaggle.com/datasets/isaienkov/ bearing-classification. (Accessed on March 12, 2024) Kilic, H. S., Zaim, S., & Delen, D. (2014). Development of a hybrid methodology for erp system selection: The case of turkish airlines. Decision Support Systems, 66, 82–92.

Li, X., Zhang, W., Ding, Q., & Sun, J.-Q. (2020). Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation. Journal of Intelligent Manufacturing, 31(2), 433–452.

Nectoux, P., Gouriveau, R., Medjaher, K., Ramasso, E., Chebel-Morello, B., Zerhouni, N., & Varnier, C. (2012). Pronostia: An experimental platform for bearings accelerated degradation tests. In Ieee international conference on prognostics and health management, phm’12. (pp. 1–8).
Nydick, R. L., & Hill, R. P. (1992). Using the analytic hierarchy process to structure the supplier selection procedure. International Journal of purchasing and materials management, 28(2), 31–36. Omri, N., Al Masry, Z., Mairot, N., Giampiccolo, S., & Zerhouni, N. (2020). Industrial data management strategy towards an sme-oriented phm. Journal of Manufacturing Systems, 56, 23–36. Omri, N., Al Masry, Z., Mairot, N., Giampiccolo, S., & Zerhouni, N. (2021). Towards an adapted phm approach: Data quality requirements methodology for fault detection applications. Computers in industry, 127, 103414. Oreski, D., Oreski, S., & Klicek, B. (2017). Effects of dataset characteristics on the performance of feature selection techniques. Applied Soft Computing, 52, 109–119. Pant, S., Kumar, A., Ram, M., Klochkov, Y., & Sharma, H. K. (2022). Consistency indices in analytic hierarchy process: a review. Mathematics, 10(8), 1206. Perez, L., & Wang, J. (2017). The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621. Qiu, H., Lee, J., Lin, J., & Yu, G. (2006). Wavelet filter-based weak signature detection method and its application on rolling element bearing prognostics. Journal of sound and vibration, 289(4-5), 1066–1090. Redman, T. C. (1997). Data quality for the information age. Artech House, Inc. Ren, J., & L¨utzen, M. (2015). Fuzzy multi-criteria decisionmaking method for technology selection for emissions reduction from shipping under uncertainties. Transportation Research Part D: Transport and Environment, 40, 43–60. Saaty, T. L. (1980). The analytic hierarchy process (ahp). The

Journal of the Operational Research Society, 41(11), 1073–1076. Saaty, T. L. (2005). Theory and applications of the analytic network process: decision making with benefits, opportunities, costs, and risks. RWS publications. Saxena, A., Goebel, K., Simon, D., & Eklund, N. (2008). Damage propagation modeling for aircraft engine runto-failure simulation. In 2008 international conference on prognostics and health management (pp. 1–9). Shao, S., McAleer, S., Yan, R., & Baldi, P. (2018). Highly accurate machine fault diagnosis using deep transfer learning. IEEE Transactions on Industrial Informatics, 15(4), 2446–2455. Smith, W. A., & Randall, R. B. (2015). Rolling element bearing diagnostics using the case western reserve university data: A benchmark study. Mechanical systems and signal processing, 64, 100–131. Strong, D. M., Lee, Y. W., & Wang, R. Y. (1997). Data quality in context. Communications of the ACM, 40(5), 103–110. Tobon-Mejia, D. A., Medjaher, K., Zerhouni, N., & Tripot, G. (2012). A data-driven failure prognostics method based on mixture of gaussians hidden markov models. IEEE Transactions on reliability, 61(2), 491–503. Wang, Z., Yang, J., Jiang, H., & Fan, X. (2020). Cnn training with twenty samples for crack detection via data augmentation. Sensors, 20(17), 4849. Weiss, K., Khoshgoftaar, T. M., & Wang, D. (2016). A survey of transfer learning. Journal of Big data, 3, 1–40. Wen, L., Gao, L., & Li, X. (2017). A new deep transfer learning based on sparse auto-encoder for fault diagnosis. IEEE Transactions on systems, man, and cybernetics: systems, 49(1), 136–144.
Section
Technical Papers