A Novel Approach for Evaluating Datasets Similarities Based on Analytical Hierarchy Process in the Industrial PHM Context
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Abstract
In prognostics and health management (PHM), data-driven approaches are crucial for performing prognostics based on historical data, relying on the analysis of extensive datasets to identify patterns and relationships that contribute to predicting or optimizing variables. However, their efficiency is contingent upon the availability of large, high-quality datasets tailored to the specific task at hand.
Yet, real-world applications frequently face challenges as data may not always be readily available due to limitations in data acquisition systems or confidentiality concerns. Paradoxically, the contemporary era witnesses an unprecedented surge in the availability of online databases across various fields. These databases offer a plethora of data that can be harnessed to develop, prototype, and test PHM solutions.
This study endeavors to introduce an innovative approach for assessing the similarity between datasets, specifically tailored for prognostic and health management applications. The objective is to empower the development of PHM solutions for predefined systems without relying on data generated from the system itself, but rather by leveraging analogous datasets.
To quantify the similarity between different datasets, we propose a set of criteria and sub-criteria based on the characteristics of datasets. Subsequently, the analytic hierarchy process (AHP), a well-established multi-criteria decision-making approach, is employed to systematically compare the importance of criteria and sub-criteria for each elementary process within the PHM cycle. This dynamic process considers the varying importance of criteria across different phases, acknowledging that a criterion may not be uniformly significant for all elementary processes. The evaluation of dataset similarity incorporates the proposed criteria and sub-criteria, utilizing a fundamental scale of importance intensity and weights assigned through AHP. This holistic approach yields a comprehensive similarity score, enabling a nuanced understanding of dataset compatibility.
To exemplify the efficiency of our proposed approach, we applied it to a practical case study. The study involves assessing the similarity between a run-to-stop database of mechanical bearings and a set of online databases dedicated to the same application. Our solution facilitated the identification of criteria pertinent to the case study, the determination of criterion weights, and ultimately, the calculation of a similarity score for each database. This process proved instrumental in selecting the most similar database, showcasing the practical utility of our proposed approach in real-world PHM scenarios.
How to Cite
##plugins.themes.bootstrap3.article.details##
Data Similarity, AHP, PHM, Data-driven, Data Characteristics
approaches based on dataset characteristics. International Journal of soft computing and Engineering, 2(10), 234–247.
Bougacha, O., Varnier, C., & Zerhouni, N. (2022). Impact of decision horizon on post-prognostics maintenance and missions scheduling: a railways case study. International Journal of Rail Transportation, 10(4), 516–546. Brunelli, M. (2014). Introduction to the analytic hierarchy process. Springer.
Cabrita, M. D. R., & Frade, R. (2016). Supplier selection approach: integrating analytic hierarchy process and supplier risk analysis. International Journal of Business and Systems Research, 10(2-4), 238–261.
CWRU. (.). Case western reserve university bearing data center dataset. https://engineering.case .edu/bearingdatacenter. (Accessed on March 12, 2024) Franek, J., & Kresta, A. (2014). Judgment scales and consistency measure in ahp. Procedia economics and finance, 12, 164–173.
Guo, L., Lei, Y., Xing, S., Yan, T., & Li, N. (2018). Deep convolutional transfer learning network: A new method for intelligent fault diagnosis of machines with unlabeled data. IEEE Transactions on Industrial Electronics, 66(9), 7316–7325.
Ishizaka, A. (2004). The advantages of clusters in ahp. In The 15th mini-euro conference, mudsm.
Ishizaka, A., & Labib, A. (2011). Review of the main developments in the analytic hierarchy process. Expert systems with applications, 38(11), 14336–14345. Kaggle. (2023). Bearing classification dataset.
www.kaggle.com/datasets/isaienkov/ bearing-classification. (Accessed on March 12, 2024) Kilic, H. S., Zaim, S., & Delen, D. (2014). Development of a hybrid methodology for erp system selection: The case of turkish airlines. Decision Support Systems, 66, 82–92.
Li, X., Zhang, W., Ding, Q., & Sun, J.-Q. (2020). Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation. Journal of Intelligent Manufacturing, 31(2), 433–452.
Nectoux, P., Gouriveau, R., Medjaher, K., Ramasso, E., Chebel-Morello, B., Zerhouni, N., & Varnier, C. (2012). Pronostia: An experimental platform for bearings accelerated degradation tests. In Ieee international conference on prognostics and health management, phm’12. (pp. 1–8).
Nydick, R. L., & Hill, R. P. (1992). Using the analytic hierarchy process to structure the supplier selection procedure. International Journal of purchasing and materials management, 28(2), 31–36. Omri, N., Al Masry, Z., Mairot, N., Giampiccolo, S., & Zerhouni, N. (2020). Industrial data management strategy towards an sme-oriented phm. Journal of Manufacturing Systems, 56, 23–36. Omri, N., Al Masry, Z., Mairot, N., Giampiccolo, S., & Zerhouni, N. (2021). Towards an adapted phm approach: Data quality requirements methodology for fault detection applications. Computers in industry, 127, 103414. Oreski, D., Oreski, S., & Klicek, B. (2017). Effects of dataset characteristics on the performance of feature selection techniques. Applied Soft Computing, 52, 109–119. Pant, S., Kumar, A., Ram, M., Klochkov, Y., & Sharma, H. K. (2022). Consistency indices in analytic hierarchy process: a review. Mathematics, 10(8), 1206. Perez, L., & Wang, J. (2017). The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621. Qiu, H., Lee, J., Lin, J., & Yu, G. (2006). Wavelet filter-based weak signature detection method and its application on rolling element bearing prognostics. Journal of sound and vibration, 289(4-5), 1066–1090. Redman, T. C. (1997). Data quality for the information age. Artech House, Inc. Ren, J., & L¨utzen, M. (2015). Fuzzy multi-criteria decisionmaking method for technology selection for emissions reduction from shipping under uncertainties. Transportation Research Part D: Transport and Environment, 40, 43–60. Saaty, T. L. (1980). The analytic hierarchy process (ahp). The
Journal of the Operational Research Society, 41(11), 1073–1076. Saaty, T. L. (2005). Theory and applications of the analytic network process: decision making with benefits, opportunities, costs, and risks. RWS publications. Saxena, A., Goebel, K., Simon, D., & Eklund, N. (2008). Damage propagation modeling for aircraft engine runto-failure simulation. In 2008 international conference on prognostics and health management (pp. 1–9). Shao, S., McAleer, S., Yan, R., & Baldi, P. (2018). Highly accurate machine fault diagnosis using deep transfer learning. IEEE Transactions on Industrial Informatics, 15(4), 2446–2455. Smith, W. A., & Randall, R. B. (2015). Rolling element bearing diagnostics using the case western reserve university data: A benchmark study. Mechanical systems and signal processing, 64, 100–131. Strong, D. M., Lee, Y. W., & Wang, R. Y. (1997). Data quality in context. Communications of the ACM, 40(5), 103–110. Tobon-Mejia, D. A., Medjaher, K., Zerhouni, N., & Tripot, G. (2012). A data-driven failure prognostics method based on mixture of gaussians hidden markov models. IEEE Transactions on reliability, 61(2), 491–503. Wang, Z., Yang, J., Jiang, H., & Fan, X. (2020). Cnn training with twenty samples for crack detection via data augmentation. Sensors, 20(17), 4849. Weiss, K., Khoshgoftaar, T. M., & Wang, D. (2016). A survey of transfer learning. Journal of Big data, 3, 1–40. Wen, L., Gao, L., & Li, X. (2017). A new deep transfer learning based on sparse auto-encoder for fault diagnosis. IEEE Transactions on systems, man, and cybernetics: systems, 49(1), 136–144.
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.