Timeseries Feature Extraction for Dataset Creation in Prognostic Health Management A Case Study in Steel Manufacturing

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Published Jun 27, 2024
Thanos Kontogiannis
Wanda Melfo
Nick Eleftheroglou
Dimitrios Zarouchas

Abstract

This study focuses on a critical aspect of implementing prognostics and health management (PHM) for assets: the creation of a descriptive dataset. In real-world applications, dealing with sparse and unlabelled big data is common, particularly in industries like production lines where complex subprocesses are monitored by multiple sensors. Moreover, selective application of quality control means that much of the data lacks information about end properties, making datasets provided by manufacturers unsuitable for PHM frameworks. This work aims to bridge the gap between raw production data and PHM frameworks, focusing on steel manufacturing management. In the context of steel manufacturing, compromised surface quality, characterized by thicker oxide layers chipping during milling, has been observed. We propose inferring compromised coils by analyzing temperature profiles directly before the coiling station to address this. Deviations from the goal temperature profile can indicate compromised surface quality, eliminating the need for tedious oxide layer thickness measurements, which are not feasible for continuous hot strip milling processes. The available dataset comprised multiple years of production, with no direct indication of the surface quality. Exploratory clustering analysis was the first step in the lack of labels. Even though indicative of the underlying pattern of the healthy/damaged coils distinction, three shortcomings were identified. Clustering was solely based on the similarity between the temperature profiles of the coils, so no domain knowledge was included regarding the goal temperature profile. Additionally, since different steel grades have different goal profiles, the model needs to be specifically trained for each grade. Also, a soft classification between healthy and damaged can provide more detailed information about the surface quality. Coils with low-confidence classifications can be identified and treated accordingly, thereby improving PHM framework performance by providing a dataset with only high-confidence samples. To tackle these issues, an expert-knowledge-based normalization technique and feature engineering, paired with synthetic labelling, contributed to the creation of a soft neural network classifier. This study presents the reality of handling real-world data for PHM applications and highlights the need for careful and informed feature extraction. This ensures the seamless integration of PHM frameworks into real-world systems, ultimately enhancing production yield by improving end-product quality.

How to Cite

Kontogiannis, T., Melfo, W., Eleftheroglou, N., & Zarouchas, D. (2024). Timeseries Feature Extraction for Dataset Creation in Prognostic Health Management: A Case Study in Steel Manufacturing. PHM Society European Conference, 8(1), 13. https://doi.org/10.36001/phme.2024.v8i1.3968
Abstract 77 | PDF Downloads 51

##plugins.themes.bootstrap3.article.details##

Keywords

prognostics, manufacturing, feature extraction, timeseries, hot strip mill

References
Amari, S.-I. (1972). Learning patterns and pattern sequences by self-organizing nets of threshold elements. IEEE Transactions on computers, 100(11), 1197–1206. Bandt, C., & Pompe, B. (2002). Permutation entropy: a natural complexity measure for time series. Physical review letters, 88(17), 174102.

Batista, G. E., Keogh, E. J., Tataw, O. M., & De Souza, V. M.

(2014). Cid: an efficient complexity-invariant distance for time series. Data Mining and Knowledge Discovery, 28, 634–669.

Benjamini, Y., & Yekutieli, D. (2001). The control of the false discovery rate in multiple testing under dependency. Annals of statistics, 1165–1188.

Box, G. E., Jenkins, G. M., Reinsel, G. C., & Ljung, G. M.

(2015). Time series analysis: forecasting and control.John Wiley & Sons.

Caesarendra, W., & Tjahjowidodo, T. (2017). A review of feature extraction methods in vibration-based condition monitoring and its application for degradation trend estimation of low-speed slew bearing. Machines, 5(4). Chen, R., & Yuen, W. (2001). Oxide-scale structures formed on commercial hot-rolled steel strip and their formation mechanisms. Oxidation of metals, 56(1), 89–118. Choi, S. W., Lee, C., & Lee, e. a. (2005). Fault detection and identification of nonlinear processes based on kernel pca. Chemometrics and intelligent laboratory systems, 75(1), 55–67.

Christ, M., Kempa-Liehr, A. W., & Feindt, M. (2016).

Distributed and parallel time series feature extraction for industrial big data applications. arXiv preprint arXiv:1610.07717.

Cuturi, M., & Blondel, M. (2018). Soft-dtw: a differentiable loss function for time-series.

Deng, G., Zhu, H., Tieu, A. K., Su, L., Reid, M., Zhang, L., . .. others (2017). Theoretical and experimental investigation of thermal and oxidation behaviours of a high speed steel work roll during hot rolling. International Journal of Mechanical Sciences, 131, 811–826. Friedrich, R., Siegert, S., Peinke, J., Siefert, M., Lindemann, M., Raethjen, J., ... others (2000). Extracting model equations from experimental data. Physics Letters A, 271(3), 217–222.

Fulcher, B. D., & Jones, N. S. (2014). Highly comparative feature-based time-series classification. IEEE Transactions on Knowledge and Data Engineering, 26(12), 3026–3037.

Gush, T., Bukhari, S. B. A., Haider, R., Admasie, S., Oh, Y.-S., Cho, G.-J., & Kim, C.-H. (2018). Fault detection and location in a microgrid using mathematical morphology and recursive least square methods. International Journal of Electrical Power & Energy Systems, 102, 324–331.

Jardine, A. K., Lin, D., & Banjevic, D. (2006). A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mechanical systems and signal processing, 20(7), 1483–1510.

Jiang, W., Hong, Y., Zhou, B., He, X., & Cheng, C. (2019). A gan-based anomaly detection approach for imbalanced industrial time series. IEEE Access, 7, 143608–143619. Kankar, P. K., Sharma, S. C., & Harsha, S. P. (2011). Fault diagnosis of ball bearings using continuous wavelet transform. Applied Soft Computing, 11(2), 2300–2312. Liu, H., Zhou, J., Xu, Y., Zheng, Y., Peng, X., & Jiang, W.

(2018). Unsupervised fault diagnosis of rolling bearings using a deep neural network based on generative adversarial networks. Neurocomputing, 315, 412–424. Lloyd, S. (1982). Least squares quantization in pcm. IEEE transactions on information theory, 28(2), 129–137.

Mallat, S. (1999). A wavelet tour of signal processing. Elsevier. Massey Jr, F. J. (1951). The kolmogorov-smirnov test for goodness of fit. Journal of the American statistical Association, 46(253), 68–78. Min, K., Kim, K., Kim, S. K., & Lee, D.-J. (2012). Effects of oxide layers on surface defects during hot rolling processes. Metals and Materials International, 18, 341–

348.

Petitjean, F., Ketterlin, A., & Ganc¸arski, P. (2011). A global averaging method for dynamic time warping, with applications to clustering. Pattern recognition, 44(3), 678–

693.

Richman, J. S., & Moorman, J. R. (2000). Physiological time-series analysis using approximate entropy and sample entropy. American journal of physiology-heart and circulatory physiology, 278(6), H2039–H2049. Rousseeuw, P. J. (1987). Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of computational and applied mathematics, 20, 53–65. Sakoe, H. (1971). Dynamic-programming approach to continuous speech recognition. In 1971 proc. the international congress of acoustics, budapest.

Schreiber, T., & Schmitz, A. (1997). Discrimination power of measures for nonlinearity in a time series. Physical Review E, 55(5), 5443.

Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research, 15(1), 1929–1958. Wang, Y., Pan, Z., Yuan, X., Yang, C., & Gui, W. (2020).

A novel deep learning based fault diagnosis approach for chemical process with extended deep belief network. ISA transactions, 96, 457–467.

Wang, Y., Yang, H., Yuan, X., Shardt, Y. A., Yang, C., & Gui,

W. (2020). Deep learning for fault-relevant feature extraction and fault classification with stacked supervised auto-encoder. Journal of Process Control, 92, 79–89. Wang, Z., McConnell, S., Balog, R. S., & Johnson, J. (2014). Arc fault signal detection - fourier transformation vs. wavelet decomposition techniques using synthesized data. In 2014 ieee 40th photovoltaic specialist conference (pvsc) (p. 3239-3244). Wang, Z., Wang, J., & Chen, S. (2020). Fault location of strip steel surface quality defects on hot-rolling production line based on information fusion of historical cases and process data. IEEE Access, 8, 171240-171251. Welch, P. (1967). The use of fast fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms. IEEE Transactions on audio and electroacoustics, 15(2), 70–73.
Section
Technical Papers