Data Augmentation of Multivariate Sensor Time Series using Autoregressive Models and Application to Failure Prognostics
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Abstract
This work presents a novel data augmentation solution for non-stationary multivariate time series and its application to failure prognostics. The method extends previous work from the authors which is based on time-varying autoregressive processes. It can be employed to extract key information from a limited number of samples and generate new synthetic samples in a way that potentially improves the performance of PHM solutions. This is especially valuable in situations of data scarcity which are very usual in PHM, especially for failure prognostics. The proposed approach is tested based on the CMAPSS dataset, commonly employed for prognostics experiments and benchmarks. An AutoML approach from PHM literature is employed for automating the design of the prognostics solution. The empirical evaluation provides evidence that the proposed method can substantially improve the performance of PHM solutions.
How to Cite
##plugins.themes.bootstrap3.article.details##
data augmentation, autoregressive models, failure prognostics, data-centric PHM
AsSadhan, B., Zeb, K., Al-Muhtadi, J., & Alshebeili, S. (2017). Anomaly detection based on lrd behavior analysis of decomposed control and data planes network traffic using soss and farima models. IEEE Access, 5, 13501-13519.
Biggio, L., & Kastanis, I. (2020). Prognostics and health management of industrial assets: Current progress and road ahead. Front. Artif. Intell., 3, 1-24.
Christ, M., Braun, N., Neuffer, J., & Kempa-Liehr, A. W. (2018). Time series feature extraction on basis of scalable hypothesis tests (tsfresh – a Python package). Neurocomputing,
307, 72-77.
Das, A., Kong, W., Sen, R., & Zhou, Y. (2024). A decoder-only foundation model for time-series forecasting. In International Conference on Machine Learning (PMLR 2024) (Vol. 235, p. 10148-10167). Vienna, Austria.
Deistler, M., & Scherrer, W. (2022). Time series models (1st ed.).
Vienna, Austria: Springer. de Oliveira, F. A. C., Niemi, A., GarcÅLıa-Ortiz, A., & Torres, F. S. (2023). Partial camera obstruction detection using single value image metrics and data augmentation. In International Conference on System Reliability and Safety (ICSRS 2023) (p. 292-299). Venice, Italy.
de Souza, D. B., Chanussot, J., & Favre, A.-C. (2014). On selecting relevant intrinsic mode functions in empirical mode decomposition: An energy-based approach. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) (p. 325-329). Florence, Italy.
de Souza, D. B., Chanussot, J., Favre, A.-C., & Borgnat, P. (2012). A modified time-frequency method for testing wide-sense stationarity. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012) (p. 3409-3412). Kyoto, Japan.
de Souza, D. B., Chanussot, J., Favre, A.-C., & Borgnat, P. (2014). A new nonparametric method for testing stationarity based on trend analysis in the time marginal distribution. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2014) (p. 320-324). Florence, Italy.
de Souza, D. B., Chanussot, J., Favre, A.-C., & Borgnat, P. (2018). A nonparametric test for slowly-varying nonstationarities. Signal Process., 143, 241-252.
de Souza, D. B., Chanussot, J., Favre, A.-C., & Borgnat, P. (2019). An improved stationarity test based on surrogates. IEEE Signal Process. Lett., 26(10), 1431-1435.
de Souza, D. B., Kuhn, E. V., & Seara, R. (2019). A time varying autoregressive model for characterizing nonstationary processes. IEEE Signal Process. Lett., 26(1), 134-138.
de Souza, D. B., & Leao, B. P. (2023). Data augmentation of sensor time series using time-varying autoregressive processes. In Annual Conference of the Prognostics and Health Management Society (PHM 2023) (Vol. 15, p. 1-10). Salt Lake City, UT.
Garan,M., Tidriri, K., & Kovalenko, I. (2022). A data-centric machine learning methodology: Application on predictive maintenance of wind turbines. Energies, 15(3), 1- 21.
Jiang, X., & Ge, Z. (2021). Data Augmentation Classifier for Imbalanced Fault Classification. IEEE Trans. Autom. Sci., 18(3), 1206-1217.
Kay, S. (2008). A new nonstationarity detector. IEEE Trans. Signal Process., 56(4), 1440-1451.
Kefalas, M., Baratchi, M., Apostolidis, A., van den Herik, D., & BÅNack, T. (2021). Automated machine learning for remaining useful life estimation of aircraft engines. In 2021 ieee international conference on prognostics and health management (icphm) (pp. 1–9).
Kim, S., Choi, J.-H., & Kim, N. H. (2021). Challenges and opportunities of system-level prognostics. Sensors, 21(22), 1-25.
Kim, S., Kim, N. H., & Choi, J.-H. (2020). Prediction of remaining useful life by data augmentation technique based on dynamic time warping. Mech. Syst. Signal Process., 136, 106486.
Kwak, M., & Lee, J. (2023). Diagnosis-based domain adaptive design using designable data augmentation and bayesian transfer learning: Target design estimation and validation. Appl. Soft Comput., 143, 110459.
Le, T. T., Fu, W., & Moore, J. H. (2020). Scaling tree based automated machine learning to biomedical big data with a feature set selector. Bioinformatics, 36(1), 250–256.
Leao, B. P., Fradkin, D., Lan, T., &Wang, J. (2021). Unleashing the power of industrial big data through scalable manual labeling. In NeurIPS Data-Centric AI Workshop (p. 1-5).
Li, H., Zhang, Z., & Zhang, C. (2023). Data augmentation via variational mode reconstruction and its application in few-shot fault diagnosis of rolling bearings. Measurement, 217, 113062.
Li, X. Y., Cheng, D. J., Fang, X. F., Zhang, C. Y., & Wang, Y. F. (2024). A novel data augmentation strategy for aeroengine multitask prognosis based on degradation behavior extrapolation and diversity-usability trade-off. Reliability Engineering & System Safety, 249, 110238. doi: https://doi.org/10.1016/j.ress.2024 .110238
Matei, I., Zhenirovskyy, M., de Kleer, J., & Feldman, A. (2018). Classification-based diagnosis using synthetic data from uncertain models. In Annual Conference of the Prognostics and Health Management Society (PHM 2018) (p. 1-8). Philadelphia, PA.
Pachori, R. B., & Sircar, P. (2008). EEG signal analysis using FB expansion and second-order linear TVAR process. Signal Process., 88(2), 415-420.
Ramasso, E., & Saxena, A. (2014). Review and analysis of algorithmic approaches developed for prognostics on cmapss dataset. In Annual conference of the prognostics and health management society 2014.
Rao, T. S. (1970). The fitting of non-stationary time series models with time-dependent parameters. J. R. Statist. Soc., 32, 312-322.
Saxena, A., & Goebel, K. (2008). Turbofan engine degradation simulation data set. NASA Ames Prognostics Data repository, NASA Ames Research Center, Moffett Field.
Shen, B., Yao, L., Jiang, X., Yang, Z., & Zeng, J. (2023). Time series data augmentation classifier for industrial process imbalanced fault diagnosis. In IEEE Data Driven Control and Learning Systems Conference (DDCLS 2023) (p. 1392-1397). Xiangtan, China.
Sodsri, C. (2003). Time-varying autoregressive modelling for nonstationary acoustic signal and its frequency analysis (Unpublished doctoral dissertation). Pennsylvania State University.
Taghiyarrenani, Z., & Berenji, A. (2022). Noise-robust representation for fault identification with limited data via data augmentation. In European Conference of the Prognostics and Health Management Society (PHME 2022) (p. 473-479). Turin, Italy.
Wang, D., Dong, Y., Wang, H., & Tang, G. (2023). Limited Fault Data Augmentation With Compressed Sensing for Bearing Fault Diagnosis. IEEE Sens. J., 23(13), 14499-14511.
Yang, A., Lu, C., Yu, W., Hu, J., Nakanishi, Y., & Wu, M. (2023). Data Augmentation Considering Distribution Discrepancy for Fault Diagnosis of Drilling Process With Limited Samples. IEEE Trans. Ind. Electron., 70(11), 11774-11783.
Yang, Y., Zheng, H., & Zhang, R. (2017). Prediction and analysis of aircraft failure rate based on sarima model. In IEEE International Conference on Computational Intelligence and Applications (ICCIA 2017) (p. 567- 571). Beijing, China.
Zhang, X., Chowdhury, R. R., Gupta, R. K., & Shang, J. (2024). Large language models for time series: A survey. In International Joint Conference on Artificial Intelligence (IJCAI 2024) (p. 1-9). Jeju, South Korea.
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.