Evaluation of Remaining Useful Life Prediction Algorithms in the Absence of Run-to-Failure Ground Truth Data

##plugins.themes.bootstrap3.article.main##

##plugins.themes.bootstrap3.article.sidebar##

Published Jul 3, 2026
Indranil Roychoudhury Prasham Sheth Taoufik Wassar

Abstract

Accurate evaluation of Remaining Useful Life (RUL) prediction algorithms is fundamental to the deployment of Prognostics and Health Management solutions. However, for critical industrial assets with extended operational lifespans, run-to-failure ground truth data is typically not available. Preventive maintenance intentionally precludes failure events, creating a fundamental challenge of assessing prognostic accuracy without observing actual end-of-life. This paper presents an algorithm-agnostic framework for the continuous online evaluation of RUL predictions in the absence of run-to-failure data. The innovation is a retrospective methodology that treats the asset’s current sensor state as a pseudo ground truth, enabling the evaluation of whether past predictions correctly anticipated the trajectory leading to the present condition. The framework includes two evaluation modes: (1) Measurement-based evaluation that assesses past sensor forecast accuracy against current observations, and (2) RUL-based evaluation that treats the current sensor value as a virtual degradation threshold and evaluates whether past RUL estimates correctly predicted the time to reach the present condition. The RUL-based evaluation adapts the well-established α–λ accuracy framework (Saxena, Celaya, et al., 2008) by replacing the unknown end-of-life with the current time as a pseudo ground truth reference, enabling continuous online assessment without failure observations. Individual prediction verdicts are aggregated using configurable weighting schemes into a single Service-Level Indicator suitable for performance monitoring. Experimental results across several industrial systems demonstrate the framework’s generalizability across diverse degradation mechanisms, sensor modalities, and prediction algorithms. The framework requires only historical sensor measurements and RUL predictions at different times.

How to Cite

Roychoudhury, I., Sheth, P., & Wassar, T. (2026). Evaluation of Remaining Useful Life Prediction Algorithms in the Absence of Run-to-Failure Ground Truth Data. PHM Society European Conference, 9(1), 1–17. https://doi.org/10.36001/phme.2026.v9i1.5055
Abstract 0 | PDF Downloads 0

##plugins.themes.bootstrap3.article.details##

Keywords

Prognostics evaluation, Online performance monitoring

References
Beyer, B., Jones, C., Petoff, J., & Murphy, N. R. (2016). Site reliability engineering: How Google runs production systems. O’Reilly Media.

Chao, M. A., Kulkarni, C. S., Goebel, K. F., & Fink, O. (2020). Fusing physics-based and deep learning models for prognostics. Reliability Engineering & System Safety, 217, 107961.

Chen, Z., Wu, M., Zhao, R., Guretno, F., Yan, R., & Li, X. (2021). Machine remaining useful life prediction via an attention-based deep learning approach. IEEE Transactions on Industrial Electronics, 68(3), 2521–2531.

Cheng, H., Kong, X., Wang, Q., Ma, H., Yang, S., & Chen, G. (2023). Deep transfer learning based on dynamic domain adaptation for remaining useful life prediction under different working conditions. Journal of Intelligent Manufacturing, 34(2), 587–613.

da Costa, P. R. d. O., Akçay, A., Zhang, Y., & Kaymak, U. (2020). Remaining useful lifetime prediction via deep domain adaptation. Reliability Engineering & System Safety, 195, 106682.

Ding, Y., Ding, P., Zhao, X., Cao, Y., & Jia, M. (2022). Transfer learning for remaining useful life prediction across operating conditions based on multisource domain adaptation. IEEE/ASME Transactions on Mechatronics, 27(5), 4143–4152.

Forgione, M., Muni, A., Piga, D., & Gallieri, M. (2023). On the adaptation of recurrent neural networks for system identification. Automatica, 155, 111092.

Gneiting, T., Balabdaoui, F., & Raftery, A. E. (2007). Probabilistic forecasts, calibration and sharpness. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 69(2), 243–268.

Goebel, K., Daigle, M. J., Saxena, A., Roychoudhury, I., Sankararaman, S., & Celaya, J. R. (2017). Prognostics: The science of making predictions.

Goebel, K., Saxena, A., Saha, S., Saha, B., & Celaya, J. (2012). Prognostic performance metrics. In M. Pecht & M. Kang (Eds.). CRC Press.

Huang, Z., Xu, Z., Wang, W., & Sun, Y. (2015). Remaining useful life prediction for a nonlinear heterogeneous Wiener process model with an adaptive drift. IEEE Transactions on Reliability, 64(2), 687–700.

Jardine, A. K., Lin, D., & Banjevic, D. (2006). A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mechanical Systems and Signal Processing, 20(7), 1483–1510.

Lee, J., Wu, F., Zhao, W., Ghaffari, M., Liao, L., & Siegel, D. (2014). Prognostics and health management design for rotary machinery systems: Reviews, methodology and applications. Mechanical Systems and Signal Processing, 42(1–2), 314–334.

Lei, Y., Li, N., Gontarz, S., Lin, J., Radkowski, S., & Dybala, J. (2016). A model-based method for remaining useful life prediction of machinery. IEEE Transactions on Reliability, 65(3), 1314–1326.

Lei, Y., Li, N., Guo, L., Li, N., Yan, T., & Lin, J. (2018). Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mechanical Systems and Signal Processing, 104, 799–834.

Ma, M., & Mao, Z. (2021). Deep-convolution-based LSTM network for remaining useful life prediction. IEEE Transactions on Industrial Informatics, 17(3), 1658–1667.

Ramasso, E., & Saxena, A. (2014). Performance benchmarking and analysis of prognostic methods for CMAPSS datasets. International Journal of Prognostics and Health Management, 5, 1–15.

Roychoudhury, I., Hafiychuk, V., & Goebel, K. (2013). Model-based diagnosis and prognosis of a water recycling system. In 2013 IEEE Aerospace Conference (pp. 1–9).

Sankararaman, S., & Goebel, K. (2015). Significance, interpretation, and quantification of uncertainty in prognostics and remaining useful life prediction. Mechanical Systems and Signal Processing, 52–53, 228–247.

Saxena, A., Celaya, J., Balaban, E., Goebel, K., Saha, B., Saha, S., & Schwabacher, M. (2008). Metrics for evaluating performance of prognostic techniques. In 2008 International Conference on Prognostics and Health Management (pp. 1–17). IEEE.

Saxena, A., Celaya, J., Saha, B., Saha, S., & Goebel, K. (2010). Metrics for offline evaluation of prognostic performance (Vol. 1, pp. 1–20).

Saxena, A., & Goebel, K. (2008). Turbofan engine degradation simulation data set [Data set]. NASA Ames Prognostics Data Repository. NASA Ames Research Center, Moffett Field, CA.

Saxena, A., Goebel, K., Simon, D., & Eklund, N. (2008). Damage propagation modeling for aircraft engine run-to-failure simulation. In 2008 International Conference on Prognostics and Health Management (pp. 1–9). IEEE.

Sheth, P., & Roychoudhury, I. (2024). Robust remaining useful life prediction using Jacobian feature regression-based model adaptation. PHM Society European Conference, 8, 11.

Si, X.-S., Hu, C.-H., Chen, M.-Y., & Wang, W. (2011). An adaptive and nonlinear drift-based Wiener process for remaining useful life estimation. In 2011 Prognostics and System Health Management Conference (pp. 1–5).

Sikorska, J. Z., Hodkiewicz, M., & Ma, L. (2011). Prognostic modelling options for remaining useful life estimation by industry. Mechanical Systems and Signal Processing, 25(5), 1803–1836.

Taylor, S. J., & Letham, B. (2018). Forecasting at scale. The American Statistician, 72(1), 37–45.

Vachtsevanos, G., Lewis, F. L., Roemer, M., Hess, A., & Wu, B. (2006). Intelligent fault diagnosis and prognosis for engineering systems. John Wiley & Sons.

Wang, Y., Zhao, Y., & Addepalli, S. (2020). Remaining useful life prediction using deep learning approaches: A review. Procedia Manufacturing, 49, 81–88.

Zhang, Y., Xiong, R., He, H., & Liu, Z. (2017). A LSTM-RNN method for the lithium-ion battery remaining useful life prediction. In 2017 Prognostics and System Health Management Conference (PHM-Harbin) (pp. 1–4).

Zio, E. (2022). Prognostics and health management (PHM): Where are we and where do we need to go in theory and practice? Reliability Engineering & System Safety, 218, 108119.
Section
Technical Papers