A Collective Learning Workflow for Remaining Useful Life Estimation of Building Assets Under Sparse Degradation Data
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Abstract
Building maintenance in the commercial sector is still dominated by reactive and schedule-based strategies, yet the shift to predictive maintenance is held back by a chronic shortage of degradation data: individual assets such as fan coil units, air handling units and pumps rarely accumulate enough failure or degradation history to train standalone prognostic models, and the records that do exist are short and heavily skewed towards normal operation. Existing approaches do not resolve this. Deep sequence models require continuous multi-year recordings that newly instrumented buildings cannot provide and overfit on small, imbalanced datasets; semantic ontologies organise building data but carry no prognostic capability; and baseline forecasting methods generate operating features without estimating degradation. This paper presents a collective-learning workflow that groups functionally identical assets by their Brick semantic class, pools their operational records into a shared dataset, and trains a single Random Forest model to estimate each asset's daily degradation increment, which is then accumulated into a remaining-useful-life projection and updated as maintenance is recorded. Applied to operational data from a pilot building, the workflow produced per-unit daily degradation estimates and remaining-useful-life projections with quantified uncertainty for fan coil units that individually lacked sufficient history to be modelled in isolation. Under leave-one-out cross-validation, operational features alone did not recover the per-unit degradation level (cross-validated R² below zero), whereas fusing a static condition indicator with the operational dynamics was required to do so (R² = 0.67, CVRMSE 20 percent), reported as a preliminary finding rather than a validated prognostic capability. The results show that pooling sparse degradation labels across semantically aligned assets makes data-driven prognostics feasible for building portfolios under data scarcity, providing a transferable route to predictive maintenance that does not depend on multi-year failure records.
How to Cite
##plugins.themes.bootstrap3.article.details##
predictive maintenance, remaining useful life, collective learning, Random Forest, data scarcity, fan coil units, building asset management
ASHRAE. (2014). Guideline 14-2014: Measurement of energy, demand, and water savings. Atlanta, GA: American Society of Heating, Refrigerating and Air-Conditioning Engineers.
Balaji, B., Bhattacharya, A., Fierro, G., Gao, J., Gluck, J., Hong, D., Johansen, A., Koh, J., Ploennigs, J., Acer, Y., Berges, M., Culler, D., Gupta, R., Kjaergaard, M. B., Srivastava, M., & Whitehouse, K. (2018). Brick: Metadata schema for portable smart building applications. Applied Energy, 226, 1273–1292.
Asare, K. A. B., Liu, R., & Anumba, C. J. (2025). Generative AI for predictive maintenance in buildings. In Proceedings of SASBE 2024 (pp. 928–935). Springer.
Guzmán-Torres, J. A., Domínguez-Mota, F. J., Alonso-Guzmán, E. M., Martínez-Molina, W., & Tinoco-Guerrero, G. (2025). A digital twin approach-based method for classification of salt damage in building evaluation. Mathematics and Computers in Simulation, 233, 433–447.
Hakimi, O., Liu, H., & Abudayyeh, O. (2025). Deep learning-driven multi-level data fusion for predictive maintenance of concrete bridge decks. Automation in Construction, 175, 106180.
Hosamo, H. H., Svennevig, P. R., Svidt, K., Han, D., & Nielsen, H. K. (2022). A digital twin predictive maintenance framework of AHUs based on automatic fault detection and diagnostics. Energy and Buildings, 261, 111988.
Hosamo, H. H., Nielsen, H. K., Kraniotis, D., Svennevig, P. R., & Svidt, K. (2023). Digital twin framework for automated fault source detection and prediction for comfort performance evaluation of existing non-residential Norwegian buildings. Energy and Buildings, 281, 112732.
Hosseini Gourabpasi, A., & Nik-Bakht, M. (2024). BIM-based automated fault detection and diagnostics of HVAC systems in commercial buildings. Journal of Building Engineering, 87, 109022.
Hu, W., & Cai, Y. (2024). A semi-supervised method for digital twin-enabled predictive maintenance. Neural Computing and Applications.
Hu, W., Wang, X., Tan, K., & Cai, Y. (2023). Digital twin-enhanced predictive maintenance for indoor climate: A parallel LSTM-autoencoder failure prediction approach. Energy and Buildings, 301, 113738.
Hyndman, R. J., & Koehler, A. B. (2006). Another look at measures of forecast accuracy. International Journal of Forecasting, 22(4), 679–688.
Kang, J. S., Chung, K., & Hong, E. J. (2021). Multimedia knowledge-based bridge health monitoring using digital twin. Multimedia Tools and Applications, 80, 34609–34624.
Ma, X., Shi, Z., Guo, F., Chen, Z., & Wei, J. (2023). Digital twin model for chiller fault diagnosis based on SSAE and transfer learning. Building and Environment, 243, 110718.
Shafer, G., & Vovk, V. (2008). A tutorial on conformal prediction. Journal of Machine Learning Research, 9, 371–421.
Wager, S., Hastie, T., & Efron, B. (2014). Confidence intervals for random forests: The jackknife and the infinitesimal jackknife. Journal of Machine Learning Research, 15, 1625–1651.

This work is licensed under a Creative Commons Attribution 3.0 Unported License.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.