Evaluating Vector Representations of Short Text Data for Automating Recommendations of Maintenance Cases
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Abstract
Nuclear power is a carbon-free source of energy, and features as a key component in the mix of energy towards meeting ambitious decarbonization goals. However, as it currently stands, nuclear power generation is orders of magnitude more expensive when compared to fossil energy sources. Recently, there has been a significant push, by both the US government and the power industry, towards identifying and addressing opportunities for cost reductions in nuclear power generation. While capital costs are being addressed through innovative Smart Modular Reactor (SMR) designs, reductions in Operations and Maintenance (O&M) costs is an equally important opportunity.
Enhanced methods for remote monitoring and health management-based maintenance optimization has been recognized as a key element for reducing preventive and corrective maintenance costs. Beyond sensor data and signal processing, there is also significant potential for reducing manual efforts and improving productivity in maintenance scheduling and planning activities. By utilizing state-of-the-art Technical Language Processing (TLP) methods to analyze large volumes of historical maintenance case data, there is the potential to automate the generation of maintenance recommendations to reduce human effort and accelerate maintenance processes.
This paper presents our efforts towards developing a prescriptive maintenance system that integrates with and enhances state-of-the-art asset performance management software available in the industry. The goal of prescriptive maintenance is to analyze the behavior of an asset, assess its condition, and recommend specific actions to maximize the utility of that asset. Specifically, this work evaluates three approaches of different complexities for vectorization of short-text maintenance case titles for kNN-based recommendation of cases relevant to a new input case title. Industrial text must first be vectorized to build automated and/or machine learning-based prediction and recommendation models. The choice of vectorization methods heavily dictates how the language gets modeled and consequently impacts the performance of downstream prediction and recommendation models.
The objective of the nearest neighbor case recommendations is to reduce manual Subject Matter Expert (SME) effort and increase consistency of recommended maintenance actions on industrial assets by reusing actions performed on the identified nearest neighbor cases from past maintenance work. Four models based on three text vectorization approaches are evaluated, quantitatively and qualitatively, using real data from a large variety of utility customers from the energy domain. A single tier (WVEC-1tier) and a three-tier (WVEC-3tier) approach that represent case titles in word-based vector spaces each significantly outperform a more complex bag-of-phrases topic vector space-based approach (TVEC-K-topics). We present our findings and challenges identified so far in building such a recommendation system.
How to Cite
##plugins.themes.bootstrap3.article.details##
technical language processing, prescriptive maintenance, text mining, advanced nuclear power, Recommendation System
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.