A Comparison of Feature Selection and Feature Extraction Techniques for Condition Monitoring of a Hydraulic Actuator
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Abstract
In many applications, there are a number of data sources that can be collected and numerous features that can be calculated from these data sources. The error of big data has lead many to believe that the larger the data, the better the results. However, as the dimensionality of the data increases, the effects of the curse of dimensionality become more prevalent. Further, a large feature set also increases the computational cost
of data collection and feature calculation. In this study, we evaluated four dimensionality reduction techniques as part of a system for condition monitoring of a hydraulic actuator. Two feature selection techniques, ReliefF and variable importance, and two feature extraction techniques, principal component analysis and autoencoders, are used to reduce the input into three classification algorithms. We conclude that variable importance in conjunction with the random forest algorithm outperforms the other dimensionality reduction techniques. Feature selection has the added advantage of being able to remove data sources and features from the data collection and feature calculation process that are not present in
the relevant feature subset.
How to Cite
##plugins.themes.bootstrap3.article.details##
condition monitoring, feature extraction, feature selection
Adams, S., Beling, P. A., Farinholt, K., Brown, N., Polter, S., & Dong, Q. (2016). Condition based monitoring for a hydraulic actuator. In Proceedings of the annual conference of the prognostics and health management society 2016.
Almuallim, H., & Dietterich, T. G. (1991). Learning with many irrelevant features. In Aaai (Vol. 91, pp. 547– 552).
Benkedjouh, T., Medjaher, K., Zerhouni, N., & Rechak, S. (2015). Health assessment and life prediction of cutting tools based on support vector regression. Journal of Intelligent Manufacturing, 26(2), 213–223.
Bishop, C. (2006). Pattern recognition and machine learning. Springer-Verlag New York.
Blum, A. L., & Langley, P. (1997). Selection of relevant features and examples in machine learning. Artificial intelligence, 97(1), 245–271.
Breiman, L. (2001). Random forests. Machine learning, 45(1), 5–32.
Dash, M., & Liu, H. (1997). Feature selection for classification. Intelligent data analysis, 1(1-4), 131–156.
Duda, R. O., Hart, P. E., & Stork, D. G. (2001). Pattern classification. 2nd. Edition. New York.
Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of machine learning research, 3(Mar), 1157–1182. Harmouche, J., Delpha, C., & Diallo, D. (2014). Incipient fault detection and diagnosis based on kullback–leibler divergence using principal component analysis: Part i. Signal Processing, 94, 278–287.
Hinton, G. E., & Salakhutdinov, R. R. (2006). Reducing the dimensionality of data with neural networks. science, 313(5786), 504–507.
Hu, Y., Palmé, T., & Fink, O. (2016). Deep health indicator extraction: A method based on auto-encoders and extreme learning machines. In Annual conference of the prognostics and health management society 2016.
Jain, A. K., Duin, R. P. W., & Mao, J. (2000). Statistical pattern recognition: A review. IEEE Transactions on pattern analysis and machine intelligence, 22(1), 4–37.
The Prognostic and Health Management Society advocates open-access to scientific data and uses a Creative Commons license for publishing and distributing any papers. A Creative Commons license does not relinquish the author’s copyright; rather it allows them to share some of their rights with any member of the public under certain conditions whilst enjoying full legal protection. By submitting an article to the International Conference of the Prognostics and Health Management Society, the authors agree to be bound by the associated terms and conditions including the following:
As the author, you retain the copyright to your Work. By submitting your Work, you are granting anybody the right to copy, distribute and transmit your Work and to adapt your Work with proper attribution under the terms of the Creative Commons Attribution 3.0 United States license. You assign rights to the Prognostics and Health Management Society to publish and disseminate your Work through electronic and print media if it is accepted for publication. A license note citing the Creative Commons Attribution 3.0 United States License as shown below needs to be placed in the footnote on the first page of the article.
First Author et al. This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 United States License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.