Post-clustering Prioritization Framework for Autonomous Decision Making in the Absence of Ground Truth via Hypothetical Probing
##plugins.themes.bootstrap3.article.main##
##plugins.themes.bootstrap3.article.sidebar##
Karm Al Hajhog
Abstract
A generic prioritization framework is introduced for addressing the problem of automated prioritization of region of interest or target selection. The framework is based on the assumption that clustering of preliminary data for preidentified regions or targets of interest within an operational area has already occurred, i.e., post-classification, and that the clustering quality can be expressed as an energy/objective function. Region or target of interest prioritization then means to rank regions or targets of interest according to their probability of changing the energy/objective function value upon subsequent hypothetical probing as opposed to actually conducted reexamination, i.e., thorough follow-up or in-situ measurements. The mathematical formalism for calculating these probabilities to contribute to this change of the energy/objective function value is introduced and validated through numerical simulations. Moreover, these probabilities can also be understood as a confidence-check of the classification, i.e., the pre-clustering of the preliminary data. The operation of the prioritization framework is independent of the algorithm used to pre-cluster the preliminary data, and supports autonomous decision-making. It is widely applicable across many scientific disciplines and areas, ranging from the microscopic to the macroscopic scale. Due to its ability to help maximize scientific return while optimizing resource utilization, it is particularly relevant in the context of resource-constrained autonomous robotic planetary exploration as it may extend the Remaining Useful Lifetime (RUL) – a key aspect in Prognostics and Health Management (PHM) – of space missions. On a more general, PHM-relevant level, the prioritization framework may provide an additional mechanism of identifying and correcting the maintenance status of system components to assist predictive maintenance or condition-based maintenance.
##plugins.themes.bootstrap3.article.details##
Generic scientific region of interest or target prioritization, Autonomous decision making, Resource-constrained scientific return optimization, Information processing, Clustering quality analysis, Feature-vector based data mining, Hypothetical probing in the absence of ground truth
Baker, V.R. (2017) Debates – Hypothesis testing in hydrology: pursuing certainty versus pursuing uberty; Water Resources Research, 53, 1770-1778.
Banerjee, P., Udpa, L., Udpa, S., & Benson, J. (2017) Confidence metric for signal classification in Non Destructive Evaluation; NDT & E International, 91, 88-96. https://doi.org/10.1016/j.ndteint.2017.06.003.
Bezdek, J.C. (1981) Pattern Recognition with Fuzzy Objective Function Algorithms; ISBN 0-306-40671-3.
Bishop, C.M. (1995) Neural Networks for Pattern Recognition; Clarendon Press, Oxford, 1995.
Borlea, I.-D., Precup, R.-E., & Borlea, A.-B. (2022) Improvement of K-means Cluster Quality by Post Processing Improvement of K-means Cluster Quality by Post Processing Resulted Clusters; Procedia Computer Science, 199, 63–70. https://doi.org/10.1016/j.procs.2022.01.009.
Chamberlin, T.C. (1890) The method of multiple working hypotheses; Science 15:92–96. (reprinted in Science 148: 754–759 [1965]).
Duda, R.O., Hart, P.E., & Stork, D.G. (2000) Pattern Classification and Scene Analysis; John Wiley & Sons, 2nd edition, 2000.
Dunn, J.C. (1973) A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters; Journal of Cybernetics, 3 (3): 32–57; https://doi.org/10.1080/01969727308546046.
Everitt, B.S. (1980) Cluster Analysis; John Wiley, Hoboken, N.J.
Fink, W. (2006) Generic Prioritization Framework for Target Selection and Instrument Usage for Reconnaissance Mission Autonomy; IEEE WCCI Conf. Proc., 11116-11119; https://doi.org/10.1109/IJCNN.2006.247316.
Fink, W., Brooks, A.J.-W., & Tarbell, M.A. (2018) Anomaly Detection and Target Prioritization in Planetary Imagery via the Automated Global Feature AnalyzerTM (AGFATM): Progress Towards a Driver for Autonomous C4ISR Missions; SPIE DCS Conf. Proc., 106391Z; https://doi.org/10.1117/12.2303795.
Fink, W., Castano, R., Davies, A., & Mjolsness, E. (2001) Clustering Algorithm for Mutually Constraining Heterogeneous Features; JPL Technical Report; JPL Open Repository, V1; https://hdl.handle.net/2014/1306201-1569.pdf [fileName] [last accessed: 07/23/2023].
Fink, W., Datta, A., Dohm, J.M., Tarbell, M.A., Jobling, F.M., Furfaro, R., Kargel, J.S., Schulze-Makuch, D., & Baker, V.R. (2008) Automated Global Feature Analyzer (AGFA) – A Driver for Tier-Scalable Reconnaissance; IEEE Aerospace Conf. Proc., paper #1273; https://doi.org/10.1109/AERO.2008.4526422.
Fink, W., Dohm, J.M., Tarbell, M.A., Hare, T.M., & Baker, V.R. (2005) Next-Generation Robotic Planetary Reconnaissance Missions: A Paradigm Shift; Planetary and Space Science, 53, 1419-1426; https://doi.org/10.1016/j.pss.2005.07.013.
Fink, W., Dohm, J.M., Tarbell, M.A., Hare, T.M., & Baker, V.R. (2022) 20 Years of Tier-Scalable Reconnaissance: Adoption of a Game-changing Mission Paradigm; IEEE Aerospace Conf. Proc., paper #2886; https://doi.org/10.1109/AERO53065.2022.9843542.
Fink, W., Tarbell, M.A., & Jobling, F.M. (2008) Tier-Scalable Reconnaissance – A Paradigm Shift in Autonomous Remote Planetary Exploration of Mars and Beyond; Chapter 1 in “Planet Mars Research Focus”, Ed. L. A. Costas. Nova Science Publishers, Hauppauge, NY. ISBN: 1-60021-826-1.
Gilbert, G.K. (1886) The inculcation of scientific method by example; American Journal of Science, 31, 284–299. Gilbert, G.K. (1896) The origin of hypotheses; Science, 3, 1-12.
Han, J., Kamber, M., & Pei, J. (2012a) 8 - Classification: Basic Concepts and Methods; Chapter 8 in “The Morgan Kaufmann Series in Data Management Systems, Data Mining: Concepts and Techniques” (Third Edition), Editor(s): Jiawei Han, Micheline Kamber, Jian Pei, Morgan Kaufmann, 2012, Pages 327-391, ISBN 9780123814791, https://doi.org/10.1016/B978-0-12-381479-1.00008-3.
Han, J., Kamber, M., & Pei, J. (2012b) 10 - Cluster Analysis: Basic Concepts and Methods; Chapter 10 in “The Morgan Kaufmann Series in Data Management Systems, Data Mining: Concepts and Techniques” (Third Edition), Editor(s): Jiawei Han, Micheline Kamber, Jian Pei, Morgan Kaufmann, 2012, Pages 443-495, ISBN 9780123814791, https://doi.org/10.1016/B978-0-12-381479-1.00010-1.
Hastie, T., Tibshirani, R., & Friedman, J. (2009) The elements of statistical learning: data mining, inference, and prediction; Second Edition. Publisher: Springer New York, NY. eBook ISBN 978-0-387-84858-7; https://doi.org/10.1007/978-0-387-84858-7.
Johnson, W.R., Wilson, D.W., Fink, W., Humayun, M.S., & Bearman, G.H. (2007) Snapshot hyperspectral imaging in ophthalmology; Journal for Biomedical Optics, 12(1):014036; https://doi.org/10.1117/1.2434950.
Karim, Md.R., Beyan, O., Zappa, A., Costa, I.G., Rebholz-Schuhmann, D., Cochez, M., & Decker, S. (2021) Deep learning-based clustering approaches for bioinformatics; Briefings in Bioinformatics, 22(1), 2021, 393–415; https://doi.org/10.1093/bib/bbz170.
MacQueen, J. (1967) Some methods for classification and analysis of multivariate observations; in 5th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, edited by L. M. Le Cam and J. Neyman, pp. 281– 296, Univ. of Calif. Press, Berkeley.
Marzo, G.A., Roush, T.L., Blanco, A., Fonti, S., & Orofino, V. (2006) Cluster analysis of planetary remote sensing spectral data; J. Geophys. Res., 111, E03002; https://doi:10.1029/2005JE002532.
Sathishkumar, E. (2015) Fuzzy C Means Manual Work; Slideshare; https://www.slideshare.net/ENSathishkumar/fuzzy-cmeans-manual-work [last accessed: 07/23/2023].
Williams, C.K.I. (2000) An MCMC Approach to Hierarchical Mixture Modelling; Advances in Neural Information Processing Systems 12, Solla, S.A., Leen, T.K., Mueller, K.-R., eds., MIT Press, 2000.