Data Visualization, Regression, Applicability Domains and Inverse Analysis Based on Generative Topographic Mapping
This paper introduces two generative topographic mapping (GTM) methods that can be used for data visualization, regression analysis, inverse analysis, and the determination of applicability domains (ADs). In GTM-multiple linear regression (GTM-MLR), the prior probability distribution of the descriptors or explanatory variables (X) is calculated with GTM, and the posterior probability distribution of the property/activity or objective variable (y) given X is calculated with MLR; inverse analysis is then performed using the product rule and Bayes’ theorem. In GTM-regression (GTMR), X and y are combined and GTM is performed to obtain the joint probability distribution of X and y; this leads to the posterior probability distributions of y given X and of X given y, which are used for regression and inverse analysis, respectively. Simulations using linear and nonlinear datasets and quantitative structure-activity relationship (QSAR) and quantitative structure-property relationship (QSPR) datasets confirm that GTM-MLR and GTMR enable data visualization, regression analysis, and inverse analysis considering appropriate ADs.
Related content
Scientific Software in Light of the European Accessibility Act
Copy and paste, click and go, swipe right, drag and drop – these computer UI actions are so...
How to Marvin: UI Overview
Learn the logic behind Marvin's user interface from this episode of the How to Marvin video series.
How to Marvin: Chemical Naming
Learn how you can generate and convert chemical names from this episode of the How to Marvin video...
How to Marvin: Search Bar
Learn about the different uses of Marvin's search bar from this episode of the How to Marvin video...