Recursive Partitioning for the Prediction of Cytochromes P450 2D6 and 1A2 Inhibition:? Importance of the Quality of the Dataset

publication · 8 years ago
by Olivier Barberan, Ismail Ijjaali, François Petitet, André Michel, Julien Burton, Daniel P. Vercauteren (Facultés Universitaires Notre-Dame de la Paix)
The purpose of this study was to explore the use of detailed biological data in combination with a statistical learning method for predicting the CYP1A2 and CYP2D6 inhibition. Data were extracted from the Aureus−Pharma highly structured databases which contain precise measures and detailed experimental protocol concerning the inhibition of the two cytochromes. The methodology used was Recursive Partitioning, an easy and quick method to implement. The building of models was preceded by the evaluation of the chemical space covered by the datasets. The descriptors used are available in the MOE software suite. The models reached at least 80% of Accuracy and often exceeded this percentage for the Sensitivity (Recall), Specificity, and Precision parameters. CYP2D6 datasets provided 11 models with Accuracy over 80%, while CYP1A2 datasets counted 5 high-accuracy models. Our models can be useful to predict the ADME properties during the drug discovery process and are indicated for high-throughput screening.
Visit publication