Designing and optimizing novel drugs require both creativity and knowledge. Using the Matched Molecular Pairs method is one way of supporting this process. Commonly, MMP is used to connect structural changes of drug molecules to corresponding changes in assay readouts (Figure 1).
The MMP method was used to extract all synthetically available transformations described in the patent database SureCHEMBL. Accordingly, it is possible to get an overview of how often medicinal chemists have used certain transformations, irrespective of their optimization parameters (Table 1).
Numbers |
Data points used to create MMP set |
61 |
Years of deposited patent applications |
600 MB |
of text |
1.35 M |
patent applications |
20 M |
exemplified compounds |
1.4 M |
Unique transformations |
1-20 |
Transformation size (#atoms) |
> 1000 |
Transformations with >1000 examples |
>50.000 |
Transformations with > 50 examples |
Table 1. Data behind extracted transformations
The MMP transformations from SureCHEMBL can be used in different ways to create analogues to a seed compound:
These analogues can then be filtered through any additional virtual screening cascade prior to selection for synthesis (Figure 3).
Figure 3. Example of workflows applying SureCHEMBL MMP transformations for creation of Design Sets
Download the 500 most common transformations from SureCHEMBL
—
[1] Hussain and Rea, Journal of Chemical Information and Modeling 2010 50 (3), 339-348