Introduction: The drug development cost is increasing and developing a successful drug is estimated to be in the range of billions of dollars. The leading causes for this are due to a general increase in R&D cost as well as a decrease in clinical success rates. In silico methods, such as virtual screenings, have become more prominent within drug discovery as they can early in development identify potential chemical starting points. However, with the massive increase in available compounds and molecular targets in the last decades, these datasets need to be properly filtered to generate good results. Recently, a tool called macHine leArning booSTEd dockiNg (HASTEN) has been implemented in virtual screening campaigns to enhance their effectiveness. By only docking a fraction of dataset, HASTEN can accurately predict docking scores for the remaining compounds, speeding up the screening process. Although HASTEN in its current state can generate good results, the tool is still being improved.
Aim: The aim of this study is to find optimizations in the HASTEN protocol to better predict top scoring compounds by investigating HASTEN’s performance on three antibiotic targets.
Methods: Baseline performance was generated by training models using the default HASTEN settings, where recall was used as the primary evaluation metric. New models were trained by systematically changing settings to improve HASTEN’s performance. Various settings were investigated, including the number of iterations, different datasets variants, training set size, additional feature generators, hyperparameter optimization, activation methods and optimizers.
Results: Many investigated settings failed to improve performance, while some even decreased performance. The biggest increase in model performance was seen in models trained with additional Morgan fingerprints, which improved recall by 20% across all targets. The addition of Morgan fingerprints made the models less stable but could be stabilized with the addition of hyperparameters.
Conclusions: In this study we found optimizations that could enhance the performance of HASTEN screenings as well as settings to avoid. Mainly, the addition of Morgan fingerprint features improved recall values the most, in conjunction with hyperparameters it created a stable and reliable model. Potential areas that could further improve HASTEN performance include investigation of larger datasets, systematic selection of initial training compounds, cross-fold validation and ensembles.