Journal of Chemical Information and Modeling 2017-11-21

Identification of Protein–Ligand Binding Sites by Sequence Information and Ensemble Classifier

Yijie Ding, Jijun Tang, Fei Guo

Index: 10.1021/acs.jcim.7b00307

Full Text: HTML

Abstract

Identifying protein–ligand binding sites is an important process in drug discovery and structure-based drug design. Detecting protein–ligand binding sites is expensive and time-consuming by traditional experimental methods. Hence, computational approaches provide many effective strategies to deal with this issue. Recently, lots of computational methods are based on structure information on proteins. However, these methods are limited in the common scenario, where both the sequence of protein target is known and sufficient 3D structure information is available. Studies indicate that sequence-based computational approaches for predicting protein–ligand binding sites are more practical. In this paper, we employ a novel computational model of protein–ligand binding sites prediction, using protein sequence. We apply the Discrete Cosine Transform (DCT) to extract feature from Position-Specific Score Matrix (PSSM). In order to improve the accuracy, Predicted Relative Solvent Accessibility (PRSA) information is also utilized. The predictor of protein–ligand binding sites is built by employing the ensemble weighted sparse representation model with random under-sampling. To evaluate our method, we conduct several comprehensive tests (12 types of ligands testing sets) for predicting protein–ligand binding sites. Results show that our method achieves better Matthew’s correlation coefficient (MCC) than other outstanding methods on independent test sets of ATP (0.506), ADP (0.511), AMP (0.393), GDP (0.579), GTP (0.641), Mg2+ (0.317), Fe3+ (0.490) and HEME (0.640). Our proposed method outperforms earlier predictors (the performance of MCC) in 8 of the 12 ligands types.

Latest Articles:

Holistic Approach to Partial Covalent Interactions in Protein Structure Prediction and Design with Rosetta

2018-04-19

[10.1021/acs.jcim.7b00398]

Force Field Benchmark of Amino Acids: I. Hydration and Diffusion in Different Water Models

2018-04-18

[10.1021/acs.jcim.8b00026]

Role of Molecular Interactions and Protein Rearrangement in the Dissociation Kinetics of p38α MAP Kinase Type-I/II/III Inhibitors

2018-04-16

[10.1021/acs.jcim.7b00640]

Peptidic Macrocycles - Conformational Sampling and Thermodynamic Characterization

2018-04-13

[10.1021/acs.jcim.8b00097]

ReFlex3D: Refined Flexible Alignment of Molecules Using Shape and Electrostatics

2018-04-13

[10.1021/acs.jcim.7b00618]

More Articles...