A benchmark database for variations


Home | Instructions | Datasets | Citing | Disclaimer |


Protein solubility

Dataset used for PON-SOL

DATASET 1

444 experimentally verified solubility-affecting amino acid substitutions.

    Protein solubility dataset (Excel file)

Reference: Yang Y, Niroula A, Shen B, Vihinen M, PON-Sol: prediction of effects of amino acid substitutions on protein solubility, Bioinformatics, Volume 32, Issue 13, 1 July 2016, Pages 2032–2034, https://doi.org/10.1093/bioinformatics/btw066.  PUBMED  

Dataset used for PON-SOL2

DATASET 2

The dataset contains all the original PON-Sol cases of 443 single amino acid substitutions in 71 proteins. In addition, 10,758 variants in six additional protein collected based on an literature search.

    train     test1     test2

Reference: Yang Y, Zeng L, Vihinen M, PON-Sol2: Prediction of Effects of Variants on Protein Solubility, Int J Mol Sci;22(15):8027. doi: 10.3390/ijms22158027.  PUBMED  



Last updated: 2022-02-28 by Niloofar Shirvanizadeh.