To start using SolAcc, just provide protein sequence in fasta format. For protein sequences, there are two ways to input: directly by providing FASTA sequences or uploading a fasta file. Sequences can be written or pasted to the boxes in the input forms or uploaded as a file.
The header info of each sequence must be unique.
To look for an input example, please click the "Example" text on the input page.
>example0
GSAGEDVGAPPDHLWVHQEGIYRDEYQRTWVAVVEEETSFLRARVQQIQVPLGDAARPSHLLTSQLPLMWQLYPEERYMDNNSRLWQIQHHLMVRGVQELLLKLLPDD
>example1
ALTQERKREIIEQFKVHENDTGSPEVQIAILTEQINNLNEHLRVHKKDHHSRRGLLKMVGKRRRLLAYLRNKDVARYREIVEKLGLRR
Extensive data mining was performed for obtaining cases for training and testing.
Blind Test data set was originally used to test SolAcc.
download
Dataset
sequence
residue
Training dataset
6000
1402211
Blind Test
500
118180
Validation dataset
500
121276
Blind Test data set was originally used to test SolAcc. It can be download from here. download
If you have any problems, please contact Mengqi Chen (20215227105@stu.suda.edu.cn).