1
Imentally verified DNA binding proteins. Redundancy was checked within the dataset itself and against the training set using the parameters that were used in the dataset construction, as mentioned above, and removed. Thus, no sequence in the validation set was redundant with any other sequence in the validation set or with any other sequence in the training set. This resulted in 557 protein chains