Skip to main content

Table 1 Dataset composition

From: An automatic method for assessing structural importance of amino acid positions

Superfamily

CATH code

Min ID

Median

Max

N

Amylase

3.20.20.80

6%

11%

74%

40

Cupredoxin

2.60.40.420

9%

20%

90%

35

Globins

1.10.490.10

4%

19%

89%

71

Jellyroll/Capsid

2.60.120.20

4%

11%

89%

53

Lysozyme C

1.10.530.10

5%

34%

87%

17

PLA 2

1.20.90.10

5%

40%

90%

44

  1. Superfamily name, CATH code, %ID ranges and dataset sizes are shown