Skip to main content

Advertisement

Table 2 Frequency of residues in Ala-X-Ala conformations, their extended state ASA (ESA) values and highest observed ASA (HOA) obtained from the entire data set of proteins (8.9 million residues, overall including residues with different sequence neighbors).

From: Context dependent reference states of solvent accessibility derived from native protein structures and assessed by predictability analysis

Res Freq ESA HOA CC Res Freq ESA HOA CC
C 998 140.4 120 0.9200 L 7428 183.1 173 0.9885
W 889 240.5 217 0.9633 S 3694 117.2 142 0.9885
M 1554 200.1 181 0.9740 T 3049 138.7 149 0.9891
Y 1857 213.7 223 0.9813 Q 2810 178.6 204 0.9893
F 2453 200.7 202 0.9838 N 2910 146.4 176 0.9895
I 4114 185.0 167 0.9843 R 3297 229.0 280 0.9895
H 1849 181.9 200 0.9845 P 2528 141.9 138 0.9901
G 5328 78.7 96 0.9857 K 4192 205.7 219 0.9916
A 8032 110.2 120 0.9873 E 4962 174.7 201 0.9917
V 5704 153.7 148 0.9875 D 4567 144.1 165 0.9934
  1. Correlation coefficient (CC) between relative ASA obtained after normalizing by the Ala-X-Ala extended state (ESA) and tripeptide highest observed ASA (context-dependent ASA) is also provided in the last column. CC values are based on all residues with different sequence neighbor. Residues are arranged in the order of CC values.
  2. Abbreviations: Res: residue ID, Freq: number of residues in Ala-X-Ala conformation, ESA: extended state ASA of Ala-X-Ala conformation, HOA: highest observed ASA in Y-X-Z conformation (any neighbor), CC: correlation coefficient between ESA-normalized and HOA-normalized values in the benchmark data set.