ID 1P57A STANDARD; PRT; 114 AA. DT CONVERTED FROM PDB (SEQRES) 1P57 DE Serine protease hepsin OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.750 CC R-Factor 0.186 FT #SUB 5 5 PRO A 201 205 ILE B Protein B 2 FT #SUB 6 6 LEU A 201 205 ILE B Protein A 3 FT #SUB 6 6 LEU A 203 207 ARG B Protein S 4 FT #SUB 7 7 TYR A 118 125 ALA B Protein S 3 FT #SUB 7 7 TYR A 119 126 ALA B Protein S 7 FT #SUB 7 7 TYR A 201 205 ILE B Protein A 4 FT #SUB 10 10 GLN A 201 205 ILE B Protein S 4 FT #SUB 10 10 GLN A 202 206 SER B Protein S 4 FT #SUB 10 10 GLN A 203 207 ARG B Protein S 4 FT #SUB 39 39 ARG A 243 243 LYS B Protein S 1 FT #SUB 46 46 GLU A 235 235 ARG B Protein B 4 FT #SUB 46 46 GLU A 239 239 PHE B Protein A 7 FT #SUB 46 46 GLU A 243 243 LYS B Protein S 4 FT #SUB 47 47 GLU A 118 125 ALA B Protein B 3 FT #SUB 47 47 GLU A 119 126 ALA B Protein B 4 FT #SUB 47 47 GLU A 235 235 ARG B Protein B 1 FT #SUB 48 48 MET A 118 125 ALA B Protein B 3 FT #SUB 48 48 MET A 208 208D ARG B Protein B 2 FT #SUB 49 49 GLY A 115 122 CYS B Protein B 3 FT #SUB 49 49 GLY A 116 123 LEU B Protein B 6 FT #SUB 49 49 GLY A 117 124 PRO B Protein B 1 FT #SUB 49 49 GLY A 208 208D ARG B Protein B 1 FT #SUB 50 50 PHE A 206 208B ARG B Protein S 5 FT #SUB 50 50 PHE A 239 239 PHE B Protein B 1 FT #SUB 51 51 LEU A 31 47 LEU B Protein S 1 FT #SUB 51 51 LEU A 32 48 SER B Protein S 1 FT #SUB 51 51 LEU A 239 239 PHE B Protein B 2 FT #SUB 51 51 LEU A 242 242 ILE B Protein S 2 FT #SUB 52 52 ARG A 32 48 SER B Protein S 2 FT #SUB 52 52 ARG A 34 50 ASP B Protein S 1 FT #SUB 84 84 GLN A 236 236 GLU B Protein S 1 FT #SUB 85 85 ARG A 119 126 ALA B Protein S 3 FT #SUB 106 106 GLN A 31 47 LEU B Protein S 3 FT #SUB 106 106 GLN A 32 48 SER B Protein S 4 FT #SUB 106 106 GLN A 113 120 PRO B Protein S 1 FT #SUB 107 107 ASP A 113 120 PRO B Protein B 1 FT #SUB 107 107 ASP A 206 208B ARG B Protein S 2 FT #SUB 108 108 CYS A 113 120 PRO B Protein A 3 FT #SUB 108 108 CYS A 114 121 VAL B Protein S 2 FT #SUB 108 108 CYS A 115 122 CYS B Protein A 10 FT #SUB 108 108 CYS A 206 208B ARG B Protein B 4 FT #SUB 109 109 GLY A 14 29 TRP B Protein B 1 FT #SUB 109 109 GLY A 113 120 PRO B Protein B 2 FT #SUB 109 109 GLY A 114 121 VAL B Protein B 1 FT #SUB 109 109 GLY A 115 122 CYS B Protein B 3 FT #SUB 109 109 GLY A 206 208B ARG B Protein B 3 FT #SUB 109 109 GLY A 207 208C TRP B Protein B 7 FT #SUB 110 110 ARG A 112 119 GLN B Protein B 1 FT #SUB 110 110 ARG A 204 208 THR B Protein S 1 FT #SUB 110 110 ARG A 205 208A PRO B Protein S 1 FT #SUB 110 110 ARG A 206 208B ARG B Protein S 2 FT #SUB 111 111 ARG A 10 25 GLY B Protein S 3 FT #SUB 111 111 ARG A 11 26 ARG B Protein S 13 FT #SUB 111 111 ARG A 12 27 TRP B Protein S 5 FT #SUB 112 112 LYS A 109 116 GLU B Protein A 2 FT #SUB 112 112 LYS A 112 119 GLN B Protein S 7 FT #SUB 113 113 LEU A 9 24 LEU B Protein S 1 FT #SUB 113 113 LEU A 10 25 GLY B Protein S 2 FT #SUB 113 113 LEU A 110 117 TYR B Protein S 1 FT DISORDER 1 4 CC SEQUENCE 110 AA (ATOM); CC PLYPVQVSSA DARLMVFDKT EGTWRLLCSS RSNARVAGLS CEEMGFLRAL THSELDVRTA CC GAAGTSGFFC VDEGRLPHTQ RLLEVISVCD CPRGRFLAAI CQDCGRRKLP CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGF CC ATOM ----PLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGF CC ********************************************** CC SEQRES LRALTHSELDVRTAGAAGTSGFFCVDEGRLPHTQRLLEVISVCDCPRGRF CC ATOM LRALTHSELDVRTAGAAGTSGFFCVDEGRLPHTQRLLEVISVCDCPRGRF CC ************************************************** CC SEQRES LAAICQDCGRRKLP CC ATOM LAAICQDCGRRKLP CC ************** SQ SEQUENCE 114 AA; MW; CN; SDQEPLYPVQ VSSADARLMV FDKTEGTWRL LCSSRSNARV AGLSCEEMGF LRALTHSELD VRTAGAAGTS GFFCVDEGRL PHTQRLLEVI SVCDCPRGRF LAAICQDCGR RKLP // ID 1P57B STANDARD; PRT; 255 AA. DT CONVERTED FROM PDB (SEQRES) 1P57 DE Serine protease hepsin OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.750 CC R-Factor 0.186 FT #SUB 9 24 LEU B 113 113 LEU A Protein B 1 FT #SUB 10 25 GLY B 111 111 ARG A Protein B 3 FT #SUB 10 25 GLY B 113 113 LEU A Protein B 2 FT #SUB 11 26 ARG B 111 111 ARG A Protein A 13 FT #SUB 12 27 TRP B 111 111 ARG A Protein S 5 FT #SUB 14 29 TRP B 109 109 GLY A Protein S 1 FT #SUB 31 47 LEU B 51 51 LEU A Protein B 1 FT #SUB 31 47 LEU B 106 106 GLN A Protein B 3 FT #SUB 32 48 SER B 51 51 LEU A Protein S 1 FT #SUB 32 48 SER B 52 52 ARG A Protein S 2 FT #SUB 32 48 SER B 106 106 GLN A Protein A 4 FT #SUB 34 50 ASP B 52 52 ARG A Protein S 1 FT #SUB 109 116 GLU B 112 112 LYS A Protein A 2 FT #SUB 110 117 TYR B 113 113 LEU A Protein S 1 FT #SUB 112 119 GLN B 110 110 ARG A Protein S 1 FT #SUB 112 119 GLN B 112 112 LYS A Protein S 7 FT #SUB 113 120 PRO B 106 106 GLN A Protein S 1 FT #SUB 113 120 PRO B 107 107 ASP A Protein S 1 FT #SUB 113 120 PRO B 108 108 CYS A Protein B 3 FT #SUB 113 120 PRO B 109 109 GLY A Protein B 2 FT #SUB 114 121 VAL B 108 108 CYS A Protein B 2 FT #SUB 114 121 VAL B 109 109 GLY A Protein B 1 FT #SUB 115 122 CYS B 49 49 GLY A Protein A 3 FT #SUB 115 122 CYS B 108 108 CYS A Protein A 10 FT #SUB 115 122 CYS B 109 109 GLY A Protein A 3 FT #SUB 116 123 LEU B 49 49 GLY A Protein A 6 FT #SUB 117 124 PRO B 49 49 GLY A Protein B 1 FT #SUB 118 125 ALA B 7 7 TYR A Protein S 3 FT #SUB 118 125 ALA B 47 47 GLU A Protein A 3 FT #SUB 118 125 ALA B 48 48 MET A Protein A 3 FT #SUB 119 126 ALA B 7 7 TYR A Protein A 7 FT #SUB 119 126 ALA B 47 47 GLU A Protein A 4 FT #SUB 119 126 ALA B 85 85 ARG A Protein S 3 FT #SUB 201 205 ILE B 5 5 PRO A Protein S 2 FT #SUB 201 205 ILE B 6 6 LEU A Protein B 3 FT #SUB 201 205 ILE B 7 7 TYR A Protein A 4 FT #SUB 201 205 ILE B 10 10 GLN A Protein B 4 FT #SUB 202 206 SER B 10 10 GLN A Protein B 4 FT #SUB 203 207 ARG B 6 6 LEU A Protein S 4 FT #SUB 203 207 ARG B 10 10 GLN A Protein S 4 FT #SUB 204 208 THR B 110 110 ARG A Protein S 1 FT #SUB 205 208A PRO B 110 110 ARG A Protein B 1 FT #SUB 206 208B ARG B 50 50 PHE A Protein S 5 FT #SUB 206 208B ARG B 107 107 ASP A Protein S 2 FT #SUB 206 208B ARG B 108 108 CYS A Protein S 4 FT #SUB 206 208B ARG B 109 109 GLY A Protein A 3 FT #SUB 206 208B ARG B 110 110 ARG A Protein S 2 FT #SUB 207 208C TRP B 109 109 GLY A Protein A 7 FT #SUB 208 208D ARG B 48 48 MET A Protein S 2 FT #SUB 208 208D ARG B 49 49 GLY A Protein S 1 FT #SUB 235 235 ARG B 46 46 GLU A Protein S 4 FT #SUB 235 235 ARG B 47 47 GLU A Protein S 1 FT #SUB 236 236 GLU B 84 84 GLN A Protein S 1 FT #SUB 239 239 PHE B 46 46 GLU A Protein S 7 FT #SUB 239 239 PHE B 50 50 PHE A Protein S 1 FT #SUB 239 239 PHE B 51 51 LEU A Protein S 2 FT #SUB 242 242 ILE B 51 51 LEU A Protein S 2 FT #SUB 243 243 LYS B 39 39 ARG A Protein S 1 FT #SUB 243 243 LYS B 46 46 GLU A Protein S 4 FT #HET 41 57 HIS B 1 346 CR4 B S 8 FT #HET 185 189 ASP B 1 346 CR4 B S 14 FT #HET 186 190 ALA B 1 346 CR4 B A 17 FT #HET 187 191 CYS B 1 346 CR4 B B 12 FT #HET 188 192 GLN B 1 346 CR4 B A 11 FT #HET 190 194 ASP B 1 346 CR4 B B 1 FT #HET 191 195 SER B 1 346 CR4 B A 16 FT #HET 213 213 VAL B 1 346 CR4 B S 4 FT #HET 215 215 TRP B 1 346 CR4 B B 6 FT #HET 216 216 GLY B 1 346 CR4 B B 12 FT #HET 218 219 GLY B 1 346 CR4 B B 7 FT #HET 219 220 CYS B 1 346 CR4 B A 8 FT #HET 226 226 GLY B 1 346 CR4 B B 4 FT DISORDER 85 91 FT DISORDER 255 255 CC SEQUENCE 247 AA (ATOM); CC IVGGRDTSLG RWPWQVSLRY DGAHLCGGSL LSGDWVLTAA HCFPERNRVL SRWRVFAGAV CC AQASPHGLQL GVQAVVYHGG YLPFNSNDIA LVHLSSPLPL TEYIQPVCLP AAGQALVDGK CC ICTVTGWGNT QYYGQQAGVL QEARVPIISN DVCNGADFYG NQIKPKMFCA GYPEGGIDAC CC QGDSGGPFVC EDSISRTPRW RLCGIVSWGT GCALAQKPGV YTKVSDFREW IFQAIKTHSE CC ASGMVTQ CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES IVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVL CC ATOM IVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVL CC ************************************************** CC SEQRES SRWRVFAGAVAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVH CC ATOM SRWRVFAGAVAQASPHGLQLGVQAVVYHGGYLPF-------NSNDIALVH CC ********************************** ********* CC SEQRES LSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEA CC ATOM LSSPLPLTEYIQPVCLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEA CC ************************************************** CC SEQRES RVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCEDS CC ATOM RVPIISNDVCNGADFYGNQIKPKMFCAGYPEGGIDACQGDSGGPFVCEDS CC ************************************************** CC SEQRES ISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASG CC ATOM ISRTPRWRLCGIVSWGTGCALAQKPGVYTKVSDFREWIFQAIKTHSEASG CC ************************************************** CC SEQRES MVTQL CC ATOM MVTQ- CC **** SQ SEQUENCE 255 AA; MW; CN; IVGGRDTSLG RWPWQVSLRY DGAHLCGGSL LSGDWVLTAA HCFPERNRVL SRWRVFAGAV AQASPHGLQL GVQAVVYHGG YLPFRDPNSE ENSNDIALVH LSSPLPLTEY IQPVCLPAAG QALVDGKICT VTGWGNTQYY GQQAGVLQEA RVPIISNDVC NGADFYGNQI KPKMFCAGYP EGGIDACQGD SGGPFVCEDS ISRTPRWRLC GIVSWGTGCA LAQKPGVYTK VSDFREWIFQ AIKTHSEASG MVTQL //