ID 3HHIA STANDARD; PRT; 325 AA. DT CONVERTED FROM PDB (SEQRES) 3HHI DE Cathepsin B-like cysteine protease OS Trypanosoma brucei CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.600 CC R-Factor 0.148 FT #SUB 64 86 THR A 66 88 GLU B Protein S 4 FT #SUB 66 88 GLU A 64 86 THR B Protein S 4 FT #SUB 66 88 GLU A 67 89 GLU B Protein S 4 FT #SUB 66 88 GLU A 227 249 PHE B Protein B 2 FT #SUB 67 89 GLU A 66 88 GLU B Protein S 4 FT #SUB 69 91 ARG A 115 137 THR B Protein S 4 FT #SUB 69 91 ARG A 116 138 MET B Protein S 3 FT #SUB 69 91 ARG A 227 249 PHE B Protein S 5 FT #SUB 70 92 ALA A 226 248 PHE B Protein S 1 FT #SUB 71 93 PRO A 81 103 ALA B Protein S 2 FT #SUB 71 93 PRO A 82 104 TRP B Protein S 1 FT #SUB 71 93 PRO A 226 248 PHE B Protein A 4 FT #SUB 81 103 ALA A 71 93 PRO B Protein A 2 FT #SUB 82 104 TRP A 69 91 ARG B Protein S 1 FT #SUB 82 104 TRP A 71 93 PRO B Protein S 1 FT #SUB 115 137 THR A 69 91 ARG B Protein B 5 FT #SUB 116 138 MET A 69 91 ARG B Protein A 4 FT #SUB 226 248 PHE A 70 92 ALA B Protein S 1 FT #SUB 226 248 PHE A 71 93 PRO B Protein A 4 FT #SUB 226 248 PHE A 226 248 PHE B Protein S 4 FT #SUB 227 249 PHE A 66 88 GLU B Protein S 2 FT #SUB 227 249 PHE A 69 91 ARG B Protein B 6 FT #SUB 227 249 PHE A 70 92 ALA B Protein S 1 FT #SUB 227 249 PHE A 227 249 PHE B Protein S 1 FT #HET 66 88 GLU A 5 349 MG A A 5 FT #HET 69 91 ARG A 5 349 MG A S 1 FT #HET 71 93 PRO A 10 4 GOL B B 3 FT #HET 72 94 LEU A 10 4 GOL B B 1 FT #HET 73 95 PRO A 10 4 GOL B A 7 FT #HET 76 98 PHE A 9 3 GOL B A 7 FT #HET 77 99 ASP A 9 3 GOL B B 5 FT #HET 80 102 GLU A 9 3 GOL B S 7 FT #HET 81 103 ALA A 9 3 GOL B A 3 FT #HET 92 114 ALA A 3 2 GOL A A 3 FT #HET 93 115 ASP A 3 2 GOL A B 6 FT #HET 94 116 GLN A 1 348 074 A S 9 FT #HET 95 117 SER A 1 348 074 A B 1 FT #HET 97 119 CYS A 1 348 074 A B 1 FT #HET 98 120 GLY A 1 348 074 A B 4 FT #HET 99 121 SER A 1 348 074 A B 1 FT #HET 100 122 CYS A 1 348 074 A A 15 FT #HET 101 123 TRP A 1 348 074 A A 4 FT #HET 103 125 VAL A 3 2 GOL A S 2 FT #HET 126 148 ALA A 3 2 GOL A S 1 FT #HET 128 150 ASP A 2 1 TRS A S 10 FT #HET 132 154 CYS A 2 1 TRS A S 4 FT #HET 142 164 GLY A 1 348 074 A B 3 FT #HET 143 165 GLY A 1 348 074 A B 7 FT #HET 145 167 PRO A 1 348 074 A S 1 FT #HET 151 173 TYR A 2 1 TRS A S 4 FT #HET 155 177 THR A 2 1 TRS A B 1 FT #HET 156 178 GLY A 2 1 TRS A B 5 FT #HET 157 179 LEU A 2 1 TRS A B 2 FT #HET 158 180 VAL A 2 1 TRS A A 4 FT #HET 165 187 TYR A 3 2 GOL A S 5 FT #HET 172 194 HIS A 1 348 074 A S 7 FT #HET 173 195 HIS A 1 348 074 A S 6 FT #HET 204 226 VAL A 2 1 TRS A S 1 FT #HET 206 228 ASN A 2 1 TRS A S 6 FT #HET 226 248 PHE A 9 3 GOL B S 1 FT #HET 234 256 ALA A 1 348 074 A S 1 FT #HET 234 256 ALA A 4 5 GOL A A 5 FT #HET 235 257 PHE A 4 5 GOL A B 5 FT #HET 236 258 ASP A 4 5 GOL A S 2 FT #HET 259 281 GLY A 1 348 074 A B 6 FT #HET 260 282 HIS A 1 348 074 A S 5 FT #HET 282 304 TRP A 1 348 074 A S 10 FT #HET 302 324 GLY A 4 5 GOL A B 5 FT #HET 305 327 ASP A 4 5 GOL A S 1 FT DISORDER 1 55 FT DISORDER 314 325 CC SEQUENCE 258 AA (ATOM); CC SILPKRRFTE EEARAPLPSS FDSAEAWPNC PTIPQIADQS ACGSCWAVAA ASAMSDRFCT CC MGGVQDVHIS AGDLLACCSD CGDGCNGGDP DRAWAYFSST GLVSDYCQPY PFPHCSHHSK CC SKNGYPPCSQ FNFDTPKCDY TCDDPTIPVV NYRSWTSYAL QGEDDYMREL FFRGPFEVAF CC DVYEDFIAYN SGVYHHVSGQ YLGGHAVRLV GWGTSNGVPY WKIANSWNTE WGMDGYFLIR CC RGSSECGIED GGSAGIPL CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES ALVAEDAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIK CC ATOM -------------------------------------------------- CC CC SEQRES KNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSC CC ATOM -----SILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSC CC ********************************************* CC SEQRES WAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWA CC ATOM WAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWA CC ************************************************** CC SEQRES YFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDP CC ATOM YFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDP CC ************************************************** CC SEQRES TIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH CC ATOM TIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH CC ************************************************** CC SEQRES HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSE CC ATOM HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSE CC ************************************************** CC SEQRES CGIEDGGSAGIPLAPNTAHHHHHHH CC ATOM CGIEDGGSAGIPL------------ CC ************* SQ SEQUENCE 325 AA; MW; CN; ALVAEDAPVL SKAFVDRVNR LNRGIWKAKY DGVMQNITLR EAKRLNGVIK KNNNASILPK RRFTEEEARA PLPSSFDSAE AWPNCPTIPQ IADQSACGSC WAVAAASAMS DRFCTMGGVQ DVHISAGDLL ACCSDCGDGC NGGDPDRAWA YFSSTGLVSD YCQPYPFPHC SHHSKSKNGY PPCSQFNFDT PKCDYTCDDP TIPVVNYRSW TSYALQGEDD YMRELFFRGP FEVAFDVYED FIAYNSGVYH HVSGQYLGGH AVRLVGWGTS NGVPYWKIAN SWNTEWGMDG YFLIRRGSSE CGIEDGGSAG IPLAPNTAHH HHHHH // ID 3HHIB STANDARD; PRT; 325 AA. DT CONVERTED FROM PDB (SEQRES) 3HHI DE Cathepsin B-like cysteine protease OS Trypanosoma brucei CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.600 CC R-Factor 0.148 FT #SUB 64 86 THR B 66 88 GLU A Protein S 4 FT #SUB 66 88 GLU B 64 86 THR A Protein S 4 FT #SUB 66 88 GLU B 67 89 GLU A Protein S 4 FT #SUB 66 88 GLU B 227 249 PHE A Protein B 2 FT #SUB 67 89 GLU B 66 88 GLU A Protein S 4 FT #SUB 69 91 ARG B 82 104 TRP A Protein S 1 FT #SUB 69 91 ARG B 115 137 THR A Protein S 5 FT #SUB 69 91 ARG B 116 138 MET A Protein S 4 FT #SUB 69 91 ARG B 227 249 PHE A Protein S 6 FT #SUB 70 92 ALA B 226 248 PHE A Protein S 1 FT #SUB 70 92 ALA B 227 249 PHE A Protein S 1 FT #SUB 71 93 PRO B 81 103 ALA A Protein S 2 FT #SUB 71 93 PRO B 82 104 TRP A Protein S 1 FT #SUB 71 93 PRO B 226 248 PHE A Protein A 4 FT #SUB 81 103 ALA B 71 93 PRO A Protein A 2 FT #SUB 82 104 TRP B 71 93 PRO A Protein S 1 FT #SUB 115 137 THR B 69 91 ARG A Protein B 4 FT #SUB 116 138 MET B 69 91 ARG A Protein S 3 FT #SUB 226 248 PHE B 70 92 ALA A Protein S 1 FT #SUB 226 248 PHE B 71 93 PRO A Protein A 4 FT #SUB 226 248 PHE B 226 248 PHE A Protein S 4 FT #SUB 227 249 PHE B 66 88 GLU A Protein S 2 FT #SUB 227 249 PHE B 69 91 ARG A Protein B 5 FT #SUB 227 249 PHE B 227 249 PHE A Protein S 1 FT #HET 71 93 PRO B 9 3 GOL B B 3 FT #HET 72 94 LEU B 9 3 GOL B B 1 FT #HET 73 95 PRO B 9 3 GOL B A 8 FT #HET 76 98 PHE B 10 4 GOL B A 8 FT #HET 77 99 ASP B 10 4 GOL B B 5 FT #HET 80 102 GLU B 10 4 GOL B S 4 FT #HET 81 103 ALA B 10 4 GOL B A 3 FT #HET 92 114 ALA B 7 1 GOL B A 3 FT #HET 93 115 ASP B 7 1 GOL B B 7 FT #HET 94 116 GLN B 6 348 074 B S 9 FT #HET 95 117 SER B 6 348 074 B B 1 FT #HET 97 119 CYS B 6 348 074 B B 1 FT #HET 98 120 GLY B 6 348 074 B B 4 FT #HET 99 121 SER B 6 348 074 B B 1 FT #HET 100 122 CYS B 6 348 074 B A 15 FT #HET 101 123 TRP B 6 348 074 B A 4 FT #HET 103 125 VAL B 7 1 GOL B S 1 FT #HET 126 148 ALA B 7 1 GOL B S 1 FT #HET 142 164 GLY B 6 348 074 B B 3 FT #HET 143 165 GLY B 6 348 074 B B 7 FT #HET 145 167 PRO B 6 348 074 B S 2 FT #HET 164 186 PRO B 11 349 LI B S 1 FT #HET 165 187 TYR B 7 1 GOL B S 5 FT #HET 172 194 HIS B 6 348 074 B S 7 FT #HET 173 195 HIS B 6 348 074 B S 6 FT #HET 198 220 ASP B 11 349 LI B S 3 FT #HET 226 248 PHE B 10 4 GOL B S 1 FT #HET 228 250 ARG B 5 349 MG A S 2 FT #HET 234 256 ALA B 6 348 074 B S 1 FT #HET 234 256 ALA B 8 350 GOL B A 5 FT #HET 235 257 PHE B 8 350 GOL B B 4 FT #HET 236 258 ASP B 8 350 GOL B S 6 FT #HET 259 281 GLY B 6 348 074 B B 7 FT #HET 260 282 HIS B 6 348 074 B S 6 FT #HET 282 304 TRP B 6 348 074 B S 13 FT #HET 302 324 GLY B 8 350 GOL B B 5 FT #HET 305 327 ASP B 8 350 GOL B S 4 FT DISORDER 1 55 FT DISORDER 316 325 CC SEQUENCE 260 AA (ATOM); CC SILPKRRFTE EEARAPLPSS FDSAEAWPNC PTIPQIADQS ACGSCWAVAA ASAMSDRFCT CC MGGVQDVHIS AGDLLACCSD CGDGCNGGDP DRAWAYFSST GLVSDYCQPY PFPHCSHHSK CC SKNGYPPCSQ FNFDTPKCDY TCDDPTIPVV NYRSWTSYAL QGEDDYMREL FFRGPFEVAF CC DVYEDFIAYN SGVYHHVSGQ YLGGHAVRLV GWGTSNGVPY WKIANSWNTE WGMDGYFLIR CC RGSSECGIED GGSAGIPLAP CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES ALVAEDAPVLSKAFVDRVNRLNRGIWKAKYDGVMQNITLREAKRLNGVIK CC ATOM -------------------------------------------------- CC CC SEQRES KNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSC CC ATOM -----SILPKRRFTEEEARAPLPSSFDSAEAWPNCPTIPQIADQSACGSC CC ********************************************* CC SEQRES WAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWA CC ATOM WAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWA CC ************************************************** CC SEQRES YFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDP CC ATOM YFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDP CC ************************************************** CC SEQRES TIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH CC ATOM TIPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYEDFIAYNSGVYH CC ************************************************** CC SEQRES HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSE CC ATOM HVSGQYLGGHAVRLVGWGTSNGVPYWKIANSWNTEWGMDGYFLIRRGSSE CC ************************************************** CC SEQRES CGIEDGGSAGIPLAPNTAHHHHHHH CC ATOM CGIEDGGSAGIPLAP---------- CC *************** SQ SEQUENCE 325 AA; MW; CN; ALVAEDAPVL SKAFVDRVNR LNRGIWKAKY DGVMQNITLR EAKRLNGVIK KNNNASILPK RRFTEEEARA PLPSSFDSAE AWPNCPTIPQ IADQSACGSC WAVAAASAMS DRFCTMGGVQ DVHISAGDLL ACCSDCGDGC NGGDPDRAWA YFSSTGLVSD YCQPYPFPHC SHHSKSKNGY PPCSQFNFDT PKCDYTCDDP TIPVVNYRSW TSYALQGEDD YMRELFFRGP FEVAFDVYED FIAYNSGVYH HVSGQYLGGH AVRLVGWGTS NGVPYWKIAN SWNTEWGMDG YFLIRRGSSE CGIEDGGSAG IPLAPNTAHH HHHHH //