ID 4ZFMA STANDARD; PRT; 485 AA. DT CONVERTED FROM PDB (SEQRES) 4ZFM DE Putative 6-phospho-beta-galactobiosidase OS Geobacillus stearothermophilus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.400 CC R-Factor 0.132 FT #SUB 163 156 GLN A 356 349 ASN B Protein S 7 FT #SUB 163 156 GLN A 357 350 TRP B Protein A 10 FT #SUB 164 157 ARG A 357 350 TRP B Protein S 19 FT #SUB 221 214 HIS A 391 384 PHE B Protein A 15 FT #SUB 221 214 HIS A 445 438 GLN B Protein S 1 FT #SUB 222 215 TYR A 356 349 ASN B Protein S 4 FT #SUB 224 217 PRO A 56 49 PHE B Protein S 1 FT #SUB 98 91 ASP A 200 193 ARG C Protein S 1 FT #SUB 140 133 ASP A 54 47 ARG C Protein B 4 FT #SUB 140 133 ASP A 194 187 GLY C Protein A 6 FT #SUB 140 133 ASP A 196 189 LYS C Protein S 5 FT #SUB 141 134 ALA A 51 44 GLN C Protein B 3 FT #SUB 141 134 ALA A 54 47 ARG C Protein B 2 FT #SUB 141 134 ALA A 193 186 PRO C Protein A 2 FT #SUB 141 134 ALA A 194 187 GLY C Protein B 2 FT #SUB 142 135 TYR A 51 44 GLN C Protein A 5 FT #SUB 146 139 GLU A 52 45 PRO C Protein S 2 FT #SUB 200 193 ARG A 52 45 PRO C Protein S 3 FT #HET 30 23 GLN A 3 501 BG6 A S 7 FT #HET 56 49 PHE A 7 505 GOL A S 1 FT #HET 66 59 ASP A 9 507 IMD A S 5 FT #HET 131 124 HIS A 3 501 BG6 A S 2 FT #HET 177 170 GLN A 3 501 BG6 A S 6 FT #HET 180 173 ILE A 3 501 BG6 A S 1 FT #HET 221 214 HIS A 11 502 GOL B B 1 FT #HET 252 245 ASN A 8 506 GOL A S 7 FT #HET 267 260 TRP A 5 503 GOL A S 14 FT #HET 267 260 TRP A 6 504 GOL A S 5 FT #HET 269 262 MET A 5 503 GOL A S 1 FT #HET 270 263 TYR A 4 502 GOL A B 3 FT #HET 271 264 PRO A 4 502 GOL A B 2 FT #HET 272 265 GLN A 4 502 GOL A A 10 FT #HET 273 266 ALA A 8 506 GOL A A 3 FT #HET 275 268 TRP A 4 502 GOL A S 12 FT #HET 292 285 TRP A 4 502 GOL A S 1 FT #HET 343 336 LEU A 8 506 GOL A S 4 FT #HET 357 350 TRP A 7 505 GOL A S 2 FT #HET 359 352 TRP A 3 501 BG6 A S 2 FT #HET 376 369 ARG A 6 504 GOL A B 2 FT #HET 377 370 TYR A 6 504 GOL A B 3 FT #HET 378 371 GLN A 6 504 GOL A S 5 FT #HET 385 378 GLU A 3 501 BG6 A S 7 FT #HET 391 384 PHE A 7 505 GOL A S 1 FT #HET 432 425 TRP A 3 501 BG6 A S 8 FT #HET 439 432 SER A 3 501 BG6 A S 1 FT #HET 440 433 TRP A 3 501 BG6 A S 4 FT #HET 442 435 ASN A 3 501 BG6 A S 1 FT #HET 442 435 ASN A 7 505 GOL A A 3 FT #HET 445 438 GLN A 7 505 GOL A S 3 FT #HET 446 439 LYS A 3 501 BG6 A S 2 FT #HET 446 439 LYS A 7 505 GOL A S 1 FT #HET 448 441 TYR A 3 501 BG6 A S 7 FT #HET 460 453 GLU A 9 507 IMD A S 9 FT #HET 463 456 LEU A 9 507 IMD A S 1 FT DISORDER 1 11 FT DISORDER 323 335 CC SEQUENCE 461 AA (ATOM); CC HLKPFPPEFL WGAASAAYQV EGAWNEDGKG LSVWDVFAKQ PGRTFKGTNG DVAVDHYHRY CC QEDVALMAEM GLKAYRFSVS WSRVFPDGNG AVNEKGLDFY DRLIEELRNH GIEPIVTLYH CC WDVPQALMDA YGAWESRRII DDFDRYAVTL FQRFGDRVKY WVTLNQQNIF ISFGYRLGLH CC PPGVKDMKRM YEANHIANLA NAKVIQSFRH YVPDGKIGPS FAYSPMYPYD SRPENVLAFE CC NAEEFQNHWW MDVYAWGMYP QAAWNYLESQ GLEPTVAPGD WELLQAAKPD FMGVNYYQTT CC TVEHNPPDGV GTSSGIPGLF KTVRNPHVDT TNWDWAIDPV GLRIGLRRIA NRYQLPILIT CC ENGLGEFDTL EPGDIVNDDY RIDYLRRHVQ EIQRAITDGV DVLGYCAWSF TDLLSWLNGY CC QKRYGFVYVN RDDESEKDLR RIKKKSFYWY QRVIETNGAE L CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MIHHHHHHEHRHLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC ATOM -----------HLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC *************************************** CC SEQRES QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ATOM QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ************************************************** CC SEQRES GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ATOM GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ************************************************** CC SEQRES IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ATOM IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ************************************************** CC SEQRES MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ATOM MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ************************************************** CC SEQRES ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ATOM ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ************************************************** CC SEQRES DFMGVNYYQTTTVEHNPPDGVGEGVMNTTGKKGTSTSSGIPGLFKTVRNP CC ATOM DFMGVNYYQTTTVEHNPPDGVG-------------TSSGIPGLFKTVRNP CC ********************** *************** CC SEQRES HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ATOM HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ************************************************** CC SEQRES NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ATOM NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ************************************************** CC SEQRES VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC ATOM VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC *********************************** SQ SEQUENCE 485 AA; MW; CN; MIHHHHHHEH RHLKPFPPEF LWGAASAAYQ VEGAWNEDGK GLSVWDVFAK QPGRTFKGTN GDVAVDHYHR YQEDVALMAE MGLKAYRFSV SWSRVFPDGN GAVNEKGLDF YDRLIEELRN HGIEPIVTLY HWDVPQALMD AYGAWESRRI IDDFDRYAVT LFQRFGDRVK YWVTLNQQNI FISFGYRLGL HPPGVKDMKR MYEANHIANL ANAKVIQSFR HYVPDGKIGP SFAYSPMYPY DSRPENVLAF ENAEEFQNHW WMDVYAWGMY PQAAWNYLES QGLEPTVAPG DWELLQAAKP DFMGVNYYQT TTVEHNPPDG VGEGVMNTTG KKGTSTSSGI PGLFKTVRNP HVDTTNWDWA IDPVGLRIGL RRIANRYQLP ILITENGLGE FDTLEPGDIV NDDYRIDYLR RHVQEIQRAI TDGVDVLGYC AWSFTDLLSW LNGYQKRYGF VYVNRDDESE KDLRRIKKKS FYWYQRVIET NGAEL // ID 4ZFMB STANDARD; PRT; 485 AA. DT CONVERTED FROM PDB (SEQRES) 4ZFM DE Putative 6-phospho-beta-galactobiosidase OS Geobacillus stearothermophilus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.400 CC R-Factor 0.132 FT #SUB 56 49 PHE B 224 217 PRO A Protein S 1 FT #SUB 356 349 ASN B 163 156 GLN A Protein A 7 FT #SUB 356 349 ASN B 222 215 TYR A Protein A 4 FT #SUB 357 350 TRP B 163 156 GLN A Protein A 10 FT #SUB 357 350 TRP B 164 157 ARG A Protein S 19 FT #SUB 391 384 PHE B 221 214 HIS A Protein S 15 FT #SUB 445 438 GLN B 221 214 HIS A Protein S 1 FT #SUB 476 469 ARG B 12 5 HIS D Protein S 1 FT #SUB 476 469 ARG B 13 6 LEU D Protein S 4 FT #SUB 476 469 ARG B 14 7 LYS D Protein S 1 FT #SUB 476 469 ARG B 15 8 PRO D Protein S 6 FT #SUB 479 472 GLU B 15 8 PRO D Protein S 6 FT #SUB 480 473 THR B 15 8 PRO D Protein S 1 FT #SUB 484 477 GLU B 483 476 ALA D Protein S 3 FT #HET 30 23 GLN B 10 501 BG6 B S 8 FT #HET 56 49 PHE B 11 502 GOL B S 2 FT #HET 131 124 HIS B 10 501 BG6 B S 2 FT #HET 177 170 GLN B 10 501 BG6 B S 5 FT #HET 180 173 ILE B 10 501 BG6 B S 1 FT #HET 270 263 TYR B 12 503 GOL B A 4 FT #HET 271 264 PRO B 12 503 GOL B B 2 FT #HET 272 265 GLN B 12 503 GOL B A 10 FT #HET 275 268 TRP B 12 503 GOL B S 12 FT #HET 292 285 TRP B 12 503 GOL B S 2 FT #HET 359 352 TRP B 10 501 BG6 B S 2 FT #HET 385 378 GLU B 10 501 BG6 B S 6 FT #HET 391 384 PHE B 11 502 GOL B S 2 FT #HET 432 425 TRP B 10 501 BG6 B S 8 FT #HET 439 432 SER B 10 501 BG6 B S 2 FT #HET 440 433 TRP B 10 501 BG6 B S 6 FT #HET 442 435 ASN B 10 501 BG6 B S 1 FT #HET 442 435 ASN B 11 502 GOL B A 3 FT #HET 445 438 GLN B 11 502 GOL B S 2 FT #HET 446 439 LYS B 10 501 BG6 B S 1 FT #HET 446 439 LYS B 11 502 GOL B S 1 FT #HET 448 441 TYR B 10 501 BG6 B S 6 FT DISORDER 1 11 FT DISORDER 323 335 CC SEQUENCE 461 AA (ATOM); CC HLKPFPPEFL WGAASAAYQV EGAWNEDGKG LSVWDVFAKQ PGRTFKGTNG DVAVDHYHRY CC QEDVALMAEM GLKAYRFSVS WSRVFPDGNG AVNEKGLDFY DRLIEELRNH GIEPIVTLYH CC WDVPQALMDA YGAWESRRII DDFDRYAVTL FQRFGDRVKY WVTLNQQNIF ISFGYRLGLH CC PPGVKDMKRM YEANHIANLA NAKVIQSFRH YVPDGKIGPS FAYSPMYPYD SRPENVLAFE CC NAEEFQNHWW MDVYAWGMYP QAAWNYLESQ GLEPTVAPGD WELLQAAKPD FMGVNYYQTT CC TVEHNPPDGV GTSSGIPGLF KTVRNPHVDT TNWDWAIDPV GLRIGLRRIA NRYQLPILIT CC ENGLGEFDTL EPGDIVNDDY RIDYLRRHVQ EIQRAITDGV DVLGYCAWSF TDLLSWLNGY CC QKRYGFVYVN RDDESEKDLR RIKKKSFYWY QRVIETNGAE L CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MIHHHHHHEHRHLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC ATOM -----------HLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC *************************************** CC SEQRES QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ATOM QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ************************************************** CC SEQRES GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ATOM GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ************************************************** CC SEQRES IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ATOM IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ************************************************** CC SEQRES MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ATOM MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ************************************************** CC SEQRES ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ATOM ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ************************************************** CC SEQRES DFMGVNYYQTTTVEHNPPDGVGEGVMNTTGKKGTSTSSGIPGLFKTVRNP CC ATOM DFMGVNYYQTTTVEHNPPDGVG-------------TSSGIPGLFKTVRNP CC ********************** *************** CC SEQRES HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ATOM HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ************************************************** CC SEQRES NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ATOM NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ************************************************** CC SEQRES VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC ATOM VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC *********************************** SQ SEQUENCE 485 AA; MW; CN; MIHHHHHHEH RHLKPFPPEF LWGAASAAYQ VEGAWNEDGK GLSVWDVFAK QPGRTFKGTN GDVAVDHYHR YQEDVALMAE MGLKAYRFSV SWSRVFPDGN GAVNEKGLDF YDRLIEELRN HGIEPIVTLY HWDVPQALMD AYGAWESRRI IDDFDRYAVT LFQRFGDRVK YWVTLNQQNI FISFGYRLGL HPPGVKDMKR MYEANHIANL ANAKVIQSFR HYVPDGKIGP SFAYSPMYPY DSRPENVLAF ENAEEFQNHW WMDVYAWGMY PQAAWNYLES QGLEPTVAPG DWELLQAAKP DFMGVNYYQT TTVEHNPPDG VGEGVMNTTG KKGTSTSSGI PGLFKTVRNP HVDTTNWDWA IDPVGLRIGL RRIANRYQLP ILITENGLGE FDTLEPGDIV NDDYRIDYLR RHVQEIQRAI TDGVDVLGYC AWSFTDLLSW LNGYQKRYGF VYVNRDDESE KDLRRIKKKS FYWYQRVIET NGAEL // ID 4ZFMC STANDARD; PRT; 485 AA. DT CONVERTED FROM PDB (SEQRES) 4ZFM DE Putative 6-phospho-beta-galactobiosidase OS Geobacillus stearothermophilus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.400 CC R-Factor 0.132 FT #SUB 51 44 GLN C 141 134 ALA A Protein S 3 FT #SUB 51 44 GLN C 142 135 TYR A Protein S 5 FT #SUB 52 45 PRO C 146 139 GLU A Protein S 2 FT #SUB 52 45 PRO C 200 193 ARG A Protein A 3 FT #SUB 54 47 ARG C 140 133 ASP A Protein S 4 FT #SUB 54 47 ARG C 141 134 ALA A Protein S 2 FT #SUB 193 186 PRO C 141 134 ALA A Protein B 2 FT #SUB 194 187 GLY C 140 133 ASP A Protein B 6 FT #SUB 194 187 GLY C 141 134 ALA A Protein B 2 FT #SUB 196 189 LYS C 140 133 ASP A Protein S 5 FT #SUB 200 193 ARG C 98 91 ASP A Protein S 1 FT #SUB 163 156 GLN C 353 346 ASP D Protein B 1 FT #SUB 163 156 GLN C 407 400 ASP D Protein S 3 FT #SUB 163 156 GLN C 411 404 ARG D Protein S 3 FT #SUB 221 214 HIS C 356 349 ASN D Protein A 4 FT #SUB 221 214 HIS C 390 383 GLU D Protein S 4 FT #SUB 221 214 HIS C 404 397 TYR D Protein S 16 FT #SUB 222 215 TYR C 356 349 ASN D Protein B 1 FT #SUB 222 215 TYR C 404 397 TYR D Protein S 3 FT #SUB 222 215 TYR C 407 400 ASP D Protein S 1 FT #SUB 224 217 PRO C 356 349 ASN D Protein S 1 FT #HET 30 23 GLN C 18 506 0WK C S 6 FT #HET 131 124 HIS C 18 506 0WK C S 2 FT #HET 177 170 GLN C 18 506 0WK C S 4 FT #HET 177 170 GLN C 19 507 BGC C S 5 FT #HET 180 173 ILE C 19 507 BGC C S 2 FT #HET 184 177 PHE C 19 507 BGC C S 3 FT #HET 196 189 LYS C 17 505 IMD C B 2 FT #HET 197 190 ASP C 17 505 IMD C A 11 FT #HET 233 226 ALA C 19 507 BGC C S 1 FT #HET 267 260 TRP C 13 501 GOL C S 8 FT #HET 267 260 TRP C 14 502 GOL C S 15 FT #HET 270 263 TYR C 15 503 GOL C A 4 FT #HET 271 264 PRO C 15 503 GOL C B 2 FT #HET 272 265 GLN C 15 503 GOL C A 10 FT #HET 275 268 TRP C 15 503 GOL C S 16 FT #HET 292 285 TRP C 15 503 GOL C S 1 FT #HET 306 299 ASN C 19 507 BGC C S 1 FT #HET 308 301 TYR C 19 507 BGC C S 4 FT #HET 376 369 ARG C 13 501 GOL C B 2 FT #HET 377 370 TYR C 13 501 GOL C B 1 FT #HET 378 371 GLN C 13 501 GOL C S 5 FT #HET 385 378 GLU C 18 506 0WK C S 6 FT #HET 390 383 GLU C 16 504 GOL C S 2 FT #HET 391 384 PHE C 16 504 GOL C A 6 FT #HET 393 386 THR C 16 504 GOL C A 4 FT #HET 404 397 TYR C 16 504 GOL C S 6 FT #HET 432 425 TRP C 18 506 0WK C S 8 FT #HET 439 432 SER C 18 506 0WK C S 5 FT #HET 440 433 TRP C 18 506 0WK C A 8 FT #HET 448 441 TYR C 18 506 0WK C S 4 FT DISORDER 1 11 FT DISORDER 323 335 CC SEQUENCE 461 AA (ATOM); CC HLKPFPPEFL WGAASAAYQV EGAWNEDGKG LSVWDVFAKQ PGRTFKGTNG DVAVDHYHRY CC QEDVALMAEM GLKAYRFSVS WSRVFPDGNG AVNEKGLDFY DRLIEELRNH GIEPIVTLYH CC WDVPQALMDA YGAWESRRII DDFDRYAVTL FQRFGDRVKY WVTLNQQNIF ISFGYRLGLH CC PPGVKDMKRM YEANHIANLA NAKVIQSFRH YVPDGKIGPS FAYSPMYPYD SRPENVLAFE CC NAEEFQNHWW MDVYAWGMYP QAAWNYLESQ GLEPTVAPGD WELLQAAKPD FMGVNYYQTT CC TVEHNPPDGV GTSSGIPGLF KTVRNPHVDT TNWDWAIDPV GLRIGLRRIA NRYQLPILIT CC ENGLGEFDTL EPGDIVNDDY RIDYLRRHVQ EIQRAITDGV DVLGYCAWSF TDLLSWLNGY CC QKRYGFVYVN RDDESEKDLR RIKKKSFYWY QRVIETNGAE L CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MIHHHHHHEHRHLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC ATOM -----------HLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC *************************************** CC SEQRES QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ATOM QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ************************************************** CC SEQRES GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ATOM GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ************************************************** CC SEQRES IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ATOM IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ************************************************** CC SEQRES MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ATOM MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ************************************************** CC SEQRES ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ATOM ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ************************************************** CC SEQRES DFMGVNYYQTTTVEHNPPDGVGEGVMNTTGKKGTSTSSGIPGLFKTVRNP CC ATOM DFMGVNYYQTTTVEHNPPDGVG-------------TSSGIPGLFKTVRNP CC ********************** *************** CC SEQRES HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ATOM HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ************************************************** CC SEQRES NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ATOM NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ************************************************** CC SEQRES VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC ATOM VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC *********************************** SQ SEQUENCE 485 AA; MW; CN; MIHHHHHHEH RHLKPFPPEF LWGAASAAYQ VEGAWNEDGK GLSVWDVFAK QPGRTFKGTN GDVAVDHYHR YQEDVALMAE MGLKAYRFSV SWSRVFPDGN GAVNEKGLDF YDRLIEELRN HGIEPIVTLY HWDVPQALMD AYGAWESRRI IDDFDRYAVT LFQRFGDRVK YWVTLNQQNI FISFGYRLGL HPPGVKDMKR MYEANHIANL ANAKVIQSFR HYVPDGKIGP SFAYSPMYPY DSRPENVLAF ENAEEFQNHW WMDVYAWGMY PQAAWNYLES QGLEPTVAPG DWELLQAAKP DFMGVNYYQT TTVEHNPPDG VGEGVMNTTG KKGTSTSSGI PGLFKTVRNP HVDTTNWDWA IDPVGLRIGL RRIANRYQLP ILITENGLGE FDTLEPGDIV NDDYRIDYLR RHVQEIQRAI TDGVDVLGYC AWSFTDLLSW LNGYQKRYGF VYVNRDDESE KDLRRIKKKS FYWYQRVIET NGAEL // ID 4ZFMD STANDARD; PRT; 485 AA. DT CONVERTED FROM PDB (SEQRES) 4ZFM DE Putative 6-phospho-beta-galactobiosidase OS Geobacillus stearothermophilus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.400 CC R-Factor 0.132 FT #SUB 12 5 HIS D 476 469 ARG B Protein S 1 FT #SUB 13 6 LEU D 476 469 ARG B Protein B 4 FT #SUB 14 7 LYS D 476 469 ARG B Protein B 1 FT #SUB 15 8 PRO D 476 469 ARG B Protein A 6 FT #SUB 15 8 PRO D 479 472 GLU B Protein S 6 FT #SUB 15 8 PRO D 480 473 THR B Protein S 1 FT #SUB 483 476 ALA D 484 477 GLU B Protein S 3 FT #SUB 353 346 ASP D 163 156 GLN C Protein S 1 FT #SUB 356 349 ASN D 221 214 HIS C Protein S 4 FT #SUB 356 349 ASN D 222 215 TYR C Protein S 1 FT #SUB 356 349 ASN D 224 217 PRO C Protein S 1 FT #SUB 390 383 GLU D 221 214 HIS C Protein S 4 FT #SUB 404 397 TYR D 221 214 HIS C Protein S 16 FT #SUB 404 397 TYR D 222 215 TYR C Protein A 3 FT #SUB 407 400 ASP D 163 156 GLN C Protein S 3 FT #SUB 407 400 ASP D 222 215 TYR C Protein S 1 FT #SUB 411 404 ARG D 163 156 GLN C Protein S 3 FT #HET 30 23 GLN D 2 2 0WK E S 8 FT #HET 131 124 HIS D 2 2 0WK E S 2 FT #HET 163 156 GLN D 21 502 IMD D B 2 FT #HET 164 157 ARG D 21 502 IMD D A 7 FT #HET 177 170 GLN D 1 1 BGC E S 5 FT #HET 177 170 GLN D 2 2 0WK E S 4 FT #HET 184 177 PHE D 1 1 BGC E S 4 FT #HET 233 226 ALA D 1 1 BGC E S 1 FT #HET 270 263 TYR D 20 501 GOL D B 3 FT #HET 271 264 PRO D 20 501 GOL D B 2 FT #HET 272 265 GLN D 20 501 GOL D A 11 FT #HET 275 268 TRP D 20 501 GOL D S 11 FT #HET 292 285 TRP D 20 501 GOL D S 1 FT #HET 306 299 ASN D 1 1 BGC E S 1 FT #HET 308 301 TYR D 1 1 BGC E S 4 FT #HET 359 352 TRP D 2 2 0WK E S 2 FT #HET 385 378 GLU D 2 2 0WK E S 6 FT #HET 432 425 TRP D 2 2 0WK E S 8 FT #HET 439 432 SER D 2 2 0WK E S 2 FT #HET 440 433 TRP D 2 2 0WK E S 5 FT #HET 442 435 ASN D 2 2 0WK E S 1 FT #HET 446 439 LYS D 2 2 0WK E S 3 FT #HET 448 441 TYR D 2 2 0WK E S 8 FT DISORDER 1 11 FT DISORDER 323 335 CC SEQUENCE 461 AA (ATOM); CC HLKPFPPEFL WGAASAAYQV EGAWNEDGKG LSVWDVFAKQ PGRTFKGTNG DVAVDHYHRY CC QEDVALMAEM GLKAYRFSVS WSRVFPDGNG AVNEKGLDFY DRLIEELRNH GIEPIVTLYH CC WDVPQALMDA YGAWESRRII DDFDRYAVTL FQRFGDRVKY WVTLNQQNIF ISFGYRLGLH CC PPGVKDMKRM YEANHIANLA NAKVIQSFRH YVPDGKIGPS FAYSPMYPYD SRPENVLAFE CC NAEEFQNHWW MDVYAWGMYP QAAWNYLESQ GLEPTVAPGD WELLQAAKPD FMGVNYYQTT CC TVEHNPPDGV GTSSGIPGLF KTVRNPHVDT TNWDWAIDPV GLRIGLRRIA NRYQLPILIT CC ENGLGEFDTL EPGDIVNDDY RIDYLRRHVQ EIQRAITDGV DVLGYCAWSF TDLLSWLNGY CC QKRYGFVYVN RDDESEKDLR RIKKKSFYWY QRVIETNGAE L CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MIHHHHHHEHRHLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC ATOM -----------HLKPFPPEFLWGAASAAYQVEGAWNEDGKGLSVWDVFAK CC *************************************** CC SEQRES QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ATOM QPGRTFKGTNGDVAVDHYHRYQEDVALMAEMGLKAYRFSVSWSRVFPDGN CC ************************************************** CC SEQRES GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ATOM GAVNEKGLDFYDRLIEELRNHGIEPIVTLYHWDVPQALMDAYGAWESRRI CC ************************************************** CC SEQRES IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ATOM IDDFDRYAVTLFQRFGDRVKYWVTLNQQNIFISFGYRLGLHPPGVKDMKR CC ************************************************** CC SEQRES MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ATOM MYEANHIANLANAKVIQSFRHYVPDGKIGPSFAYSPMYPYDSRPENVLAF CC ************************************************** CC SEQRES ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ATOM ENAEEFQNHWWMDVYAWGMYPQAAWNYLESQGLEPTVAPGDWELLQAAKP CC ************************************************** CC SEQRES DFMGVNYYQTTTVEHNPPDGVGEGVMNTTGKKGTSTSSGIPGLFKTVRNP CC ATOM DFMGVNYYQTTTVEHNPPDGVG-------------TSSGIPGLFKTVRNP CC ********************** *************** CC SEQRES HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ATOM HVDTTNWDWAIDPVGLRIGLRRIANRYQLPILITENGLGEFDTLEPGDIV CC ************************************************** CC SEQRES NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ATOM NDDYRIDYLRRHVQEIQRAITDGVDVLGYCAWSFTDLLSWLNGYQKRYGF CC ************************************************** CC SEQRES VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC ATOM VYVNRDDESEKDLRRIKKKSFYWYQRVIETNGAEL CC *********************************** SQ SEQUENCE 485 AA; MW; CN; MIHHHHHHEH RHLKPFPPEF LWGAASAAYQ VEGAWNEDGK GLSVWDVFAK QPGRTFKGTN GDVAVDHYHR YQEDVALMAE MGLKAYRFSV SWSRVFPDGN GAVNEKGLDF YDRLIEELRN HGIEPIVTLY HWDVPQALMD AYGAWESRRI IDDFDRYAVT LFQRFGDRVK YWVTLNQQNI FISFGYRLGL HPPGVKDMKR MYEANHIANL ANAKVIQSFR HYVPDGKIGP SFAYSPMYPY DSRPENVLAF ENAEEFQNHW WMDVYAWGMY PQAAWNYLES QGLEPTVAPG DWELLQAAKP DFMGVNYYQT TTVEHNPPDG VGEGVMNTTG KKGTSTSSGI PGLFKTVRNP HVDTTNWDWA IDPVGLRIGL RRIANRYQLP ILITENGLGE FDTLEPGDIV NDDYRIDYLR RHVQEIQRAI TDGVDVLGYC AWSFTDLLSW LNGYQKRYGF VYVNRDDESE KDLRRIKKKS FYWYQRVIET NGAEL //