ID 1JYXA STANDARD; PRT; 1023 AA. DT CONVERTED FROM PDB (SEQRES) 1JYX DE Beta-Galactosidase OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.750 CC R-Factor 0.168 FT #SUB 339 339 ASN A 527 527 PRO B Protein A 7 FT #SUB 339 339 ASN A 528 528 GLY B Protein A 7 FT #SUB 341 341 LEU A 527 527 PRO B Protein S 2 FT #SUB 507 507 ASP A 558 558 GLN B Protein B 3 FT #SUB 509 509 ASP A 558 558 GLN B Protein S 5 FT #SUB 519 519 SER A 558 558 GLN B Protein S 1 FT #SUB 521 521 LYS A 559 559 TYR B Protein A 4 FT #SUB 522 522 LYS A 558 558 GLN B Protein S 6 FT #SUB 522 522 LYS A 559 559 TYR B Protein A 6 FT #SUB 524 524 LEU A 525 525 SER B Protein S 2 FT #SUB 525 525 SER A 524 524 LEU B Protein S 3 FT #SUB 525 525 SER A 559 559 TYR B Protein S 2 FT #SUB 525 525 SER A 561 561 ARG B Protein A 5 FT #SUB 527 527 PRO A 339 339 ASN B Protein A 6 FT #SUB 527 527 PRO A 341 341 LEU B Protein S 1 FT #SUB 528 528 GLY A 339 339 ASN B Protein B 7 FT #SUB 558 558 GLN A 507 507 ASP B Protein S 3 FT #SUB 558 558 GLN A 509 509 ASP B Protein S 4 FT #SUB 558 558 GLN A 519 519 SER B Protein S 1 FT #SUB 558 558 GLN A 522 522 LYS B Protein A 4 FT #SUB 559 559 TYR A 521 521 LYS B Protein S 5 FT #SUB 559 559 TYR A 522 522 LYS B Protein S 7 FT #SUB 559 559 TYR A 525 525 SER B Protein S 2 FT #SUB 560 560 PRO A 527 527 PRO B Protein S 1 FT #SUB 561 561 ARG A 525 525 SER B Protein S 6 FT #SUB 693 693 GLN A 874 874 SER B Protein S 4 FT #SUB 721 721 ARG A 874 874 SER B Protein S 2 FT #SUB 723 723 ALA A 875 875 ASP B Protein A 4 FT #SUB 724 724 GLU A 847 847 LYS B Protein B 4 FT #SUB 724 724 GLU A 872 872 VAL B Protein S 1 FT #SUB 724 724 GLU A 873 873 ALA B Protein S 2 FT #SUB 724 724 GLU A 874 874 SER B Protein S 8 FT #SUB 724 724 GLU A 875 875 ASP B Protein A 6 FT #SUB 726 726 LEU A 851 851 ILE B Protein S 1 FT #SUB 726 726 LEU A 871 871 GLU B Protein S 1 FT #SUB 726 726 LEU A 873 873 ALA B Protein S 2 FT #SUB 727 727 SER A 851 851 ILE B Protein B 1 FT #SUB 728 728 VAL A 848 848 THR B Protein S 2 FT #SUB 728 728 VAL A 851 851 ILE B Protein S 1 FT #SUB 730 730 LEU A 823 823 LEU B Protein S 5 FT #SUB 823 823 LEU A 730 730 LEU B Protein B 2 FT #SUB 828 828 ASP A 830 830 LEU B Protein S 7 FT #SUB 828 828 ASP A 831 831 ALA B Protein S 4 FT #SUB 829 829 THR A 829 829 THR B Protein B 1 FT #SUB 830 830 LEU A 828 828 ASP B Protein S 2 FT #SUB 830 830 LEU A 830 830 LEU B Protein S 1 FT #SUB 831 831 ALA A 828 828 ASP B Protein A 2 FT #SUB 841 841 ALA A 728 728 VAL B Protein S 1 FT #SUB 847 847 LYS A 724 724 GLU B Protein S 3 FT #SUB 848 848 THR A 726 726 LEU B Protein B 1 FT #SUB 848 848 THR A 728 728 VAL B Protein S 2 FT #SUB 849 849 LEU A 726 726 LEU B Protein B 1 FT #SUB 851 851 ILE A 726 726 LEU B Protein S 1 FT #SUB 851 851 ILE A 727 727 SER B Protein S 1 FT #SUB 857 857 ARG A 828 828 ASP B Protein S 1 FT #SUB 869 869 ASP A 1015 1015 HIS B Protein S 1 FT #SUB 871 871 GLU A 726 726 LEU B Protein S 1 FT #SUB 873 873 ALA A 724 724 GLU B Protein B 2 FT #SUB 873 873 ALA A 726 726 LEU B Protein A 4 FT #SUB 874 874 SER A 693 693 GLN B Protein S 4 FT #SUB 874 874 SER A 722 722 LEU B Protein S 1 FT #SUB 874 874 SER A 724 724 GLU B Protein A 6 FT #SUB 875 875 ASP A 723 723 ALA B Protein S 4 FT #SUB 875 875 ASP A 724 724 GLU B Protein S 5 FT #SUB 942 942 ARG A 1013 1013 ARG B Protein S 7 FT #SUB 954 954 ASP A 1013 1013 ARG B Protein S 3 FT #SUB 1013 1013 ARG A 942 942 ARG B Protein S 9 FT #SUB 1013 1013 ARG A 954 954 ASP B Protein S 4 FT #SUB 1015 1015 HIS A 869 869 ASP B Protein S 2 FT #SUB 1015 1015 HIS A 1015 1015 HIS B Protein S 17 FT #SUB 1017 1017 GLN A 869 869 ASP B Protein S 2 FT #SUB 232 232 ASN A 233 233 ASP C Protein A 3 FT #SUB 233 233 ASP A 232 232 ASN C Protein S 3 FT #SUB 233 233 ASP A 233 233 ASP C Protein A 19 FT #SUB 13 13 ARG A 13 13 ARG D Protein S 14 FT #SUB 13 13 ARG A 15 15 ASP D Protein S 9 FT #SUB 13 13 ARG A 24 24 LEU D Protein S 2 FT #SUB 15 15 ASP A 13 13 ARG D Protein S 8 FT #SUB 20 20 GLY A 20 20 GLY D Protein B 1 FT #SUB 24 24 LEU A 13 13 ARG D Protein S 2 FT #SUB 26 26 ARG A 431 431 ARG D Protein B 3 FT #SUB 27 27 LEU A 431 431 ARG D Protein B 1 FT #SUB 28 28 ALA A 431 431 ARG D Protein A 3 FT #SUB 103 103 VAL A 282 282 ARG D Protein S 2 FT #SUB 278 278 ILE A 513 513 PRO D Protein S 1 FT #SUB 278 278 ILE A 514 514 ALA D Protein B 1 FT #SUB 279 279 ILE A 422 422 PRO D Protein S 4 FT #SUB 279 279 ILE A 424 424 ASN D Protein S 3 FT #SUB 279 279 ILE A 514 514 ALA D Protein B 2 FT #SUB 279 279 ILE A 515 515 VAL D Protein B 1 FT #SUB 280 280 ASP A 422 422 PRO D Protein S 5 FT #SUB 280 280 ASP A 423 423 MET D Protein S 10 FT #SUB 280 280 ASP A 424 424 ASN D Protein S 1 FT #SUB 280 280 ASP A 463 463 GLY D Protein S 1 FT #SUB 280 280 ASP A 515 515 VAL D Protein B 1 FT #SUB 281 281 GLU A 423 423 MET D Protein S 3 FT #SUB 281 281 GLU A 515 515 VAL D Protein A 6 FT #SUB 282 282 ARG A 103 103 VAL D Protein S 2 FT #SUB 282 282 ARG A 418 418 HIS D Protein S 3 FT #SUB 282 282 ARG A 419 419 GLY D Protein S 6 FT #SUB 282 282 ARG A 420 420 MET D Protein S 4 FT #SUB 282 282 ARG A 421 421 VAL D Protein A 4 FT #SUB 282 282 ARG A 422 422 PRO D Protein S 2 FT #SUB 282 282 ARG A 423 423 MET D Protein S 3 FT #SUB 283 283 GLY A 422 422 PRO D Protein B 2 FT #SUB 284 284 GLY A 422 422 PRO D Protein B 6 FT #SUB 285 285 TYR A 422 422 PRO D Protein S 8 FT #SUB 285 285 TYR A 424 424 ASN D Protein S 7 FT #SUB 285 285 TYR A 425 425 ARG D Protein S 8 FT #SUB 287 287 ASP A 425 425 ARG D Protein S 9 FT #SUB 418 418 HIS A 282 282 ARG D Protein B 3 FT #SUB 419 419 GLY A 282 282 ARG D Protein B 6 FT #SUB 420 420 MET A 282 282 ARG D Protein B 6 FT #SUB 421 421 VAL A 282 282 ARG D Protein S 5 FT #SUB 422 422 PRO A 280 280 ASP D Protein A 6 FT #SUB 422 422 PRO A 282 282 ARG D Protein B 2 FT #SUB 422 422 PRO A 283 283 GLY D Protein S 2 FT #SUB 422 422 PRO A 284 284 GLY D Protein A 7 FT #SUB 422 422 PRO A 285 285 TYR D Protein S 6 FT #SUB 423 423 MET A 280 280 ASP D Protein A 10 FT #SUB 423 423 MET A 281 281 GLU D Protein S 2 FT #SUB 423 423 MET A 282 282 ARG D Protein A 4 FT #SUB 424 424 ASN A 279 279 ILE D Protein S 1 FT #SUB 424 424 ASN A 280 280 ASP D Protein B 1 FT #SUB 424 424 ASN A 285 285 TYR D Protein A 6 FT #SUB 425 425 ARG A 285 285 TYR D Protein A 8 FT #SUB 425 425 ARG A 287 287 ASP D Protein S 8 FT #SUB 430 430 PRO A 445 445 GLN D Protein S 1 FT #SUB 433 433 LEU A 437 437 SER D Protein S 1 FT #SUB 434 434 PRO A 434 434 PRO D Protein S 3 FT #SUB 437 437 SER A 433 433 LEU D Protein S 2 FT #SUB 441 441 THR A 430 430 PRO D Protein S 1 FT #SUB 445 445 GLN A 430 430 PRO D Protein S 1 FT #SUB 463 463 GLY A 280 280 ASP D Protein B 1 FT #SUB 466 466 ALA A 474 474 TRP D Protein A 3 FT #SUB 466 466 ALA A 478 478 VAL D Protein S 1 FT #SUB 469 469 ASP A 473 473 ARG D Protein B 3 FT #SUB 469 469 ASP A 477 477 SER D Protein S 4 FT #SUB 470 470 ALA A 470 470 ALA D Protein A 4 FT #SUB 473 473 ARG A 469 469 ASP D Protein S 4 FT #SUB 473 473 ARG A 473 473 ARG D Protein S 1 FT #SUB 474 474 TRP A 466 466 ALA D Protein S 3 FT #SUB 477 477 SER A 469 469 ASP D Protein S 4 FT #SUB 478 478 VAL A 466 466 ALA D Protein S 1 FT #SUB 513 513 PRO A 278 278 ILE D Protein S 2 FT #SUB 514 514 ALA A 278 278 ILE D Protein S 1 FT #SUB 514 514 ALA A 279 279 ILE D Protein S 2 FT #SUB 515 515 VAL A 279 279 ILE D Protein S 1 FT #SUB 515 515 VAL A 280 280 ASP D Protein S 1 FT #SUB 515 515 VAL A 281 281 GLU D Protein S 8 FT #HET 15 15 ASP A 2 3002 MG A B 2 FT #HET 18 18 ASN A 2 3002 MG A B 4 FT #HET 21 21 VAL A 2 3002 MG A B 2 FT #HET 32 32 PRO A 11 8404 DMS A A 3 FT #HET 33 33 PHE A 11 8404 DMS A B 4 FT #HET 34 34 ALA A 11 8404 DMS A A 2 FT #HET 34 34 ALA A 23 8501 DMS A S 1 FT #HET 36 36 TRP A 11 8404 DMS A S 2 FT #HET 36 36 TRP A 23 8501 DMS A S 1 FT #HET 45 45 ASP A 11 8404 DMS A S 3 FT #HET 45 45 ASP A 23 8501 DMS A A 4 FT #HET 46 46 ARG A 23 8501 DMS A B 3 FT #HET 47 47 PRO A 23 8501 DMS A B 2 FT #HET 48 48 SER A 23 8501 DMS A B 1 FT #HET 54 54 LEU A 14 8408 DMS A B 5 FT #HET 55 55 ASN A 14 8408 DMS A B 2 FT #HET 57 57 GLU A 24 8502 DMS A B 3 FT #HET 58 58 TRP A 24 8502 DMS A B 2 FT #HET 59 59 ARG A 24 8502 DMS A S 1 FT #HET 93 93 HIS A 22 8421 DMS A B 1 FT #HET 94 94 GLY A 22 8421 DMS A B 4 FT #HET 95 95 TYR A 22 8421 DMS A S 4 FT #HET 100 100 TYR A 3 3101 NA A S 1 FT #HET 102 102 ASN A 6 2001 IPT A S 5 FT #HET 106 106 PRO A 16 8410 DMS A B 3 FT #HET 115 115 PRO A 16 8410 DMS A S 2 FT #HET 125 125 LEU A 14 8408 DMS A S 1 FT #HET 125 125 LEU A 24 8502 DMS A A 4 FT #HET 126 126 THR A 24 8502 DMS A A 7 FT #HET 127 127 PHE A 14 8408 DMS A S 1 FT #HET 163 163 GLN A 2 3002 MG A S 3 FT #HET 191 191 TRP A 16 8410 DMS A S 3 FT #HET 193 193 ASP A 2 3002 MG A S 3 FT #HET 201 201 ASP A 3 3101 NA A S 3 FT #HET 201 201 ASP A 6 2001 IPT A S 7 FT #HET 229 229 THR A 8 8401 DMS A S 2 FT #HET 266 266 GLN A 25 8602 DMS A A 4 FT #HET 267 267 VAL A 25 8602 DMS A B 4 FT #HET 268 268 ALA A 25 8602 DMS A B 6 FT #HET 269 269 SER A 25 8602 DMS A B 1 FT #HET 271 271 THR A 12 8405 DMS A B 1 FT #HET 275 275 GLY A 18 8412 DMS A B 2 FT #HET 276 276 GLY A 18 8412 DMS A B 1 FT #HET 277 277 GLU A 18 8412 DMS A A 5 FT #HET 282 282 ARG A 103 8704 DMS D B 1 FT #HET 284 284 GLY A 103 8704 DMS D B 6 FT #HET 286 286 ALA A 103 8704 DMS D S 1 FT #HET 289 289 VAL A 18 8412 DMS A A 3 FT #HET 290 290 THR A 18 8412 DMS A A 9 FT #HET 291 291 LEU A 12 8405 DMS A A 3 FT #HET 292 292 ARG A 12 8405 DMS A B 6 FT #HET 292 292 ARG A 18 8412 DMS A S 1 FT #HET 304 304 GLU A 7 2002 IPT A B 5 FT #HET 306 306 PRO A 7 2002 IPT A S 2 FT #HET 310 310 ARG A 11 8404 DMS A S 1 FT #HET 327 327 ALA A 11 8404 DMS A B 1 FT #HET 330 330 VAL A 8 8401 DMS A A 3 FT #HET 331 331 GLY A 8 8401 DMS A B 7 FT #HET 333 333 ARG A 8 8401 DMS A S 3 FT #HET 334 334 GLU A 15 8409 DMS A S 2 FT #HET 335 335 VAL A 15 8409 DMS A B 1 FT #HET 336 336 ARG A 15 8409 DMS A S 1 FT #HET 380 380 LYS A 10 8403 DMS A B 1 FT #HET 383 383 ASN A 10 8403 DMS A S 2 FT #HET 416 416 GLU A 1 3001 MG A S 3 FT #HET 418 418 HIS A 1 3001 MG A S 4 FT #HET 428 428 ASP A 21 8420 DMS A A 6 FT #HET 430 430 PRO A 21 8420 DMS A S 1 FT #HET 448 448 ARG A 8 8401 DMS A B 1 FT #HET 449 449 ASN A 8 8401 DMS A B 4 FT #HET 450 450 HIS A 8 8401 DMS A B 1 FT #HET 451 451 PRO A 8 8401 DMS A A 10 FT #HET 461 461 GLU A 1 3001 MG A S 2 FT #HET 461 461 GLU A 6 2001 IPT A S 9 FT #HET 480 480 PRO A 15 8409 DMS A B 1 FT #HET 481 481 SER A 15 8409 DMS A B 1 FT #HET 482 482 ARG A 8 8401 DMS A S 1 FT #HET 502 502 MET A 6 2001 IPT A S 1 FT #HET 503 503 TYR A 6 2001 IPT A S 3 FT #HET 505 505 ARG A 13 8407 DMS A S 1 FT #HET 508 508 GLU A 13 8407 DMS A S 6 FT #HET 537 537 GLU A 6 2001 IPT A S 4 FT #HET 540 540 HIS A 6 2001 IPT A S 5 FT #HET 556 556 PHE A 4 3102 NA A B 2 FT #HET 557 557 ARG A 9 8402 DMS A S 5 FT #HET 559 559 TYR A 4 3102 NA A B 2 FT #HET 560 560 PRO A 4 3102 NA A B 4 FT #HET 562 562 LEU A 4 3102 NA A B 2 FT #HET 568 568 TRP A 6 2001 IPT A S 1 FT #HET 576 576 ILE A 17 8411 DMS A S 3 FT #HET 584 584 PRO A 17 8411 DMS A B 2 FT #HET 585 585 TRP A 17 8411 DMS A B 3 FT #HET 586 586 SER A 17 8411 DMS A A 5 FT #HET 593 593 GLY A 19 8413 DMS A B 4 FT #HET 594 594 ASP A 19 8413 DMS A B 2 FT #HET 595 595 THR A 19 8413 DMS A A 6 FT #HET 601 601 PHE A 3 3101 NA A B 2 FT #HET 601 601 PHE A 6 2001 IPT A B 1 FT #HET 604 604 ASN A 3 3101 NA A S 3 FT #HET 604 604 ASN A 6 2001 IPT A S 3 FT #HET 621 621 LYS A 20 8415 DMS A S 4 FT #HET 622 622 HIS A 9 8402 DMS A A 9 FT #HET 623 623 GLN A 9 8402 DMS A A 4 FT #HET 625 625 GLN A 9 8402 DMS A S 3 FT #HET 626 626 PHE A 10 8403 DMS A S 6 FT #HET 628 628 GLN A 9 8402 DMS A S 5 FT #HET 642 642 TYR A 7 2002 IPT A S 2 FT #HET 642 642 TYR A 10 8403 DMS A S 5 FT #HET 645 645 ARG A 7 2002 IPT A S 17 FT #HET 648 648 ASP A 7 2002 IPT A S 5 FT #HET 650 650 GLU A 7 2002 IPT A S 1 FT #HET 702 702 GLN A 7 2002 IPT A S 2 FT #HET 702 702 GLN A 10 8403 DMS A S 1 FT #HET 706 706 THR A 7 2002 IPT A S 1 FT #HET 708 708 TRP A 7 2002 IPT A S 6 FT #HET 708 708 TRP A 10 8403 DMS A S 5 FT #HET 714 714 ILE A 20 8415 DMS A S 2 FT #HET 717 717 TRP A 20 8415 DMS A S 11 FT #HET 931 931 PHE A 5 3103 NA A S 3 FT #HET 932 932 PRO A 5 3103 NA A B 2 FT #HET 967 967 LEU A 5 3103 NA A B 2 FT #HET 968 968 MET A 5 3103 NA A B 3 FT #HET 970 970 THR A 5 3103 NA A B 2 FT #HET 973 973 ARG A 17 8411 DMS A S 5 FT #HET 999 999 TRP A 6 2001 IPT A S 13 FT #HET 1001 1001 PRO A 13 8407 DMS A A 3 FT #HET 1003 1003 VAL A 13 8407 DMS A B 1 FT DISORDER 1 12 CC SEQUENCE 1011 AA (ATOM); CC RRDWENPGVT QLNRLAAHPP FASWRNSEEA RTDRPSQQLR SLNGEWRFAW FPAPEAVPES CC WLECDLPEAD TVVVPSNWQM HGYDAPIYTN VTYPITVNPP FVPTENPTGC YSLTFNVDES CC WLQEGQTRII FDGVNSAFHL WCNGRWVGYG QDSRLPSEFD LSAFLRAGEN RLAVMVLRWS CC DGSYLEDQDM WRMSGIFRDV SLLHKPTTQI SDFHVATRFN DDFSRAVLEA EVQMCGELRD CC YLRVTVSLWQ GETQVASGTA PFGGEIIDER GGYADRVTLR LNVENPKLWS AEIPNLYRAV CC VELHTADGTL IEAEACDVGF REVRIENGLL LLNGKPLLIR GVNRHEHHPL HGQVMDEQTM CC VQDILLMKQN NFNAVRCSHY PNHPLWYTLC DRYGLYVVDE ANIETHGMVP MNRLTDDPRW CC LPAMSERVTR MVQRDRNHPS VIIWSLGNES GHGANHDALY RWIKSVDPSR PVQYEGGGAD CC TTATDIICPM YARVDEDQPF PAVPKWSIKK WLSLPGETRP LILCEYAHAM GNSLGGFAKY CC WQAFRQYPRL QGGFVWDWVD QSLIKYDENG NPWSAYGGDF GDTPNDRQFC MNGLVFADRT CC PHPALTEAKH QQQFFQFRLS GQTIEVTSEY LFRHSDNELL HWMVALDGKP LASGEVPLDV CC APQGKQLIEL PELPQPESAG QLWLTVRVVQ PNATAWSEAG HISAWQQWRL AENLSVTLPA CC ASHAIPHLTT SEMDFCIELG NKRWQFNRQS GFLSQMWIGD KKQLLTPLRD QFTRAPLDND CC IGVSEATRID PNAWVERWKA AGHYQAEAAL LQCTADTLAD AVLITTAHAW QHQGKTLFIS CC RKTYRIDGSG QMAITVDVEV ASDTPHPARI GLNCQLAQVA ERVNWLGLGP QENYPDRLTA CC ACFDRWDLPL SDMYTPYVFP SENGLRCGTR ELNYGPHQWR GDFQFNISRY SQQQLMETSH CC RHLLHAEEGT WLNIDGFHMG IGGDDSWSPS VSAEFQLSAG RYHYQLVWCQ K CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSHMLEDPVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ATOM ------------RRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ************************************** CC SEQRES LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ATOM LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ************************************************** CC SEQRES TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ATOM TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ************************************************** CC SEQRES HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ATOM HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ************************************************** CC SEQRES DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ATOM DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ************************************************** CC SEQRES RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ATOM RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ************************************************** CC SEQRES WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ATOM WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ************************************************** CC SEQRES IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ATOM IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ************************************************** CC SEQRES LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ATOM LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ************************************************** CC SEQRES PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ATOM PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ************************************************** CC SEQRES PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ATOM PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ************************************************** CC SEQRES KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ATOM KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ************************************************** CC SEQRES FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ATOM FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ************************************************** CC SEQRES LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ATOM LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ************************************************** CC SEQRES VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ATOM VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ************************************************** CC SEQRES LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ATOM LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ************************************************** CC SEQRES IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ATOM IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ************************************************** CC SEQRES ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ATOM ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ************************************************** CC SEQRES GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ATOM GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ************************************************** CC SEQRES WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ATOM WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ************************************************** CC SEQRES PSVSAEFQLSAGRYHYQLVWCQK CC ATOM PSVSAEFQLSAGRYHYQLVWCQK CC *********************** SQ SEQUENCE 1023 AA; MW; CN; GSHMLEDPVV LQRRDWENPG VTQLNRLAAH PPFASWRNSE EARTDRPSQQ LRSLNGEWRF AWFPAPEAVP ESWLECDLPE ADTVVVPSNW QMHGYDAPIY TNVTYPITVN PPFVPTENPT GCYSLTFNVD ESWLQEGQTR IIFDGVNSAF HLWCNGRWVG YGQDSRLPSE FDLSAFLRAG ENRLAVMVLR WSDGSYLEDQ DMWRMSGIFR DVSLLHKPTT QISDFHVATR FNDDFSRAVL EAEVQMCGEL RDYLRVTVSL WQGETQVASG TAPFGGEIID ERGGYADRVT LRLNVENPKL WSAEIPNLYR AVVELHTADG TLIEAEACDV GFREVRIENG LLLLNGKPLL IRGVNRHEHH PLHGQVMDEQ TMVQDILLMK QNNFNAVRCS HYPNHPLWYT LCDRYGLYVV DEANIETHGM VPMNRLTDDP RWLPAMSERV TRMVQRDRNH PSVIIWSLGN ESGHGANHDA LYRWIKSVDP SRPVQYEGGG ADTTATDIIC PMYARVDEDQ PFPAVPKWSI KKWLSLPGET RPLILCEYAH AMGNSLGGFA KYWQAFRQYP RLQGGFVWDW VDQSLIKYDE NGNPWSAYGG DFGDTPNDRQ FCMNGLVFAD RTPHPALTEA KHQQQFFQFR LSGQTIEVTS EYLFRHSDNE LLHWMVALDG KPLASGEVPL DVAPQGKQLI ELPELPQPES AGQLWLTVRV VQPNATAWSE AGHISAWQQW RLAENLSVTL PAASHAIPHL TTSEMDFCIE LGNKRWQFNR QSGFLSQMWI GDKKQLLTPL RDQFTRAPLD NDIGVSEATR IDPNAWVERW KAAGHYQAEA ALLQCTADTL ADAVLITTAH AWQHQGKTLF ISRKTYRIDG SGQMAITVDV EVASDTPHPA RIGLNCQLAQ VAERVNWLGL GPQENYPDRL TAACFDRWDL PLSDMYTPYV FPSENGLRCG TRELNYGPHQ WRGDFQFNIS RYSQQQLMET SHRHLLHAEE GTWLNIDGFH MGIGGDDSWS PSVSAEFQLS AGRYHYQLVW CQK // ID 1JYXB STANDARD; PRT; 1023 AA. DT CONVERTED FROM PDB (SEQRES) 1JYX DE Beta-Galactosidase OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.750 CC R-Factor 0.168 FT #SUB 339 339 ASN B 527 527 PRO A Protein A 6 FT #SUB 339 339 ASN B 528 528 GLY A Protein A 7 FT #SUB 341 341 LEU B 527 527 PRO A Protein S 1 FT #SUB 507 507 ASP B 558 558 GLN A Protein B 3 FT #SUB 509 509 ASP B 558 558 GLN A Protein S 4 FT #SUB 519 519 SER B 558 558 GLN A Protein S 1 FT #SUB 521 521 LYS B 559 559 TYR A Protein A 5 FT #SUB 522 522 LYS B 558 558 GLN A Protein S 4 FT #SUB 522 522 LYS B 559 559 TYR A Protein A 7 FT #SUB 524 524 LEU B 525 525 SER A Protein S 3 FT #SUB 525 525 SER B 524 524 LEU A Protein S 2 FT #SUB 525 525 SER B 559 559 TYR A Protein S 2 FT #SUB 525 525 SER B 561 561 ARG A Protein A 6 FT #SUB 527 527 PRO B 339 339 ASN A Protein A 7 FT #SUB 527 527 PRO B 341 341 LEU A Protein S 2 FT #SUB 527 527 PRO B 560 560 PRO A Protein S 1 FT #SUB 528 528 GLY B 339 339 ASN A Protein B 7 FT #SUB 558 558 GLN B 507 507 ASP A Protein S 3 FT #SUB 558 558 GLN B 509 509 ASP A Protein S 5 FT #SUB 558 558 GLN B 519 519 SER A Protein S 1 FT #SUB 558 558 GLN B 522 522 LYS A Protein A 6 FT #SUB 559 559 TYR B 521 521 LYS A Protein S 4 FT #SUB 559 559 TYR B 522 522 LYS A Protein S 6 FT #SUB 559 559 TYR B 525 525 SER A Protein S 2 FT #SUB 561 561 ARG B 525 525 SER A Protein S 5 FT #SUB 693 693 GLN B 874 874 SER A Protein S 4 FT #SUB 722 722 LEU B 874 874 SER A Protein B 1 FT #SUB 723 723 ALA B 875 875 ASP A Protein A 4 FT #SUB 724 724 GLU B 847 847 LYS A Protein B 3 FT #SUB 724 724 GLU B 873 873 ALA A Protein S 2 FT #SUB 724 724 GLU B 874 874 SER A Protein S 6 FT #SUB 724 724 GLU B 875 875 ASP A Protein B 5 FT #SUB 726 726 LEU B 848 848 THR A Protein S 1 FT #SUB 726 726 LEU B 849 849 LEU A Protein S 1 FT #SUB 726 726 LEU B 851 851 ILE A Protein S 1 FT #SUB 726 726 LEU B 871 871 GLU A Protein S 1 FT #SUB 726 726 LEU B 873 873 ALA A Protein S 4 FT #SUB 727 727 SER B 851 851 ILE A Protein B 1 FT #SUB 728 728 VAL B 841 841 ALA A Protein S 1 FT #SUB 728 728 VAL B 848 848 THR A Protein S 2 FT #SUB 730 730 LEU B 823 823 LEU A Protein S 2 FT #SUB 823 823 LEU B 730 730 LEU A Protein A 5 FT #SUB 828 828 ASP B 830 830 LEU A Protein S 2 FT #SUB 828 828 ASP B 831 831 ALA A Protein S 2 FT #SUB 828 828 ASP B 857 857 ARG A Protein S 1 FT #SUB 829 829 THR B 829 829 THR A Protein B 1 FT #SUB 830 830 LEU B 828 828 ASP A Protein A 7 FT #SUB 830 830 LEU B 830 830 LEU A Protein S 1 FT #SUB 831 831 ALA B 828 828 ASP A Protein A 4 FT #SUB 847 847 LYS B 724 724 GLU A Protein S 4 FT #SUB 848 848 THR B 728 728 VAL A Protein S 2 FT #SUB 851 851 ILE B 726 726 LEU A Protein S 1 FT #SUB 851 851 ILE B 727 727 SER A Protein S 1 FT #SUB 851 851 ILE B 728 728 VAL A Protein S 1 FT #SUB 869 869 ASP B 1015 1015 HIS A Protein S 2 FT #SUB 869 869 ASP B 1017 1017 GLN A Protein S 2 FT #SUB 871 871 GLU B 726 726 LEU A Protein S 1 FT #SUB 872 872 VAL B 724 724 GLU A Protein B 1 FT #SUB 873 873 ALA B 724 724 GLU A Protein B 2 FT #SUB 873 873 ALA B 726 726 LEU A Protein B 2 FT #SUB 874 874 SER B 693 693 GLN A Protein S 4 FT #SUB 874 874 SER B 721 721 ARG A Protein S 2 FT #SUB 874 874 SER B 724 724 GLU A Protein A 8 FT #SUB 875 875 ASP B 723 723 ALA A Protein S 4 FT #SUB 875 875 ASP B 724 724 GLU A Protein S 6 FT #SUB 942 942 ARG B 1013 1013 ARG A Protein S 9 FT #SUB 954 954 ASP B 1013 1013 ARG A Protein S 4 FT #SUB 1013 1013 ARG B 942 942 ARG A Protein S 7 FT #SUB 1013 1013 ARG B 954 954 ASP A Protein S 3 FT #SUB 1015 1015 HIS B 869 869 ASP A Protein S 1 FT #SUB 1015 1015 HIS B 1015 1015 HIS A Protein S 17 FT #SUB 13 13 ARG B 13 13 ARG C Protein S 18 FT #SUB 13 13 ARG B 15 15 ASP C Protein S 8 FT #SUB 13 13 ARG B 24 24 LEU C Protein S 1 FT #SUB 15 15 ASP B 13 13 ARG C Protein S 9 FT #SUB 20 20 GLY B 20 20 GLY C Protein B 1 FT #SUB 21 21 VAL B 21 21 VAL C Protein S 1 FT #SUB 24 24 LEU B 13 13 ARG C Protein S 3 FT #SUB 24 24 LEU B 18 18 ASN C Protein S 1 FT #SUB 26 26 ARG B 431 431 ARG C Protein B 3 FT #SUB 28 28 ALA B 431 431 ARG C Protein B 1 FT #SUB 103 103 VAL B 282 282 ARG C Protein S 2 FT #SUB 278 278 ILE B 514 514 ALA C Protein B 1 FT #SUB 279 279 ILE B 424 424 ASN C Protein S 2 FT #SUB 279 279 ILE B 514 514 ALA C Protein B 2 FT #SUB 280 280 ASP B 422 422 PRO C Protein S 6 FT #SUB 280 280 ASP B 423 423 MET C Protein S 10 FT #SUB 280 280 ASP B 424 424 ASN C Protein S 1 FT #SUB 280 280 ASP B 463 463 GLY C Protein S 1 FT #SUB 281 281 GLU B 423 423 MET C Protein S 2 FT #SUB 281 281 GLU B 515 515 VAL C Protein A 8 FT #SUB 282 282 ARG B 103 103 VAL C Protein S 2 FT #SUB 282 282 ARG B 418 418 HIS C Protein S 3 FT #SUB 282 282 ARG B 419 419 GLY C Protein S 6 FT #SUB 282 282 ARG B 420 420 MET C Protein S 5 FT #SUB 282 282 ARG B 421 421 VAL C Protein A 4 FT #SUB 282 282 ARG B 422 422 PRO C Protein S 2 FT #SUB 282 282 ARG B 423 423 MET C Protein S 5 FT #SUB 283 283 GLY B 422 422 PRO C Protein B 1 FT #SUB 284 284 GLY B 422 422 PRO C Protein B 4 FT #SUB 285 285 TYR B 422 422 PRO C Protein S 8 FT #SUB 285 285 TYR B 424 424 ASN C Protein S 7 FT #SUB 285 285 TYR B 425 425 ARG C Protein S 8 FT #SUB 287 287 ASP B 425 425 ARG C Protein S 8 FT #SUB 418 418 HIS B 282 282 ARG C Protein B 4 FT #SUB 419 419 GLY B 282 282 ARG C Protein B 6 FT #SUB 420 420 MET B 282 282 ARG C Protein B 4 FT #SUB 421 421 VAL B 282 282 ARG C Protein S 2 FT #SUB 422 422 PRO B 280 280 ASP C Protein A 5 FT #SUB 422 422 PRO B 282 282 ARG C Protein B 2 FT #SUB 422 422 PRO B 283 283 GLY C Protein S 2 FT #SUB 422 422 PRO B 284 284 GLY C Protein A 4 FT #SUB 422 422 PRO B 285 285 TYR C Protein S 8 FT #SUB 423 423 MET B 280 280 ASP C Protein A 10 FT #SUB 423 423 MET B 281 281 GLU C Protein S 2 FT #SUB 423 423 MET B 282 282 ARG C Protein A 5 FT #SUB 424 424 ASN B 279 279 ILE C Protein S 1 FT #SUB 424 424 ASN B 280 280 ASP C Protein B 1 FT #SUB 424 424 ASN B 285 285 TYR C Protein A 7 FT #SUB 425 425 ARG B 285 285 TYR C Protein A 6 FT #SUB 425 425 ARG B 287 287 ASP C Protein S 8 FT #SUB 430 430 PRO B 441 441 THR C Protein S 1 FT #SUB 430 430 PRO B 445 445 GLN C Protein S 1 FT #SUB 431 431 ARG B 26 26 ARG C Protein S 4 FT #SUB 431 431 ARG B 28 28 ALA C Protein S 1 FT #SUB 434 434 PRO B 434 434 PRO C Protein S 3 FT #SUB 437 437 SER B 433 433 LEU C Protein S 2 FT #SUB 441 441 THR B 430 430 PRO C Protein S 1 FT #SUB 445 445 GLN B 430 430 PRO C Protein S 1 FT #SUB 463 463 GLY B 280 280 ASP C Protein B 1 FT #SUB 466 466 ALA B 474 474 TRP C Protein A 3 FT #SUB 466 466 ALA B 477 477 SER C Protein S 1 FT #SUB 466 466 ALA B 478 478 VAL C Protein S 1 FT #SUB 469 469 ASP B 473 473 ARG C Protein A 3 FT #SUB 469 469 ASP B 477 477 SER C Protein S 2 FT #SUB 470 470 ALA B 470 470 ALA C Protein A 4 FT #SUB 473 473 ARG B 469 469 ASP C Protein S 3 FT #SUB 473 473 ARG B 473 473 ARG C Protein S 2 FT #SUB 474 474 TRP B 466 466 ALA C Protein S 3 FT #SUB 477 477 SER B 466 466 ALA C Protein S 2 FT #SUB 477 477 SER B 469 469 ASP C Protein S 3 FT #SUB 478 478 VAL B 466 466 ALA C Protein S 1 FT #SUB 514 514 ALA B 278 278 ILE C Protein S 1 FT #SUB 514 514 ALA B 279 279 ILE C Protein S 2 FT #SUB 515 515 VAL B 280 280 ASP C Protein S 1 FT #SUB 515 515 VAL B 281 281 GLU C Protein S 8 FT #SUB 232 232 ASN B 233 233 ASP D Protein A 3 FT #SUB 233 233 ASP B 232 232 ASN D Protein S 3 FT #SUB 233 233 ASP B 233 233 ASP D Protein A 19 FT #HET 15 15 ASP B 27 3002 MG B B 2 FT #HET 18 18 ASN B 27 3002 MG B B 4 FT #HET 21 21 VAL B 27 3002 MG B B 2 FT #HET 32 32 PRO B 37 8404 DMS B A 3 FT #HET 33 33 PHE B 37 8404 DMS B B 4 FT #HET 34 34 ALA B 37 8404 DMS B A 2 FT #HET 36 36 TRP B 37 8404 DMS B S 2 FT #HET 36 36 TRP B 50 8501 DMS B S 1 FT #HET 45 45 ASP B 37 8404 DMS B S 3 FT #HET 45 45 ASP B 50 8501 DMS B A 5 FT #HET 46 46 ARG B 50 8501 DMS B B 1 FT #HET 47 47 PRO B 50 8501 DMS B A 5 FT #HET 48 48 SER B 50 8501 DMS B B 1 FT #HET 53 53 SER B 41 8408 DMS B B 1 FT #HET 54 54 LEU B 41 8408 DMS B B 4 FT #HET 55 55 ASN B 41 8408 DMS B B 3 FT #HET 57 57 GLU B 41 8408 DMS B B 2 FT #HET 57 57 GLU B 51 8502 DMS B B 3 FT #HET 58 58 TRP B 51 8502 DMS B B 3 FT #HET 93 93 HIS B 48 8421 DMS B B 3 FT #HET 94 94 GLY B 48 8421 DMS B B 2 FT #HET 95 95 TYR B 48 8421 DMS B S 4 FT #HET 100 100 TYR B 29 3101 NA B S 1 FT #HET 102 102 ASN B 33 2001 IPT B S 5 FT #HET 106 106 PRO B 43 8410 DMS B B 2 FT #HET 115 115 PRO B 43 8410 DMS B S 1 FT #HET 125 125 LEU B 41 8408 DMS B S 1 FT #HET 125 125 LEU B 51 8502 DMS B A 4 FT #HET 126 126 THR B 51 8502 DMS B A 5 FT #HET 127 127 PHE B 41 8408 DMS B S 1 FT #HET 163 163 GLN B 27 3002 MG B S 3 FT #HET 191 191 TRP B 43 8410 DMS B S 3 FT #HET 193 193 ASP B 27 3002 MG B S 3 FT #HET 201 201 ASP B 29 3101 NA B S 3 FT #HET 201 201 ASP B 33 2001 IPT B S 7 FT #HET 229 229 THR B 34 8401 DMS B S 2 FT #HET 271 271 THR B 38 8405 DMS B B 1 FT #HET 275 275 GLY B 45 8412 DMS B B 2 FT #HET 276 276 GLY B 45 8412 DMS B B 1 FT #HET 277 277 GLU B 45 8412 DMS B B 1 FT #HET 289 289 VAL B 45 8412 DMS B A 4 FT #HET 290 290 THR B 45 8412 DMS B A 7 FT #HET 291 291 LEU B 38 8405 DMS B A 4 FT #HET 292 292 ARG B 38 8405 DMS B B 7 FT #HET 292 292 ARG B 45 8412 DMS B S 2 FT #HET 314 314 GLU B 39 8406 DMS B S 4 FT #HET 316 316 HIS B 39 8406 DMS B S 4 FT #HET 320 320 GLY B 39 8406 DMS B B 5 FT #HET 327 327 ALA B 37 8404 DMS B B 1 FT #HET 330 330 VAL B 34 8401 DMS B A 3 FT #HET 331 331 GLY B 34 8401 DMS B B 6 FT #HET 333 333 ARG B 34 8401 DMS B S 2 FT #HET 334 334 GLU B 42 8409 DMS B S 1 FT #HET 335 335 VAL B 42 8409 DMS B B 2 FT #HET 336 336 ARG B 42 8409 DMS B S 1 FT #HET 369 369 GLU B 28 3007 MG B S 1 FT #HET 380 380 LYS B 36 8403 DMS B B 1 FT #HET 383 383 ASN B 36 8403 DMS B S 1 FT #HET 416 416 GLU B 26 3001 MG B S 3 FT #HET 418 418 HIS B 26 3001 MG B S 4 FT #HET 428 428 ASP B 47 8420 DMS B A 5 FT #HET 448 448 ARG B 34 8401 DMS B B 1 FT #HET 449 449 ASN B 34 8401 DMS B B 3 FT #HET 450 450 HIS B 34 8401 DMS B B 2 FT #HET 451 451 PRO B 34 8401 DMS B A 8 FT #HET 461 461 GLU B 26 3001 MG B S 3 FT #HET 461 461 GLU B 33 2001 IPT B S 7 FT #HET 480 480 PRO B 42 8409 DMS B B 1 FT #HET 481 481 SER B 42 8409 DMS B B 1 FT #HET 482 482 ARG B 34 8401 DMS B S 1 FT #HET 502 502 MET B 33 2001 IPT B S 1 FT #HET 503 503 TYR B 33 2001 IPT B S 3 FT #HET 505 505 ARG B 40 8407 DMS B S 1 FT #HET 508 508 GLU B 40 8407 DMS B S 8 FT #HET 537 537 GLU B 33 2001 IPT B S 4 FT #HET 540 540 HIS B 33 2001 IPT B S 5 FT #HET 556 556 PHE B 30 3102 NA B B 2 FT #HET 557 557 ARG B 35 8402 DMS B S 5 FT #HET 559 559 TYR B 30 3102 NA B B 2 FT #HET 560 560 PRO B 30 3102 NA B B 4 FT #HET 562 562 LEU B 30 3102 NA B B 3 FT #HET 568 568 TRP B 33 2001 IPT B S 1 FT #HET 576 576 ILE B 44 8411 DMS B S 1 FT #HET 584 584 PRO B 44 8411 DMS B B 2 FT #HET 585 585 TRP B 44 8411 DMS B B 2 FT #HET 586 586 SER B 44 8411 DMS B A 6 FT #HET 593 593 GLY B 46 8413 DMS B B 4 FT #HET 594 594 ASP B 46 8413 DMS B B 2 FT #HET 595 595 THR B 46 8413 DMS B A 8 FT #HET 601 601 PHE B 29 3101 NA B B 2 FT #HET 601 601 PHE B 33 2001 IPT B A 2 FT #HET 604 604 ASN B 29 3101 NA B S 3 FT #HET 604 604 ASN B 33 2001 IPT B S 3 FT #HET 622 622 HIS B 35 8402 DMS B A 8 FT #HET 623 623 GLN B 35 8402 DMS B A 5 FT #HET 625 625 GLN B 35 8402 DMS B S 3 FT #HET 626 626 PHE B 36 8403 DMS B S 6 FT #HET 628 628 GLN B 35 8402 DMS B S 4 FT #HET 642 642 TYR B 36 8403 DMS B S 5 FT #HET 647 647 SER B 32 3104 NA B B 2 FT #HET 647 647 SER B 49 8425 DMS B B 1 FT #HET 648 648 ASP B 32 3104 NA B B 2 FT #HET 648 648 ASP B 49 8425 DMS B B 5 FT #HET 649 649 ASN B 32 3104 NA B B 1 FT #HET 649 649 ASN B 49 8425 DMS B A 5 FT #HET 650 650 GLU B 32 3104 NA B A 5 FT #HET 650 650 GLU B 49 8425 DMS B B 5 FT #HET 670 670 LEU B 32 3104 NA B B 3 FT #HET 703 703 PRO B 49 8425 DMS B S 1 FT #HET 704 704 ASN B 49 8425 DMS B S 1 FT #HET 708 708 TRP B 36 8403 DMS B S 5 FT #HET 931 931 PHE B 31 3103 NA B S 3 FT #HET 932 932 PRO B 31 3103 NA B B 2 FT #HET 967 967 LEU B 31 3103 NA B B 2 FT #HET 968 968 MET B 31 3103 NA B B 1 FT #HET 970 970 THR B 31 3103 NA B B 2 FT #HET 973 973 ARG B 44 8411 DMS B S 4 FT #HET 999 999 TRP B 33 2001 IPT B S 12 FT #HET 1001 1001 PRO B 40 8407 DMS B A 4 FT #HET 1003 1003 VAL B 40 8407 DMS B B 1 FT DISORDER 1 12 CC SEQUENCE 1011 AA (ATOM); CC RRDWENPGVT QLNRLAAHPP FASWRNSEEA RTDRPSQQLR SLNGEWRFAW FPAPEAVPES CC WLECDLPEAD TVVVPSNWQM HGYDAPIYTN VTYPITVNPP FVPTENPTGC YSLTFNVDES CC WLQEGQTRII FDGVNSAFHL WCNGRWVGYG QDSRLPSEFD LSAFLRAGEN RLAVMVLRWS CC DGSYLEDQDM WRMSGIFRDV SLLHKPTTQI SDFHVATRFN DDFSRAVLEA EVQMCGELRD CC YLRVTVSLWQ GETQVASGTA PFGGEIIDER GGYADRVTLR LNVENPKLWS AEIPNLYRAV CC VELHTADGTL IEAEACDVGF REVRIENGLL LLNGKPLLIR GVNRHEHHPL HGQVMDEQTM CC VQDILLMKQN NFNAVRCSHY PNHPLWYTLC DRYGLYVVDE ANIETHGMVP MNRLTDDPRW CC LPAMSERVTR MVQRDRNHPS VIIWSLGNES GHGANHDALY RWIKSVDPSR PVQYEGGGAD CC TTATDIICPM YARVDEDQPF PAVPKWSIKK WLSLPGETRP LILCEYAHAM GNSLGGFAKY CC WQAFRQYPRL QGGFVWDWVD QSLIKYDENG NPWSAYGGDF GDTPNDRQFC MNGLVFADRT CC PHPALTEAKH QQQFFQFRLS GQTIEVTSEY LFRHSDNELL HWMVALDGKP LASGEVPLDV CC APQGKQLIEL PELPQPESAG QLWLTVRVVQ PNATAWSEAG HISAWQQWRL AENLSVTLPA CC ASHAIPHLTT SEMDFCIELG NKRWQFNRQS GFLSQMWIGD KKQLLTPLRD QFTRAPLDND CC IGVSEATRID PNAWVERWKA AGHYQAEAAL LQCTADTLAD AVLITTAHAW QHQGKTLFIS CC RKTYRIDGSG QMAITVDVEV ASDTPHPARI GLNCQLAQVA ERVNWLGLGP QENYPDRLTA CC ACFDRWDLPL SDMYTPYVFP SENGLRCGTR ELNYGPHQWR GDFQFNISRY SQQQLMETSH CC RHLLHAEEGT WLNIDGFHMG IGGDDSWSPS VSAEFQLSAG RYHYQLVWCQ K CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSHMLEDPVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ATOM ------------RRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ************************************** CC SEQRES LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ATOM LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ************************************************** CC SEQRES TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ATOM TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ************************************************** CC SEQRES HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ATOM HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ************************************************** CC SEQRES DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ATOM DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ************************************************** CC SEQRES RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ATOM RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ************************************************** CC SEQRES WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ATOM WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ************************************************** CC SEQRES IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ATOM IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ************************************************** CC SEQRES LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ATOM LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ************************************************** CC SEQRES PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ATOM PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ************************************************** CC SEQRES PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ATOM PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ************************************************** CC SEQRES KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ATOM KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ************************************************** CC SEQRES FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ATOM FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ************************************************** CC SEQRES LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ATOM LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ************************************************** CC SEQRES VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ATOM VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ************************************************** CC SEQRES LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ATOM LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ************************************************** CC SEQRES IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ATOM IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ************************************************** CC SEQRES ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ATOM ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ************************************************** CC SEQRES GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ATOM GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ************************************************** CC SEQRES WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ATOM WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ************************************************** CC SEQRES PSVSAEFQLSAGRYHYQLVWCQK CC ATOM PSVSAEFQLSAGRYHYQLVWCQK CC *********************** SQ SEQUENCE 1023 AA; MW; CN; GSHMLEDPVV LQRRDWENPG VTQLNRLAAH PPFASWRNSE EARTDRPSQQ LRSLNGEWRF AWFPAPEAVP ESWLECDLPE ADTVVVPSNW QMHGYDAPIY TNVTYPITVN PPFVPTENPT GCYSLTFNVD ESWLQEGQTR IIFDGVNSAF HLWCNGRWVG YGQDSRLPSE FDLSAFLRAG ENRLAVMVLR WSDGSYLEDQ DMWRMSGIFR DVSLLHKPTT QISDFHVATR FNDDFSRAVL EAEVQMCGEL RDYLRVTVSL WQGETQVASG TAPFGGEIID ERGGYADRVT LRLNVENPKL WSAEIPNLYR AVVELHTADG TLIEAEACDV GFREVRIENG LLLLNGKPLL IRGVNRHEHH PLHGQVMDEQ TMVQDILLMK QNNFNAVRCS HYPNHPLWYT LCDRYGLYVV DEANIETHGM VPMNRLTDDP RWLPAMSERV TRMVQRDRNH PSVIIWSLGN ESGHGANHDA LYRWIKSVDP SRPVQYEGGG ADTTATDIIC PMYARVDEDQ PFPAVPKWSI KKWLSLPGET RPLILCEYAH AMGNSLGGFA KYWQAFRQYP RLQGGFVWDW VDQSLIKYDE NGNPWSAYGG DFGDTPNDRQ FCMNGLVFAD RTPHPALTEA KHQQQFFQFR LSGQTIEVTS EYLFRHSDNE LLHWMVALDG KPLASGEVPL DVAPQGKQLI ELPELPQPES AGQLWLTVRV VQPNATAWSE AGHISAWQQW RLAENLSVTL PAASHAIPHL TTSEMDFCIE LGNKRWQFNR QSGFLSQMWI GDKKQLLTPL RDQFTRAPLD NDIGVSEATR IDPNAWVERW KAAGHYQAEA ALLQCTADTL ADAVLITTAH AWQHQGKTLF ISRKTYRIDG SGQMAITVDV EVASDTPHPA RIGLNCQLAQ VAERVNWLGL GPQENYPDRL TAACFDRWDL PLSDMYTPYV FPSENGLRCG TRELNYGPHQ WRGDFQFNIS RYSQQQLMET SHRHLLHAEE GTWLNIDGFH MGIGGDDSWS PSVSAEFQLS AGRYHYQLVW CQK // ID 1JYXC STANDARD; PRT; 1023 AA. DT CONVERTED FROM PDB (SEQRES) 1JYX DE Beta-Galactosidase OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.750 CC R-Factor 0.168 FT #SUB 232 232 ASN C 233 233 ASP A Protein A 3 FT #SUB 233 233 ASP C 232 232 ASN A Protein S 3 FT #SUB 233 233 ASP C 233 233 ASP A Protein A 19 FT #SUB 13 13 ARG C 13 13 ARG B Protein S 18 FT #SUB 13 13 ARG C 15 15 ASP B Protein S 9 FT #SUB 13 13 ARG C 24 24 LEU B Protein S 3 FT #SUB 15 15 ASP C 13 13 ARG B Protein S 8 FT #SUB 18 18 ASN C 24 24 LEU B Protein S 1 FT #SUB 20 20 GLY C 20 20 GLY B Protein B 1 FT #SUB 21 21 VAL C 21 21 VAL B Protein S 1 FT #SUB 24 24 LEU C 13 13 ARG B Protein S 1 FT #SUB 26 26 ARG C 431 431 ARG B Protein B 4 FT #SUB 28 28 ALA C 431 431 ARG B Protein B 1 FT #SUB 103 103 VAL C 282 282 ARG B Protein S 2 FT #SUB 278 278 ILE C 514 514 ALA B Protein B 1 FT #SUB 279 279 ILE C 424 424 ASN B Protein S 1 FT #SUB 279 279 ILE C 514 514 ALA B Protein B 2 FT #SUB 280 280 ASP C 422 422 PRO B Protein S 5 FT #SUB 280 280 ASP C 423 423 MET B Protein S 10 FT #SUB 280 280 ASP C 424 424 ASN B Protein S 1 FT #SUB 280 280 ASP C 463 463 GLY B Protein S 1 FT #SUB 280 280 ASP C 515 515 VAL B Protein B 1 FT #SUB 281 281 GLU C 423 423 MET B Protein S 2 FT #SUB 281 281 GLU C 515 515 VAL B Protein A 8 FT #SUB 282 282 ARG C 103 103 VAL B Protein S 2 FT #SUB 282 282 ARG C 418 418 HIS B Protein S 4 FT #SUB 282 282 ARG C 419 419 GLY B Protein S 6 FT #SUB 282 282 ARG C 420 420 MET B Protein S 4 FT #SUB 282 282 ARG C 421 421 VAL B Protein A 2 FT #SUB 282 282 ARG C 422 422 PRO B Protein S 2 FT #SUB 282 282 ARG C 423 423 MET B Protein S 5 FT #SUB 283 283 GLY C 422 422 PRO B Protein B 2 FT #SUB 284 284 GLY C 422 422 PRO B Protein B 4 FT #SUB 285 285 TYR C 422 422 PRO B Protein S 8 FT #SUB 285 285 TYR C 424 424 ASN B Protein S 7 FT #SUB 285 285 TYR C 425 425 ARG B Protein S 6 FT #SUB 287 287 ASP C 425 425 ARG B Protein S 8 FT #SUB 418 418 HIS C 282 282 ARG B Protein B 3 FT #SUB 419 419 GLY C 282 282 ARG B Protein B 6 FT #SUB 420 420 MET C 282 282 ARG B Protein B 5 FT #SUB 421 421 VAL C 282 282 ARG B Protein S 4 FT #SUB 422 422 PRO C 280 280 ASP B Protein A 6 FT #SUB 422 422 PRO C 282 282 ARG B Protein B 2 FT #SUB 422 422 PRO C 283 283 GLY B Protein S 1 FT #SUB 422 422 PRO C 284 284 GLY B Protein A 4 FT #SUB 422 422 PRO C 285 285 TYR B Protein S 8 FT #SUB 423 423 MET C 280 280 ASP B Protein A 10 FT #SUB 423 423 MET C 281 281 GLU B Protein S 2 FT #SUB 423 423 MET C 282 282 ARG B Protein A 5 FT #SUB 424 424 ASN C 279 279 ILE B Protein S 2 FT #SUB 424 424 ASN C 280 280 ASP B Protein B 1 FT #SUB 424 424 ASN C 285 285 TYR B Protein A 7 FT #SUB 425 425 ARG C 285 285 TYR B Protein A 8 FT #SUB 425 425 ARG C 287 287 ASP B Protein S 8 FT #SUB 430 430 PRO C 441 441 THR B Protein S 1 FT #SUB 430 430 PRO C 445 445 GLN B Protein S 1 FT #SUB 431 431 ARG C 26 26 ARG B Protein S 3 FT #SUB 431 431 ARG C 28 28 ALA B Protein S 1 FT #SUB 433 433 LEU C 437 437 SER B Protein S 2 FT #SUB 434 434 PRO C 434 434 PRO B Protein S 3 FT #SUB 441 441 THR C 430 430 PRO B Protein S 1 FT #SUB 445 445 GLN C 430 430 PRO B Protein S 1 FT #SUB 463 463 GLY C 280 280 ASP B Protein B 1 FT #SUB 466 466 ALA C 474 474 TRP B Protein A 3 FT #SUB 466 466 ALA C 477 477 SER B Protein A 2 FT #SUB 466 466 ALA C 478 478 VAL B Protein S 1 FT #SUB 469 469 ASP C 473 473 ARG B Protein B 3 FT #SUB 469 469 ASP C 477 477 SER B Protein S 3 FT #SUB 470 470 ALA C 470 470 ALA B Protein A 4 FT #SUB 473 473 ARG C 469 469 ASP B Protein S 3 FT #SUB 473 473 ARG C 473 473 ARG B Protein S 2 FT #SUB 474 474 TRP C 466 466 ALA B Protein S 3 FT #SUB 477 477 SER C 466 466 ALA B Protein S 1 FT #SUB 477 477 SER C 469 469 ASP B Protein S 2 FT #SUB 478 478 VAL C 466 466 ALA B Protein S 1 FT #SUB 514 514 ALA C 278 278 ILE B Protein S 1 FT #SUB 514 514 ALA C 279 279 ILE B Protein S 2 FT #SUB 515 515 VAL C 281 281 GLU B Protein S 8 FT #SUB 339 339 ASN C 527 527 PRO D Protein A 8 FT #SUB 339 339 ASN C 528 528 GLY D Protein A 7 FT #SUB 341 341 LEU C 527 527 PRO D Protein S 1 FT #SUB 507 507 ASP C 558 558 GLN D Protein B 3 FT #SUB 509 509 ASP C 558 558 GLN D Protein S 5 FT #SUB 519 519 SER C 558 558 GLN D Protein S 1 FT #SUB 521 521 LYS C 559 559 TYR D Protein A 5 FT #SUB 522 522 LYS C 558 558 GLN D Protein S 6 FT #SUB 522 522 LYS C 559 559 TYR D Protein A 7 FT #SUB 522 522 LYS C 560 560 PRO D Protein S 1 FT #SUB 524 524 LEU C 525 525 SER D Protein S 3 FT #SUB 525 525 SER C 524 524 LEU D Protein S 2 FT #SUB 525 525 SER C 559 559 TYR D Protein S 2 FT #SUB 525 525 SER C 561 561 ARG D Protein A 6 FT #SUB 527 527 PRO C 339 339 ASN D Protein A 7 FT #SUB 527 527 PRO C 341 341 LEU D Protein S 2 FT #SUB 527 527 PRO C 560 560 PRO D Protein S 2 FT #SUB 528 528 GLY C 339 339 ASN D Protein B 7 FT #SUB 558 558 GLN C 507 507 ASP D Protein S 1 FT #SUB 558 558 GLN C 509 509 ASP D Protein S 4 FT #SUB 558 558 GLN C 519 519 SER D Protein S 1 FT #SUB 558 558 GLN C 522 522 LYS D Protein A 7 FT #SUB 559 559 TYR C 521 521 LYS D Protein S 6 FT #SUB 559 559 TYR C 522 522 LYS D Protein S 9 FT #SUB 559 559 TYR C 525 525 SER D Protein S 2 FT #SUB 561 561 ARG C 525 525 SER D Protein S 6 FT #SUB 693 693 GLN C 874 874 SER D Protein S 4 FT #SUB 721 721 ARG C 874 874 SER D Protein S 2 FT #SUB 723 723 ALA C 875 875 ASP D Protein A 3 FT #SUB 724 724 GLU C 847 847 LYS D Protein B 1 FT #SUB 724 724 GLU C 873 873 ALA D Protein S 2 FT #SUB 724 724 GLU C 874 874 SER D Protein S 6 FT #SUB 724 724 GLU C 875 875 ASP D Protein B 5 FT #SUB 726 726 LEU C 849 849 LEU D Protein S 1 FT #SUB 726 726 LEU C 851 851 ILE D Protein S 1 FT #SUB 726 726 LEU C 871 871 GLU D Protein S 1 FT #SUB 726 726 LEU C 873 873 ALA D Protein S 1 FT #SUB 727 727 SER C 851 851 ILE D Protein B 1 FT #SUB 728 728 VAL C 823 823 LEU D Protein B 1 FT #SUB 728 728 VAL C 848 848 THR D Protein S 2 FT #SUB 728 728 VAL C 851 851 ILE D Protein S 1 FT #SUB 730 730 LEU C 823 823 LEU D Protein S 1 FT #SUB 823 823 LEU C 730 730 LEU D Protein B 1 FT #SUB 828 828 ASP C 830 830 LEU D Protein S 8 FT #SUB 828 828 ASP C 831 831 ALA D Protein S 2 FT #SUB 830 830 LEU C 828 828 ASP D Protein A 3 FT #SUB 830 830 LEU C 830 830 LEU D Protein S 1 FT #SUB 831 831 ALA C 828 828 ASP D Protein A 2 FT #SUB 847 847 LYS C 724 724 GLU D Protein S 3 FT #SUB 848 848 THR C 726 726 LEU D Protein B 1 FT #SUB 848 848 THR C 728 728 VAL D Protein S 2 FT #SUB 851 851 ILE C 726 726 LEU D Protein S 1 FT #SUB 851 851 ILE C 727 727 SER D Protein S 1 FT #SUB 851 851 ILE C 728 728 VAL D Protein S 1 FT #SUB 869 869 ASP C 1015 1015 HIS D Protein S 1 FT #SUB 871 871 GLU C 726 726 LEU D Protein S 1 FT #SUB 873 873 ALA C 724 724 GLU D Protein B 2 FT #SUB 873 873 ALA C 726 726 LEU D Protein B 2 FT #SUB 874 874 SER C 693 693 GLN D Protein S 4 FT #SUB 874 874 SER C 721 721 ARG D Protein S 2 FT #SUB 874 874 SER C 724 724 GLU D Protein A 6 FT #SUB 875 875 ASP C 723 723 ALA D Protein S 4 FT #SUB 875 875 ASP C 724 724 GLU D Protein S 5 FT #SUB 942 942 ARG C 1013 1013 ARG D Protein S 9 FT #SUB 954 954 ASP C 1013 1013 ARG D Protein S 4 FT #SUB 1013 1013 ARG C 942 942 ARG D Protein S 8 FT #SUB 1013 1013 ARG C 954 954 ASP D Protein S 4 FT #SUB 1015 1015 HIS C 869 869 ASP D Protein S 2 FT #SUB 1015 1015 HIS C 1015 1015 HIS D Protein S 11 FT #SUB 1017 1017 GLN C 869 869 ASP D Protein S 1 FT #HET 15 15 ASP C 53 3002 MG C B 2 FT #HET 16 16 TRP C 53 3002 MG C B 1 FT #HET 18 18 ASN C 53 3002 MG C B 3 FT #HET 21 21 VAL C 53 3002 MG C B 2 FT #HET 32 32 PRO C 62 8404 DMS C A 3 FT #HET 33 33 PHE C 62 8404 DMS C B 4 FT #HET 34 34 ALA C 62 8404 DMS C A 2 FT #HET 34 34 ALA C 77 8501 DMS C S 1 FT #HET 36 36 TRP C 62 8404 DMS C S 2 FT #HET 36 36 TRP C 77 8501 DMS C S 1 FT #HET 45 45 ASP C 62 8404 DMS C S 2 FT #HET 45 45 ASP C 77 8501 DMS C A 5 FT #HET 46 46 ARG C 77 8501 DMS C B 3 FT #HET 47 47 PRO C 77 8501 DMS C A 5 FT #HET 48 48 SER C 77 8501 DMS C B 1 FT #HET 53 53 SER C 66 8408 DMS C B 1 FT #HET 54 54 LEU C 66 8408 DMS C B 6 FT #HET 55 55 ASN C 66 8408 DMS C B 2 FT #HET 57 57 GLU C 66 8408 DMS C B 2 FT #HET 82 82 ASP C 74 8421 DMS C S 1 FT #HET 83 83 THR C 71 8414 DMS C B 1 FT #HET 84 84 VAL C 71 8414 DMS C A 3 FT #HET 85 85 VAL C 71 8414 DMS C A 4 FT #HET 93 93 HIS C 71 8414 DMS C S 7 FT #HET 93 93 HIS C 74 8421 DMS C B 1 FT #HET 94 94 GLY C 74 8421 DMS C B 2 FT #HET 95 95 TYR C 74 8421 DMS C S 9 FT #HET 99 99 ILE C 68 8410 DMS C S 1 FT #HET 100 100 TYR C 54 3101 NA C S 1 FT #HET 102 102 ASN C 58 2001 IPT C S 4 FT #HET 102 102 ASN C 79 8506 DMS C S 3 FT #HET 106 106 PRO C 68 8410 DMS C B 3 FT #HET 115 115 PRO C 68 8410 DMS C S 1 FT #HET 125 125 LEU C 66 8408 DMS C S 1 FT #HET 127 127 PHE C 66 8408 DMS C S 1 FT #HET 161 161 TYR C 53 3002 MG C S 1 FT #HET 163 163 GLN C 53 3002 MG C S 3 FT #HET 191 191 TRP C 68 8410 DMS C S 1 FT #HET 193 193 ASP C 53 3002 MG C S 3 FT #HET 201 201 ASP C 54 3101 NA C S 3 FT #HET 201 201 ASP C 58 2001 IPT C S 7 FT #HET 229 229 THR C 59 8401 DMS C S 2 FT #HET 271 271 THR C 63 8405 DMS C B 1 FT #HET 275 275 GLY C 70 8412 DMS C B 2 FT #HET 276 276 GLY C 70 8412 DMS C B 1 FT #HET 289 289 VAL C 70 8412 DMS C A 3 FT #HET 290 290 THR C 70 8412 DMS C A 7 FT #HET 291 291 LEU C 63 8405 DMS C A 6 FT #HET 292 292 ARG C 63 8405 DMS C B 6 FT #HET 292 292 ARG C 70 8412 DMS C S 1 FT #HET 310 310 ARG C 62 8404 DMS C S 1 FT #HET 314 314 GLU C 64 8406 DMS C S 3 FT #HET 316 316 HIS C 64 8406 DMS C S 5 FT #HET 320 320 GLY C 64 8406 DMS C B 6 FT #HET 321 321 THR C 64 8406 DMS C B 1 FT #HET 327 327 ALA C 62 8404 DMS C B 1 FT #HET 330 330 VAL C 59 8401 DMS C A 3 FT #HET 331 331 GLY C 59 8401 DMS C B 6 FT #HET 333 333 ARG C 59 8401 DMS C S 1 FT #HET 334 334 GLU C 67 8409 DMS C S 1 FT #HET 335 335 VAL C 67 8409 DMS C B 2 FT #HET 336 336 ARG C 67 8409 DMS C S 1 FT #HET 380 380 LYS C 61 8403 DMS C A 4 FT #HET 383 383 ASN C 61 8403 DMS C S 1 FT #HET 416 416 GLU C 52 3001 MG C S 3 FT #HET 418 418 HIS C 52 3001 MG C S 4 FT #HET 428 428 ASP C 73 8420 DMS C A 6 FT #HET 448 448 ARG C 59 8401 DMS C B 1 FT #HET 449 449 ASN C 59 8401 DMS C B 4 FT #HET 450 450 HIS C 59 8401 DMS C B 1 FT #HET 451 451 PRO C 59 8401 DMS C A 9 FT #HET 461 461 GLU C 52 3001 MG C S 4 FT #HET 461 461 GLU C 58 2001 IPT C S 7 FT #HET 474 474 TRP C 47 8420 DMS B S 1 FT #HET 478 478 VAL C 47 8420 DMS B S 1 FT #HET 480 480 PRO C 67 8409 DMS C B 1 FT #HET 481 481 SER C 67 8409 DMS C A 2 FT #HET 482 482 ARG C 59 8401 DMS C S 1 FT #HET 502 502 MET C 58 2001 IPT C S 1 FT #HET 503 503 TYR C 58 2001 IPT C S 3 FT #HET 505 505 ARG C 65 8407 DMS C S 1 FT #HET 508 508 GLU C 65 8407 DMS C S 4 FT #HET 537 537 GLU C 58 2001 IPT C S 4 FT #HET 540 540 HIS C 58 2001 IPT C S 4 FT #HET 556 556 PHE C 55 3102 NA C B 2 FT #HET 557 557 ARG C 60 8402 DMS C S 4 FT #HET 559 559 TYR C 55 3102 NA C B 2 FT #HET 560 560 PRO C 55 3102 NA C B 4 FT #HET 562 562 LEU C 55 3102 NA C B 3 FT #HET 568 568 TRP C 58 2001 IPT C S 1 FT #HET 576 576 ILE C 69 8411 DMS C S 1 FT #HET 584 584 PRO C 69 8411 DMS C A 5 FT #HET 585 585 TRP C 69 8411 DMS C B 3 FT #HET 586 586 SER C 69 8411 DMS C A 6 FT #HET 598 598 ASP C 79 8506 DMS C S 7 FT #HET 601 601 PHE C 54 3101 NA C B 2 FT #HET 601 601 PHE C 58 2001 IPT C A 2 FT #HET 601 601 PHE C 79 8506 DMS C S 11 FT #HET 604 604 ASN C 54 3101 NA C S 3 FT #HET 604 604 ASN C 58 2001 IPT C S 3 FT #HET 621 621 LYS C 72 8415 DMS C S 3 FT #HET 622 622 HIS C 60 8402 DMS C A 8 FT #HET 623 623 GLN C 60 8402 DMS C A 3 FT #HET 625 625 GLN C 60 8402 DMS C S 3 FT #HET 626 626 PHE C 61 8403 DMS C S 7 FT #HET 628 628 GLN C 60 8402 DMS C S 5 FT #HET 629 629 PHE C 78 8503 DMS C B 3 FT #HET 630 630 ARG C 78 8503 DMS C B 1 FT #HET 642 642 TYR C 61 8403 DMS C S 5 FT #HET 647 647 SER C 57 3104 NA C B 2 FT #HET 647 647 SER C 76 8425 DMS C B 1 FT #HET 648 648 ASP C 57 3104 NA C B 2 FT #HET 648 648 ASP C 76 8425 DMS C B 5 FT #HET 649 649 ASN C 57 3104 NA C B 1 FT #HET 649 649 ASN C 76 8425 DMS C B 3 FT #HET 650 650 GLU C 57 3104 NA C B 4 FT #HET 650 650 GLU C 76 8425 DMS C B 6 FT #HET 670 670 LEU C 57 3104 NA C B 2 FT #HET 704 704 ASN C 76 8425 DMS C S 1 FT #HET 708 708 TRP C 61 8403 DMS C S 6 FT #HET 714 714 ILE C 72 8415 DMS C S 1 FT #HET 717 717 TRP C 72 8415 DMS C S 8 FT #HET 718 718 GLN C 78 8503 DMS C S 3 FT #HET 720 720 TRP C 78 8503 DMS C S 3 FT #HET 795 795 VAL C 79 8506 DMS C S 1 FT #HET 796 796 SER C 75 8422 DMS C B 2 FT #HET 797 797 GLU C 75 8422 DMS C B 3 FT #HET 798 798 ALA C 75 8422 DMS C B 1 FT #HET 801 801 ILE C 75 8422 DMS C S 1 FT #HET 808 808 GLU C 75 8422 DMS C S 3 FT #HET 811 811 LYS C 75 8422 DMS C S 3 FT #HET 931 931 PHE C 56 3103 NA C S 3 FT #HET 932 932 PRO C 56 3103 NA C B 2 FT #HET 967 967 LEU C 56 3103 NA C B 2 FT #HET 968 968 MET C 56 3103 NA C B 2 FT #HET 970 970 THR C 56 3103 NA C B 2 FT #HET 973 973 ARG C 69 8411 DMS C S 5 FT #HET 999 999 TRP C 58 2001 IPT C S 12 FT #HET 1001 1001 PRO C 65 8407 DMS C A 4 FT #HET 1003 1003 VAL C 65 8407 DMS C B 1 FT DISORDER 1 12 CC SEQUENCE 1011 AA (ATOM); CC RRDWENPGVT QLNRLAAHPP FASWRNSEEA RTDRPSQQLR SLNGEWRFAW FPAPEAVPES CC WLECDLPEAD TVVVPSNWQM HGYDAPIYTN VTYPITVNPP FVPTENPTGC YSLTFNVDES CC WLQEGQTRII FDGVNSAFHL WCNGRWVGYG QDSRLPSEFD LSAFLRAGEN RLAVMVLRWS CC DGSYLEDQDM WRMSGIFRDV SLLHKPTTQI SDFHVATRFN DDFSRAVLEA EVQMCGELRD CC YLRVTVSLWQ GETQVASGTA PFGGEIIDER GGYADRVTLR LNVENPKLWS AEIPNLYRAV CC VELHTADGTL IEAEACDVGF REVRIENGLL LLNGKPLLIR GVNRHEHHPL HGQVMDEQTM CC VQDILLMKQN NFNAVRCSHY PNHPLWYTLC DRYGLYVVDE ANIETHGMVP MNRLTDDPRW CC LPAMSERVTR MVQRDRNHPS VIIWSLGNES GHGANHDALY RWIKSVDPSR PVQYEGGGAD CC TTATDIICPM YARVDEDQPF PAVPKWSIKK WLSLPGETRP LILCEYAHAM GNSLGGFAKY CC WQAFRQYPRL QGGFVWDWVD QSLIKYDENG NPWSAYGGDF GDTPNDRQFC MNGLVFADRT CC PHPALTEAKH QQQFFQFRLS GQTIEVTSEY LFRHSDNELL HWMVALDGKP LASGEVPLDV CC APQGKQLIEL PELPQPESAG QLWLTVRVVQ PNATAWSEAG HISAWQQWRL AENLSVTLPA CC ASHAIPHLTT SEMDFCIELG NKRWQFNRQS GFLSQMWIGD KKQLLTPLRD QFTRAPLDND CC IGVSEATRID PNAWVERWKA AGHYQAEAAL LQCTADTLAD AVLITTAHAW QHQGKTLFIS CC RKTYRIDGSG QMAITVDVEV ASDTPHPARI GLNCQLAQVA ERVNWLGLGP QENYPDRLTA CC ACFDRWDLPL SDMYTPYVFP SENGLRCGTR ELNYGPHQWR GDFQFNISRY SQQQLMETSH CC RHLLHAEEGT WLNIDGFHMG IGGDDSWSPS VSAEFQLSAG RYHYQLVWCQ K CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSHMLEDPVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ATOM ------------RRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ************************************** CC SEQRES LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ATOM LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ************************************************** CC SEQRES TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ATOM TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ************************************************** CC SEQRES HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ATOM HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ************************************************** CC SEQRES DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ATOM DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ************************************************** CC SEQRES RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ATOM RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ************************************************** CC SEQRES WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ATOM WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ************************************************** CC SEQRES IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ATOM IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ************************************************** CC SEQRES LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ATOM LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ************************************************** CC SEQRES PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ATOM PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ************************************************** CC SEQRES PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ATOM PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ************************************************** CC SEQRES KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ATOM KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ************************************************** CC SEQRES FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ATOM FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ************************************************** CC SEQRES LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ATOM LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ************************************************** CC SEQRES VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ATOM VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ************************************************** CC SEQRES LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ATOM LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ************************************************** CC SEQRES IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ATOM IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ************************************************** CC SEQRES ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ATOM ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ************************************************** CC SEQRES GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ATOM GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ************************************************** CC SEQRES WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ATOM WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ************************************************** CC SEQRES PSVSAEFQLSAGRYHYQLVWCQK CC ATOM PSVSAEFQLSAGRYHYQLVWCQK CC *********************** SQ SEQUENCE 1023 AA; MW; CN; GSHMLEDPVV LQRRDWENPG VTQLNRLAAH PPFASWRNSE EARTDRPSQQ LRSLNGEWRF AWFPAPEAVP ESWLECDLPE ADTVVVPSNW QMHGYDAPIY TNVTYPITVN PPFVPTENPT GCYSLTFNVD ESWLQEGQTR IIFDGVNSAF HLWCNGRWVG YGQDSRLPSE FDLSAFLRAG ENRLAVMVLR WSDGSYLEDQ DMWRMSGIFR DVSLLHKPTT QISDFHVATR FNDDFSRAVL EAEVQMCGEL RDYLRVTVSL WQGETQVASG TAPFGGEIID ERGGYADRVT LRLNVENPKL WSAEIPNLYR AVVELHTADG TLIEAEACDV GFREVRIENG LLLLNGKPLL IRGVNRHEHH PLHGQVMDEQ TMVQDILLMK QNNFNAVRCS HYPNHPLWYT LCDRYGLYVV DEANIETHGM VPMNRLTDDP RWLPAMSERV TRMVQRDRNH PSVIIWSLGN ESGHGANHDA LYRWIKSVDP SRPVQYEGGG ADTTATDIIC PMYARVDEDQ PFPAVPKWSI KKWLSLPGET RPLILCEYAH AMGNSLGGFA KYWQAFRQYP RLQGGFVWDW VDQSLIKYDE NGNPWSAYGG DFGDTPNDRQ FCMNGLVFAD RTPHPALTEA KHQQQFFQFR LSGQTIEVTS EYLFRHSDNE LLHWMVALDG KPLASGEVPL DVAPQGKQLI ELPELPQPES AGQLWLTVRV VQPNATAWSE AGHISAWQQW RLAENLSVTL PAASHAIPHL TTSEMDFCIE LGNKRWQFNR QSGFLSQMWI GDKKQLLTPL RDQFTRAPLD NDIGVSEATR IDPNAWVERW KAAGHYQAEA ALLQCTADTL ADAVLITTAH AWQHQGKTLF ISRKTYRIDG SGQMAITVDV EVASDTPHPA RIGLNCQLAQ VAERVNWLGL GPQENYPDRL TAACFDRWDL PLSDMYTPYV FPSENGLRCG TRELNYGPHQ WRGDFQFNIS RYSQQQLMET SHRHLLHAEE GTWLNIDGFH MGIGGDDSWS PSVSAEFQLS AGRYHYQLVW CQK // ID 1JYXD STANDARD; PRT; 1023 AA. DT CONVERTED FROM PDB (SEQRES) 1JYX DE Beta-Galactosidase OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.750 CC R-Factor 0.168 FT #SUB 13 13 ARG D 13 13 ARG A Protein S 14 FT #SUB 13 13 ARG D 15 15 ASP A Protein S 8 FT #SUB 13 13 ARG D 24 24 LEU A Protein S 2 FT #SUB 15 15 ASP D 13 13 ARG A Protein S 9 FT #SUB 20 20 GLY D 20 20 GLY A Protein B 1 FT #SUB 24 24 LEU D 13 13 ARG A Protein S 2 FT #SUB 103 103 VAL D 282 282 ARG A Protein S 2 FT #SUB 278 278 ILE D 513 513 PRO A Protein S 2 FT #SUB 278 278 ILE D 514 514 ALA A Protein B 1 FT #SUB 279 279 ILE D 424 424 ASN A Protein S 1 FT #SUB 279 279 ILE D 514 514 ALA A Protein B 2 FT #SUB 279 279 ILE D 515 515 VAL A Protein B 1 FT #SUB 280 280 ASP D 422 422 PRO A Protein S 6 FT #SUB 280 280 ASP D 423 423 MET A Protein S 10 FT #SUB 280 280 ASP D 424 424 ASN A Protein S 1 FT #SUB 280 280 ASP D 463 463 GLY A Protein S 1 FT #SUB 280 280 ASP D 515 515 VAL A Protein B 1 FT #SUB 281 281 GLU D 423 423 MET A Protein S 2 FT #SUB 281 281 GLU D 515 515 VAL A Protein A 8 FT #SUB 282 282 ARG D 103 103 VAL A Protein S 2 FT #SUB 282 282 ARG D 418 418 HIS A Protein S 3 FT #SUB 282 282 ARG D 419 419 GLY A Protein S 6 FT #SUB 282 282 ARG D 420 420 MET A Protein S 6 FT #SUB 282 282 ARG D 421 421 VAL A Protein A 5 FT #SUB 282 282 ARG D 422 422 PRO A Protein S 2 FT #SUB 282 282 ARG D 423 423 MET A Protein S 4 FT #SUB 283 283 GLY D 422 422 PRO A Protein B 2 FT #SUB 284 284 GLY D 422 422 PRO A Protein B 7 FT #SUB 285 285 TYR D 422 422 PRO A Protein S 6 FT #SUB 285 285 TYR D 424 424 ASN A Protein S 6 FT #SUB 285 285 TYR D 425 425 ARG A Protein S 8 FT #SUB 287 287 ASP D 425 425 ARG A Protein S 8 FT #SUB 418 418 HIS D 282 282 ARG A Protein B 3 FT #SUB 419 419 GLY D 282 282 ARG A Protein B 6 FT #SUB 420 420 MET D 282 282 ARG A Protein B 4 FT #SUB 421 421 VAL D 282 282 ARG A Protein S 4 FT #SUB 422 422 PRO D 279 279 ILE A Protein S 4 FT #SUB 422 422 PRO D 280 280 ASP A Protein A 5 FT #SUB 422 422 PRO D 282 282 ARG A Protein B 2 FT #SUB 422 422 PRO D 283 283 GLY A Protein S 2 FT #SUB 422 422 PRO D 284 284 GLY A Protein A 6 FT #SUB 422 422 PRO D 285 285 TYR A Protein S 8 FT #SUB 423 423 MET D 280 280 ASP A Protein A 10 FT #SUB 423 423 MET D 281 281 GLU A Protein S 3 FT #SUB 423 423 MET D 282 282 ARG A Protein A 3 FT #SUB 424 424 ASN D 279 279 ILE A Protein S 3 FT #SUB 424 424 ASN D 280 280 ASP A Protein B 1 FT #SUB 424 424 ASN D 285 285 TYR A Protein A 7 FT #SUB 425 425 ARG D 285 285 TYR A Protein A 8 FT #SUB 425 425 ARG D 287 287 ASP A Protein S 9 FT #SUB 430 430 PRO D 441 441 THR A Protein S 1 FT #SUB 430 430 PRO D 445 445 GLN A Protein S 1 FT #SUB 431 431 ARG D 26 26 ARG A Protein S 3 FT #SUB 431 431 ARG D 27 27 LEU A Protein S 1 FT #SUB 431 431 ARG D 28 28 ALA A Protein S 3 FT #SUB 433 433 LEU D 437 437 SER A Protein S 2 FT #SUB 434 434 PRO D 434 434 PRO A Protein S 3 FT #SUB 437 437 SER D 433 433 LEU A Protein S 1 FT #SUB 445 445 GLN D 430 430 PRO A Protein S 1 FT #SUB 463 463 GLY D 280 280 ASP A Protein B 1 FT #SUB 466 466 ALA D 474 474 TRP A Protein A 3 FT #SUB 466 466 ALA D 478 478 VAL A Protein S 1 FT #SUB 469 469 ASP D 473 473 ARG A Protein A 4 FT #SUB 469 469 ASP D 477 477 SER A Protein S 4 FT #SUB 470 470 ALA D 470 470 ALA A Protein A 4 FT #SUB 473 473 ARG D 469 469 ASP A Protein S 3 FT #SUB 473 473 ARG D 473 473 ARG A Protein S 1 FT #SUB 474 474 TRP D 466 466 ALA A Protein S 3 FT #SUB 477 477 SER D 469 469 ASP A Protein S 4 FT #SUB 478 478 VAL D 466 466 ALA A Protein S 1 FT #SUB 513 513 PRO D 278 278 ILE A Protein S 1 FT #SUB 514 514 ALA D 278 278 ILE A Protein S 1 FT #SUB 514 514 ALA D 279 279 ILE A Protein S 2 FT #SUB 515 515 VAL D 279 279 ILE A Protein S 1 FT #SUB 515 515 VAL D 280 280 ASP A Protein S 1 FT #SUB 515 515 VAL D 281 281 GLU A Protein S 6 FT #SUB 232 232 ASN D 233 233 ASP B Protein A 3 FT #SUB 233 233 ASP D 232 232 ASN B Protein S 3 FT #SUB 233 233 ASP D 233 233 ASP B Protein A 19 FT #SUB 339 339 ASN D 527 527 PRO C Protein A 7 FT #SUB 339 339 ASN D 528 528 GLY C Protein A 7 FT #SUB 341 341 LEU D 527 527 PRO C Protein S 2 FT #SUB 507 507 ASP D 558 558 GLN C Protein B 1 FT #SUB 509 509 ASP D 558 558 GLN C Protein S 4 FT #SUB 519 519 SER D 558 558 GLN C Protein S 1 FT #SUB 521 521 LYS D 559 559 TYR C Protein A 6 FT #SUB 522 522 LYS D 558 558 GLN C Protein S 7 FT #SUB 522 522 LYS D 559 559 TYR C Protein A 9 FT #SUB 524 524 LEU D 525 525 SER C Protein S 2 FT #SUB 525 525 SER D 524 524 LEU C Protein S 3 FT #SUB 525 525 SER D 559 559 TYR C Protein S 2 FT #SUB 525 525 SER D 561 561 ARG C Protein A 6 FT #SUB 527 527 PRO D 339 339 ASN C Protein A 8 FT #SUB 527 527 PRO D 341 341 LEU C Protein S 1 FT #SUB 528 528 GLY D 339 339 ASN C Protein B 7 FT #SUB 558 558 GLN D 507 507 ASP C Protein S 3 FT #SUB 558 558 GLN D 509 509 ASP C Protein S 5 FT #SUB 558 558 GLN D 519 519 SER C Protein S 1 FT #SUB 558 558 GLN D 522 522 LYS C Protein A 6 FT #SUB 559 559 TYR D 521 521 LYS C Protein S 5 FT #SUB 559 559 TYR D 522 522 LYS C Protein S 7 FT #SUB 559 559 TYR D 525 525 SER C Protein S 2 FT #SUB 560 560 PRO D 522 522 LYS C Protein S 1 FT #SUB 560 560 PRO D 527 527 PRO C Protein S 2 FT #SUB 561 561 ARG D 525 525 SER C Protein S 6 FT #SUB 693 693 GLN D 874 874 SER C Protein S 4 FT #SUB 721 721 ARG D 874 874 SER C Protein S 2 FT #SUB 723 723 ALA D 875 875 ASP C Protein A 4 FT #SUB 724 724 GLU D 847 847 LYS C Protein B 3 FT #SUB 724 724 GLU D 873 873 ALA C Protein S 2 FT #SUB 724 724 GLU D 874 874 SER C Protein S 6 FT #SUB 724 724 GLU D 875 875 ASP C Protein B 5 FT #SUB 726 726 LEU D 848 848 THR C Protein S 1 FT #SUB 726 726 LEU D 851 851 ILE C Protein S 1 FT #SUB 726 726 LEU D 871 871 GLU C Protein S 1 FT #SUB 726 726 LEU D 873 873 ALA C Protein S 2 FT #SUB 727 727 SER D 851 851 ILE C Protein B 1 FT #SUB 728 728 VAL D 848 848 THR C Protein S 2 FT #SUB 728 728 VAL D 851 851 ILE C Protein S 1 FT #SUB 730 730 LEU D 823 823 LEU C Protein S 1 FT #SUB 823 823 LEU D 728 728 VAL C Protein S 1 FT #SUB 823 823 LEU D 730 730 LEU C Protein B 1 FT #SUB 828 828 ASP D 830 830 LEU C Protein S 3 FT #SUB 828 828 ASP D 831 831 ALA C Protein S 2 FT #SUB 830 830 LEU D 828 828 ASP C Protein A 8 FT #SUB 830 830 LEU D 830 830 LEU C Protein S 1 FT #SUB 831 831 ALA D 828 828 ASP C Protein A 2 FT #SUB 847 847 LYS D 724 724 GLU C Protein S 1 FT #SUB 848 848 THR D 728 728 VAL C Protein S 2 FT #SUB 849 849 LEU D 726 726 LEU C Protein B 1 FT #SUB 851 851 ILE D 726 726 LEU C Protein S 1 FT #SUB 851 851 ILE D 727 727 SER C Protein S 1 FT #SUB 851 851 ILE D 728 728 VAL C Protein S 1 FT #SUB 869 869 ASP D 1015 1015 HIS C Protein S 2 FT #SUB 869 869 ASP D 1017 1017 GLN C Protein S 1 FT #SUB 871 871 GLU D 726 726 LEU C Protein S 1 FT #SUB 873 873 ALA D 724 724 GLU C Protein B 2 FT #SUB 873 873 ALA D 726 726 LEU C Protein B 1 FT #SUB 874 874 SER D 693 693 GLN C Protein S 4 FT #SUB 874 874 SER D 721 721 ARG C Protein S 2 FT #SUB 874 874 SER D 724 724 GLU C Protein A 6 FT #SUB 875 875 ASP D 723 723 ALA C Protein S 3 FT #SUB 875 875 ASP D 724 724 GLU C Protein S 5 FT #SUB 942 942 ARG D 1013 1013 ARG C Protein S 8 FT #SUB 954 954 ASP D 1013 1013 ARG C Protein S 4 FT #SUB 1013 1013 ARG D 942 942 ARG C Protein S 9 FT #SUB 1013 1013 ARG D 954 954 ASP C Protein S 4 FT #SUB 1015 1015 HIS D 869 869 ASP C Protein S 1 FT #SUB 1015 1015 HIS D 1015 1015 HIS C Protein S 11 FT #HET 15 15 ASP D 81 3002 MG D B 2 FT #HET 18 18 ASN D 81 3002 MG D B 4 FT #HET 21 21 VAL D 81 3002 MG D B 2 FT #HET 32 32 PRO D 90 8404 DMS D B 2 FT #HET 33 33 PHE D 90 8404 DMS D B 4 FT #HET 34 34 ALA D 90 8404 DMS D A 2 FT #HET 34 34 ALA D 102 8501 DMS D S 1 FT #HET 36 36 TRP D 90 8404 DMS D S 2 FT #HET 36 36 TRP D 102 8501 DMS D S 1 FT #HET 45 45 ASP D 90 8404 DMS D S 3 FT #HET 45 45 ASP D 102 8501 DMS D A 5 FT #HET 46 46 ARG D 102 8501 DMS D B 3 FT #HET 47 47 PRO D 102 8501 DMS D A 5 FT #HET 48 48 SER D 102 8501 DMS D B 1 FT #HET 54 54 LEU D 93 8408 DMS D B 3 FT #HET 55 55 ASN D 93 8408 DMS D B 3 FT #HET 57 57 GLU D 93 8408 DMS D B 1 FT #HET 84 84 VAL D 99 8414 DMS D A 4 FT #HET 85 85 VAL D 99 8414 DMS D A 3 FT #HET 93 93 HIS D 99 8414 DMS D S 10 FT #HET 99 99 ILE D 95 8410 DMS D S 1 FT #HET 100 100 TYR D 82 3101 NA D S 1 FT #HET 102 102 ASN D 85 2001 IPT D S 4 FT #HET 106 106 PRO D 95 8410 DMS D B 3 FT #HET 107 107 ILE D 95 8410 DMS D S 1 FT #HET 115 115 PRO D 95 8410 DMS D S 2 FT #HET 125 125 LEU D 93 8408 DMS D S 1 FT #HET 163 163 GLN D 81 3002 MG D S 3 FT #HET 191 191 TRP D 95 8410 DMS D S 2 FT #HET 193 193 ASP D 81 3002 MG D S 3 FT #HET 201 201 ASP D 82 3101 NA D S 3 FT #HET 201 201 ASP D 85 2001 IPT D S 7 FT #HET 229 229 THR D 87 8401 DMS D S 2 FT #HET 231 231 PHE D 101 8417 DMS D B 2 FT #HET 232 232 ASN D 101 8417 DMS D B 5 FT #HET 233 233 ASP D 101 8417 DMS D B 4 FT #HET 235 235 PHE D 101 8417 DMS D S 2 FT #HET 271 271 THR D 91 8405 DMS D B 1 FT #HET 275 275 GLY D 97 8412 DMS D B 2 FT #HET 276 276 GLY D 97 8412 DMS D B 1 FT #HET 289 289 VAL D 97 8412 DMS D A 3 FT #HET 290 290 THR D 97 8412 DMS D A 7 FT #HET 291 291 LEU D 91 8405 DMS D A 4 FT #HET 292 292 ARG D 91 8405 DMS D B 5 FT #HET 292 292 ARG D 97 8412 DMS D S 1 FT #HET 304 304 GLU D 86 2002 IPT D B 5 FT #HET 306 306 PRO D 86 2002 IPT D S 2 FT #HET 310 310 ARG D 90 8404 DMS D S 1 FT #HET 327 327 ALA D 90 8404 DMS D B 1 FT #HET 330 330 VAL D 87 8401 DMS D A 3 FT #HET 331 331 GLY D 87 8401 DMS D B 7 FT #HET 333 333 ARG D 87 8401 DMS D S 1 FT #HET 334 334 GLU D 94 8409 DMS D S 1 FT #HET 334 334 GLU D 101 8417 DMS D S 3 FT #HET 335 335 VAL D 94 8409 DMS D B 2 FT #HET 336 336 ARG D 94 8409 DMS D S 1 FT #HET 380 380 LYS D 89 8403 DMS D A 2 FT #HET 383 383 ASN D 89 8403 DMS D S 2 FT #HET 416 416 GLU D 80 3001 MG D S 3 FT #HET 418 418 HIS D 80 3001 MG D S 4 FT #HET 418 418 HIS D 85 2001 IPT D S 1 FT #HET 421 421 VAL D 103 8704 DMS D S 1 FT #HET 448 448 ARG D 87 8401 DMS D B 1 FT #HET 449 449 ASN D 87 8401 DMS D B 4 FT #HET 451 451 PRO D 87 8401 DMS D A 8 FT #HET 461 461 GLU D 80 3001 MG D S 3 FT #HET 461 461 GLU D 85 2001 IPT D S 7 FT #HET 478 478 VAL D 21 8420 DMS A S 1 FT #HET 480 480 PRO D 94 8409 DMS D B 1 FT #HET 481 481 SER D 94 8409 DMS D B 1 FT #HET 482 482 ARG D 87 8401 DMS D S 1 FT #HET 502 502 MET D 85 2001 IPT D S 1 FT #HET 503 503 TYR D 85 2001 IPT D S 3 FT #HET 505 505 ARG D 92 8407 DMS D S 1 FT #HET 508 508 GLU D 92 8407 DMS D S 5 FT #HET 537 537 GLU D 85 2001 IPT D S 4 FT #HET 540 540 HIS D 85 2001 IPT D S 5 FT #HET 556 556 PHE D 83 3102 NA D B 2 FT #HET 557 557 ARG D 88 8402 DMS D S 5 FT #HET 559 559 TYR D 83 3102 NA D B 2 FT #HET 560 560 PRO D 83 3102 NA D B 4 FT #HET 562 562 LEU D 83 3102 NA D B 3 FT #HET 568 568 TRP D 85 2001 IPT D S 1 FT #HET 576 576 ILE D 96 8411 DMS D S 1 FT #HET 584 584 PRO D 96 8411 DMS D B 2 FT #HET 585 585 TRP D 96 8411 DMS D B 3 FT #HET 586 586 SER D 96 8411 DMS D A 5 FT #HET 593 593 GLY D 98 8413 DMS D B 3 FT #HET 594 594 ASP D 98 8413 DMS D B 2 FT #HET 595 595 THR D 98 8413 DMS D A 8 FT #HET 601 601 PHE D 82 3101 NA D B 2 FT #HET 601 601 PHE D 85 2001 IPT D A 2 FT #HET 604 604 ASN D 82 3101 NA D S 3 FT #HET 604 604 ASN D 85 2001 IPT D S 3 FT #HET 621 621 LYS D 100 8415 DMS D S 3 FT #HET 622 622 HIS D 88 8402 DMS D A 9 FT #HET 623 623 GLN D 88 8402 DMS D A 4 FT #HET 625 625 GLN D 88 8402 DMS D S 3 FT #HET 626 626 PHE D 89 8403 DMS D S 6 FT #HET 628 628 GLN D 88 8402 DMS D S 4 FT #HET 642 642 TYR D 86 2002 IPT D S 1 FT #HET 642 642 TYR D 89 8403 DMS D S 5 FT #HET 645 645 ARG D 86 2002 IPT D S 18 FT #HET 648 648 ASP D 86 2002 IPT D S 5 FT #HET 650 650 GLU D 86 2002 IPT D S 1 FT #HET 699 699 ARG D 100 8415 DMS D S 2 FT #HET 702 702 GLN D 86 2002 IPT D S 2 FT #HET 706 706 THR D 86 2002 IPT D S 1 FT #HET 708 708 TRP D 86 2002 IPT D S 7 FT #HET 708 708 TRP D 89 8403 DMS D S 5 FT #HET 714 714 ILE D 100 8415 DMS D S 1 FT #HET 717 717 TRP D 100 8415 DMS D S 6 FT #HET 931 931 PHE D 84 3103 NA D S 2 FT #HET 932 932 PRO D 84 3103 NA D B 2 FT #HET 967 967 LEU D 84 3103 NA D B 2 FT #HET 968 968 MET D 84 3103 NA D B 3 FT #HET 970 970 THR D 84 3103 NA D B 2 FT #HET 973 973 ARG D 96 8411 DMS D S 5 FT #HET 999 999 TRP D 85 2001 IPT D S 12 FT #HET 1001 1001 PRO D 92 8407 DMS D A 4 FT #HET 1003 1003 VAL D 92 8407 DMS D B 1 FT DISORDER 1 12 CC SEQUENCE 1011 AA (ATOM); CC RRDWENPGVT QLNRLAAHPP FASWRNSEEA RTDRPSQQLR SLNGEWRFAW FPAPEAVPES CC WLECDLPEAD TVVVPSNWQM HGYDAPIYTN VTYPITVNPP FVPTENPTGC YSLTFNVDES CC WLQEGQTRII FDGVNSAFHL WCNGRWVGYG QDSRLPSEFD LSAFLRAGEN RLAVMVLRWS CC DGSYLEDQDM WRMSGIFRDV SLLHKPTTQI SDFHVATRFN DDFSRAVLEA EVQMCGELRD CC YLRVTVSLWQ GETQVASGTA PFGGEIIDER GGYADRVTLR LNVENPKLWS AEIPNLYRAV CC VELHTADGTL IEAEACDVGF REVRIENGLL LLNGKPLLIR GVNRHEHHPL HGQVMDEQTM CC VQDILLMKQN NFNAVRCSHY PNHPLWYTLC DRYGLYVVDE ANIETHGMVP MNRLTDDPRW CC LPAMSERVTR MVQRDRNHPS VIIWSLGNES GHGANHDALY RWIKSVDPSR PVQYEGGGAD CC TTATDIICPM YARVDEDQPF PAVPKWSIKK WLSLPGETRP LILCEYAHAM GNSLGGFAKY CC WQAFRQYPRL QGGFVWDWVD QSLIKYDENG NPWSAYGGDF GDTPNDRQFC MNGLVFADRT CC PHPALTEAKH QQQFFQFRLS GQTIEVTSEY LFRHSDNELL HWMVALDGKP LASGEVPLDV CC APQGKQLIEL PELPQPESAG QLWLTVRVVQ PNATAWSEAG HISAWQQWRL AENLSVTLPA CC ASHAIPHLTT SEMDFCIELG NKRWQFNRQS GFLSQMWIGD KKQLLTPLRD QFTRAPLDND CC IGVSEATRID PNAWVERWKA AGHYQAEAAL LQCTADTLAD AVLITTAHAW QHQGKTLFIS CC RKTYRIDGSG QMAITVDVEV ASDTPHPARI GLNCQLAQVA ERVNWLGLGP QENYPDRLTA CC ACFDRWDLPL SDMYTPYVFP SENGLRCGTR ELNYGPHQWR GDFQFNISRY SQQQLMETSH CC RHLLHAEEGT WLNIDGFHMG IGGDDSWSPS VSAEFQLSAG RYHYQLVWCQ K CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSHMLEDPVVLQRRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ATOM ------------RRDWENPGVTQLNRLAAHPPFASWRNSEEARTDRPSQQ CC ************************************** CC SEQRES LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ATOM LRSLNGEWRFAWFPAPEAVPESWLECDLPEADTVVVPSNWQMHGYDAPIY CC ************************************************** CC SEQRES TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ATOM TNVTYPITVNPPFVPTENPTGCYSLTFNVDESWLQEGQTRIIFDGVNSAF CC ************************************************** CC SEQRES HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ATOM HLWCNGRWVGYGQDSRLPSEFDLSAFLRAGENRLAVMVLRWSDGSYLEDQ CC ************************************************** CC SEQRES DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ATOM DMWRMSGIFRDVSLLHKPTTQISDFHVATRFNDDFSRAVLEAEVQMCGEL CC ************************************************** CC SEQRES RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ATOM RDYLRVTVSLWQGETQVASGTAPFGGEIIDERGGYADRVTLRLNVENPKL CC ************************************************** CC SEQRES WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ATOM WSAEIPNLYRAVVELHTADGTLIEAEACDVGFREVRIENGLLLLNGKPLL CC ************************************************** CC SEQRES IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ATOM IRGVNRHEHHPLHGQVMDEQTMVQDILLMKQNNFNAVRCSHYPNHPLWYT CC ************************************************** CC SEQRES LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ATOM LCDRYGLYVVDEANIETHGMVPMNRLTDDPRWLPAMSERVTRMVQRDRNH CC ************************************************** CC SEQRES PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ATOM PSVIIWSLGNESGHGANHDALYRWIKSVDPSRPVQYEGGGADTTATDIIC CC ************************************************** CC SEQRES PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ATOM PMYARVDEDQPFPAVPKWSIKKWLSLPGETRPLILCEYAHAMGNSLGGFA CC ************************************************** CC SEQRES KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ATOM KYWQAFRQYPRLQGGFVWDWVDQSLIKYDENGNPWSAYGGDFGDTPNDRQ CC ************************************************** CC SEQRES FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ATOM FCMNGLVFADRTPHPALTEAKHQQQFFQFRLSGQTIEVTSEYLFRHSDNE CC ************************************************** CC SEQRES LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ATOM LLHWMVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRV CC ************************************************** CC SEQRES VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ATOM VQPNATAWSEAGHISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIE CC ************************************************** CC SEQRES LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ATOM LGNKRWQFNRQSGFLSQMWIGDKKQLLTPLRDQFTRAPLDNDIGVSEATR CC ************************************************** CC SEQRES IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ATOM IDPNAWVERWKAAGHYQAEAALLQCTADTLADAVLITTAHAWQHQGKTLF CC ************************************************** CC SEQRES ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ATOM ISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQLAQVAERVNWLGL CC ************************************************** CC SEQRES GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ATOM GPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELNYGPHQ CC ************************************************** CC SEQRES WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ATOM WRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWS CC ************************************************** CC SEQRES PSVSAEFQLSAGRYHYQLVWCQK CC ATOM PSVSAEFQLSAGRYHYQLVWCQK CC *********************** SQ SEQUENCE 1023 AA; MW; CN; GSHMLEDPVV LQRRDWENPG VTQLNRLAAH PPFASWRNSE EARTDRPSQQ LRSLNGEWRF AWFPAPEAVP ESWLECDLPE ADTVVVPSNW QMHGYDAPIY TNVTYPITVN PPFVPTENPT GCYSLTFNVD ESWLQEGQTR IIFDGVNSAF HLWCNGRWVG YGQDSRLPSE FDLSAFLRAG ENRLAVMVLR WSDGSYLEDQ DMWRMSGIFR DVSLLHKPTT QISDFHVATR FNDDFSRAVL EAEVQMCGEL RDYLRVTVSL WQGETQVASG TAPFGGEIID ERGGYADRVT LRLNVENPKL WSAEIPNLYR AVVELHTADG TLIEAEACDV GFREVRIENG LLLLNGKPLL IRGVNRHEHH PLHGQVMDEQ TMVQDILLMK QNNFNAVRCS HYPNHPLWYT LCDRYGLYVV DEANIETHGM VPMNRLTDDP RWLPAMSERV TRMVQRDRNH PSVIIWSLGN ESGHGANHDA LYRWIKSVDP SRPVQYEGGG ADTTATDIIC PMYARVDEDQ PFPAVPKWSI KKWLSLPGET RPLILCEYAH AMGNSLGGFA KYWQAFRQYP RLQGGFVWDW VDQSLIKYDE NGNPWSAYGG DFGDTPNDRQ FCMNGLVFAD RTPHPALTEA KHQQQFFQFR LSGQTIEVTS EYLFRHSDNE LLHWMVALDG KPLASGEVPL DVAPQGKQLI ELPELPQPES AGQLWLTVRV VQPNATAWSE AGHISAWQQW RLAENLSVTL PAASHAIPHL TTSEMDFCIE LGNKRWQFNR QSGFLSQMWI GDKKQLLTPL RDQFTRAPLD NDIGVSEATR IDPNAWVERW KAAGHYQAEA ALLQCTADTL ADAVLITTAH AWQHQGKTLF ISRKTYRIDG SGQMAITVDV EVASDTPHPA RIGLNCQLAQ VAERVNWLGL GPQENYPDRL TAACFDRWDL PLSDMYTPYV FPSENGLRCG TRELNYGPHQ WRGDFQFNIS RYSQQQLMET SHRHLLHAEE GTWLNIDGFH MGIGGDDSWS PSVSAEFQLS AGRYHYQLVW CQK //