ID 2W67A STANDARD; PRT; 716 AA. DT CONVERTED FROM PDB (SEQRES) 2W67 DE O-GLCNACASE BT_4395 OS BACTEROIDES THETAIOTAOMICRON VPI-5482 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.250 CC R-Factor 0.183 FT #SUB 473 473 GLU A 514 514 LYS B Protein S 4 FT #SUB 473 473 GLU A 523 523 TYR B Protein S 4 FT #SUB 473 473 GLU A 526 526 ARG B Protein S 1 FT #SUB 473 473 GLU A 527 527 LYS B Protein S 3 FT #SUB 474 474 ARG A 526 526 ARG B Protein S 1 FT #SUB 476 476 LYS A 476 476 LYS B Protein S 1 FT #SUB 476 476 LYS A 530 530 HIS B Protein B 1 FT #SUB 477 477 GLU A 526 526 ARG B Protein S 10 FT #SUB 477 477 GLU A 527 527 LYS B Protein S 5 FT #SUB 477 477 GLU A 530 530 HIS B Protein A 8 FT #SUB 480 480 ASP A 530 530 HIS B Protein S 6 FT #SUB 480 480 ASP A 533 533 ALA B Protein B 2 FT #SUB 480 480 ASP A 534 534 LEU B Protein S 1 FT #SUB 480 480 ASP A 537 537 GLN B Protein A 5 FT #SUB 481 481 ILE A 529 529 ASN B Protein S 2 FT #SUB 481 481 ILE A 533 533 ALA B Protein A 3 FT #SUB 483 483 LEU A 537 537 GLN B Protein S 1 FT #SUB 484 484 MET A 533 533 ALA B Protein S 1 FT #SUB 484 484 MET A 536 536 GLN B Protein S 2 FT #SUB 484 484 MET A 537 537 GLN B Protein S 5 FT #SUB 503 503 LYS A 507 507 GLU B Protein S 5 FT #SUB 507 507 GLU A 503 503 LYS B Protein S 4 FT #SUB 514 514 LYS A 473 473 GLU B Protein S 1 FT #SUB 523 523 TYR A 473 473 GLU B Protein S 4 FT #SUB 526 526 ARG A 470 470 TYR B Protein S 2 FT #SUB 526 526 ARG A 473 473 GLU B Protein S 2 FT #SUB 526 526 ARG A 474 474 ARG B Protein S 1 FT #SUB 526 526 ARG A 477 477 GLU B Protein S 11 FT #SUB 527 527 LYS A 473 473 GLU B Protein S 3 FT #SUB 527 527 LYS A 477 477 GLU B Protein S 4 FT #SUB 529 529 ASN A 481 481 ILE B Protein A 2 FT #SUB 530 530 HIS A 476 476 LYS B Protein S 4 FT #SUB 530 530 HIS A 477 477 GLU B Protein S 6 FT #SUB 530 530 HIS A 480 480 ASP B Protein S 6 FT #SUB 533 533 ALA A 480 480 ASP B Protein S 2 FT #SUB 533 533 ALA A 481 481 ILE B Protein S 3 FT #SUB 533 533 ALA A 484 484 MET B Protein B 1 FT #SUB 534 534 LEU A 480 480 ASP B Protein S 2 FT #SUB 536 536 GLN A 484 484 MET B Protein A 5 FT #SUB 537 537 GLN A 480 480 ASP B Protein S 7 FT #SUB 537 537 GLN A 483 483 LEU B Protein S 1 FT #SUB 537 537 GLN A 484 484 MET B Protein A 5 FT #HET 32 32 GLU A 3 1718 CA A B 2 FT #HET 33 33 ALA A 3 1718 CA A B 1 FT #HET 61 61 GLU A 3 1718 CA A S 3 FT #HET 64 64 ASP A 3 1718 CA A S 3 FT #HET 135 135 GLY A 1 1716 F34 A B 4 FT #HET 136 136 PHE A 1 1716 F34 A B 1 FT #HET 137 137 TYR A 1 1716 F34 A S 2 FT #HET 166 166 LYS A 1 1716 F34 A S 3 FT #HET 242 242 ASP A 1 1716 F34 A S 9 FT #HET 243 243 ASP A 1 1716 F34 A S 5 FT #HET 282 282 TYR A 1 1716 F34 A S 16 FT #HET 310 310 THR A 1 1716 F34 A S 1 FT #HET 314 314 VAL A 1 1716 F34 A S 2 FT #HET 337 337 TRP A 1 1716 F34 A S 7 FT #HET 339 339 ASN A 1 1716 F34 A S 7 FT #HET 342 342 VAL A 1 1716 F34 A S 3 FT #HET 344 344 ASP A 1 1716 F34 A S 10 FT #HET 354 354 PRO A 2 1717 GOL A S 2 FT #HET 356 356 TYR A 2 1717 GOL A S 5 FT #HET 372 372 ASN A 1 1716 F34 A S 4 FT #HET 399 399 THR A 2 1717 GOL A S 1 FT #HET 400 400 TRP A 2 1717 GOL A S 9 FT #HET 438 438 GLU A 2 1717 GOL A S 3 FT DISORDER 1 4 FT DISORDER 51 52 FT DISORDER 596 603 FT DISORDER 619 630 FT DISORDER 649 682 FT DISORDER 695 707 FT DISORDER 716 716 CC SEQUENCE 642 AA (ATOM); CC LQPPPQQLIV QNKTIDLPAV YQLNGGEEAN PHAVKVLKEL LSGKQSKGML ISIGEKGDKS CC VRKYSRQIPD HKEGYYLSVN EKEIVLAGND ERGTYYALQT FAQLLKDGKL PEVEIKDYPS CC VRYRGVVEGF YGTPWSHQAR LSQLKFYGKN KMNTYIYGPK DDPYHSAPNW RLPYPDKEAA CC QLQELVAVAN ENEVDFVWAI HPGQDIKWNK EDRDLLLAKF EKMYQLGVRS FAVFFDDISG CC EGTNPQKQAE LLNYIDEKFA QVKPDINQLV MCPTEYNKSW SNPNGNYLTT LGDKLNPSIQ CC IMWTGDRVIS DITRDGISWI NERIKRPAYI WWNFPVSDYV RDHLLLGPVY GNDTTIAKEM CC SGFVTNPMEH AESSKIAIYS VASYAWNPAK YDTWQTWKDA IRTILPSAAE ELECFAMHNS CC DLGPNGHGYR REESMDIQPA AERFLKAFKE GKNYDKADFE TLQYTFERMK ESADILLMNT CC ENKPLIVEIT PWVHQFKLTA EMGEEVLKMV EGRNESYFLR KYNHVKALQQ QMFYIDQTSN CC QNPYQPGVKT ATRVIKPLID RTFATVVKFF NQKFNAHLDA TTDYMPHKMN LPLQVKANRV CC LISPVEIELD AIYPGENIQI NFGLQKAPVK FVRFQFVLTI EK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES QNVSLQPPPQQLIVQNKTIDLPAVYQLNGGEEANPHAVKVLKELLSGKQS CC ATOM ----LQPPPQQLIVQNKTIDLPAVYQLNGGEEANPHAVKVLKELLSGKQS CC ********************************************** CC SEQRES SKKGMLISIGEKGDKSVRKYSRQIPDHKEGYYLSVNEKEIVLAGNDERGT CC ATOM --KGMLISIGEKGDKSVRKYSRQIPDHKEGYYLSVNEKEIVLAGNDERGT CC ************************************************ CC SEQRES YYALQTFAQLLKDGKLPEVEIKDYPSVRYRGVVEGFYGTPWSHQARLSQL CC ATOM YYALQTFAQLLKDGKLPEVEIKDYPSVRYRGVVEGFYGTPWSHQARLSQL CC ************************************************** CC SEQRES KFYGKNKMNTYIYGPKDDPYHSAPNWRLPYPDKEAAQLQELVAVANENEV CC ATOM KFYGKNKMNTYIYGPKDDPYHSAPNWRLPYPDKEAAQLQELVAVANENEV CC ************************************************** CC SEQRES DFVWAIHPGQDIKWNKEDRDLLLAKFEKMYQLGVRSFAVFFDDISGEGTN CC ATOM DFVWAIHPGQDIKWNKEDRDLLLAKFEKMYQLGVRSFAVFFDDISGEGTN CC ************************************************** CC SEQRES PQKQAELLNYIDEKFAQVKPDINQLVMCPTEYNKSWSNPNGNYLTTLGDK CC ATOM PQKQAELLNYIDEKFAQVKPDINQLVMCPTEYNKSWSNPNGNYLTTLGDK CC ************************************************** CC SEQRES LNPSIQIMWTGDRVISDITRDGISWINERIKRPAYIWWNFPVSDYVRDHL CC ATOM LNPSIQIMWTGDRVISDITRDGISWINERIKRPAYIWWNFPVSDYVRDHL CC ************************************************** CC SEQRES LLGPVYGNDTTIAKEMSGFVTNPMEHAESSKIAIYSVASYAWNPAKYDTW CC ATOM LLGPVYGNDTTIAKEMSGFVTNPMEHAESSKIAIYSVASYAWNPAKYDTW CC ************************************************** CC SEQRES QTWKDAIRTILPSAAEELECFAMHNSDLGPNGHGYRREESMDIQPAAERF CC ATOM QTWKDAIRTILPSAAEELECFAMHNSDLGPNGHGYRREESMDIQPAAERF CC ************************************************** CC SEQRES LKAFKEGKNYDKADFETLQYTFERMKESADILLMNTENKPLIVEITPWVH CC ATOM LKAFKEGKNYDKADFETLQYTFERMKESADILLMNTENKPLIVEITPWVH CC ************************************************** CC SEQRES QFKLTAEMGEEVLKMVEGRNESYFLRKYNHVKALQQQMFYIDQTSNQNPY CC ATOM QFKLTAEMGEEVLKMVEGRNESYFLRKYNHVKALQQQMFYIDQTSNQNPY CC ************************************************** CC SEQRES QPGVKTATRVIKPLIDRTFATVVKFFNQKFNAHLDATTDYMPHKMISNVE CC ATOM QPGVKTATRVIKPLIDRTFATVVKFFNQKFNAHLDATTDYMPHKM----- CC ********************************************* CC SEQRES QIKNLPLQVKANRVLISPANEVVKWAAGNSVEIELDAIYPGENIQINFGK CC ATOM ---NLPLQVKANRVLISP------------VEIELDAIYPGENIQINF-- CC *************** ****************** CC SEQRES DAPCTWGRLEISTDGKEWKTVDLKQKESRLSAGLQKAPVKFVRFTNVSDE CC ATOM --------------------------------GLQKAPVKFVRF------ CC ************ CC SEQRES EQQVYLRQFVLTIEKK CC ATOM -------QFVLTIEK- CC ******** SQ SEQUENCE 716 AA; MW; CN; QNVSLQPPPQ QLIVQNKTID LPAVYQLNGG EEANPHAVKV LKELLSGKQS SKKGMLISIG EKGDKSVRKY SRQIPDHKEG YYLSVNEKEI VLAGNDERGT YYALQTFAQL LKDGKLPEVE IKDYPSVRYR GVVEGFYGTP WSHQARLSQL KFYGKNKMNT YIYGPKDDPY HSAPNWRLPY PDKEAAQLQE LVAVANENEV DFVWAIHPGQ DIKWNKEDRD LLLAKFEKMY QLGVRSFAVF FDDISGEGTN PQKQAELLNY IDEKFAQVKP DINQLVMCPT EYNKSWSNPN GNYLTTLGDK LNPSIQIMWT GDRVISDITR DGISWINERI KRPAYIWWNF PVSDYVRDHL LLGPVYGNDT TIAKEMSGFV TNPMEHAESS KIAIYSVASY AWNPAKYDTW QTWKDAIRTI LPSAAEELEC FAMHNSDLGP NGHGYRREES MDIQPAAERF LKAFKEGKNY DKADFETLQY TFERMKESAD ILLMNTENKP LIVEITPWVH QFKLTAEMGE EVLKMVEGRN ESYFLRKYNH VKALQQQMFY IDQTSNQNPY QPGVKTATRV IKPLIDRTFA TVVKFFNQKF NAHLDATTDY MPHKMISNVE QIKNLPLQVK ANRVLISPAN EVVKWAAGNS VEIELDAIYP GENIQINFGK DAPCTWGRLE ISTDGKEWKT VDLKQKESRL SAGLQKAPVK FVRFTNVSDE EQQVYLRQFV LTIEKK // ID 2W67B STANDARD; PRT; 716 AA. DT CONVERTED FROM PDB (SEQRES) 2W67 DE O-GLCNACASE BT_4395 OS BACTEROIDES THETAIOTAOMICRON VPI-5482 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.250 CC R-Factor 0.183 FT #SUB 470 470 TYR B 526 526 ARG A Protein S 2 FT #SUB 473 473 GLU B 514 514 LYS A Protein S 1 FT #SUB 473 473 GLU B 523 523 TYR A Protein S 4 FT #SUB 473 473 GLU B 526 526 ARG A Protein S 2 FT #SUB 473 473 GLU B 527 527 LYS A Protein S 3 FT #SUB 474 474 ARG B 526 526 ARG A Protein S 1 FT #SUB 476 476 LYS B 476 476 LYS A Protein S 1 FT #SUB 476 476 LYS B 530 530 HIS A Protein B 4 FT #SUB 477 477 GLU B 526 526 ARG A Protein S 11 FT #SUB 477 477 GLU B 527 527 LYS A Protein S 4 FT #SUB 477 477 GLU B 530 530 HIS A Protein A 6 FT #SUB 480 480 ASP B 530 530 HIS A Protein S 6 FT #SUB 480 480 ASP B 533 533 ALA A Protein B 2 FT #SUB 480 480 ASP B 534 534 LEU A Protein S 2 FT #SUB 480 480 ASP B 537 537 GLN A Protein A 7 FT #SUB 481 481 ILE B 529 529 ASN A Protein S 2 FT #SUB 481 481 ILE B 533 533 ALA A Protein A 3 FT #SUB 483 483 LEU B 537 537 GLN A Protein S 1 FT #SUB 484 484 MET B 533 533 ALA A Protein S 1 FT #SUB 484 484 MET B 536 536 GLN A Protein A 5 FT #SUB 484 484 MET B 537 537 GLN A Protein S 5 FT #SUB 503 503 LYS B 507 507 GLU A Protein S 4 FT #SUB 507 507 GLU B 503 503 LYS A Protein S 5 FT #SUB 514 514 LYS B 473 473 GLU A Protein S 4 FT #SUB 523 523 TYR B 473 473 GLU A Protein S 4 FT #SUB 526 526 ARG B 473 473 GLU A Protein S 1 FT #SUB 526 526 ARG B 474 474 ARG A Protein S 1 FT #SUB 526 526 ARG B 477 477 GLU A Protein S 10 FT #SUB 527 527 LYS B 473 473 GLU A Protein S 3 FT #SUB 527 527 LYS B 477 477 GLU A Protein S 5 FT #SUB 529 529 ASN B 481 481 ILE A Protein A 2 FT #SUB 530 530 HIS B 476 476 LYS A Protein S 1 FT #SUB 530 530 HIS B 477 477 GLU A Protein S 8 FT #SUB 530 530 HIS B 480 480 ASP A Protein S 6 FT #SUB 533 533 ALA B 480 480 ASP A Protein S 2 FT #SUB 533 533 ALA B 481 481 ILE A Protein S 3 FT #SUB 533 533 ALA B 484 484 MET A Protein B 1 FT #SUB 534 534 LEU B 480 480 ASP A Protein S 1 FT #SUB 536 536 GLN B 484 484 MET A Protein B 2 FT #SUB 537 537 GLN B 480 480 ASP A Protein S 5 FT #SUB 537 537 GLN B 483 483 LEU A Protein S 1 FT #SUB 537 537 GLN B 484 484 MET A Protein A 5 FT #HET 32 32 GLU B 5 1590 CA B B 2 FT #HET 61 61 GLU B 5 1590 CA B S 2 FT #HET 64 64 ASP B 5 1590 CA B S 3 FT #HET 135 135 GLY B 4 1589 F34 B B 4 FT #HET 136 136 PHE B 4 1589 F34 B B 1 FT #HET 137 137 TYR B 4 1589 F34 B A 3 FT #HET 166 166 LYS B 4 1589 F34 B S 3 FT #HET 242 242 ASP B 4 1589 F34 B S 8 FT #HET 243 243 ASP B 4 1589 F34 B S 5 FT #HET 278 278 CYS B 4 1589 F34 B S 1 FT #HET 282 282 TYR B 4 1589 F34 B S 15 FT #HET 310 310 THR B 4 1589 F34 B S 2 FT #HET 314 314 VAL B 4 1589 F34 B S 2 FT #HET 337 337 TRP B 4 1589 F34 B S 6 FT #HET 339 339 ASN B 4 1589 F34 B S 7 FT #HET 342 342 VAL B 4 1589 F34 B S 3 FT #HET 344 344 ASP B 4 1589 F34 B S 11 FT #HET 372 372 ASN B 4 1589 F34 B S 4 FT DISORDER 1 4 FT DISORDER 46 53 FT DISORDER 589 716 CC SEQUENCE 576 AA (ATOM); CC LQPPPQQLIV QNKTIDLPAV YQLNGGEEAN PHAVKVLKEL LGMLISIGEK GDKSVRKYSR CC QIPDHKEGYY LSVNEKEIVL AGNDERGTYY ALQTFAQLLK DGKLPEVEIK DYPSVRYRGV CC VEGFYGTPWS HQARLSQLKF YGKNKMNTYI YGPKDDPYHS APNWRLPYPD KEAAQLQELV CC AVANENEVDF VWAIHPGQDI KWNKEDRDLL LAKFEKMYQL GVRSFAVFFD DISGEGTNPQ CC KQAELLNYID EKFAQVKPDI NQLVMCPTEY NKSWSNPNGN YLTTLGDKLN PSIQIMWTGD CC RVISDITRDG ISWINERIKR PAYIWWNFPV SDYVRDHLLL GPVYGNDTTI AKEMSGFVTN CC PMEHAESSKI AIYSVASYAW NPAKYDTWQT WKDAIRTILP SAAEELECFA MHNSDLGPNG CC HGYRREESMD IQPAAERFLK AFKEGKNYDK ADFETLQYTF ERMKESADIL LMNTENKPLI CC VEITPWVHQF KLTAEMGEEV LKMVEGRNES YFLRKYNHVK ALQQQMFYID QTSNQNPYQP CC GVKTATRVIK PLIDRTFATV VKFFNQKFNA HLDATT CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES QNVSLQPPPQQLIVQNKTIDLPAVYQLNGGEEANPHAVKVLKELLSGKQS CC ATOM ----LQPPPQQLIVQNKTIDLPAVYQLNGGEEANPHAVKVLKELL----- CC ***************************************** CC SEQRES SKKGMLISIGEKGDKSVRKYSRQIPDHKEGYYLSVNEKEIVLAGNDERGT CC ATOM ---GMLISIGEKGDKSVRKYSRQIPDHKEGYYLSVNEKEIVLAGNDERGT CC *********************************************** CC SEQRES YYALQTFAQLLKDGKLPEVEIKDYPSVRYRGVVEGFYGTPWSHQARLSQL CC ATOM YYALQTFAQLLKDGKLPEVEIKDYPSVRYRGVVEGFYGTPWSHQARLSQL CC ************************************************** CC SEQRES KFYGKNKMNTYIYGPKDDPYHSAPNWRLPYPDKEAAQLQELVAVANENEV CC ATOM KFYGKNKMNTYIYGPKDDPYHSAPNWRLPYPDKEAAQLQELVAVANENEV CC ************************************************** CC SEQRES DFVWAIHPGQDIKWNKEDRDLLLAKFEKMYQLGVRSFAVFFDDISGEGTN CC ATOM DFVWAIHPGQDIKWNKEDRDLLLAKFEKMYQLGVRSFAVFFDDISGEGTN CC ************************************************** CC SEQRES PQKQAELLNYIDEKFAQVKPDINQLVMCPTEYNKSWSNPNGNYLTTLGDK CC ATOM PQKQAELLNYIDEKFAQVKPDINQLVMCPTEYNKSWSNPNGNYLTTLGDK CC ************************************************** CC SEQRES LNPSIQIMWTGDRVISDITRDGISWINERIKRPAYIWWNFPVSDYVRDHL CC ATOM LNPSIQIMWTGDRVISDITRDGISWINERIKRPAYIWWNFPVSDYVRDHL CC ************************************************** CC SEQRES LLGPVYGNDTTIAKEMSGFVTNPMEHAESSKIAIYSVASYAWNPAKYDTW CC ATOM LLGPVYGNDTTIAKEMSGFVTNPMEHAESSKIAIYSVASYAWNPAKYDTW CC ************************************************** CC SEQRES QTWKDAIRTILPSAAEELECFAMHNSDLGPNGHGYRREESMDIQPAAERF CC ATOM QTWKDAIRTILPSAAEELECFAMHNSDLGPNGHGYRREESMDIQPAAERF CC ************************************************** CC SEQRES LKAFKEGKNYDKADFETLQYTFERMKESADILLMNTENKPLIVEITPWVH CC ATOM LKAFKEGKNYDKADFETLQYTFERMKESADILLMNTENKPLIVEITPWVH CC ************************************************** CC SEQRES QFKLTAEMGEEVLKMVEGRNESYFLRKYNHVKALQQQMFYIDQTSNQNPY CC ATOM QFKLTAEMGEEVLKMVEGRNESYFLRKYNHVKALQQQMFYIDQTSNQNPY CC ************************************************** CC SEQRES QPGVKTATRVIKPLIDRTFATVVKFFNQKFNAHLDATTDYMPHKMISNVE CC ATOM QPGVKTATRVIKPLIDRTFATVVKFFNQKFNAHLDATT------------ CC ************************************** CC SEQRES QIKNLPLQVKANRVLISPANEVVKWAAGNSVEIELDAIYPGENIQINFGK CC ATOM -------------------------------------------------- CC CC SEQRES DAPCTWGRLEISTDGKEWKTVDLKQKESRLSAGLQKAPVKFVRFTNVSDE CC ATOM -------------------------------------------------- CC CC SEQRES EQQVYLRQFVLTIEKK CC ATOM ---------------- CC SQ SEQUENCE 716 AA; MW; CN; QNVSLQPPPQ QLIVQNKTID LPAVYQLNGG EEANPHAVKV LKELLSGKQS SKKGMLISIG EKGDKSVRKY SRQIPDHKEG YYLSVNEKEI VLAGNDERGT YYALQTFAQL LKDGKLPEVE IKDYPSVRYR GVVEGFYGTP WSHQARLSQL KFYGKNKMNT YIYGPKDDPY HSAPNWRLPY PDKEAAQLQE LVAVANENEV DFVWAIHPGQ DIKWNKEDRD LLLAKFEKMY QLGVRSFAVF FDDISGEGTN PQKQAELLNY IDEKFAQVKP DINQLVMCPT EYNKSWSNPN GNYLTTLGDK LNPSIQIMWT GDRVISDITR DGISWINERI KRPAYIWWNF PVSDYVRDHL LLGPVYGNDT TIAKEMSGFV TNPMEHAESS KIAIYSVASY AWNPAKYDTW QTWKDAIRTI LPSAAEELEC FAMHNSDLGP NGHGYRREES MDIQPAAERF LKAFKEGKNY DKADFETLQY TFERMKESAD ILLMNTENKP LIVEITPWVH QFKLTAEMGE EVLKMVEGRN ESYFLRKYNH VKALQQQMFY IDQTSNQNPY QPGVKTATRV IKPLIDRTFA TVVKFFNQKF NAHLDATTDY MPHKMISNVE QIKNLPLQVK ANRVLISPAN EVVKWAAGNS VEIELDAIYP GENIQINFGK DAPCTWGRLE ISTDGKEWKT VDLKQKESRL SAGLQKAPVK FVRFTNVSDE EQQVYLRQFV LTIEKK //