ID 1K72A STANDARD; PRT; 614 AA. DT CONVERTED FROM PDB (SEQRES) 1K72 DE Endoglucanase 9G OS Clostridium cellulolyticum CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.800 CC R-Factor 0.166 FT #SUB 370 370 GLN A 294 294 ASN B Protein S 1 FT #SUB 386 386 THR A 336 336 SER B Protein A 3 FT #SUB 389 389 THR A 336 336 SER B Protein S 1 FT #SUB 389 389 THR A 337 337 LYS B Protein S 3 FT #SUB 390 390 TYR A 292 292 GLY B Protein S 2 FT #SUB 453 453 LYS A 295 295 GLY B Protein S 1 FT #SUB 455 455 THR A 295 295 GLY B Protein S 3 FT #HET 24 24 ASP A 6 780 MG A S 3 FT #HET 25 25 LEU A 6 780 MG A B 1 FT #HET 58 58 ASP A 2 2 BGC C S 2 FT #HET 125 125 HIS A 1 1 BGC C A 5 FT #HET 125 125 HIS A 2 2 BGC C S 10 FT #HET 126 126 SER A 3 615 GLC A B 1 FT #HET 128 128 TRP A 1 1 BGC C S 2 FT #HET 128 128 TRP A 3 615 GLC A A 4 FT #HET 195 195 ASP A 9 783 GOL A B 3 FT #HET 196 196 ALA A 9 783 GOL A B 1 FT #HET 198 198 TYR A 9 783 GOL A A 6 FT #HET 204 204 TYR A 2 2 BGC C S 2 FT #HET 206 206 SER A 9 783 GOL A A 3 FT #HET 207 207 SER A 9 783 GOL A B 7 FT #HET 209 209 SER A 5 779 CA A A 5 FT #HET 212 212 ASP A 5 779 CA A S 3 FT #HET 213 213 ASP A 5 779 CA A S 3 FT #HET 259 259 ASP A 5 779 CA A B 2 FT #HET 373 373 HIS A 1 1 BGC C S 4 FT #HET 373 373 HIS A 2 2 BGC C S 14 FT #HET 375 375 ARG A 1 1 BGC C S 3 FT #HET 375 375 ARG A 2 2 BGC C S 10 FT #HET 383 383 ASP A 3 615 GLC A A 10 FT #HET 384 384 GLN A 3 615 GLC A A 3 FT #HET 385 385 MET A 1 1 BGC C S 5 FT #HET 385 385 MET A 3 615 GLC A A 12 FT #HET 386 386 THR A 3 615 GLC A A 3 FT #HET 416 416 TYR A 2 2 BGC C S 12 FT #HET 420 420 GLU A 2 2 BGC C S 20 FT #HET 420 420 GLU A 8 782 MG A S 1 FT #HET 500 500 ASP A 4 778 CA A B 2 FT #HET 500 500 ASP A 7 781 MG A S 3 FT #HET 503 503 GLU A 4 778 CA A S 3 FT #HET 578 578 ASN A 4 778 CA A B 2 FT #HET 581 581 ASN A 4 778 CA A S 3 FT #HET 582 582 ASP A 4 778 CA A S 3 FT DISORDER 1 2 CC SEQUENCE 612 AA (ATOM); CC TYNYGEALQK SIMFYEFQRS GDLPADKRDN WRDDSGMKDG SDVGVDLTGG WYDAGDHVKF CC NLPMSYTSAM LAWSLYEDKD AYDKSGQTKY IMDGIKWAND YFIKCNPTPG VYYYQVGDGG CC KDHSWWGPAE VMQMERPSFK VDASKPGSAV CASTAASLAS AAVVFKSSDP TYAEKCISHA CC KNLFDMADKA KSDAGYTAAS GYYSSSSFYD DLSWAAVWLY LATNDSTYLD KAESYVPNWG CC KEQQTDIIAY KWGQXWDDVH YGAELLLAKL TNKQLYKDSI EMNLDFWTTG VNGTRVSYTP CC KGLAWLFQWG SLRHATTQAF LAGVYAEWEG CTPSKVSVYK DFLKSQIDYA LGSTGRSFVV CC GYGVNPPQHP HHRTAHGSWT DQMTSPTYHR HTIYGALVGG PDNADGYTDE INNYVNNEIA CC CDYNAGFTGA LAKMYKHSGG DPIPNFKAIE KITNDEVIIK AGLNSTGPNY TEIKAVVYNQ CC TGWPARVTDK ISFKYFMDLS EIVAAGIDPL SLVTSSNYSE GKNTKVSGVL PWDVSNNVYY CC VNVDLTGENI YPGGQSACRR EVQFRIAAPQ GTTYWNPKND FSYDGLPTTS TVNTVTNIPV CC YDNGVKVFGN EP CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AGTYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLT CC ATOM --TYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLT CC ************************************************ CC SEQRES GGWYDAGDHVKFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWA CC ATOM GGWYDAGDHVKFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWA CC ************************************************** CC SEQRES NDYFIKCNPTPGVYYYQVGDGGKDHSWWGPAEVMQMERPSFKVDASKPGS CC ATOM NDYFIKCNPTPGVYYYQVGDGGKDHSWWGPAEVMQMERPSFKVDASKPGS CC ************************************************** CC SEQRES AVCASTAASLASAAVVFKSSDPTYAEKCISHAKNLFDMADKAKSDAGYTA CC ATOM AVCASTAASLASAAVVFKSSDPTYAEKCISHAKNLFDMADKAKSDAGYTA CC ************************************************** CC SEQRES ASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWGKEQQTDII CC ATOM ASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWGKEQQTDII CC ************************************************** CC SEQRES AYKWGQXWDDVHYGAELLLAKLTNKQLYKDSIEMNLDFWTTGVNGTRVSY CC ATOM AYKWGQXWDDVHYGAELLLAKLTNKQLYKDSIEMNLDFWTTGVNGTRVSY CC ************************************************** CC SEQRES TPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQID CC ATOM TPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQID CC ************************************************** CC SEQRES YALGSTGRSFVVGYGVNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALV CC ATOM YALGSTGRSFVVGYGVNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALV CC ************************************************** CC SEQRES GGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFKA CC ATOM GGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFKA CC ************************************************** CC SEQRES IEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQTGWPARVTDKISFKYFMD CC ATOM IEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQTGWPARVTDKISFKYFMD CC ************************************************** CC SEQRES LSEIVAAGIDPLSLVTSSNYSEGKNTKVSGVLPWDVSNNVYYVNVDLTGE CC ATOM LSEIVAAGIDPLSLVTSSNYSEGKNTKVSGVLPWDVSNNVYYVNVDLTGE CC ************************************************** CC SEQRES NIYPGGQSACRREVQFRIAAPQGTTYWNPKNDFSYDGLPTTSTVNTVTNI CC ATOM NIYPGGQSACRREVQFRIAAPQGTTYWNPKNDFSYDGLPTTSTVNTVTNI CC ************************************************** CC SEQRES PVYDNGVKVFGNEP CC ATOM PVYDNGVKVFGNEP CC ************** SQ SEQUENCE 614 AA; MW; CN; AGTYNYGEAL QKSIMFYEFQ RSGDLPADKR DNWRDDSGMK DGSDVGVDLT GGWYDAGDHV KFNLPMSYTS AMLAWSLYED KDAYDKSGQT KYIMDGIKWA NDYFIKCNPT PGVYYYQVGD GGKDHSWWGP AEVMQMERPS FKVDASKPGS AVCASTAASL ASAAVVFKSS DPTYAEKCIS HAKNLFDMAD KAKSDAGYTA ASGYYSSSSF YDDLSWAAVW LYLATNDSTY LDKAESYVPN WGKEQQTDII AYKWGQXWDD VHYGAELLLA KLTNKQLYKD SIEMNLDFWT TGVNGTRVSY TPKGLAWLFQ WGSLRHATTQ AFLAGVYAEW EGCTPSKVSV YKDFLKSQID YALGSTGRSF VVGYGVNPPQ HPHHRTAHGS WTDQMTSPTY HRHTIYGALV GGPDNADGYT DEINNYVNNE IACDYNAGFT GALAKMYKHS GGDPIPNFKA IEKITNDEVI IKAGLNSTGP NYTEIKAVVY NQTGWPARVT DKISFKYFMD LSEIVAAGID PLSLVTSSNY SEGKNTKVSG VLPWDVSNNV YYVNVDLTGE NIYPGGQSAC RREVQFRIAA PQGTTYWNPK NDFSYDGLPT TSTVNTVTNI PVYDNGVKVF GNEP // ID 1K72B STANDARD; PRT; 614 AA. DT CONVERTED FROM PDB (SEQRES) 1K72 DE Endoglucanase 9G OS Clostridium cellulolyticum CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.800 CC R-Factor 0.166 FT #SUB 292 292 GLY B 390 390 TYR A Protein B 2 FT #SUB 294 294 ASN B 370 370 GLN A Protein S 1 FT #SUB 295 295 GLY B 453 453 LYS A Protein B 1 FT #SUB 295 295 GLY B 455 455 THR A Protein B 3 FT #SUB 336 336 SER B 386 386 THR A Protein S 3 FT #SUB 336 336 SER B 389 389 THR A Protein S 1 FT #SUB 337 337 LYS B 389 389 THR A Protein S 3 FT #HET 23 23 GLY B 13 618 GOL B B 2 FT #HET 24 24 ASP B 12 617 MG B S 3 FT #HET 24 24 ASP B 13 618 GOL B A 4 FT #HET 25 25 LEU B 12 617 MG B B 1 FT #HET 50 50 THR B 13 618 GOL B S 4 FT #HET 51 51 GLY B 13 618 GOL B B 3 FT #HET 106 106 LYS B 13 618 GOL B S 5 FT #HET 209 209 SER B 11 616 CA B A 5 FT #HET 212 212 ASP B 11 616 CA B S 3 FT #HET 213 213 ASP B 11 616 CA B S 3 FT #HET 259 259 ASP B 11 616 CA B B 2 FT #HET 500 500 ASP B 10 615 CA B B 2 FT #HET 503 503 GLU B 10 615 CA B S 3 FT #HET 578 578 ASN B 10 615 CA B B 2 FT #HET 581 581 ASN B 10 615 CA B S 3 FT #HET 582 582 ASP B 10 615 CA B S 3 FT DISORDER 1 1 CC SEQUENCE 613 AA (ATOM); CC GTYNYGEALQ KSIMFYEFQR SGDLPADKRD NWRDDSGMKD GSDVGVDLTG GWYDAGDHVK CC FNLPMSYTSA MLAWSLYEDK DAYDKSGQTK YIMDGIKWAN DYFIKCNPTP GVYYYQVGDG CC GKDHSWWGPA EVMQMERPSF KVDASKPGSA VCASTAASLA SAAVVFKSSD PTYAEKCISH CC AKNLFDMADK AKSDAGYTAA SGYYSSSSFY DDLSWAAVWL YLATNDSTYL DKAESYVPNW CC GKEQQTDIIA YKWGQXWDDV HYGAELLLAK LTNKQLYKDS IEMNLDFWTT GVNGTRVSYT CC PKGLAWLFQW GSLRHATTQA FLAGVYAEWE GCTPSKVSVY KDFLKSQIDY ALGSTGRSFV CC VGYGVNPPQH PHHRTAHGSW TDQMTSPTYH RHTIYGALVG GPDNADGYTD EINNYVNNEI CC ACDYNAGFTG ALAKMYKHSG GDPIPNFKAI EKITNDEVII KAGLNSTGPN YTEIKAVVYN CC QTGWPARVTD KISFKYFMDL SEIVAAGIDP LSLVTSSNYS EGKNTKVSGV LPWDVSNNVY CC YVNVDLTGEN IYPGGQSACR REVQFRIAAP QGTTYWNPKN DFSYDGLPTT STVNTVTNIP CC VYDNGVKVFG NEP CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AGTYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLT CC ATOM -GTYNYGEALQKSIMFYEFQRSGDLPADKRDNWRDDSGMKDGSDVGVDLT CC ************************************************* CC SEQRES GGWYDAGDHVKFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWA CC ATOM GGWYDAGDHVKFNLPMSYTSAMLAWSLYEDKDAYDKSGQTKYIMDGIKWA CC ************************************************** CC SEQRES NDYFIKCNPTPGVYYYQVGDGGKDHSWWGPAEVMQMERPSFKVDASKPGS CC ATOM NDYFIKCNPTPGVYYYQVGDGGKDHSWWGPAEVMQMERPSFKVDASKPGS CC ************************************************** CC SEQRES AVCASTAASLASAAVVFKSSDPTYAEKCISHAKNLFDMADKAKSDAGYTA CC ATOM AVCASTAASLASAAVVFKSSDPTYAEKCISHAKNLFDMADKAKSDAGYTA CC ************************************************** CC SEQRES ASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWGKEQQTDII CC ATOM ASGYYSSSSFYDDLSWAAVWLYLATNDSTYLDKAESYVPNWGKEQQTDII CC ************************************************** CC SEQRES AYKWGQXWDDVHYGAELLLAKLTNKQLYKDSIEMNLDFWTTGVNGTRVSY CC ATOM AYKWGQXWDDVHYGAELLLAKLTNKQLYKDSIEMNLDFWTTGVNGTRVSY CC ************************************************** CC SEQRES TPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQID CC ATOM TPKGLAWLFQWGSLRHATTQAFLAGVYAEWEGCTPSKVSVYKDFLKSQID CC ************************************************** CC SEQRES YALGSTGRSFVVGYGVNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALV CC ATOM YALGSTGRSFVVGYGVNPPQHPHHRTAHGSWTDQMTSPTYHRHTIYGALV CC ************************************************** CC SEQRES GGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFKA CC ATOM GGPDNADGYTDEINNYVNNEIACDYNAGFTGALAKMYKHSGGDPIPNFKA CC ************************************************** CC SEQRES IEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQTGWPARVTDKISFKYFMD CC ATOM IEKITNDEVIIKAGLNSTGPNYTEIKAVVYNQTGWPARVTDKISFKYFMD CC ************************************************** CC SEQRES LSEIVAAGIDPLSLVTSSNYSEGKNTKVSGVLPWDVSNNVYYVNVDLTGE CC ATOM LSEIVAAGIDPLSLVTSSNYSEGKNTKVSGVLPWDVSNNVYYVNVDLTGE CC ************************************************** CC SEQRES NIYPGGQSACRREVQFRIAAPQGTTYWNPKNDFSYDGLPTTSTVNTVTNI CC ATOM NIYPGGQSACRREVQFRIAAPQGTTYWNPKNDFSYDGLPTTSTVNTVTNI CC ************************************************** CC SEQRES PVYDNGVKVFGNEP CC ATOM PVYDNGVKVFGNEP CC ************** SQ SEQUENCE 614 AA; MW; CN; AGTYNYGEAL QKSIMFYEFQ RSGDLPADKR DNWRDDSGMK DGSDVGVDLT GGWYDAGDHV KFNLPMSYTS AMLAWSLYED KDAYDKSGQT KYIMDGIKWA NDYFIKCNPT PGVYYYQVGD GGKDHSWWGP AEVMQMERPS FKVDASKPGS AVCASTAASL ASAAVVFKSS DPTYAEKCIS HAKNLFDMAD KAKSDAGYTA ASGYYSSSSF YDDLSWAAVW LYLATNDSTY LDKAESYVPN WGKEQQTDII AYKWGQXWDD VHYGAELLLA KLTNKQLYKD SIEMNLDFWT TGVNGTRVSY TPKGLAWLFQ WGSLRHATTQ AFLAGVYAEW EGCTPSKVSV YKDFLKSQID YALGSTGRSF VVGYGVNPPQ HPHHRTAHGS WTDQMTSPTY HRHTIYGALV GGPDNADGYT DEINNYVNNE IACDYNAGFT GALAKMYKHS GGDPIPNFKA IEKITNDEVI IKAGLNSTGP NYTEIKAVVY NQTGWPARVT DKISFKYFMD LSEIVAAGID PLSLVTSSNY SEGKNTKVSG VLPWDVSNNV YYVNVDLTGE NIYPGGQSAC RREVQFRIAA PQGTTYWNPK NDFSYDGLPT TSTVNTVTNI PVYDNGVKVF GNEP //