ID 2CBUA STANDARD; PRT; 468 AA. DT CONVERTED FROM PDB (SEQRES) 2CBU DE BETA-GLUCOSIDASE A OS THERMOTOGA MARITIMA CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.850 CC R-Factor 0.186 FT #SUB 68 46 LYS A 25 3 VAL B Protein S 1 FT #SUB 343 321 ALA A 24 2 ASN B Protein B 1 FT #SUB 344 322 MET A 24 2 ASN B Protein B 6 FT #SUB 345 323 GLY A 24 2 ASN B Protein B 3 FT #HET 42 20 GLN A 1 1447 CTS A S 7 FT #HET 75 53 VAL A 2 1448 ACT A S 2 FT #HET 76 54 ALA A 2 1448 ACT A B 6 FT #HET 143 121 HIS A 1 1447 CTS A S 7 FT #HET 144 122 TRP A 1 1447 CTS A S 2 FT #HET 187 165 ASN A 1 1447 CTS A S 4 FT #HET 188 166 GLU A 1 1447 CTS A S 7 FT #HET 317 295 TYR A 1 1447 CTS A S 10 FT #HET 346 324 TRP A 1 1447 CTS A S 3 FT #HET 373 351 GLU A 1 1447 CTS A S 11 FT #HET 386 364 GLY A 3 1449 ACT A B 2 FT #HET 387 365 ARG A 3 1449 ACT A S 4 FT #HET 420 398 TRP A 1 1447 CTS A S 14 FT #HET 427 405 GLU A 1 1447 CTS A S 12 FT #HET 428 406 TRP A 1 1447 CTS A S 9 FT #HET 432 410 TYR A 2 1448 ACT A S 3 FT #HET 436 414 PHE A 1 1447 CTS A S 4 FT #HET 443 421 TYR A 2 1448 ACT A S 7 FT #HET 449 427 ILE A 3 1449 ACT A S 2 FT #HET 450 428 VAL A 3 1449 ACT A B 1 FT #HET 455 433 TYR A 3 1449 ACT A S 5 FT DISORDER 1 24 FT DISORDER 255 256 FT DISORDER 328 329 CC Miss-SC 2 CC SEQUENCE 440 AA (ATOM); CC VKKFPEGFLW GVATASYQIE GSPLADGAGM SIWHTFSHTP GNVKNGDTGD VACDHYNRWK CC EDIEIIEKLG VKAYRFSISW PRILPEGTGR VNQKGLDFYN RIIDTLLEKG ITPFVTIYHW CC DLPFALQLKG GWANREIADW FAEYSRVLFE NFGDRVKNWI TLNEPWVVAI VGHLYGVHAP CC GMRDIYVAFR AVHNLLRAHA RAVKVFRETV KDGKIGIVFN NGYFEPASEK DIRAVRFMHQ CC FNNYPLFLNP IYRGDYPELV LEFAREYLPE NYKDDMSEIQ EKIDFVGLNY YSGHLVKFDP CC DAKVSFVERD LPKTAMGWEI VPEGIYWILK KVKEEYNPPE VYITENGAAF DDVVSEDGRV CC HDQNRIDYLK AHIGQAWKAI QEGVPLKGYF VWSLLDNFEW AEGYSKRFGI VYVDYSTQKR CC IVKDSGYWYS NVVKNNGLED CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGSSHHHHHHSSGLVPRGSHMASNVKKFPEGFLWGVATASYQIEGSPLAD CC ATOM ------------------------VKKFPEGFLWGVATASYQIEGSPLAD CC ************************** CC SEQRES GAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWKEDIEIIEKLGVKAYRF CC ATOM GAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWKEDIEIIEKLGVKAYRF CC ************************************************** CC SEQRES SISWPRILPEGTGRVNQKGLDFYNRIIDTLLEKGITPFVTIYHWDLPFAL CC ATOM SISWPRILPEGTGRVNQKGLDFYNRIIDTLLEKGITPFVTIYHWDLPFAL CC ************************************************** CC SEQRES QLKGGWANREIADWFAEYSRVLFENFGDRVKNWITLNEPWVVAIVGHLYG CC ATOM QLKGGWANREIADWFAEYSRVLFENFGDRVKNWITLNEPWVVAIVGHLYG CC ************************************************** CC SEQRES VHAPGMRDIYVAFRAVHNLLRAHARAVKVFRETVKDGKIGIVFNNGYFEP CC ATOM VHAPGMRDIYVAFRAVHNLLRAHARAVKVFRETVKDGKIGIVFNNGYFEP CC ************************************************** CC SEQRES ASEKEEDIRAVRFMHQFNNYPLFLNPIYRGDYPELVLEFAREYLPENYKD CC ATOM ASEK--DIRAVRFMHQFNNYPLFLNPIYRGDYPELVLEFAREYLPENYKD CC **** ******************************************** CC SEQRES DMSEIQEKIDFVGLNYYSGHLVKFDPDAPAKVSFVERDLPKTAMGWEIVP CC ATOM DMSEIQEKIDFVGLNYYSGHLVKFDPD--AKVSFVERDLPKTAMGWEIVP CC *************************** ********************* CC SEQRES EGIYWILKKVKEEYNPPEVYITENGAAFDDVVSEDGRVHDQNRIDYLKAH CC ATOM EGIYWILKKVKEEYNPPEVYITENGAAFDDVVSEDGRVHDQNRIDYLKAH CC ************************************************** CC SEQRES IGQAWKAIQEGVPLKGYFVWSLLDNFEWAEGYSKRFGIVYVDYSTQKRIV CC ATOM IGQAWKAIQEGVPLKGYFVWSLLDNFEWAEGYSKRFGIVYVDYSTQKRIV CC ************************************************** CC SEQRES KDSGYWYSNVVKNNGLED CC ATOM KDSGYWYSNVVKNNGLED CC ****************** SQ SEQUENCE 468 AA; MW; CN; MGSSHHHHHH SSGLVPRGSH MASNVKKFPE GFLWGVATAS YQIEGSPLAD GAGMSIWHTF SHTPGNVKNG DTGDVACDHY NRWKEDIEII EKLGVKAYRF SISWPRILPE GTGRVNQKGL DFYNRIIDTL LEKGITPFVT IYHWDLPFAL QLKGGWANRE IADWFAEYSR VLFENFGDRV KNWITLNEPW VVAIVGHLYG VHAPGMRDIY VAFRAVHNLL RAHARAVKVF RETVKDGKIG IVFNNGYFEP ASEKEEDIRA VRFMHQFNNY PLFLNPIYRG DYPELVLEFA REYLPENYKD DMSEIQEKID FVGLNYYSGH LVKFDPDAPA KVSFVERDLP KTAMGWEIVP EGIYWILKKV KEEYNPPEVY ITENGAAFDD VVSEDGRVHD QNRIDYLKAH IGQAWKAIQE GVPLKGYFVW SLLDNFEWAE GYSKRFGIVY VDYSTQKRIV KDSGYWYSNV VKNNGLED // ID 2CBUB STANDARD; PRT; 468 AA. DT CONVERTED FROM PDB (SEQRES) 2CBU DE BETA-GLUCOSIDASE A OS THERMOTOGA MARITIMA CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.850 CC R-Factor 0.186 FT #SUB 24 2 ASN B 343 321 ALA A Protein S 1 FT #SUB 24 2 ASN B 344 322 MET A Protein S 6 FT #SUB 24 2 ASN B 345 323 GLY A Protein S 3 FT #SUB 25 3 VAL B 68 46 LYS A Protein S 1 FT #HET 42 20 GLN B 4 1445 CTS B S 7 FT #HET 143 121 HIS B 4 1445 CTS B S 7 FT #HET 144 122 TRP B 4 1445 CTS B S 2 FT #HET 187 165 ASN B 4 1445 CTS B S 4 FT #HET 188 166 GLU B 4 1445 CTS B S 7 FT #HET 300 278 ASP B 5 1446 CA B B 2 FT #HET 303 281 SER B 5 1446 CA B S 1 FT #HET 304 282 GLU B 5 1446 CA B S 3 FT #HET 317 295 TYR B 4 1445 CTS B S 11 FT #HET 346 324 TRP B 4 1445 CTS B S 3 FT #HET 373 351 GLU B 4 1445 CTS B S 11 FT #HET 420 398 TRP B 4 1445 CTS B S 14 FT #HET 427 405 GLU B 4 1445 CTS B S 10 FT #HET 428 406 TRP B 4 1445 CTS B S 8 FT #HET 436 414 PHE B 4 1445 CTS B S 3 FT DISORDER 1 23 FT DISORDER 329 330 FT DISORDER 467 468 CC Miss-SC 2 CC SEQUENCE 441 AA (ATOM); CC NVKKFPEGFL WGVATASYQI EGSPLADGAG MSIWHTFSHT PGNVKNGDTG DVACDHYNRW CC KEDIEIIEKL GVKAYRFSIS WPRILPEGTG RVNQKGLDFY NRIIDTLLEK GITPFVTIYH CC WDLPFALQLK GGWANREIAD WFAEYSRVLF ENFGDRVKNW ITLNEPWVVA IVGHLYGVHA CC PGMRDIYVAF RAVHNLLRAH ARAVKVFRET VKDGKIGIVF NNGYFEPASE KEEDIRAVRF CC MHQFNNYPLF LNPIYRGDYP ELVLEFAREY LPENYKDDMS EIQEKIDFVG LNYYSGHLVK CC FDPDAKVSFV ERDLPKTAMG WEIVPEGIYW ILKKVKEEYN PPEVYITENG AAFDDVVSED CC GRVHDQNRID YLKAHIGQAW KAIQEGVPLK GYFVWSLLDN FEWAEGYSKR FGIVYVDYST CC QKRIVKDSGY WYSNVVKNNG L CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGSSHHHHHHSSGLVPRGSHMASNVKKFPEGFLWGVATASYQIEGSPLAD CC ATOM -----------------------NVKKFPEGFLWGVATASYQIEGSPLAD CC *************************** CC SEQRES GAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWKEDIEIIEKLGVKAYRF CC ATOM GAGMSIWHTFSHTPGNVKNGDTGDVACDHYNRWKEDIEIIEKLGVKAYRF CC ************************************************** CC SEQRES SISWPRILPEGTGRVNQKGLDFYNRIIDTLLEKGITPFVTIYHWDLPFAL CC ATOM SISWPRILPEGTGRVNQKGLDFYNRIIDTLLEKGITPFVTIYHWDLPFAL CC ************************************************** CC SEQRES QLKGGWANREIADWFAEYSRVLFENFGDRVKNWITLNEPWVVAIVGHLYG CC ATOM QLKGGWANREIADWFAEYSRVLFENFGDRVKNWITLNEPWVVAIVGHLYG CC ************************************************** CC SEQRES VHAPGMRDIYVAFRAVHNLLRAHARAVKVFRETVKDGKIGIVFNNGYFEP CC ATOM VHAPGMRDIYVAFRAVHNLLRAHARAVKVFRETVKDGKIGIVFNNGYFEP CC ************************************************** CC SEQRES ASEKEEDIRAVRFMHQFNNYPLFLNPIYRGDYPELVLEFAREYLPENYKD CC ATOM ASEKEEDIRAVRFMHQFNNYPLFLNPIYRGDYPELVLEFAREYLPENYKD CC ************************************************** CC SEQRES DMSEIQEKIDFVGLNYYSGHLVKFDPDAPAKVSFVERDLPKTAMGWEIVP CC ATOM DMSEIQEKIDFVGLNYYSGHLVKFDPDA--KVSFVERDLPKTAMGWEIVP CC **************************** ******************** CC SEQRES EGIYWILKKVKEEYNPPEVYITENGAAFDDVVSEDGRVHDQNRIDYLKAH CC ATOM EGIYWILKKVKEEYNPPEVYITENGAAFDDVVSEDGRVHDQNRIDYLKAH CC ************************************************** CC SEQRES IGQAWKAIQEGVPLKGYFVWSLLDNFEWAEGYSKRFGIVYVDYSTQKRIV CC ATOM IGQAWKAIQEGVPLKGYFVWSLLDNFEWAEGYSKRFGIVYVDYSTQKRIV CC ************************************************** CC SEQRES KDSGYWYSNVVKNNGLED CC ATOM KDSGYWYSNVVKNNGL-- CC **************** SQ SEQUENCE 468 AA; MW; CN; MGSSHHHHHH SSGLVPRGSH MASNVKKFPE GFLWGVATAS YQIEGSPLAD GAGMSIWHTF SHTPGNVKNG DTGDVACDHY NRWKEDIEII EKLGVKAYRF SISWPRILPE GTGRVNQKGL DFYNRIIDTL LEKGITPFVT IYHWDLPFAL QLKGGWANRE IADWFAEYSR VLFENFGDRV KNWITLNEPW VVAIVGHLYG VHAPGMRDIY VAFRAVHNLL RAHARAVKVF RETVKDGKIG IVFNNGYFEP ASEKEEDIRA VRFMHQFNNY PLFLNPIYRG DYPELVLEFA REYLPENYKD DMSEIQEKID FVGLNYYSGH LVKFDPDAPA KVSFVERDLP KTAMGWEIVP EGIYWILKKV KEEYNPPEVY ITENGAAFDD VVSEDGRVHD QNRIDYLKAH IGQAWKAIQE GVPLKGYFVW SLLDNFEWAE GYSKRFGIVY VDYSTQKRIV KDSGYWYSNV VKNNGLED //