ID 3F5KA STANDARD; PRT; 481 AA. DT CONVERTED FROM PDB (SEQRES) 3F5K DE Beta-glucosidase OS Oryza sativa Japonica Group CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.800 CC R-Factor 0.178 FT #SUB 38 33 MET A 38 33 MET B Protein S 3 FT #SUB 38 33 MET A 73 68 HIS B Protein S 1 FT #SUB 41 36 SER A 41 36 SER B Protein S 4 FT #SUB 70 65 ASP A 70 65 ASP B Protein S 1 FT #SUB 70 65 ASP A 73 68 HIS B Protein S 6 FT #SUB 73 68 HIS A 38 33 MET B Protein S 1 FT #SUB 73 68 HIS A 70 65 ASP B Protein S 6 FT #SUB 73 68 HIS A 73 68 HIS B Protein S 4 FT #SUB 73 68 HIS A 464 459 LEU B Protein B 1 FT #SUB 74 69 ARG A 74 69 ARG B Protein S 10 FT #SUB 76 71 LYS A 462 457 ASN B Protein S 5 FT #SUB 77 72 GLU A 463 458 THR B Protein B 2 FT #SUB 80 75 ASN A 463 458 THR B Protein S 1 FT #SUB 462 457 ASN A 76 71 LYS B Protein A 5 FT #SUB 463 458 THR A 77 72 GLU B Protein A 3 FT #SUB 463 458 THR A 80 75 ASN B Protein S 1 FT #SUB 464 459 LEU A 73 68 HIS B Protein S 1 FT #HET 34 29 GLN A 5 5 BGC C S 7 FT #HET 70 65 ASP A 11 1001 ZN A S 3 FT #HET 73 68 HIS A 11 1001 ZN A S 4 FT #HET 135 130 HIS A 5 5 BGC C S 6 FT #HET 136 131 TYR A 4 4 BGC C S 1 FT #HET 168 163 THR A 12 1002 SO4 A B 1 FT #HET 171 166 ASN A 12 1002 SO4 A S 7 FT #HET 172 167 ARG A 12 1002 SO4 A S 9 FT #HET 180 175 ASN A 5 5 BGC C S 4 FT #HET 181 176 GLN A 4 4 BGC C S 10 FT #HET 181 176 GLN A 5 5 BGC C S 9 FT #HET 187 182 LEU A 2 2 BGC C S 1 FT #HET 188 183 LEU A 3 3 BGC C S 2 FT #HET 188 183 LEU A 4 4 BGC C S 1 FT #HET 192 187 GLN A 2 2 BGC C S 1 FT #HET 248 243 ASP A 4 4 BGC C S 1 FT #HET 250 245 ASN A 3 3 BGC C S 4 FT #HET 250 245 ASN A 4 4 BGC C S 1 FT #HET 283 278 GLY A 13 1003 MES A B 2 FT #HET 284 279 HIS A 13 1003 MES A A 4 FT #HET 285 280 TYR A 13 1003 MES A A 3 FT #HET 290 285 GLN A 13 1003 MES A S 5 FT #HET 299 294 LYS A 13 1003 MES A B 2 FT #HET 300 295 PHE A 13 1003 MES A A 7 FT #HET 305 300 ALA A 13 1003 MES A S 1 FT #HET 318 313 ASN A 5 5 BGC C S 1 FT #HET 320 315 TYR A 4 4 BGC C S 2 FT #HET 320 315 TYR A 5 5 BGC C S 6 FT #HET 346 341 TYR A 2 2 BGC C S 2 FT #HET 348 343 PHE A 2 2 BGC C S 1 FT #HET 363 358 TRP A 3 3 BGC C S 9 FT #HET 363 358 TRP A 4 4 BGC C S 6 FT #HET 363 358 TRP A 5 5 BGC C S 1 FT #HET 391 386 GLU A 5 5 BGC C S 10 FT #HET 438 433 TRP A 5 5 BGC C S 8 FT #HET 445 440 GLU A 4 4 BGC C S 1 FT #HET 445 440 GLU A 5 5 BGC C S 13 FT #HET 446 441 TRP A 5 5 BGC C S 6 FT #HET 454 449 PHE A 5 5 BGC C S 5 FT DISORDER 1 9 CC SEQUENCE 472 AA (ATOM); CC NWLGGLSRAA FPKRFVFGTV TSAYQVEGMA ASGGRGPSIW DAFAHTPGNV AGNQNGDVAT CC DQYHRYKEDV NLMKSLNFDA YRFSISWSRI FPDGEGRVNQ EGVAYYNNLI NYLLQKGITP CC YVNLYHYDLP LALEKKYGGW LNAKMADLFT EYADFCFKTF GNRVKHWFTF NQPRIVALLG CC YDQGTNPPKR CTKCAAGGNS ATEPYIVAHN FLLSHAAAVA RYRTKYQAAQ QGKVGIVLDF CC NWYEALSNST EDQAAAQRAR DFHIGWYLDP LINGHYPQIM QDLVKDRLPK FTPEQARLVK CC GSADYIGINQ YTASYMKGQQ LMQQTPTSYS ADWQVTYVFA KNGKPIGPQA NSNWLYIVPW CC GMYGCVNYIK QKYGNPTVVI TENGMDQPAN LSRDQYLRDT TRVHFYRSYL TQLKKAIDEG CC ANVAGYFAWS LLDNFEWLSG YTSKFGIVYV DFNTLERHPK ASAYWFRDML KH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AMADVVPKPNWLGGLSRAAFPKRFVFGTVTSAYQVEGMAASGGRGPSIWD CC ATOM ---------NWLGGLSRAAFPKRFVFGTVTSAYQVEGMAASGGRGPSIWD CC ***************************************** CC SEQRES AFAHTPGNVAGNQNGDVATDQYHRYKEDVNLMKSLNFDAYRFSISWSRIF CC ATOM AFAHTPGNVAGNQNGDVATDQYHRYKEDVNLMKSLNFDAYRFSISWSRIF CC ************************************************** CC SEQRES PDGEGRVNQEGVAYYNNLINYLLQKGITPYVNLYHYDLPLALEKKYGGWL CC ATOM PDGEGRVNQEGVAYYNNLINYLLQKGITPYVNLYHYDLPLALEKKYGGWL CC ************************************************** CC SEQRES NAKMADLFTEYADFCFKTFGNRVKHWFTFNQPRIVALLGYDQGTNPPKRC CC ATOM NAKMADLFTEYADFCFKTFGNRVKHWFTFNQPRIVALLGYDQGTNPPKRC CC ************************************************** CC SEQRES TKCAAGGNSATEPYIVAHNFLLSHAAAVARYRTKYQAAQQGKVGIVLDFN CC ATOM TKCAAGGNSATEPYIVAHNFLLSHAAAVARYRTKYQAAQQGKVGIVLDFN CC ************************************************** CC SEQRES WYEALSNSTEDQAAAQRARDFHIGWYLDPLINGHYPQIMQDLVKDRLPKF CC ATOM WYEALSNSTEDQAAAQRARDFHIGWYLDPLINGHYPQIMQDLVKDRLPKF CC ************************************************** CC SEQRES TPEQARLVKGSADYIGINQYTASYMKGQQLMQQTPTSYSADWQVTYVFAK CC ATOM TPEQARLVKGSADYIGINQYTASYMKGQQLMQQTPTSYSADWQVTYVFAK CC ************************************************** CC SEQRES NGKPIGPQANSNWLYIVPWGMYGCVNYIKQKYGNPTVVITENGMDQPANL CC ATOM NGKPIGPQANSNWLYIVPWGMYGCVNYIKQKYGNPTVVITENGMDQPANL CC ************************************************** CC SEQRES SRDQYLRDTTRVHFYRSYLTQLKKAIDEGANVAGYFAWSLLDNFEWLSGY CC ATOM SRDQYLRDTTRVHFYRSYLTQLKKAIDEGANVAGYFAWSLLDNFEWLSGY CC ************************************************** CC SEQRES TSKFGIVYVDFNTLERHPKASAYWFRDMLKH CC ATOM TSKFGIVYVDFNTLERHPKASAYWFRDMLKH CC ******************************* SQ SEQUENCE 481 AA; MW; CN; AMADVVPKPN WLGGLSRAAF PKRFVFGTVT SAYQVEGMAA SGGRGPSIWD AFAHTPGNVA GNQNGDVATD QYHRYKEDVN LMKSLNFDAY RFSISWSRIF PDGEGRVNQE GVAYYNNLIN YLLQKGITPY VNLYHYDLPL ALEKKYGGWL NAKMADLFTE YADFCFKTFG NRVKHWFTFN QPRIVALLGY DQGTNPPKRC TKCAAGGNSA TEPYIVAHNF LLSHAAAVAR YRTKYQAAQQ GKVGIVLDFN WYEALSNSTE DQAAAQRARD FHIGWYLDPL INGHYPQIMQ DLVKDRLPKF TPEQARLVKG SADYIGINQY TASYMKGQQL MQQTPTSYSA DWQVTYVFAK NGKPIGPQAN SNWLYIVPWG MYGCVNYIKQ KYGNPTVVIT ENGMDQPANL SRDQYLRDTT RVHFYRSYLT QLKKAIDEGA NVAGYFAWSL LDNFEWLSGY TSKFGIVYVD FNTLERHPKA SAYWFRDMLK H // ID 3F5KB STANDARD; PRT; 481 AA. DT CONVERTED FROM PDB (SEQRES) 3F5K DE Beta-glucosidase OS Oryza sativa Japonica Group CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.800 CC R-Factor 0.178 FT #SUB 38 33 MET B 38 33 MET A Protein S 3 FT #SUB 38 33 MET B 73 68 HIS A Protein S 1 FT #SUB 41 36 SER B 41 36 SER A Protein S 4 FT #SUB 70 65 ASP B 70 65 ASP A Protein S 1 FT #SUB 70 65 ASP B 73 68 HIS A Protein S 6 FT #SUB 73 68 HIS B 38 33 MET A Protein S 1 FT #SUB 73 68 HIS B 70 65 ASP A Protein S 6 FT #SUB 73 68 HIS B 73 68 HIS A Protein S 4 FT #SUB 73 68 HIS B 464 459 LEU A Protein B 1 FT #SUB 74 69 ARG B 74 69 ARG A Protein S 10 FT #SUB 76 71 LYS B 462 457 ASN A Protein S 5 FT #SUB 77 72 GLU B 463 458 THR A Protein A 3 FT #SUB 80 75 ASN B 463 458 THR A Protein S 1 FT #SUB 462 457 ASN B 76 71 LYS A Protein A 5 FT #SUB 463 458 THR B 77 72 GLU A Protein S 2 FT #SUB 463 458 THR B 80 75 ASN A Protein S 1 FT #SUB 464 459 LEU B 73 68 HIS A Protein S 1 FT #HET 34 29 GLN B 10 5 BGC D S 7 FT #HET 70 65 ASP B 11 1001 ZN A S 3 FT #HET 73 68 HIS B 11 1001 ZN A S 4 FT #HET 135 130 HIS B 10 5 BGC D S 6 FT #HET 136 131 TYR B 9 4 BGC D S 1 FT #HET 168 163 THR B 15 1002 SO4 B B 1 FT #HET 171 166 ASN B 15 1002 SO4 B S 8 FT #HET 172 167 ARG B 15 1002 SO4 B S 10 FT #HET 180 175 ASN B 10 5 BGC D S 4 FT #HET 181 176 GLN B 9 4 BGC D S 10 FT #HET 181 176 GLN B 10 5 BGC D S 9 FT #HET 187 182 LEU B 7 2 BGC D S 1 FT #HET 188 183 LEU B 8 3 BGC D S 2 FT #HET 188 183 LEU B 9 4 BGC D S 1 FT #HET 192 187 GLN B 7 2 BGC D S 1 FT #HET 248 243 ASP B 9 4 BGC D S 1 FT #HET 250 245 ASN B 8 3 BGC D S 4 FT #HET 250 245 ASN B 9 4 BGC D S 1 FT #HET 283 278 GLY B 16 1003 MES B B 2 FT #HET 284 279 HIS B 16 1003 MES B A 4 FT #HET 285 280 TYR B 16 1003 MES B A 2 FT #HET 290 285 GLN B 16 1003 MES B S 5 FT #HET 299 294 LYS B 16 1003 MES B A 3 FT #HET 300 295 PHE B 16 1003 MES B A 9 FT #HET 302 297 PRO B 16 1003 MES B S 1 FT #HET 305 300 ALA B 16 1003 MES B S 1 FT #HET 318 313 ASN B 10 5 BGC D S 1 FT #HET 320 315 TYR B 9 4 BGC D S 2 FT #HET 320 315 TYR B 10 5 BGC D S 7 FT #HET 346 341 TYR B 7 2 BGC D S 2 FT #HET 348 343 PHE B 7 2 BGC D S 2 FT #HET 363 358 TRP B 8 3 BGC D S 9 FT #HET 363 358 TRP B 9 4 BGC D S 6 FT #HET 363 358 TRP B 10 5 BGC D S 1 FT #HET 391 386 GLU B 10 5 BGC D S 10 FT #HET 438 433 TRP B 10 5 BGC D S 8 FT #HET 445 440 GLU B 9 4 BGC D S 1 FT #HET 445 440 GLU B 10 5 BGC D S 13 FT #HET 446 441 TRP B 10 5 BGC D S 6 FT #HET 454 449 PHE B 10 5 BGC D S 5 FT DISORDER 1 9 CC SEQUENCE 472 AA (ATOM); CC NWLGGLSRAA FPKRFVFGTV TSAYQVEGMA ASGGRGPSIW DAFAHTPGNV AGNQNGDVAT CC DQYHRYKEDV NLMKSLNFDA YRFSISWSRI FPDGEGRVNQ EGVAYYNNLI NYLLQKGITP CC YVNLYHYDLP LALEKKYGGW LNAKMADLFT EYADFCFKTF GNRVKHWFTF NQPRIVALLG CC YDQGTNPPKR CTKCAAGGNS ATEPYIVAHN FLLSHAAAVA RYRTKYQAAQ QGKVGIVLDF CC NWYEALSNST EDQAAAQRAR DFHIGWYLDP LINGHYPQIM QDLVKDRLPK FTPEQARLVK CC GSADYIGINQ YTASYMKGQQ LMQQTPTSYS ADWQVTYVFA KNGKPIGPQA NSNWLYIVPW CC GMYGCVNYIK QKYGNPTVVI TENGMDQPAN LSRDQYLRDT TRVHFYRSYL TQLKKAIDEG CC ANVAGYFAWS LLDNFEWLSG YTSKFGIVYV DFNTLERHPK ASAYWFRDML KH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AMADVVPKPNWLGGLSRAAFPKRFVFGTVTSAYQVEGMAASGGRGPSIWD CC ATOM ---------NWLGGLSRAAFPKRFVFGTVTSAYQVEGMAASGGRGPSIWD CC ***************************************** CC SEQRES AFAHTPGNVAGNQNGDVATDQYHRYKEDVNLMKSLNFDAYRFSISWSRIF CC ATOM AFAHTPGNVAGNQNGDVATDQYHRYKEDVNLMKSLNFDAYRFSISWSRIF CC ************************************************** CC SEQRES PDGEGRVNQEGVAYYNNLINYLLQKGITPYVNLYHYDLPLALEKKYGGWL CC ATOM PDGEGRVNQEGVAYYNNLINYLLQKGITPYVNLYHYDLPLALEKKYGGWL CC ************************************************** CC SEQRES NAKMADLFTEYADFCFKTFGNRVKHWFTFNQPRIVALLGYDQGTNPPKRC CC ATOM NAKMADLFTEYADFCFKTFGNRVKHWFTFNQPRIVALLGYDQGTNPPKRC CC ************************************************** CC SEQRES TKCAAGGNSATEPYIVAHNFLLSHAAAVARYRTKYQAAQQGKVGIVLDFN CC ATOM TKCAAGGNSATEPYIVAHNFLLSHAAAVARYRTKYQAAQQGKVGIVLDFN CC ************************************************** CC SEQRES WYEALSNSTEDQAAAQRARDFHIGWYLDPLINGHYPQIMQDLVKDRLPKF CC ATOM WYEALSNSTEDQAAAQRARDFHIGWYLDPLINGHYPQIMQDLVKDRLPKF CC ************************************************** CC SEQRES TPEQARLVKGSADYIGINQYTASYMKGQQLMQQTPTSYSADWQVTYVFAK CC ATOM TPEQARLVKGSADYIGINQYTASYMKGQQLMQQTPTSYSADWQVTYVFAK CC ************************************************** CC SEQRES NGKPIGPQANSNWLYIVPWGMYGCVNYIKQKYGNPTVVITENGMDQPANL CC ATOM NGKPIGPQANSNWLYIVPWGMYGCVNYIKQKYGNPTVVITENGMDQPANL CC ************************************************** CC SEQRES SRDQYLRDTTRVHFYRSYLTQLKKAIDEGANVAGYFAWSLLDNFEWLSGY CC ATOM SRDQYLRDTTRVHFYRSYLTQLKKAIDEGANVAGYFAWSLLDNFEWLSGY CC ************************************************** CC SEQRES TSKFGIVYVDFNTLERHPKASAYWFRDMLKH CC ATOM TSKFGIVYVDFNTLERHPKASAYWFRDMLKH CC ******************************* SQ SEQUENCE 481 AA; MW; CN; AMADVVPKPN WLGGLSRAAF PKRFVFGTVT SAYQVEGMAA SGGRGPSIWD AFAHTPGNVA GNQNGDVATD QYHRYKEDVN LMKSLNFDAY RFSISWSRIF PDGEGRVNQE GVAYYNNLIN YLLQKGITPY VNLYHYDLPL ALEKKYGGWL NAKMADLFTE YADFCFKTFG NRVKHWFTFN QPRIVALLGY DQGTNPPKRC TKCAAGGNSA TEPYIVAHNF LLSHAAAVAR YRTKYQAAQQ GKVGIVLDFN WYEALSNSTE DQAAAQRARD FHIGWYLDPL INGHYPQIMQ DLVKDRLPKF TPEQARLVKG SADYIGINQY TASYMKGQQL MQQTPTSYSA DWQVTYVFAK NGKPIGPQAN SNWLYIVPWG MYGCVNYIKQ KYGNPTVVIT ENGMDQPANL SRDQYLRDTT RVHFYRSYLT QLKKAIDEGA NVAGYFAWSL LDNFEWLSGY TSKFGIVYVD FNTLERHPKA SAYWFRDMLK H //