ID 5OHTA STANDARD; PRT; 686 AA. DT CONVERTED FROM PDB (SEQRES) 5OHT DE Sulfoquinovosidase OS Escherichia coli (strain K12) CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.870 CC R-Factor 0.168 FT #SUB 40 40 TRP A 62 62 GLN B Protein S 3 FT #SUB 61 61 LEU A 61 61 LEU B Protein S 1 FT #SUB 61 61 LEU A 64 64 LYS B Protein B 3 FT #SUB 62 62 GLN A 40 40 TRP B Protein B 2 FT #SUB 62 62 GLN A 64 64 LYS B Protein B 3 FT #SUB 63 63 GLU A 64 64 LYS B Protein A 3 FT #SUB 63 63 GLU A 65 65 ILE B Protein S 2 FT #SUB 63 63 GLU A 66 66 ALA B Protein S 5 FT #SUB 63 63 GLU A 85 85 ARG B Protein S 4 FT #SUB 64 64 LYS A 61 61 LEU B Protein S 3 FT #SUB 64 64 LYS A 62 62 GLN B Protein S 4 FT #SUB 64 64 LYS A 63 63 GLU B Protein B 4 FT #SUB 64 64 LYS A 64 64 LYS B Protein B 9 FT #SUB 65 65 ILE A 63 63 GLU B Protein B 2 FT #SUB 65 65 ILE A 65 65 ILE B Protein S 1 FT #SUB 66 66 ALA A 63 63 GLU B Protein A 5 FT #SUB 85 85 ARG A 63 63 GLU B Protein S 3 FT #SUB 85 85 ARG A 89 89 ILE B Protein S 2 FT #SUB 88 88 ASP A 85 85 ARG B Protein S 3 FT #SUB 89 89 ILE A 85 85 ARG B Protein S 2 FT #SUB 120 120 ARG A 62 62 GLN B Protein S 2 FT #SUB 159 159 LYS A 165 165 TRP B Protein B 2 FT #SUB 160 160 GLN A 165 165 TRP B Protein B 4 FT #SUB 160 160 GLN A 168 168 ASP B Protein S 1 FT #SUB 160 160 GLN A 169 169 CYS B Protein S 3 FT #SUB 161 161 THR A 165 165 TRP B Protein B 3 FT #SUB 162 162 TYR A 162 162 TYR B Protein S 1 FT #SUB 162 162 TYR A 165 165 TRP B Protein A 4 FT #SUB 165 165 TRP A 159 159 LYS B Protein S 2 FT #SUB 165 165 TRP A 160 160 GLN B Protein S 3 FT #SUB 165 165 TRP A 161 161 THR B Protein S 2 FT #SUB 165 165 TRP A 162 162 TYR B Protein S 2 FT #SUB 165 165 TRP A 165 165 TRP B Protein S 8 FT #SUB 168 168 ASP A 160 160 GLN B Protein B 1 FT #SUB 169 169 CYS A 160 160 GLN B Protein B 3 FT #HET 153 153 GLN A 1 701 CA A S 2 FT #HET 154 154 GLY A 1 701 CA A B 2 FT #HET 288 288 GLN A 2 702 9VH A S 5 FT #HET 301 301 ARG A 2 702 9VH A A 13 FT #HET 302 302 VAL A 2 702 9VH A A 5 FT #HET 304 304 TRP A 2 702 9VH A S 6 FT #HET 333 333 TYR A 2 702 9VH A S 2 FT #HET 403 403 MET A 2 702 9VH A S 2 FT #HET 405 405 ASP A 2 702 9VH A S 6 FT #HET 406 406 PHE A 2 702 9VH A S 2 FT #HET 455 455 ARG A 2 702 9VH A S 2 FT #HET 472 472 ASP A 1 701 CA A B 2 FT #HET 472 472 ASP A 2 702 9VH A S 2 FT #HET 481 481 ASP A 1 701 CA A S 3 FT #HET 508 508 TYR A 2 702 9VH A S 6 FT #HET 537 537 HIS A 2 702 9VH A S 6 FT DISORDER 1 8 FT DISORDER 16 16 FT DISORDER 680 686 CC SEQUENCE 670 AA (ATOM); CC LDFQFHQNDS FTLHFQQRLI LTHSKDNPCL WIGSGIADID MFRGNFSIKD KLQEKIALTD CC AIVSQSPDGW LIHFSRGSDI SATLNISADD QGRLLLELQN DNLNHNRIWL RLAAQPEDHI CC YGCGEQFSYF DLRGKPFPLW TSEQGVGRNK QTYVTWQADC KENAGGDYYW TFFPQPTFVS CC TQKYYCHVDN SCYMNFDFSA PEYHELALWE DKATLRFECA DTYISLLEKL TALLGRQPEL CC PDWIYDGVTL GIQGGTEVCQ KKLDTMRNAG VKVNGIWAQD WSGIRMTSFG KRVMWNWKWN CC SENYPQLDSR IKQWNQEGVQ FLAYINPYVA SDKDLCEEAA QHGYLAKDAS GGDYLVEFGE CC FYGGVVDLTN PEAYAWFKEV IKKNMIELGC GGWMADFGEY LPTDTYLHNG VSAEIMHNAW CC PALWAKCNYE ALEETGKLGE ILFFMRAGST GSQKYSTMMW AGDQNVDWSL DDGLASVVPA CC ALSLAMTGHG LHHSDIGGYT TLFEMKRSKE LLLRWCDFSA FTPMMRTHEG NRPGDNWQFD CC GDAETIAHFA RMTTVFTTLK PYLKEAVALN AKSGLPVMRP LFLHYEDDAH TYTLKYQYLL CC GRDILVAPVH EEGRSDWTLY LPEDNWVHAW TGEAFRGGEV TVNAPIGKPP VFYRADSEWA CC ALFASLKSIL CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MDTPRPQLLDFQFHQNNDSFTLHFQQRLILTHSKDNPCLWIGSGIADIDM CC ATOM --------LDFQFHQ-NDSFTLHFQQRLILTHSKDNPCLWIGSGIADIDM CC ******* ********************************** CC SEQRES FRGNFSIKDKLQEKIALTDAIVSQSPDGWLIHFSRGSDISATLNISADDQ CC ATOM FRGNFSIKDKLQEKIALTDAIVSQSPDGWLIHFSRGSDISATLNISADDQ CC ************************************************** CC SEQRES GRLLLELQNDNLNHNRIWLRLAAQPEDHIYGCGEQFSYFDLRGKPFPLWT CC ATOM GRLLLELQNDNLNHNRIWLRLAAQPEDHIYGCGEQFSYFDLRGKPFPLWT CC ************************************************** CC SEQRES SEQGVGRNKQTYVTWQADCKENAGGDYYWTFFPQPTFVSTQKYYCHVDNS CC ATOM SEQGVGRNKQTYVTWQADCKENAGGDYYWTFFPQPTFVSTQKYYCHVDNS CC ************************************************** CC SEQRES CYMNFDFSAPEYHELALWEDKATLRFECADTYISLLEKLTALLGRQPELP CC ATOM CYMNFDFSAPEYHELALWEDKATLRFECADTYISLLEKLTALLGRQPELP CC ************************************************** CC SEQRES DWIYDGVTLGIQGGTEVCQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGK CC ATOM DWIYDGVTLGIQGGTEVCQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGK CC ************************************************** CC SEQRES RVMWNWKWNSENYPQLDSRIKQWNQEGVQFLAYINPYVASDKDLCEEAAQ CC ATOM RVMWNWKWNSENYPQLDSRIKQWNQEGVQFLAYINPYVASDKDLCEEAAQ CC ************************************************** CC SEQRES HGYLAKDASGGDYLVEFGEFYGGVVDLTNPEAYAWFKEVIKKNMIELGCG CC ATOM HGYLAKDASGGDYLVEFGEFYGGVVDLTNPEAYAWFKEVIKKNMIELGCG CC ************************************************** CC SEQRES GWMADFGEYLPTDTYLHNGVSAEIMHNAWPALWAKCNYEALEETGKLGEI CC ATOM GWMADFGEYLPTDTYLHNGVSAEIMHNAWPALWAKCNYEALEETGKLGEI CC ************************************************** CC SEQRES LFFMRAGSTGSQKYSTMMWAGDQNVDWSLDDGLASVVPAALSLAMTGHGL CC ATOM LFFMRAGSTGSQKYSTMMWAGDQNVDWSLDDGLASVVPAALSLAMTGHGL CC ************************************************** CC SEQRES HHSDIGGYTTLFEMKRSKELLLRWCDFSAFTPMMRTHEGNRPGDNWQFDG CC ATOM HHSDIGGYTTLFEMKRSKELLLRWCDFSAFTPMMRTHEGNRPGDNWQFDG CC ************************************************** CC SEQRES DAETIAHFARMTTVFTTLKPYLKEAVALNAKSGLPVMRPLFLHYEDDAHT CC ATOM DAETIAHFARMTTVFTTLKPYLKEAVALNAKSGLPVMRPLFLHYEDDAHT CC ************************************************** CC SEQRES YTLKYQYLLGRDILVAPVHEEGRSDWTLYLPEDNWVHAWTGEAFRGGEVT CC ATOM YTLKYQYLLGRDILVAPVHEEGRSDWTLYLPEDNWVHAWTGEAFRGGEVT CC ************************************************** CC SEQRES VNAPIGKPPVFYRADSEWAALFASLKSILEHHHHHH CC ATOM VNAPIGKPPVFYRADSEWAALFASLKSIL------- CC ***************************** SQ SEQUENCE 686 AA; MW; CN; MDTPRPQLLD FQFHQNNDSF TLHFQQRLIL THSKDNPCLW IGSGIADIDM FRGNFSIKDK LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLNISADDQ GRLLLELQND NLNHNRIWLR LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KATLRFECAD TYISLLEKLT ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK RVMWNWKWNS ENYPQLDSRI KQWNQEGVQF LAYINPYVAS DKDLCEEAAQ HGYLAKDASG GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGV SAEIMHNAWP ALWAKCNYEA LEETGKLGEI LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAHT YTLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GEAFRGGEVT VNAPIGKPPV FYRADSEWAA LFASLKSILE HHHHHH // ID 5OHTB STANDARD; PRT; 686 AA. DT CONVERTED FROM PDB (SEQRES) 5OHT DE Sulfoquinovosidase OS Escherichia coli (strain K12) CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.870 CC R-Factor 0.168 FT #SUB 40 40 TRP B 62 62 GLN A Protein S 2 FT #SUB 61 61 LEU B 61 61 LEU A Protein S 1 FT #SUB 61 61 LEU B 64 64 LYS A Protein B 3 FT #SUB 62 62 GLN B 40 40 TRP A Protein A 3 FT #SUB 62 62 GLN B 64 64 LYS A Protein B 4 FT #SUB 62 62 GLN B 120 120 ARG A Protein S 2 FT #SUB 63 63 GLU B 64 64 LYS A Protein A 4 FT #SUB 63 63 GLU B 65 65 ILE A Protein S 2 FT #SUB 63 63 GLU B 66 66 ALA A Protein S 5 FT #SUB 63 63 GLU B 85 85 ARG A Protein S 3 FT #SUB 64 64 LYS B 61 61 LEU A Protein S 3 FT #SUB 64 64 LYS B 62 62 GLN A Protein S 3 FT #SUB 64 64 LYS B 63 63 GLU A Protein B 3 FT #SUB 64 64 LYS B 64 64 LYS A Protein B 9 FT #SUB 65 65 ILE B 63 63 GLU A Protein B 2 FT #SUB 65 65 ILE B 65 65 ILE A Protein S 1 FT #SUB 66 66 ALA B 63 63 GLU A Protein A 5 FT #SUB 85 85 ARG B 63 63 GLU A Protein S 4 FT #SUB 85 85 ARG B 88 88 ASP A Protein S 3 FT #SUB 85 85 ARG B 89 89 ILE A Protein S 2 FT #SUB 89 89 ILE B 85 85 ARG A Protein S 2 FT #SUB 159 159 LYS B 165 165 TRP A Protein B 2 FT #SUB 160 160 GLN B 165 165 TRP A Protein B 3 FT #SUB 160 160 GLN B 168 168 ASP A Protein S 1 FT #SUB 160 160 GLN B 169 169 CYS A Protein S 3 FT #SUB 161 161 THR B 165 165 TRP A Protein B 2 FT #SUB 162 162 TYR B 162 162 TYR A Protein S 1 FT #SUB 162 162 TYR B 165 165 TRP A Protein B 2 FT #SUB 165 165 TRP B 159 159 LYS A Protein S 2 FT #SUB 165 165 TRP B 160 160 GLN A Protein S 4 FT #SUB 165 165 TRP B 161 161 THR A Protein S 3 FT #SUB 165 165 TRP B 162 162 TYR A Protein S 4 FT #SUB 165 165 TRP B 165 165 TRP A Protein S 8 FT #SUB 168 168 ASP B 160 160 GLN A Protein B 1 FT #SUB 169 169 CYS B 160 160 GLN A Protein B 3 FT #HET 153 153 GLN B 3 701 CA B S 2 FT #HET 154 154 GLY B 3 701 CA B B 2 FT #HET 288 288 GLN B 4 702 9VH B S 5 FT #HET 301 301 ARG B 4 702 9VH B A 14 FT #HET 302 302 VAL B 4 702 9VH B A 4 FT #HET 304 304 TRP B 4 702 9VH B S 4 FT #HET 333 333 TYR B 4 702 9VH B S 3 FT #HET 403 403 MET B 4 702 9VH B S 2 FT #HET 405 405 ASP B 4 702 9VH B S 6 FT #HET 406 406 PHE B 4 702 9VH B S 2 FT #HET 455 455 ARG B 4 702 9VH B S 2 FT #HET 472 472 ASP B 3 701 CA B B 2 FT #HET 472 472 ASP B 4 702 9VH B S 2 FT #HET 481 481 ASP B 3 701 CA B S 3 FT #HET 508 508 TYR B 4 702 9VH B S 8 FT #HET 537 537 HIS B 4 702 9VH B S 6 FT DISORDER 1 8 FT DISORDER 16 16 FT DISORDER 99 101 FT DISORDER 679 686 CC SEQUENCE 666 AA (ATOM); CC LDFQFHQNDS FTLHFQQRLI LTHSKDNPCL WIGSGIADID MFRGNFSIKD KLQEKIALTD CC AIVSQSPDGW LIHFSRGSDI SATLNISADR LLLELQNDNL NHNRIWLRLA AQPEDHIYGC CC GEQFSYFDLR GKPFPLWTSE QGVGRNKQTY VTWQADCKEN AGGDYYWTFF PQPTFVSTQK CC YYCHVDNSCY MNFDFSAPEY HELALWEDKA TLRFECADTY ISLLEKLTAL LGRQPELPDW CC IYDGVTLGIQ GGTEVCQKKL DTMRNAGVKV NGIWAQDWSG IRMTSFGKRV MWNWKWNSEN CC YPQLDSRIKQ WNQEGVQFLA YINPYVASDK DLCEEAAQHG YLAKDASGGD YLVEFGEFYG CC GVVDLTNPEA YAWFKEVIKK NMIELGCGGW MADFGEYLPT DTYLHNGVSA EIMHNAWPAL CC WAKCNYEALE ETGKLGEILF FMRAGSTGSQ KYSTMMWAGD QNVDWSLDDG LASVVPAALS CC LAMTGHGLHH SDIGGYTTLF EMKRSKELLL RWCDFSAFTP MMRTHEGNRP GDNWQFDGDA CC ETIAHFARMT TVFTTLKPYL KEAVALNAKS GLPVMRPLFL HYEDDAHTYT LKYQYLLGRD CC ILVAPVHEEG RSDWTLYLPE DNWVHAWTGE AFRGGEVTVN APIGKPPVFY RADSEWAALF CC ASLKSI CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MDTPRPQLLDFQFHQNNDSFTLHFQQRLILTHSKDNPCLWIGSGIADIDM CC ATOM --------LDFQFHQ-NDSFTLHFQQRLILTHSKDNPCLWIGSGIADIDM CC ******* ********************************** CC SEQRES FRGNFSIKDKLQEKIALTDAIVSQSPDGWLIHFSRGSDISATLNISADDQ CC ATOM FRGNFSIKDKLQEKIALTDAIVSQSPDGWLIHFSRGSDISATLNISAD-- CC ************************************************ CC SEQRES GRLLLELQNDNLNHNRIWLRLAAQPEDHIYGCGEQFSYFDLRGKPFPLWT CC ATOM -RLLLELQNDNLNHNRIWLRLAAQPEDHIYGCGEQFSYFDLRGKPFPLWT CC ************************************************* CC SEQRES SEQGVGRNKQTYVTWQADCKENAGGDYYWTFFPQPTFVSTQKYYCHVDNS CC ATOM SEQGVGRNKQTYVTWQADCKENAGGDYYWTFFPQPTFVSTQKYYCHVDNS CC ************************************************** CC SEQRES CYMNFDFSAPEYHELALWEDKATLRFECADTYISLLEKLTALLGRQPELP CC ATOM CYMNFDFSAPEYHELALWEDKATLRFECADTYISLLEKLTALLGRQPELP CC ************************************************** CC SEQRES DWIYDGVTLGIQGGTEVCQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGK CC ATOM DWIYDGVTLGIQGGTEVCQKKLDTMRNAGVKVNGIWAQDWSGIRMTSFGK CC ************************************************** CC SEQRES RVMWNWKWNSENYPQLDSRIKQWNQEGVQFLAYINPYVASDKDLCEEAAQ CC ATOM RVMWNWKWNSENYPQLDSRIKQWNQEGVQFLAYINPYVASDKDLCEEAAQ CC ************************************************** CC SEQRES HGYLAKDASGGDYLVEFGEFYGGVVDLTNPEAYAWFKEVIKKNMIELGCG CC ATOM HGYLAKDASGGDYLVEFGEFYGGVVDLTNPEAYAWFKEVIKKNMIELGCG CC ************************************************** CC SEQRES GWMADFGEYLPTDTYLHNGVSAEIMHNAWPALWAKCNYEALEETGKLGEI CC ATOM GWMADFGEYLPTDTYLHNGVSAEIMHNAWPALWAKCNYEALEETGKLGEI CC ************************************************** CC SEQRES LFFMRAGSTGSQKYSTMMWAGDQNVDWSLDDGLASVVPAALSLAMTGHGL CC ATOM LFFMRAGSTGSQKYSTMMWAGDQNVDWSLDDGLASVVPAALSLAMTGHGL CC ************************************************** CC SEQRES HHSDIGGYTTLFEMKRSKELLLRWCDFSAFTPMMRTHEGNRPGDNWQFDG CC ATOM HHSDIGGYTTLFEMKRSKELLLRWCDFSAFTPMMRTHEGNRPGDNWQFDG CC ************************************************** CC SEQRES DAETIAHFARMTTVFTTLKPYLKEAVALNAKSGLPVMRPLFLHYEDDAHT CC ATOM DAETIAHFARMTTVFTTLKPYLKEAVALNAKSGLPVMRPLFLHYEDDAHT CC ************************************************** CC SEQRES YTLKYQYLLGRDILVAPVHEEGRSDWTLYLPEDNWVHAWTGEAFRGGEVT CC ATOM YTLKYQYLLGRDILVAPVHEEGRSDWTLYLPEDNWVHAWTGEAFRGGEVT CC ************************************************** CC SEQRES VNAPIGKPPVFYRADSEWAALFASLKSILEHHHHHH CC ATOM VNAPIGKPPVFYRADSEWAALFASLKSI-------- CC **************************** SQ SEQUENCE 686 AA; MW; CN; MDTPRPQLLD FQFHQNNDSF TLHFQQRLIL THSKDNPCLW IGSGIADIDM FRGNFSIKDK LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLNISADDQ GRLLLELQND NLNHNRIWLR LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KATLRFECAD TYISLLEKLT ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK RVMWNWKWNS ENYPQLDSRI KQWNQEGVQF LAYINPYVAS DKDLCEEAAQ HGYLAKDASG GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGV SAEIMHNAWP ALWAKCNYEA LEETGKLGEI LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN RPGDNWQFDG DAETIAHFAR MTTVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAHT YTLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GEAFRGGEVT VNAPIGKPPV FYRADSEWAA LFASLKSILE HHHHHH //