ID 2GRXA STANDARD; PRT; 725 AA. DT CONVERTED FROM PDB (SEQRES) 2GRX DE Ferrichrome-iron receptor OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.300 CC R-Factor 0.299 FT #SUB 8 8 THR A 215 225 VAL C Protein A 3 FT #SUB 9 9 ILE A 148 158 ARG C Protein S 4 FT #SUB 9 9 ILE A 215 225 VAL C Protein B 1 FT #SUB 9 9 ILE A 216 226 VAL C Protein B 3 FT #SUB 10 10 THR A 216 226 VAL C Protein A 3 FT #SUB 10 10 THR A 217 227 ASN C Protein A 5 FT #SUB 11 11 VAL A 217 227 ASN C Protein B 1 FT #SUB 11 11 VAL A 218 228 ILE C Protein A 9 FT #SUB 11 11 VAL A 219 229 LEU C Protein B 2 FT #SUB 12 12 THR A 150 160 GLN C Protein B 3 FT #SUB 12 12 THR A 219 229 LEU C Protein A 4 FT #SUB 12 12 THR A 221 231 LYS C Protein S 2 FT #SUB 13 13 ALA A 150 160 GLN C Protein B 2 FT #SUB 13 13 ALA A 219 229 LEU C Protein B 1 FT #SUB 13 13 ALA A 220 230 PHE C Protein S 1 FT #SUB 13 13 ALA A 221 231 LYS C Protein B 6 FT #SUB 14 14 ALA A 153 163 TYR C Protein B 5 FT #SUB 14 14 ALA A 158 168 GLN C Protein B 1 FT #SUB 14 14 ALA A 221 231 LYS C Protein B 3 FT #SUB 15 15 PRO A 158 168 GLN C Protein A 3 FT #SUB 15 15 PRO A 221 231 LYS C Protein S 1 FT #SUB 15 15 PRO A 223 233 ASN C Protein S 4 FT #SUB 16 16 ALA A 158 168 GLN C Protein B 1 FT #SUB 26 26 ALA A 156 166 ARG C Protein B 2 FT #SUB 27 27 THR A 156 166 ARG C Protein S 1 FT #SUB 56 56 GLU A 156 166 ARG C Protein S 1 FT #SUB 159 159 GLU A 195 205 GLU C Protein S 1 FT #SUB 159 159 GLU A 198 208 ASN C Protein S 1 FT #SUB 160 160 PRO A 198 208 ASN C Protein S 1 FT #SUB 160 160 PRO A 201 211 ARG C Protein S 2 FT #SUB 542 542 ASP A 159 169 ALA C Protein B 1 FT #SUB 544 544 PRO A 159 169 ALA C Protein S 1 FT #SUB 544 544 PRO A 160 170 LEU C Protein S 2 FT #SUB 588 588 ALA A 160 170 LEU C Protein A 2 FT #SUB 590 590 SER A 160 170 LEU C Protein B 3 FT #SUB 591 591 ALA A 156 166 ARG C Protein B 1 FT #SUB 591 591 ALA A 160 170 LEU C Protein B 1 FT #SUB 594 594 ASN A 156 166 ARG C Protein S 1 FT #SUB 634 634 PHE A 188 198 PRO C Protein S 1 FT #SUB 634 634 PHE A 189 199 ALA C Protein S 1 FT #SUB 634 634 PHE A 191 201 MET C Protein S 3 FT #SUB 639 639 SER A 189 199 ALA C Protein S 1 FT #SUB 676 676 ASP A 190 200 ASN C Protein S 5 FT #SUB 683 683 ALA A 193 203 GLU C Protein S 2 FT #SUB 684 684 GLY A 190 200 ASN C Protein B 1 FT #SUB 724 724 ARG A 194 204 ARG C Protein B 4 FT #SUB 725 725 PHE A 194 204 ARG C Protein B 9 FT #HET 81 81 ARG A 19 1050 FCI A S 2 FT #HET 99 99 GLY A 19 1050 FCI A B 10 FT #HET 100 100 GLN A 19 1050 FCI A A 17 FT #HET 115 115 PHE A 19 1050 FCI A A 12 FT #HET 116 116 TYR A 19 1050 FCI A S 9 FT #HET 229 229 PHE A 14 902 FTT A S 1 FT #HET 231 231 PHE A 15 903 FTT A S 3 FT #HET 244 244 TYR A 19 1050 FCI A S 7 FT #HET 246 246 TRP A 19 1050 FCI A S 5 FT #HET 282 282 VAL A 15 903 FTT A S 1 FT #HET 284 284 TYR A 14 902 FTT A S 5 FT #HET 284 284 TYR A 17 930 DAO A S 2 FT #HET 300 300 LEU A 14 902 FTT A S 2 FT #HET 300 300 LEU A 17 930 DAO A S 1 FT #HET 302 302 PHE A 14 902 FTT A S 10 FT #HET 302 302 PHE A 15 903 FTT A S 1 FT #HET 304 304 GLU A 11 950 PO4 A S 4 FT #HET 304 304 GLU A 15 903 FTT A S 1 FT #HET 313 313 TYR A 19 1050 FCI A S 1 FT #HET 351 351 LYS A 11 950 PO4 A S 5 FT #HET 353 353 GLN A 3 3 KDO E S 1 FT #HET 353 353 GLN A 14 902 FTT A S 2 FT #HET 355 355 PHE A 17 930 DAO A S 3 FT #HET 359 359 THR A 12 900 FTT A S 1 FT #HET 380 380 PHE A 12 900 FTT A S 7 FT #HET 380 380 PHE A 22 901 FTT B S 5 FT #HET 382 382 ARG A 1 1 GCN E S 2 FT #HET 382 382 ARG A 12 900 FTT A S 4 FT #HET 382 382 ARG A 14 902 FTT A S 10 FT #HET 382 382 ARG A 17 930 DAO A S 2 FT #HET 384 384 ARG A 3 3 KDO E S 3 FT #HET 384 384 ARG A 4 4 KDO E S 1 FT #HET 384 384 ARG A 16 910 DPO A S 1 FT #HET 386 386 ASP A 3 3 KDO E S 3 FT #HET 391 391 PHE A 19 1050 FCI A S 3 FT #HET 437 437 LEU A 3 3 KDO E S 1 FT #HET 439 439 LYS A 16 910 DPO A S 5 FT #HET 441 441 LYS A 22 901 FTT B S 2 FT #HET 443 443 THR A 22 901 FTT B S 3 FT #HET 468 468 ASP A 9 4 KDO F S 1 FT #HET 474 474 ARG A 5 5 GMH E S 4 FT #HET 481 481 LYS A 9 4 KDO F S 4 FT #HET 481 481 LYS A 28 980 EAP B S 4 FT #HET 483 483 ASP A 9 4 KDO F S 9 FT DISORDER 1 7 FT DISORDER 403 418 CC SEQUENCE 702 AA (ATOM); CC TITVTAAPAP QESAWGPAAT IAARQSATGT KTDTPIQKVP QSISVVTAEE MALHQPKSVK CC EALSYTPGVS VGTRGASNTY DHLIIRGFAA EGQSQNNYLN GLKLQGNFYN DAVIDPYMLE CC RAEIMRGPVS VLYGKSSPGG LLNMVSKRPT TEPLKEVQFK AGTDSLFQTG FDFSDSLDDD CC GVYSYRLTGL ARSANAQQKG SEEQRYAIAP AFTWRPDDKT NFTFLSYFQN EPETGYYGWL CC PKEGTVEPLP NGKRLPTDFN EGAKNNTYSR NEKMVGYSFD HEFNDTFTVR QNLRFAENKT CC SQNSVYGYGV CSDPANAYSK QCAALAPADK GHYLARKYVV DDEKLQNFSV DTQLQSKFAT CC GDIDHTLLTG VDFMRMRNDI NAWFGYDDSV PLLNLTDFDF NAKDPANSGP YRILNKQKQT CC GVYVQDQAQW DKVLVTLGGR YDWADQESLN RVAGTTDKRD DKQFTWRGGV NYLFDNGVTP CC YFSYSESFEP SSQVGKDGNI FAPSKGKQYE VGVKYVPEDR PIVVTGAVYN LTKTNNLMAD CC PEGSFFSVEG GEIRARGVEI EAKAALSASV NVVGSYTYTD AEYTTDTTYK GNTPAQVPKH CC MASLWADYTF FDGPLSGLTL GTGGRYTGSS YGDPANSFKV GSYTVVDALV RYDLARVGMA CC GSNVALHVNN LFDREYVASC FNTYGCFWGA ERQVVATATF RF CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AVEPKEDTITVTAAPAPQESAWGPAATIAARQSATGTKTDTPIQKVPQSI CC ATOM -------TITVTAAPAPQESAWGPAATIAARQSATGTKTDTPIQKVPQSI CC ******************************************* CC SEQRES SVVTAEEMALHQPKSVKEALSYTPGVSVGTRGASNTYDHLIIRGFAAEGQ CC ATOM SVVTAEEMALHQPKSVKEALSYTPGVSVGTRGASNTYDHLIIRGFAAEGQ CC ************************************************** CC SEQRES SQNNYLNGLKLQGNFYNDAVIDPYMLERAEIMRGPVSVLYGKSSPGGLLN CC ATOM SQNNYLNGLKLQGNFYNDAVIDPYMLERAEIMRGPVSVLYGKSSPGGLLN CC ************************************************** CC SEQRES MVSKRPTTEPLKEVQFKAGTDSLFQTGFDFSDSLDDDGVYSYRLTGLARS CC ATOM MVSKRPTTEPLKEVQFKAGTDSLFQTGFDFSDSLDDDGVYSYRLTGLARS CC ************************************************** CC SEQRES ANAQQKGSEEQRYAIAPAFTWRPDDKTNFTFLSYFQNEPETGYYGWLPKE CC ATOM ANAQQKGSEEQRYAIAPAFTWRPDDKTNFTFLSYFQNEPETGYYGWLPKE CC ************************************************** CC SEQRES GTVEPLPNGKRLPTDFNEGAKNNTYSRNEKMVGYSFDHEFNDTFTVRQNL CC ATOM GTVEPLPNGKRLPTDFNEGAKNNTYSRNEKMVGYSFDHEFNDTFTVRQNL CC ************************************************** CC SEQRES RFAENKTSQNSVYGYGVCSDPANAYSKQCAALAPADKGHYLARKYVVDDE CC ATOM RFAENKTSQNSVYGYGVCSDPANAYSKQCAALAPADKGHYLARKYVVDDE CC ************************************************** CC SEQRES KLQNFSVDTQLQSKFATGDIDHTLLTGVDFMRMRNDINAWFGYDDSVPLL CC ATOM KLQNFSVDTQLQSKFATGDIDHTLLTGVDFMRMRNDINAWFGYDDSVPLL CC ************************************************** CC SEQRES NLYNPSSHHHHHHGSSVNTDFDFNAKDPANSGPYRILNKQKQTGVYVQDQ CC ATOM NL----------------TDFDFNAKDPANSGPYRILNKQKQTGVYVQDQ CC ** ******************************** CC SEQRES AQWDKVLVTLGGRYDWADQESLNRVAGTTDKRDDKQFTWRGGVNYLFDNG CC ATOM AQWDKVLVTLGGRYDWADQESLNRVAGTTDKRDDKQFTWRGGVNYLFDNG CC ************************************************** CC SEQRES VTPYFSYSESFEPSSQVGKDGNIFAPSKGKQYEVGVKYVPEDRPIVVTGA CC ATOM VTPYFSYSESFEPSSQVGKDGNIFAPSKGKQYEVGVKYVPEDRPIVVTGA CC ************************************************** CC SEQRES VYNLTKTNNLMADPEGSFFSVEGGEIRARGVEIEAKAALSASVNVVGSYT CC ATOM VYNLTKTNNLMADPEGSFFSVEGGEIRARGVEIEAKAALSASVNVVGSYT CC ************************************************** CC SEQRES YTDAEYTTDTTYKGNTPAQVPKHMASLWADYTFFDGPLSGLTLGTGGRYT CC ATOM YTDAEYTTDTTYKGNTPAQVPKHMASLWADYTFFDGPLSGLTLGTGGRYT CC ************************************************** CC SEQRES GSSYGDPANSFKVGSYTVVDALVRYDLARVGMAGSNVALHVNNLFDREYV CC ATOM GSSYGDPANSFKVGSYTVVDALVRYDLARVGMAGSNVALHVNNLFDREYV CC ************************************************** CC SEQRES ASCFNTYGCFWGAERQVVATATFRF CC ATOM ASCFNTYGCFWGAERQVVATATFRF CC ************************* SQ SEQUENCE 725 AA; MW; CN; AVEPKEDTIT VTAAPAPQES AWGPAATIAA RQSATGTKTD TPIQKVPQSI SVVTAEEMAL HQPKSVKEAL SYTPGVSVGT RGASNTYDHL IIRGFAAEGQ SQNNYLNGLK LQGNFYNDAV IDPYMLERAE IMRGPVSVLY GKSSPGGLLN MVSKRPTTEP LKEVQFKAGT DSLFQTGFDF SDSLDDDGVY SYRLTGLARS ANAQQKGSEE QRYAIAPAFT WRPDDKTNFT FLSYFQNEPE TGYYGWLPKE GTVEPLPNGK RLPTDFNEGA KNNTYSRNEK MVGYSFDHEF NDTFTVRQNL RFAENKTSQN SVYGYGVCSD PANAYSKQCA ALAPADKGHY LARKYVVDDE KLQNFSVDTQ LQSKFATGDI DHTLLTGVDF MRMRNDINAW FGYDDSVPLL NLYNPSSHHH HHHGSSVNTD FDFNAKDPAN SGPYRILNKQ KQTGVYVQDQ AQWDKVLVTL GGRYDWADQE SLNRVAGTTD KRDDKQFTWR GGVNYLFDNG VTPYFSYSES FEPSSQVGKD GNIFAPSKGK QYEVGVKYVP EDRPIVVTGA VYNLTKTNNL MADPEGSFFS VEGGEIRARG VEIEAKAALS ASVNVVGSYT YTDAEYTTDT TYKGNTPAQV PKHMASLWAD YTFFDGPLSG LTLGTGGRYT GSSYGDPANS FKVGSYTVVD ALVRYDLARV GMAGSNVALH VNNLFDREYV ASCFNTYGCF WGAERQVVAT ATFRF // ID 2GRXB STANDARD; PRT; 725 AA. DT CONVERTED FROM PDB (SEQRES) 2GRX DE Ferrichrome-iron receptor OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.300 CC R-Factor 0.299 FT #SUB 542 542 ASP B 159 169 ALA D Protein B 1 FT #SUB 542 542 ASP B 161 171 ARG D Protein S 1 FT #SUB 544 544 PRO B 159 169 ALA D Protein S 1 FT #SUB 544 544 PRO B 160 170 LEU D Protein S 2 FT #SUB 588 588 ALA B 160 170 LEU D Protein B 1 FT #SUB 590 590 SER B 160 170 LEU D Protein B 3 FT #SUB 591 591 ALA B 156 166 ARG D Protein A 6 FT #SUB 591 591 ALA B 160 170 LEU D Protein A 2 FT #SUB 634 634 PHE B 190 200 ASN D Protein S 1 FT #SUB 634 634 PHE B 191 201 MET D Protein S 1 FT #SUB 676 676 ASP B 190 200 ASN D Protein S 1 FT #SUB 683 683 ALA B 194 204 ARG D Protein S 2 FT #HET 81 81 ARG B 29 1050 FCI B S 3 FT #HET 99 99 GLY B 29 1050 FCI B B 3 FT #HET 100 100 GLN B 29 1050 FCI B S 8 FT #HET 115 115 PHE B 29 1050 FCI B A 7 FT #HET 116 116 TYR B 29 1050 FCI B S 11 FT #HET 229 229 PHE B 23 902 FTT B S 1 FT #HET 231 231 PHE B 24 903 FTT B S 1 FT #HET 235 235 PHE B 27 940 MYR B S 1 FT #HET 244 244 TYR B 29 1050 FCI B S 4 FT #HET 246 246 TRP B 29 1050 FCI B S 14 FT #HET 280 280 LYS B 27 940 MYR B S 1 FT #HET 284 284 TYR B 21 900 FTT B S 2 FT #HET 284 284 TYR B 23 902 FTT B S 6 FT #HET 298 298 GLN B 13 901 FTT A S 1 FT #HET 298 298 GLN B 21 900 FTT B S 1 FT #HET 300 300 LEU B 23 902 FTT B S 2 FT #HET 300 300 LEU B 26 930 DAO B S 1 FT #HET 302 302 PHE B 23 902 FTT B S 10 FT #HET 302 302 PHE B 24 903 FTT B S 4 FT #HET 302 302 PHE B 27 940 MYR B S 1 FT #HET 304 304 GLU B 20 950 PO4 B S 4 FT #HET 304 304 GLU B 24 903 FTT B S 1 FT #HET 304 304 GLU B 27 940 MYR B S 4 FT #HET 313 313 TYR B 29 1050 FCI B S 10 FT #HET 315 315 TYR B 29 1050 FCI B S 1 FT #HET 351 351 LYS B 20 950 PO4 B S 5 FT #HET 353 353 GLN B 8 3 KDO F S 1 FT #HET 353 353 GLN B 23 902 FTT B S 2 FT #HET 355 355 PHE B 26 930 DAO B S 2 FT #HET 359 359 THR B 13 901 FTT A S 2 FT #HET 380 380 PHE B 13 901 FTT A S 1 FT #HET 380 380 PHE B 21 900 FTT B S 7 FT #HET 382 382 ARG B 6 1 GCN F S 3 FT #HET 382 382 ARG B 21 900 FTT B S 4 FT #HET 382 382 ARG B 23 902 FTT B S 10 FT #HET 382 382 ARG B 26 930 DAO B S 2 FT #HET 384 384 ARG B 8 3 KDO F S 3 FT #HET 384 384 ARG B 9 4 KDO F S 1 FT #HET 384 384 ARG B 25 910 DPO B S 1 FT #HET 386 386 ASP B 8 3 KDO F S 3 FT #HET 391 391 PHE B 29 1050 FCI B S 3 FT #HET 437 437 LEU B 8 3 KDO F S 1 FT #HET 439 439 LYS B 25 910 DPO B S 5 FT #HET 441 441 LYS B 13 901 FTT A S 4 FT #HET 443 443 THR B 13 901 FTT A S 2 FT #HET 474 474 ARG B 10 5 GMH F S 4 FT #HET 481 481 LYS B 4 4 KDO E S 4 FT #HET 483 483 ASP B 4 4 KDO E S 1 FT #HET 704 704 PHE B 29 1050 FCI B S 1 FT DISORDER 1 18 FT DISORDER 403 418 CC SEQUENCE 691 AA (ATOM); CC ESAWGPAATI AARQSATGTK TDTPIQKVPQ SISVVTAEEM ALHQPKSVKE ALSYTPGVSV CC GTRGASNTYD HLIIRGFAAE GQSQNNYLNG LKLQGNFYND AVIDPYMLER AEIMRGPVSV CC LYGKSSPGGL LNMVSKRPTT EPLKEVQFKA GTDSLFQTGF DFSDSLDDDG VYSYRLTGLA CC RSANAQQKGS EEQRYAIAPA FTWRPDDKTN FTFLSYFQNE PETGYYGWLP KEGTVEPLPN CC GKRLPTDFNE GAKNNTYSRN EKMVGYSFDH EFNDTFTVRQ NLRFAENKTS QNSVYGYGVC CC SDPANAYSKQ CAALAPADKG HYLARKYVVD DEKLQNFSVD TQLQSKFATG DIDHTLLTGV CC DFMRMRNDIN AWFGYDDSVP LLNLTDFDFN AKDPANSGPY RILNKQKQTG VYVQDQAQWD CC KVLVTLGGRY DWADQESLNR VAGTTDKRDD KQFTWRGGVN YLFDNGVTPY FSYSESFEPS CC SQVGKDGNIF APSKGKQYEV GVKYVPEDRP IVVTGAVYNL TKTNNLMADP EGSFFSVEGG CC EIRARGVEIE AKAALSASVN VVGSYTYTDA EYTTDTTYKG NTPAQVPKHM ASLWADYTFF CC DGPLSGLTLG TGGRYTGSSY GDPANSFKVG SYTVVDALVR YDLARVGMAG SNVALHVNNL CC FDREYVASCF NTYGCFWGAE RQVVATATFR F CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AVEPKEDTITVTAAPAPQESAWGPAATIAARQSATGTKTDTPIQKVPQSI CC ATOM ------------------ESAWGPAATIAARQSATGTKTDTPIQKVPQSI CC ******************************** CC SEQRES SVVTAEEMALHQPKSVKEALSYTPGVSVGTRGASNTYDHLIIRGFAAEGQ CC ATOM SVVTAEEMALHQPKSVKEALSYTPGVSVGTRGASNTYDHLIIRGFAAEGQ CC ************************************************** CC SEQRES SQNNYLNGLKLQGNFYNDAVIDPYMLERAEIMRGPVSVLYGKSSPGGLLN CC ATOM SQNNYLNGLKLQGNFYNDAVIDPYMLERAEIMRGPVSVLYGKSSPGGLLN CC ************************************************** CC SEQRES MVSKRPTTEPLKEVQFKAGTDSLFQTGFDFSDSLDDDGVYSYRLTGLARS CC ATOM MVSKRPTTEPLKEVQFKAGTDSLFQTGFDFSDSLDDDGVYSYRLTGLARS CC ************************************************** CC SEQRES ANAQQKGSEEQRYAIAPAFTWRPDDKTNFTFLSYFQNEPETGYYGWLPKE CC ATOM ANAQQKGSEEQRYAIAPAFTWRPDDKTNFTFLSYFQNEPETGYYGWLPKE CC ************************************************** CC SEQRES GTVEPLPNGKRLPTDFNEGAKNNTYSRNEKMVGYSFDHEFNDTFTVRQNL CC ATOM GTVEPLPNGKRLPTDFNEGAKNNTYSRNEKMVGYSFDHEFNDTFTVRQNL CC ************************************************** CC SEQRES RFAENKTSQNSVYGYGVCSDPANAYSKQCAALAPADKGHYLARKYVVDDE CC ATOM RFAENKTSQNSVYGYGVCSDPANAYSKQCAALAPADKGHYLARKYVVDDE CC ************************************************** CC SEQRES KLQNFSVDTQLQSKFATGDIDHTLLTGVDFMRMRNDINAWFGYDDSVPLL CC ATOM KLQNFSVDTQLQSKFATGDIDHTLLTGVDFMRMRNDINAWFGYDDSVPLL CC ************************************************** CC SEQRES NLYNPSSHHHHHHGSSVNTDFDFNAKDPANSGPYRILNKQKQTGVYVQDQ CC ATOM NL----------------TDFDFNAKDPANSGPYRILNKQKQTGVYVQDQ CC ** ******************************** CC SEQRES AQWDKVLVTLGGRYDWADQESLNRVAGTTDKRDDKQFTWRGGVNYLFDNG CC ATOM AQWDKVLVTLGGRYDWADQESLNRVAGTTDKRDDKQFTWRGGVNYLFDNG CC ************************************************** CC SEQRES VTPYFSYSESFEPSSQVGKDGNIFAPSKGKQYEVGVKYVPEDRPIVVTGA CC ATOM VTPYFSYSESFEPSSQVGKDGNIFAPSKGKQYEVGVKYVPEDRPIVVTGA CC ************************************************** CC SEQRES VYNLTKTNNLMADPEGSFFSVEGGEIRARGVEIEAKAALSASVNVVGSYT CC ATOM VYNLTKTNNLMADPEGSFFSVEGGEIRARGVEIEAKAALSASVNVVGSYT CC ************************************************** CC SEQRES YTDAEYTTDTTYKGNTPAQVPKHMASLWADYTFFDGPLSGLTLGTGGRYT CC ATOM YTDAEYTTDTTYKGNTPAQVPKHMASLWADYTFFDGPLSGLTLGTGGRYT CC ************************************************** CC SEQRES GSSYGDPANSFKVGSYTVVDALVRYDLARVGMAGSNVALHVNNLFDREYV CC ATOM GSSYGDPANSFKVGSYTVVDALVRYDLARVGMAGSNVALHVNNLFDREYV CC ************************************************** CC SEQRES ASCFNTYGCFWGAERQVVATATFRF CC ATOM ASCFNTYGCFWGAERQVVATATFRF CC ************************* SQ SEQUENCE 725 AA; MW; CN; AVEPKEDTIT VTAAPAPQES AWGPAATIAA RQSATGTKTD TPIQKVPQSI SVVTAEEMAL HQPKSVKEAL SYTPGVSVGT RGASNTYDHL IIRGFAAEGQ SQNNYLNGLK LQGNFYNDAV IDPYMLERAE IMRGPVSVLY GKSSPGGLLN MVSKRPTTEP LKEVQFKAGT DSLFQTGFDF SDSLDDDGVY SYRLTGLARS ANAQQKGSEE QRYAIAPAFT WRPDDKTNFT FLSYFQNEPE TGYYGWLPKE GTVEPLPNGK RLPTDFNEGA KNNTYSRNEK MVGYSFDHEF NDTFTVRQNL RFAENKTSQN SVYGYGVCSD PANAYSKQCA ALAPADKGHY LARKYVVDDE KLQNFSVDTQ LQSKFATGDI DHTLLTGVDF MRMRNDINAW FGYDDSVPLL NLYNPSSHHH HHHGSSVNTD FDFNAKDPAN SGPYRILNKQ KQTGVYVQDQ AQWDKVLVTL GGRYDWADQE SLNRVAGTTD KRDDKQFTWR GGVNYLFDNG VTPYFSYSES FEPSSQVGKD GNIFAPSKGK QYEVGVKYVP EDRPIVVTGA VYNLTKTNNL MADPEGSFFS VEGGEIRARG VEIEAKAALS ASVNVVGSYT YTDAEYTTDT TYKGNTPAQV PKHMASLWAD YTFFDGPLSG LTLGTGGRYT GSSYGDPANS FKVGSYTVVD ALVRYDLARV GMAGSNVALH VNNLFDREYV ASCFNTYGCF WGAERQVVAT ATFRF // ID 2GRXC STANDARD; PRT; 229 AA. DT CONVERTED FROM PDB (SEQRES) 2GRX DE Protein tonB OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.300 CC R-Factor 0.299 FT #SUB 148 158 ARG C 9 9 ILE A Protein S 4 FT #SUB 150 160 GLN C 12 12 THR A Protein S 3 FT #SUB 150 160 GLN C 13 13 ALA A Protein S 2 FT #SUB 153 163 TYR C 14 14 ALA A Protein S 5 FT #SUB 156 166 ARG C 26 26 ALA A Protein S 2 FT #SUB 156 166 ARG C 27 27 THR A Protein S 1 FT #SUB 156 166 ARG C 56 56 GLU A Protein S 1 FT #SUB 156 166 ARG C 591 591 ALA A Protein S 1 FT #SUB 156 166 ARG C 594 594 ASN A Protein S 1 FT #SUB 158 168 GLN C 14 14 ALA A Protein S 1 FT #SUB 158 168 GLN C 15 15 PRO A Protein S 3 FT #SUB 158 168 GLN C 16 16 ALA A Protein S 1 FT #SUB 159 169 ALA C 542 542 ASP A Protein B 1 FT #SUB 159 169 ALA C 544 544 PRO A Protein B 1 FT #SUB 160 170 LEU C 544 544 PRO A Protein S 2 FT #SUB 160 170 LEU C 588 588 ALA A Protein S 2 FT #SUB 160 170 LEU C 590 590 SER A Protein S 3 FT #SUB 160 170 LEU C 591 591 ALA A Protein S 1 FT #SUB 188 198 PRO C 634 634 PHE A Protein S 1 FT #SUB 189 199 ALA C 634 634 PHE A Protein B 1 FT #SUB 189 199 ALA C 639 639 SER A Protein S 1 FT #SUB 190 200 ASN C 676 676 ASP A Protein A 5 FT #SUB 190 200 ASN C 684 684 GLY A Protein S 1 FT #SUB 191 201 MET C 634 634 PHE A Protein S 3 FT #SUB 193 203 GLU C 683 683 ALA A Protein S 2 FT #SUB 194 204 ARG C 724 724 ARG A Protein S 4 FT #SUB 194 204 ARG C 725 725 PHE A Protein S 9 FT #SUB 195 205 GLU C 159 159 GLU A Protein S 1 FT #SUB 198 208 ASN C 159 159 GLU A Protein S 1 FT #SUB 198 208 ASN C 160 160 PRO A Protein S 1 FT #SUB 201 211 ARG C 160 160 PRO A Protein S 2 FT #SUB 215 225 VAL C 8 8 THR A Protein A 3 FT #SUB 215 225 VAL C 9 9 ILE A Protein B 1 FT #SUB 216 226 VAL C 9 9 ILE A Protein B 3 FT #SUB 216 226 VAL C 10 10 THR A Protein B 3 FT #SUB 217 227 ASN C 10 10 THR A Protein B 5 FT #SUB 217 227 ASN C 11 11 VAL A Protein B 1 FT #SUB 218 228 ILE C 11 11 VAL A Protein A 9 FT #SUB 219 229 LEU C 11 11 VAL A Protein B 2 FT #SUB 219 229 LEU C 12 12 THR A Protein B 4 FT #SUB 219 229 LEU C 13 13 ALA A Protein B 1 FT #SUB 220 230 PHE C 13 13 ALA A Protein S 1 FT #SUB 221 231 LYS C 12 12 THR A Protein S 2 FT #SUB 221 231 LYS C 13 13 ALA A Protein A 6 FT #SUB 221 231 LYS C 14 14 ALA A Protein B 3 FT #SUB 221 231 LYS C 15 15 PRO A Protein B 1 FT #SUB 223 233 ASN C 15 15 PRO A Protein A 4 FT DISORDER 1 147 FT DISORDER 226 229 CC SEQUENCE 78 AA (ATOM); CC RNQPQYPARA QALRIEGQVK VKFDVTPDGR VDNVQILSAK PANMFEREVK NAMRRWRYEP CC GKPGSGIVVN ILFKINGT CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSSHHHHHHSSGLVPRGSHMSVHQVIELPAPAQPISVTMVTPADLEPPQA CC ATOM -------------------------------------------------- CC CC SEQRES VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP CC ATOM -------------------------------------------------- CC CC SEQRES KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQ CC ATOM -----------------------------------------------RNQ CC *** CC SEQRES PQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAM CC ATOM PQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAM CC ************************************************** CC SEQRES RRWRYEPGKPGSGIVVNILFKINGTTEIQ CC ATOM RRWRYEPGKPGSGIVVNILFKINGT---- CC ************************* SQ SEQUENCE 229 AA; MW; CN; GSSHHHHHHS SGLVPRGSHM SVHQVIELPA PAQPISVTMV TPADLEPPQA VQPPPEPVVE PEPEPEPIPE PPKEAPVVIE KPKPKPKPKP KPVKKVQEQP KRDVKPVESR PASPFENTAP ARLTSSTATA ATSKPVTSVA SGPRALSRNQ PQYPARAQAL RIEGQVKVKF DVTPDGRVDN VQILSAKPAN MFEREVKNAM RRWRYEPGKP GSGIVVNILF KINGTTEIQ // ID 2GRXD STANDARD; PRT; 229 AA. DT CONVERTED FROM PDB (SEQRES) 2GRX DE Protein tonB OS Escherichia coli CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.300 CC R-Factor 0.299 FT #SUB 156 166 ARG D 591 591 ALA B Protein S 6 FT #SUB 159 169 ALA D 542 542 ASP B Protein B 1 FT #SUB 159 169 ALA D 544 544 PRO B Protein B 1 FT #SUB 160 170 LEU D 544 544 PRO B Protein S 2 FT #SUB 160 170 LEU D 588 588 ALA B Protein S 1 FT #SUB 160 170 LEU D 590 590 SER B Protein S 3 FT #SUB 160 170 LEU D 591 591 ALA B Protein S 2 FT #SUB 161 171 ARG D 542 542 ASP B Protein S 1 FT #SUB 190 200 ASN D 634 634 PHE B Protein B 1 FT #SUB 190 200 ASN D 676 676 ASP B Protein S 1 FT #SUB 191 201 MET D 634 634 PHE B Protein S 1 FT #SUB 194 204 ARG D 683 683 ALA B Protein S 2 FT DISORDER 1 147 FT DISORDER 226 229 CC SEQUENCE 78 AA (ATOM); CC RNQPQYPARA QALRIEGQVK VKFDVTPDGR VDNVQILSAK PANMFEREVK NAMRRWRYEP CC GKPGSGIVVN ILFKINGT CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSSHHHHHHSSGLVPRGSHMSVHQVIELPAPAQPISVTMVTPADLEPPQA CC ATOM -------------------------------------------------- CC CC SEQRES VQPPPEPVVEPEPEPEPIPEPPKEAPVVIEKPKPKPKPKPKPVKKVQEQP CC ATOM -------------------------------------------------- CC CC SEQRES KRDVKPVESRPASPFENTAPARLTSSTATAATSKPVTSVASGPRALSRNQ CC ATOM -----------------------------------------------RNQ CC *** CC SEQRES PQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAM CC ATOM PQYPARAQALRIEGQVKVKFDVTPDGRVDNVQILSAKPANMFEREVKNAM CC ************************************************** CC SEQRES RRWRYEPGKPGSGIVVNILFKINGTTEIQ CC ATOM RRWRYEPGKPGSGIVVNILFKINGT---- CC ************************* SQ SEQUENCE 229 AA; MW; CN; GSSHHHHHHS SGLVPRGSHM SVHQVIELPA PAQPISVTMV TPADLEPPQA VQPPPEPVVE PEPEPEPIPE PPKEAPVVIE KPKPKPKPKP KPVKKVQEQP KRDVKPVESR PASPFENTAP ARLTSSTATA ATSKPVTSVA SGPRALSRNQ PQYPARAQAL RIEGQVKVKF DVTPDGRVDN VQILSAKPAN MFEREVKNAM RRWRYEPGKP GSGIVVNILF KINGTTEIQ //