ID 4R7UA STANDARD; PRT; 422 AA. DT CONVERTED FROM PDB (SEQRES) 4R7U DE UDP-N-acetylglucosamine 1-carboxyvinyltransferase OS Vibrio cholerae O1 biovar El Tor str. N16961 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.450 CC R-Factor 0.181 FT #SUB 88 85 TYR A 344 341 ARG B Protein S 2 FT #SUB 88 85 TYR A 368 365 GLN B Protein S 6 FT #SUB 114 111 SER A 344 341 ARG B Protein S 1 FT #SUB 115 112 LEU A 344 341 ARG B Protein A 7 FT #SUB 117 114 GLY A 341 338 GLU B Protein B 2 FT #SUB 118 115 GLY A 340 337 PRO B Protein B 3 FT #SUB 118 115 GLY A 341 338 GLU B Protein B 2 FT #SUB 123 120 ALA A 340 337 PRO B Protein B 1 FT #SUB 142 139 LEU A 343 340 LYS B Protein S 1 FT #SUB 142 139 LEU A 344 341 ARG B Protein S 1 FT #SUB 142 139 LEU A 346 343 GLY B Protein S 2 FT #SUB 144 141 ASP A 365 362 SER B Protein B 1 FT #SUB 144 141 ASP A 366 363 GLY B Protein B 1 FT #SUB 145 142 GLY A 344 341 ARG B Protein B 4 FT #SUB 334 331 ASN A 334 331 ASN B Protein S 13 FT #SUB 340 337 PRO A 117 114 GLY B Protein S 1 FT #SUB 340 337 PRO A 118 115 GLY B Protein S 3 FT #SUB 340 337 PRO A 123 120 ALA B Protein S 3 FT #SUB 341 338 GLU A 117 114 GLY B Protein S 2 FT #SUB 341 338 GLU A 118 115 GLY B Protein S 1 FT #SUB 343 340 LYS A 115 112 LEU B Protein B 2 FT #SUB 343 340 LYS A 142 139 LEU B Protein B 1 FT #SUB 344 341 ARG A 88 85 TYR B Protein S 2 FT #SUB 344 341 ARG A 114 111 SER B Protein S 2 FT #SUB 344 341 ARG A 115 112 LEU B Protein A 10 FT #SUB 344 341 ARG A 117 114 GLY B Protein S 2 FT #SUB 344 341 ARG A 145 142 GLY B Protein A 6 FT #SUB 346 343 GLY A 142 139 LEU B Protein B 2 FT #SUB 351 348 ILE A 123 120 ALA B Protein S 1 FT #SUB 365 362 SER A 144 141 ASP B Protein S 2 FT #SUB 366 363 GLY A 144 141 ASP B Protein B 1 FT #SUB 368 365 GLN A 88 85 TYR B Protein A 10 FT #SUB 299 296 ARG A 352 349 GLU C Protein S 1 FT #SUB 328 325 THR A 352 349 GLU C Protein S 4 FT #SUB 352 349 GLU A 299 296 ARG C Protein S 1 FT #SUB 352 349 GLU A 328 325 THR C Protein S 1 FT #SUB 352 349 GLU A 355 352 THR C Protein S 2 FT #SUB 355 352 THR A 352 349 GLU C Protein S 2 FT #SUB 357 354 ILE A 357 354 ILE C Protein S 1 FT #SUB 159 156 HIS A 264 261 GLU D Protein S 2 FT #SUB 161 158 VAL A 264 261 GLU D Protein S 1 FT #SUB 161 158 VAL A 265 262 ALA D Protein S 1 FT #SUB 163 160 ASP A 269 266 LYS D Protein S 5 FT #SUB 164 161 LYS A 299 296 ARG D Protein S 1 FT #SUB 187 184 ASP A 264 261 GLU D Protein S 1 FT #SUB 188 185 ASN A 304 301 PRO D Protein A 2 FT #SUB 188 185 ASN A 305 302 GLY D Protein S 1 FT #SUB 191 188 ARG A 191 188 ARG D Protein S 10 FT #SUB 191 188 ARG A 215 212 ASP D Protein S 1 FT #SUB 215 212 ASP A 191 188 ARG D Protein S 5 FT #SUB 215 212 ASP A 304 301 PRO D Protein S 6 FT #SUB 264 261 GLU A 159 156 HIS D Protein S 2 FT #SUB 264 261 GLU A 161 158 VAL D Protein A 2 FT #SUB 264 261 GLU A 187 184 ASP D Protein S 1 FT #SUB 265 262 ALA A 161 158 VAL D Protein B 1 FT #SUB 269 266 LYS A 163 160 ASP D Protein S 6 FT #SUB 299 296 ARG A 164 161 LYS D Protein S 3 FT #SUB 304 301 PRO A 188 185 ASN D Protein S 1 FT #SUB 304 301 PRO A 215 212 ASP D Protein S 5 FT #SUB 305 302 GLY A 188 185 ASN D Protein B 2 FT #HET 27 24 ASN A 3 503 UD1 A S 11 FT #HET 85 82 CYS A 1 501 PG4 A S 2 FT #HET 93 90 THR A 2 502 FFQ A B 1 FT #HET 94 91 MET A 2 502 FFQ A B 3 FT #HET 95 92 ARG A 2 502 FFQ A A 9 FT #HET 99 96 TRP A 3 503 UD1 A S 4 FT #HET 112 109 GLN A 1 501 PG4 A S 8 FT #HET 118 115 GLY A 2 502 FFQ A B 1 FT #HET 119 116 CYS A 2 502 FFQ A A 7 FT #HET 124 121 ARG A 3 503 UD1 A A 6 FT #HET 125 122 PRO A 3 503 UD1 A A 11 FT #HET 126 123 VAL A 3 503 UD1 A B 5 FT #HET 127 124 ASP A 3 503 UD1 A A 9 FT #HET 128 125 LEU A 3 503 UD1 A A 9 FT #HET 129 126 HIS A 3 503 UD1 A A 2 FT #HET 144 141 ASP A 1 501 PG4 A S 4 FT #HET 148 145 LYS A 1 501 PG4 A S 14 FT #HET 164 161 LYS A 3 503 UD1 A S 3 FT #HET 166 163 SER A 3 503 UD1 A A 8 FT #HET 167 164 VAL A 3 503 UD1 A A 11 FT #HET 168 165 GLY A 3 503 UD1 A B 3 FT #HET 308 305 THR A 3 503 UD1 A S 1 FT #HET 309 306 ASP A 3 503 UD1 A S 10 FT #HET 318 315 ASN A 4 504 NA A B 2 FT #HET 321 318 ALA A 4 504 NA A B 2 FT #HET 324 321 GLY A 4 504 NA A B 4 FT #HET 331 328 ILE A 3 503 UD1 A A 3 FT #HET 332 329 PHE A 3 503 UD1 A S 3 FT #HET 335 332 ARG A 3 503 UD1 A S 1 FT #HET 359 356 GLY A 4 504 NA A B 2 FT #HET 360 357 ASP A 4 504 NA A A 5 FT #HET 401 398 ARG A 2 502 FFQ A S 7 FT DISORDER 1 3 FT DISORDER 422 422 CC SEQUENCE 418 AA (ATOM); CC MEKFRVIGST QPLQGEVTIS GAKNAALPIL FASILAEEPV EVANVPHLRD IDTTMELLER CC LGAKVERNGS VHVDAGPINQ YCAPYDLVKT MRASIWALGP LVARFGQGQV SLPGGCAIGA CC RPVDLHIHGL EQLGATITLE DGYVKAHVDG RLQGAHIVMD KVSVGATITI MCAATLAEGT CC TVLDNAAREP EIVDTAMFLN KLGAKISGAG TDSITIEGVE RLGGGKHAVV PDRIETGTFL CC VAAAVSRGKI VCRNTHAHLL EAVLAKLEEA GAEIECGEDW ISLDMTGREL KAVTVRTAPH CC PGFPTDMQAQ FTLLNMMAKG GGVITETIFE NRFMHVPELK RMGAKAEIEG NTVICGDVDR CC LSGAQVMATD LRASASLVIA GCIAKGETIV DRIYHIDRGY ERIEDKLSAL GANIERFR CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SNAMEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC ATOM ---MEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC *********************************************** CC SEQRES LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ATOM LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ************************************************** CC SEQRES LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ATOM LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ************************************************** CC SEQRES VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ATOM VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ************************************************** CC SEQRES FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ATOM FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ************************************************** CC SEQRES GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ATOM GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ************************************************** CC SEQRES APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ATOM APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ************************************************** CC SEQRES IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ATOM IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ************************************************** CC SEQRES RGYERIEDKLSALGANIERFRD CC ATOM RGYERIEDKLSALGANIERFR- CC ********************* SQ SEQUENCE 422 AA; MW; CN; SNAMEKFRVI GSTQPLQGEV TISGAKNAAL PILFASILAE EPVEVANVPH LRDIDTTMEL LERLGAKVER NGSVHVDAGP INQYCAPYDL VKTMRASIWA LGPLVARFGQ GQVSLPGGCA IGARPVDLHI HGLEQLGATI TLEDGYVKAH VDGRLQGAHI VMDKVSVGAT ITIMCAATLA EGTTVLDNAA REPEIVDTAM FLNKLGAKIS GAGTDSITIE GVERLGGGKH AVVPDRIETG TFLVAAAVSR GKIVCRNTHA HLLEAVLAKL EEAGAEIECG EDWISLDMTG RELKAVTVRT APHPGFPTDM QAQFTLLNMM AKGGGVITET IFENRFMHVP ELKRMGAKAE IEGNTVICGD VDRLSGAQVM ATDLRASASL VIAGCIAKGE TIVDRIYHID RGYERIEDKL SALGANIERF RD // ID 4R7UB STANDARD; PRT; 422 AA. DT CONVERTED FROM PDB (SEQRES) 4R7U DE UDP-N-acetylglucosamine 1-carboxyvinyltransferase OS Vibrio cholerae O1 biovar El Tor str. N16961 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.450 CC R-Factor 0.181 FT #SUB 88 85 TYR B 344 341 ARG A Protein S 2 FT #SUB 88 85 TYR B 368 365 GLN A Protein S 10 FT #SUB 114 111 SER B 344 341 ARG A Protein S 2 FT #SUB 115 112 LEU B 343 340 LYS A Protein S 2 FT #SUB 115 112 LEU B 344 341 ARG A Protein A 10 FT #SUB 117 114 GLY B 340 337 PRO A Protein B 1 FT #SUB 117 114 GLY B 341 338 GLU A Protein B 2 FT #SUB 117 114 GLY B 344 341 ARG A Protein B 2 FT #SUB 118 115 GLY B 340 337 PRO A Protein B 3 FT #SUB 118 115 GLY B 341 338 GLU A Protein B 1 FT #SUB 123 120 ALA B 340 337 PRO A Protein A 3 FT #SUB 123 120 ALA B 351 348 ILE A Protein S 1 FT #SUB 142 139 LEU B 343 340 LYS A Protein S 1 FT #SUB 142 139 LEU B 346 343 GLY A Protein S 2 FT #SUB 144 141 ASP B 365 362 SER A Protein A 2 FT #SUB 144 141 ASP B 366 363 GLY A Protein B 1 FT #SUB 145 142 GLY B 344 341 ARG A Protein B 6 FT #SUB 334 331 ASN B 334 331 ASN A Protein S 13 FT #SUB 340 337 PRO B 118 115 GLY A Protein S 3 FT #SUB 340 337 PRO B 123 120 ALA A Protein S 1 FT #SUB 341 338 GLU B 117 114 GLY A Protein S 2 FT #SUB 341 338 GLU B 118 115 GLY A Protein S 2 FT #SUB 343 340 LYS B 142 139 LEU A Protein B 1 FT #SUB 344 341 ARG B 88 85 TYR A Protein S 2 FT #SUB 344 341 ARG B 114 111 SER A Protein S 1 FT #SUB 344 341 ARG B 115 112 LEU A Protein S 7 FT #SUB 344 341 ARG B 142 139 LEU A Protein B 1 FT #SUB 344 341 ARG B 145 142 GLY A Protein A 4 FT #SUB 346 343 GLY B 142 139 LEU A Protein B 2 FT #SUB 365 362 SER B 144 141 ASP A Protein S 1 FT #SUB 366 363 GLY B 144 141 ASP A Protein B 1 FT #SUB 368 365 GLN B 88 85 TYR A Protein A 6 FT #SUB 159 156 HIS B 264 261 GLU C Protein S 2 FT #SUB 161 158 VAL B 264 261 GLU C Protein S 1 FT #SUB 163 160 ASP B 269 266 LYS C Protein S 5 FT #SUB 164 161 LYS B 299 296 ARG C Protein S 1 FT #SUB 188 185 ASN B 305 302 GLY C Protein S 2 FT #SUB 191 188 ARG B 191 188 ARG C Protein S 8 FT #SUB 191 188 ARG B 215 212 ASP C Protein S 4 FT #SUB 215 212 ASP B 191 188 ARG C Protein S 5 FT #SUB 215 212 ASP B 304 301 PRO C Protein S 6 FT #SUB 264 261 GLU B 159 156 HIS C Protein S 2 FT #SUB 264 261 GLU B 161 158 VAL C Protein A 2 FT #SUB 265 262 ALA B 161 158 VAL C Protein B 1 FT #SUB 269 266 LYS B 163 160 ASP C Protein S 3 FT #SUB 299 296 ARG B 164 161 LYS C Protein S 3 FT #SUB 304 301 PRO B 215 212 ASP C Protein S 4 FT #SUB 299 296 ARG B 352 349 GLU D Protein S 2 FT #SUB 326 323 VAL B 357 354 ILE D Protein S 1 FT #SUB 328 325 THR B 352 349 GLU D Protein S 5 FT #SUB 352 349 GLU B 299 296 ARG D Protein S 2 FT #SUB 352 349 GLU B 328 325 THR D Protein S 3 FT #SUB 352 349 GLU B 355 352 THR D Protein S 2 FT #SUB 355 352 THR B 352 349 GLU D Protein S 2 FT #SUB 357 354 ILE B 357 354 ILE D Protein S 2 FT #HET 27 24 ASN B 5 501 UD1 B S 14 FT #HET 93 90 THR B 6 502 FFQ B B 3 FT #HET 94 91 MET B 6 502 FFQ B B 3 FT #HET 95 92 ARG B 6 502 FFQ B A 10 FT #HET 99 96 TRP B 5 501 UD1 B S 4 FT #HET 118 115 GLY B 6 502 FFQ B B 1 FT #HET 119 116 CYS B 6 502 FFQ B A 8 FT #HET 124 121 ARG B 5 501 UD1 B A 6 FT #HET 125 122 PRO B 5 501 UD1 B A 13 FT #HET 126 123 VAL B 5 501 UD1 B B 5 FT #HET 127 124 ASP B 5 501 UD1 B A 8 FT #HET 128 125 LEU B 5 501 UD1 B A 11 FT #HET 129 126 HIS B 5 501 UD1 B A 4 FT #HET 164 161 LYS B 5 501 UD1 B S 4 FT #HET 166 163 SER B 5 501 UD1 B A 8 FT #HET 167 164 VAL B 5 501 UD1 B A 11 FT #HET 168 165 GLY B 5 501 UD1 B B 3 FT #HET 308 305 THR B 5 501 UD1 B S 3 FT #HET 309 306 ASP B 5 501 UD1 B S 10 FT #HET 318 315 ASN B 7 503 NA B B 2 FT #HET 321 318 ALA B 7 503 NA B B 2 FT #HET 324 321 GLY B 7 503 NA B B 4 FT #HET 331 328 ILE B 5 501 UD1 B A 4 FT #HET 332 329 PHE B 5 501 UD1 B S 2 FT #HET 335 332 ARG B 5 501 UD1 B S 1 FT #HET 359 356 GLY B 7 503 NA B B 2 FT #HET 360 357 ASP B 7 503 NA B A 4 FT #HET 401 398 ARG B 6 502 FFQ B S 9 FT DISORDER 1 3 FT DISORDER 422 422 CC SEQUENCE 418 AA (ATOM); CC MEKFRVIGST QPLQGEVTIS GAKNAALPIL FASILAEEPV EVANVPHLRD IDTTMELLER CC LGAKVERNGS VHVDAGPINQ YCAPYDLVKT MRASIWALGP LVARFGQGQV SLPGGCAIGA CC RPVDLHIHGL EQLGATITLE DGYVKAHVDG RLQGAHIVMD KVSVGATITI MCAATLAEGT CC TVLDNAAREP EIVDTAMFLN KLGAKISGAG TDSITIEGVE RLGGGKHAVV PDRIETGTFL CC VAAAVSRGKI VCRNTHAHLL EAVLAKLEEA GAEIECGEDW ISLDMTGREL KAVTVRTAPH CC PGFPTDMQAQ FTLLNMMAKG GGVITETIFE NRFMHVPELK RMGAKAEIEG NTVICGDVDR CC LSGAQVMATD LRASASLVIA GCIAKGETIV DRIYHIDRGY ERIEDKLSAL GANIERFR CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SNAMEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC ATOM ---MEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC *********************************************** CC SEQRES LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ATOM LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ************************************************** CC SEQRES LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ATOM LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ************************************************** CC SEQRES VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ATOM VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ************************************************** CC SEQRES FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ATOM FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ************************************************** CC SEQRES GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ATOM GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ************************************************** CC SEQRES APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ATOM APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ************************************************** CC SEQRES IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ATOM IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ************************************************** CC SEQRES RGYERIEDKLSALGANIERFRD CC ATOM RGYERIEDKLSALGANIERFR- CC ********************* SQ SEQUENCE 422 AA; MW; CN; SNAMEKFRVI GSTQPLQGEV TISGAKNAAL PILFASILAE EPVEVANVPH LRDIDTTMEL LERLGAKVER NGSVHVDAGP INQYCAPYDL VKTMRASIWA LGPLVARFGQ GQVSLPGGCA IGARPVDLHI HGLEQLGATI TLEDGYVKAH VDGRLQGAHI VMDKVSVGAT ITIMCAATLA EGTTVLDNAA REPEIVDTAM FLNKLGAKIS GAGTDSITIE GVERLGGGKH AVVPDRIETG TFLVAAAVSR GKIVCRNTHA HLLEAVLAKL EEAGAEIECG EDWISLDMTG RELKAVTVRT APHPGFPTDM QAQFTLLNMM AKGGGVITET IFENRFMHVP ELKRMGAKAE IEGNTVICGD VDRLSGAQVM ATDLRASASL VIAGCIAKGE TIVDRIYHID RGYERIEDKL SALGANIERF RD // ID 4R7UC STANDARD; PRT; 422 AA. DT CONVERTED FROM PDB (SEQRES) 4R7U DE UDP-N-acetylglucosamine 1-carboxyvinyltransferase OS Vibrio cholerae O1 biovar El Tor str. N16961 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.450 CC R-Factor 0.181 FT #SUB 299 296 ARG C 352 349 GLU A Protein S 1 FT #SUB 328 325 THR C 352 349 GLU A Protein S 1 FT #SUB 352 349 GLU C 299 296 ARG A Protein S 1 FT #SUB 352 349 GLU C 328 325 THR A Protein S 4 FT #SUB 352 349 GLU C 355 352 THR A Protein S 2 FT #SUB 355 352 THR C 352 349 GLU A Protein S 2 FT #SUB 357 354 ILE C 357 354 ILE A Protein S 1 FT #SUB 159 156 HIS C 264 261 GLU B Protein S 2 FT #SUB 161 158 VAL C 264 261 GLU B Protein S 2 FT #SUB 161 158 VAL C 265 262 ALA B Protein S 1 FT #SUB 163 160 ASP C 269 266 LYS B Protein S 3 FT #SUB 164 161 LYS C 299 296 ARG B Protein S 3 FT #SUB 191 188 ARG C 191 188 ARG B Protein S 8 FT #SUB 191 188 ARG C 215 212 ASP B Protein S 5 FT #SUB 215 212 ASP C 191 188 ARG B Protein S 4 FT #SUB 215 212 ASP C 304 301 PRO B Protein S 4 FT #SUB 264 261 GLU C 159 156 HIS B Protein S 2 FT #SUB 264 261 GLU C 161 158 VAL B Protein B 1 FT #SUB 269 266 LYS C 163 160 ASP B Protein S 5 FT #SUB 299 296 ARG C 164 161 LYS B Protein S 1 FT #SUB 304 301 PRO C 215 212 ASP B Protein S 6 FT #SUB 305 302 GLY C 188 185 ASN B Protein B 2 FT #SUB 88 85 TYR C 344 341 ARG D Protein S 3 FT #SUB 88 85 TYR C 368 365 GLN D Protein S 14 FT #SUB 92 89 LYS C 370 367 MET D Protein S 1 FT #SUB 92 89 LYS C 394 391 ASP D Protein S 1 FT #SUB 114 111 SER C 344 341 ARG D Protein S 3 FT #SUB 115 112 LEU C 343 340 LYS D Protein S 2 FT #SUB 115 112 LEU C 344 341 ARG D Protein A 11 FT #SUB 117 114 GLY C 341 338 GLU D Protein B 3 FT #SUB 117 114 GLY C 344 341 ARG D Protein B 2 FT #SUB 118 115 GLY C 340 337 PRO D Protein B 4 FT #SUB 118 115 GLY C 341 338 GLU D Protein B 3 FT #SUB 123 120 ALA C 351 348 ILE D Protein S 1 FT #SUB 142 139 LEU C 343 340 LYS D Protein S 1 FT #SUB 142 139 LEU C 344 341 ARG D Protein S 1 FT #SUB 142 139 LEU C 346 343 GLY D Protein S 2 FT #SUB 144 141 ASP C 365 362 SER D Protein A 3 FT #SUB 144 141 ASP C 366 363 GLY D Protein B 2 FT #SUB 145 142 GLY C 344 341 ARG D Protein B 3 FT #SUB 334 331 ASN C 334 331 ASN D Protein S 13 FT #SUB 340 337 PRO C 117 114 GLY D Protein S 1 FT #SUB 340 337 PRO C 118 115 GLY D Protein S 4 FT #SUB 340 337 PRO C 123 120 ALA D Protein S 1 FT #SUB 341 338 GLU C 117 114 GLY D Protein S 2 FT #SUB 341 338 GLU C 118 115 GLY D Protein S 3 FT #SUB 343 340 LYS C 142 139 LEU D Protein B 1 FT #SUB 344 341 ARG C 88 85 TYR D Protein S 3 FT #SUB 344 341 ARG C 114 111 SER D Protein S 1 FT #SUB 344 341 ARG C 115 112 LEU D Protein S 7 FT #SUB 344 341 ARG C 117 114 GLY D Protein S 2 FT #SUB 344 341 ARG C 142 139 LEU D Protein B 1 FT #SUB 344 341 ARG C 145 142 GLY D Protein A 5 FT #SUB 346 343 GLY C 142 139 LEU D Protein B 2 FT #SUB 351 348 ILE C 123 120 ALA D Protein S 1 FT #SUB 365 362 SER C 144 141 ASP D Protein S 1 FT #SUB 366 363 GLY C 144 141 ASP D Protein B 1 FT #SUB 367 364 ALA C 145 142 GLY D Protein S 1 FT #SUB 368 365 GLN C 88 85 TYR D Protein A 16 FT #HET 27 24 ASN C 8 501 UD1 C S 15 FT #HET 93 90 THR C 9 502 FFQ C B 1 FT #HET 94 91 MET C 9 502 FFQ C B 3 FT #HET 95 92 ARG C 9 502 FFQ C A 9 FT #HET 99 96 TRP C 8 501 UD1 C S 4 FT #HET 118 115 GLY C 9 502 FFQ C B 1 FT #HET 119 116 CYS C 9 502 FFQ C A 8 FT #HET 124 121 ARG C 8 501 UD1 C A 6 FT #HET 125 122 PRO C 8 501 UD1 C A 13 FT #HET 126 123 VAL C 8 501 UD1 C B 5 FT #HET 127 124 ASP C 8 501 UD1 C A 6 FT #HET 128 125 LEU C 8 501 UD1 C A 13 FT #HET 129 126 HIS C 8 501 UD1 C A 2 FT #HET 164 161 LYS C 8 501 UD1 C S 2 FT #HET 166 163 SER C 8 501 UD1 C A 9 FT #HET 167 164 VAL C 8 501 UD1 C A 10 FT #HET 168 165 GLY C 8 501 UD1 C B 3 FT #HET 308 305 THR C 8 501 UD1 C S 3 FT #HET 309 306 ASP C 8 501 UD1 C S 10 FT #HET 318 315 ASN C 10 503 NA C B 2 FT #HET 321 318 ALA C 10 503 NA C B 2 FT #HET 324 321 GLY C 10 503 NA C B 4 FT #HET 331 328 ILE C 8 501 UD1 C A 4 FT #HET 332 329 PHE C 8 501 UD1 C S 5 FT #HET 333 330 GLU C 11 504 NA C B 2 FT #HET 335 332 ARG C 8 501 UD1 C S 1 FT #HET 359 356 GLY C 10 503 NA C B 2 FT #HET 360 357 ASP C 10 503 NA C A 5 FT #HET 401 398 ARG C 9 502 FFQ C S 9 FT DISORDER 1 3 CC SEQUENCE 419 AA (ATOM); CC MEKFRVIGST QPLQGEVTIS GAKNAALPIL FASILAEEPV EVANVPHLRD IDTTMELLER CC LGAKVERNGS VHVDAGPINQ YCAPYDLVKT MRASIWALGP LVARFGQGQV SLPGGCAIGA CC RPVDLHIHGL EQLGATITLE DGYVKAHVDG RLQGAHIVMD KVSVGATITI MCAATLAEGT CC TVLDNAAREP EIVDTAMFLN KLGAKISGAG TDSITIEGVE RLGGGKHAVV PDRIETGTFL CC VAAAVSRGKI VCRNTHAHLL EAVLAKLEEA GAEIECGEDW ISLDMTGREL KAVTVRTAPH CC PGFPTDMQAQ FTLLNMMAKG GGVITETIFE NRFMHVPELK RMGAKAEIEG NTVICGDVDR CC LSGAQVMATD LRASASLVIA GCIAKGETIV DRIYHIDRGY ERIEDKLSAL GANIERFRD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SNAMEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC ATOM ---MEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC *********************************************** CC SEQRES LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ATOM LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ************************************************** CC SEQRES LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ATOM LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ************************************************** CC SEQRES VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ATOM VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ************************************************** CC SEQRES FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ATOM FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ************************************************** CC SEQRES GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ATOM GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ************************************************** CC SEQRES APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ATOM APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ************************************************** CC SEQRES IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ATOM IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ************************************************** CC SEQRES RGYERIEDKLSALGANIERFRD CC ATOM RGYERIEDKLSALGANIERFRD CC ********************** SQ SEQUENCE 422 AA; MW; CN; SNAMEKFRVI GSTQPLQGEV TISGAKNAAL PILFASILAE EPVEVANVPH LRDIDTTMEL LERLGAKVER NGSVHVDAGP INQYCAPYDL VKTMRASIWA LGPLVARFGQ GQVSLPGGCA IGARPVDLHI HGLEQLGATI TLEDGYVKAH VDGRLQGAHI VMDKVSVGAT ITIMCAATLA EGTTVLDNAA REPEIVDTAM FLNKLGAKIS GAGTDSITIE GVERLGGGKH AVVPDRIETG TFLVAAAVSR GKIVCRNTHA HLLEAVLAKL EEAGAEIECG EDWISLDMTG RELKAVTVRT APHPGFPTDM QAQFTLLNMM AKGGGVITET IFENRFMHVP ELKRMGAKAE IEGNTVICGD VDRLSGAQVM ATDLRASASL VIAGCIAKGE TIVDRIYHID RGYERIEDKL SALGANIERF RD // ID 4R7UD STANDARD; PRT; 422 AA. DT CONVERTED FROM PDB (SEQRES) 4R7U DE UDP-N-acetylglucosamine 1-carboxyvinyltransferase OS Vibrio cholerae O1 biovar El Tor str. N16961 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.450 CC R-Factor 0.181 FT #SUB 159 156 HIS D 264 261 GLU A Protein S 2 FT #SUB 161 158 VAL D 264 261 GLU A Protein S 2 FT #SUB 161 158 VAL D 265 262 ALA A Protein S 1 FT #SUB 163 160 ASP D 269 266 LYS A Protein S 6 FT #SUB 164 161 LYS D 299 296 ARG A Protein S 3 FT #SUB 187 184 ASP D 264 261 GLU A Protein S 1 FT #SUB 188 185 ASN D 304 301 PRO A Protein S 1 FT #SUB 188 185 ASN D 305 302 GLY A Protein S 2 FT #SUB 191 188 ARG D 191 188 ARG A Protein S 10 FT #SUB 191 188 ARG D 215 212 ASP A Protein S 5 FT #SUB 215 212 ASP D 191 188 ARG A Protein S 1 FT #SUB 215 212 ASP D 304 301 PRO A Protein S 5 FT #SUB 264 261 GLU D 159 156 HIS A Protein S 2 FT #SUB 264 261 GLU D 161 158 VAL A Protein B 1 FT #SUB 264 261 GLU D 187 184 ASP A Protein S 1 FT #SUB 265 262 ALA D 161 158 VAL A Protein B 1 FT #SUB 269 266 LYS D 163 160 ASP A Protein S 5 FT #SUB 299 296 ARG D 164 161 LYS A Protein S 1 FT #SUB 304 301 PRO D 188 185 ASN A Protein S 2 FT #SUB 304 301 PRO D 215 212 ASP A Protein S 6 FT #SUB 305 302 GLY D 188 185 ASN A Protein B 1 FT #SUB 299 296 ARG D 352 349 GLU B Protein S 2 FT #SUB 328 325 THR D 352 349 GLU B Protein S 3 FT #SUB 352 349 GLU D 299 296 ARG B Protein S 2 FT #SUB 352 349 GLU D 328 325 THR B Protein S 5 FT #SUB 352 349 GLU D 355 352 THR B Protein S 2 FT #SUB 355 352 THR D 352 349 GLU B Protein S 2 FT #SUB 357 354 ILE D 326 323 VAL B Protein S 1 FT #SUB 357 354 ILE D 357 354 ILE B Protein S 2 FT #SUB 88 85 TYR D 344 341 ARG C Protein S 3 FT #SUB 88 85 TYR D 368 365 GLN C Protein S 16 FT #SUB 114 111 SER D 344 341 ARG C Protein S 1 FT #SUB 115 112 LEU D 344 341 ARG C Protein B 7 FT #SUB 117 114 GLY D 340 337 PRO C Protein B 1 FT #SUB 117 114 GLY D 341 338 GLU C Protein B 2 FT #SUB 117 114 GLY D 344 341 ARG C Protein B 2 FT #SUB 118 115 GLY D 340 337 PRO C Protein B 4 FT #SUB 118 115 GLY D 341 338 GLU C Protein B 3 FT #SUB 123 120 ALA D 340 337 PRO C Protein B 1 FT #SUB 123 120 ALA D 351 348 ILE C Protein S 1 FT #SUB 142 139 LEU D 343 340 LYS C Protein S 1 FT #SUB 142 139 LEU D 344 341 ARG C Protein S 1 FT #SUB 142 139 LEU D 346 343 GLY C Protein S 2 FT #SUB 144 141 ASP D 365 362 SER C Protein B 1 FT #SUB 144 141 ASP D 366 363 GLY C Protein B 1 FT #SUB 145 142 GLY D 344 341 ARG C Protein B 5 FT #SUB 145 142 GLY D 367 364 ALA C Protein B 1 FT #SUB 334 331 ASN D 334 331 ASN C Protein S 13 FT #SUB 340 337 PRO D 118 115 GLY C Protein S 4 FT #SUB 341 338 GLU D 117 114 GLY C Protein S 3 FT #SUB 341 338 GLU D 118 115 GLY C Protein S 3 FT #SUB 343 340 LYS D 115 112 LEU C Protein B 2 FT #SUB 343 340 LYS D 142 139 LEU C Protein B 1 FT #SUB 344 341 ARG D 88 85 TYR C Protein S 3 FT #SUB 344 341 ARG D 114 111 SER C Protein S 3 FT #SUB 344 341 ARG D 115 112 LEU C Protein A 11 FT #SUB 344 341 ARG D 117 114 GLY C Protein S 2 FT #SUB 344 341 ARG D 142 139 LEU C Protein B 1 FT #SUB 344 341 ARG D 145 142 GLY C Protein A 3 FT #SUB 346 343 GLY D 142 139 LEU C Protein B 2 FT #SUB 351 348 ILE D 123 120 ALA C Protein S 1 FT #SUB 365 362 SER D 144 141 ASP C Protein S 3 FT #SUB 366 363 GLY D 144 141 ASP C Protein B 2 FT #SUB 368 365 GLN D 88 85 TYR C Protein A 14 FT #SUB 370 367 MET D 92 89 LYS C Protein S 1 FT #SUB 394 391 ASP D 92 89 LYS C Protein S 1 FT #HET 27 24 ASN D 12 501 UD1 D S 12 FT #HET 93 90 THR D 13 502 FFQ D B 1 FT #HET 94 91 MET D 13 502 FFQ D B 3 FT #HET 95 92 ARG D 13 502 FFQ D A 8 FT #HET 99 96 TRP D 12 501 UD1 D S 4 FT #HET 119 116 CYS D 13 502 FFQ D A 7 FT #HET 124 121 ARG D 12 501 UD1 D A 6 FT #HET 124 121 ARG D 13 502 FFQ D S 1 FT #HET 125 122 PRO D 12 501 UD1 D A 12 FT #HET 126 123 VAL D 12 501 UD1 D B 5 FT #HET 127 124 ASP D 12 501 UD1 D A 9 FT #HET 128 125 LEU D 12 501 UD1 D A 11 FT #HET 129 126 HIS D 12 501 UD1 D A 2 FT #HET 164 161 LYS D 12 501 UD1 D S 4 FT #HET 166 163 SER D 12 501 UD1 D A 9 FT #HET 167 164 VAL D 12 501 UD1 D A 10 FT #HET 168 165 GLY D 12 501 UD1 D B 3 FT #HET 308 305 THR D 12 501 UD1 D S 1 FT #HET 309 306 ASP D 12 501 UD1 D S 10 FT #HET 318 315 ASN D 14 503 NA D B 2 FT #HET 321 318 ALA D 14 503 NA D B 2 FT #HET 324 321 GLY D 14 503 NA D B 4 FT #HET 331 328 ILE D 12 501 UD1 D A 4 FT #HET 332 329 PHE D 12 501 UD1 D S 2 FT #HET 333 330 GLU D 11 504 NA C B 2 FT #HET 335 332 ARG D 12 501 UD1 D S 1 FT #HET 358 355 CYS D 14 503 NA D B 1 FT #HET 359 356 GLY D 14 503 NA D B 2 FT #HET 360 357 ASP D 14 503 NA D A 4 FT #HET 401 398 ARG D 13 502 FFQ D S 8 FT DISORDER 1 3 FT DISORDER 422 422 CC SEQUENCE 418 AA (ATOM); CC MEKFRVIGST QPLQGEVTIS GAKNAALPIL FASILAEEPV EVANVPHLRD IDTTMELLER CC LGAKVERNGS VHVDAGPINQ YCAPYDLVKT MRASIWALGP LVARFGQGQV SLPGGCAIGA CC RPVDLHIHGL EQLGATITLE DGYVKAHVDG RLQGAHIVMD KVSVGATITI MCAATLAEGT CC TVLDNAAREP EIVDTAMFLN KLGAKISGAG TDSITIEGVE RLGGGKHAVV PDRIETGTFL CC VAAAVSRGKI VCRNTHAHLL EAVLAKLEEA GAEIECGEDW ISLDMTGREL KAVTVRTAPH CC PGFPTDMQAQ FTLLNMMAKG GGVITETIFE NRFMHVPELK RMGAKAEIEG NTVICGDVDR CC LSGAQVMATD LRASASLVIA GCIAKGETIV DRIYHIDRGY ERIEDKLSAL GANIERFR CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SNAMEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC ATOM ---MEKFRVIGSTQPLQGEVTISGAKNAALPILFASILAEEPVEVANVPH CC *********************************************** CC SEQRES LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ATOM LRDIDTTMELLERLGAKVERNGSVHVDAGPINQYCAPYDLVKTMRASIWA CC ************************************************** CC SEQRES LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ATOM LGPLVARFGQGQVSLPGGCAIGARPVDLHIHGLEQLGATITLEDGYVKAH CC ************************************************** CC SEQRES VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ATOM VDGRLQGAHIVMDKVSVGATITIMCAATLAEGTTVLDNAAREPEIVDTAM CC ************************************************** CC SEQRES FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ATOM FLNKLGAKISGAGTDSITIEGVERLGGGKHAVVPDRIETGTFLVAAAVSR CC ************************************************** CC SEQRES GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ATOM GKIVCRNTHAHLLEAVLAKLEEAGAEIECGEDWISLDMTGRELKAVTVRT CC ************************************************** CC SEQRES APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ATOM APHPGFPTDMQAQFTLLNMMAKGGGVITETIFENRFMHVPELKRMGAKAE CC ************************************************** CC SEQRES IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ATOM IEGNTVICGDVDRLSGAQVMATDLRASASLVIAGCIAKGETIVDRIYHID CC ************************************************** CC SEQRES RGYERIEDKLSALGANIERFRD CC ATOM RGYERIEDKLSALGANIERFR- CC ********************* SQ SEQUENCE 422 AA; MW; CN; SNAMEKFRVI GSTQPLQGEV TISGAKNAAL PILFASILAE EPVEVANVPH LRDIDTTMEL LERLGAKVER NGSVHVDAGP INQYCAPYDL VKTMRASIWA LGPLVARFGQ GQVSLPGGCA IGARPVDLHI HGLEQLGATI TLEDGYVKAH VDGRLQGAHI VMDKVSVGAT ITIMCAATLA EGTTVLDNAA REPEIVDTAM FLNKLGAKIS GAGTDSITIE GVERLGGGKH AVVPDRIETG TFLVAAAVSR GKIVCRNTHA HLLEAVLAKL EEAGAEIECG EDWISLDMTG RELKAVTVRT APHPGFPTDM QAQFTLLNMM AKGGGVITET IFENRFMHVP ELKRMGAKAE IEGNTVICGD VDRLSGAQVM ATDLRASASL VIAGCIAKGE TIVDRIYHID RGYERIEDKL SALGANIERF RD //