ID 3ZGJA STANDARD; PRT; 371 AA. DT CONVERTED FROM PDB (SEQRES) 3ZGJ DE 4-HYDROXYPHENYLPYRUVIC ACID DIOXYGENASE OS STREPTOMYCES COELICOLOR CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.950 CC R-Factor 0.177 FT #SUB 192 192 ARG A 66 66 GLU B Protein S 6 FT #SUB 192 192 ARG A 173 173 THR B Protein S 5 FT #SUB 192 192 ARG A 174 174 GLY B Protein S 2 FT #SUB 192 192 ARG A 175 175 PRO B Protein S 2 FT #SUB 206 206 TYR A 176 176 ARG B Protein S 16 FT #SUB 206 206 TYR A 267 267 ASP B Protein S 5 FT #SUB 208 208 SER A 267 267 ASP B Protein A 3 FT #SUB 209 209 SER A 175 175 PRO B Protein S 2 FT #SUB 209 209 SER A 267 267 ASP B Protein A 6 FT #SUB 211 211 TYR A 172 172 ARG B Protein S 13 FT #SUB 220 220 ASP A 172 172 ARG B Protein S 8 FT #SUB 222 222 ILE A 175 175 PRO B Protein S 2 FT #SUB 239 239 ASP A 172 172 ARG B Protein S 3 FT #SUB 320 320 TRP A 345 345 ASN B Protein S 2 FT #HET 181 181 HIS A 1 1368 CO A S 3 FT #HET 181 181 HIS A 2 1369 RMN A S 5 FT #HET 183 183 ALA A 2 1369 RMN A S 1 FT #HET 223 223 PHE A 2 1369 RMN A S 4 FT #HET 234 234 THR A 2 1369 RMN A S 4 FT #HET 236 236 ILE A 2 1369 RMN A S 2 FT #HET 261 261 HIS A 1 1368 CO A S 3 FT #HET 261 261 HIS A 2 1369 RMN A S 5 FT #HET 325 325 GLN A 2 1369 RMN A S 3 FT #HET 327 327 PHE A 2 1369 RMN A S 2 FT #HET 340 340 GLU A 1 1368 CO A S 3 FT #HET 340 340 GLU A 2 1369 RMN A S 3 FT #HET 350 350 PHE A 2 1369 RMN A S 1 FT #HET 355 355 ILE A 2 1369 RMN A A 4 FT #HET 358 358 LEU A 2 1369 RMN A S 2 FT DISORDER 1 14 FT DISORDER 122 131 FT DISORDER 149 155 FT DISORDER 295 303 FT DISORDER 368 371 CC SEQUENCE 327 AA (ATOM); CC PPSDIAYAEL YVADDREASG FLVDSLGFVP LAVAGPATGT HDRRSTVLRS GEVTLVVTQA CC LAPDTPVARY VERHGDSIAD LAFGCDDVRS CFDRAVLAGA EALQAPTFAT VSGFGDIRHT CC LVPALLPPDR DWALLPAATG RTGPRPLLDH VAVCLESGTL RSTAEFYEAA FDMPYYSSEY CC IEVGEQAMDM IFVRNAGGGI TFTLIEPDDT RVPGQIDQFL SAHDGPGVQH LAFLVDDIVG CC SVRSLGDRGV AFLRTPGAYY DLLAIEDLRE TNVLADRDEW GYLLQIFTRS PYPRGTLFYE CC YIQRNGARGF GSSNIKALAE AVERERE CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MLPPFPFLHWRAAMPPSDIAYAELYVADDREASGFLVDSLGFVPLAVAGP CC ATOM --------------PPSDIAYAELYVADDREASGFLVDSLGFVPLAVAGP CC ************************************ CC SEQRES ATGTHDRRSTVLRSGEVTLVVTQALAPDTPVARYVERHGDSIADLAFGCD CC ATOM ATGTHDRRSTVLRSGEVTLVVTQALAPDTPVARYVERHGDSIADLAFGCD CC ************************************************** CC SEQRES DVRSCFDRAVLAGAEALQAPTPSHRAGQDAWFATVSGFGDIRHTLVPAAD CC ATOM DVRSCFDRAVLAGAEALQAPT----------FATVSGFGDIRHTLVPA-- CC ********************* ***************** CC SEQRES GDGAGLLPPDRDWALLPAATGRTGPRPLLDHVAVCLESGTLRSTAEFYEA CC ATOM -----LLPPDRDWALLPAATGRTGPRPLLDHVAVCLESGTLRSTAEFYEA CC ********************************************* CC SEQRES AFDMPYYSSEYIEVGEQAMDMIFVRNAGGGITFTLIEPDDTRVPGQIDQF CC ATOM AFDMPYYSSEYIEVGEQAMDMIFVRNAGGGITFTLIEPDDTRVPGQIDQF CC ************************************************** CC SEQRES LSAHDGPGVQHLAFLVDDIVGSVRSLGDRGVAFLRTPGAYYDLLTERVGA CC ATOM LSAHDGPGVQHLAFLVDDIVGSVRSLGDRGVAFLRTPGAYYDLL------ CC ******************************************** CC SEQRES MADAIEDLRETNVLADRDEWGYLLQIFTRSPYPRGTLFYEYIQRNGARGF CC ATOM ---AIEDLRETNVLADRDEWGYLLQIFTRSPYPRGTLFYEYIQRNGARGF CC *********************************************** CC SEQRES GSSNIKALAEAVEREREVAGR CC ATOM GSSNIKALAEAVERERE---- CC ***************** SQ SEQUENCE 371 AA; MW; CN; MLPPFPFLHW RAAMPPSDIA YAELYVADDR EASGFLVDSL GFVPLAVAGP ATGTHDRRST VLRSGEVTLV VTQALAPDTP VARYVERHGD SIADLAFGCD DVRSCFDRAV LAGAEALQAP TPSHRAGQDA WFATVSGFGD IRHTLVPAAD GDGAGLLPPD RDWALLPAAT GRTGPRPLLD HVAVCLESGT LRSTAEFYEA AFDMPYYSSE YIEVGEQAMD MIFVRNAGGG ITFTLIEPDD TRVPGQIDQF LSAHDGPGVQ HLAFLVDDIV GSVRSLGDRG VAFLRTPGAY YDLLTERVGA MADAIEDLRE TNVLADRDEW GYLLQIFTRS PYPRGTLFYE YIQRNGARGF GSSNIKALAE AVEREREVAG R // ID 3ZGJB STANDARD; PRT; 371 AA. DT CONVERTED FROM PDB (SEQRES) 3ZGJ DE 4-HYDROXYPHENYLPYRUVIC ACID DIOXYGENASE OS STREPTOMYCES COELICOLOR CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.950 CC R-Factor 0.177 FT #SUB 66 66 GLU B 192 192 ARG A Protein S 6 FT #SUB 172 172 ARG B 211 211 TYR A Protein S 13 FT #SUB 172 172 ARG B 220 220 ASP A Protein S 8 FT #SUB 172 172 ARG B 239 239 ASP A Protein S 3 FT #SUB 173 173 THR B 192 192 ARG A Protein A 5 FT #SUB 174 174 GLY B 192 192 ARG A Protein B 2 FT #SUB 175 175 PRO B 192 192 ARG A Protein B 2 FT #SUB 175 175 PRO B 209 209 SER A Protein S 2 FT #SUB 175 175 PRO B 222 222 ILE A Protein S 2 FT #SUB 176 176 ARG B 206 206 TYR A Protein S 16 FT #SUB 267 267 ASP B 206 206 TYR A Protein S 5 FT #SUB 267 267 ASP B 208 208 SER A Protein S 3 FT #SUB 267 267 ASP B 209 209 SER A Protein S 6 FT #SUB 345 345 ASN B 320 320 TRP A Protein S 2 FT #HET 181 181 HIS B 3 1368 CO B S 3 FT #HET 181 181 HIS B 4 1369 RMN B S 4 FT #HET 183 183 ALA B 4 1369 RMN B S 1 FT #HET 234 234 THR B 4 1369 RMN B S 6 FT #HET 236 236 ILE B 4 1369 RMN B S 2 FT #HET 261 261 HIS B 3 1368 CO B S 3 FT #HET 261 261 HIS B 4 1369 RMN B S 5 FT #HET 325 325 GLN B 4 1369 RMN B S 3 FT #HET 327 327 PHE B 4 1369 RMN B S 1 FT #HET 340 340 GLU B 3 1368 CO B S 3 FT #HET 340 340 GLU B 4 1369 RMN B S 3 FT #HET 350 350 PHE B 4 1369 RMN B S 1 FT #HET 355 355 ILE B 4 1369 RMN B A 4 FT #HET 358 358 LEU B 4 1369 RMN B S 3 FT DISORDER 1 13 FT DISORDER 149 155 FT DISORDER 297 300 FT DISORDER 368 371 CC SEQUENCE 343 AA (ATOM); CC MPPSDIAYAE LYVADDREAS GFLVDSLGFV PLAVAGPATG THDRRSTVLR SGEVTLVVTQ CC ALAPDTPVAR YVERHGDSIA DLAFGCDDVR SCFDRAVLAG AEALQAPTPS HRAGQDAWFA CC TVSGFGDIRH TLVPALLPPD RDWALLPAAT GRTGPRPLLD HVAVCLESGT LRSTAEFYEA CC AFDMPYYSSE YIEVGEQAMD MIFVRNAGGG ITFTLIEPDD TRVPGQIDQF LSAHDGPGVQ CC HLAFLVDDIV GSVRSLGDRG VAFLRTPGAY YDLLTEMADA IEDLRETNVL ADRDEWGYLL CC QIFTRSPYPR GTLFYEYIQR NGARGFGSSN IKALAEAVER ERE CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MLPPFPFLHWRAAMPPSDIAYAELYVADDREASGFLVDSLGFVPLAVAGP CC ATOM -------------MPPSDIAYAELYVADDREASGFLVDSLGFVPLAVAGP CC ************************************* CC SEQRES ATGTHDRRSTVLRSGEVTLVVTQALAPDTPVARYVERHGDSIADLAFGCD CC ATOM ATGTHDRRSTVLRSGEVTLVVTQALAPDTPVARYVERHGDSIADLAFGCD CC ************************************************** CC SEQRES DVRSCFDRAVLAGAEALQAPTPSHRAGQDAWFATVSGFGDIRHTLVPAAD CC ATOM DVRSCFDRAVLAGAEALQAPTPSHRAGQDAWFATVSGFGDIRHTLVPA-- CC ************************************************ CC SEQRES GDGAGLLPPDRDWALLPAATGRTGPRPLLDHVAVCLESGTLRSTAEFYEA CC ATOM -----LLPPDRDWALLPAATGRTGPRPLLDHVAVCLESGTLRSTAEFYEA CC ********************************************* CC SEQRES AFDMPYYSSEYIEVGEQAMDMIFVRNAGGGITFTLIEPDDTRVPGQIDQF CC ATOM AFDMPYYSSEYIEVGEQAMDMIFVRNAGGGITFTLIEPDDTRVPGQIDQF CC ************************************************** CC SEQRES LSAHDGPGVQHLAFLVDDIVGSVRSLGDRGVAFLRTPGAYYDLLTERVGA CC ATOM LSAHDGPGVQHLAFLVDDIVGSVRSLGDRGVAFLRTPGAYYDLLTE---- CC ********************************************** CC SEQRES MADAIEDLRETNVLADRDEWGYLLQIFTRSPYPRGTLFYEYIQRNGARGF CC ATOM MADAIEDLRETNVLADRDEWGYLLQIFTRSPYPRGTLFYEYIQRNGARGF CC ************************************************** CC SEQRES GSSNIKALAEAVEREREVAGR CC ATOM GSSNIKALAEAVERERE---- CC ***************** SQ SEQUENCE 371 AA; MW; CN; MLPPFPFLHW RAAMPPSDIA YAELYVADDR EASGFLVDSL GFVPLAVAGP ATGTHDRRST VLRSGEVTLV VTQALAPDTP VARYVERHGD SIADLAFGCD DVRSCFDRAV LAGAEALQAP TPSHRAGQDA WFATVSGFGD IRHTLVPAAD GDGAGLLPPD RDWALLPAAT GRTGPRPLLD HVAVCLESGT LRSTAEFYEA AFDMPYYSSE YIEVGEQAMD MIFVRNAGGG ITFTLIEPDD TRVPGQIDQF LSAHDGPGVQ HLAFLVDDIV GSVRSLGDRG VAFLRTPGAY YDLLTERVGA MADAIEDLRE TNVLADRDEW GYLLQIFTRS PYPRGTLFYE YIQRNGARGF GSSNIKALAE AVEREREVAG R //