ID 5YWGA STANDARD; PRT; 445 AA. DT CONVERTED FROM PDB (SEQRES) 5YWG DE 4-hydroxyphenylpyruvate dioxygenase OS Arabidopsis thaliana CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.600 CC R-Factor 0.264 FT #SUB 55 55 GLY A 132 132 PHE B Protein B 5 FT #SUB 56 56 ASP A 132 132 PHE B Protein S 1 FT #SUB 56 56 ASP A 386 386 GLY B Protein S 3 FT #SUB 56 56 ASP A 387 387 ASP B Protein A 11 FT #SUB 57 57 ALA A 387 387 ASP B Protein A 7 FT #SUB 58 58 THR A 387 387 ASP B Protein A 8 FT #SUB 59 59 ASN A 59 59 ASN B Protein B 1 FT #SUB 59 59 ASN A 60 60 VAL B Protein S 2 FT #SUB 59 59 ASN A 63 63 ARG B Protein S 8 FT #SUB 59 59 ASN A 64 64 PHE B Protein S 1 FT #SUB 59 59 ASN A 137 137 LEU B Protein S 1 FT #SUB 59 59 ASN A 385 385 LEU B Protein S 1 FT #SUB 62 62 ARG A 63 63 ARG B Protein S 2 FT #SUB 62 62 ARG A 327 327 SER B Protein S 2 FT #SUB 62 62 ARG A 330 330 GLY B Protein A 3 FT #SUB 62 62 ARG A 331 331 GLY B Protein S 4 FT #SUB 62 62 ARG A 333 333 ASP B Protein S 8 FT #SUB 63 63 ARG A 59 59 ASN B Protein S 10 FT #SUB 63 63 ARG A 62 62 ARG B Protein S 6 FT #SUB 64 64 PHE A 59 59 ASN B Protein S 2 FT #SUB 66 66 TRP A 66 66 TRP B Protein S 15 FT #SUB 66 66 TRP A 329 329 ILE B Protein S 7 FT #SUB 78 78 LEU A 389 389 PRO B Protein S 2 FT #SUB 86 86 ALA A 387 387 ASP B Protein S 3 FT #SUB 88 88 TYR A 387 387 ASP B Protein S 1 FT #SUB 101 101 ALA A 387 387 ASP B Protein S 1 FT #SUB 103 103 TYR A 387 387 ASP B Protein S 2 FT #SUB 103 103 TYR A 388 388 ARG B Protein B 1 FT #SUB 104 104 SER A 300 300 HIS B Protein S 1 FT #SUB 104 104 SER A 302 302 GLU B Protein A 3 FT #SUB 104 104 SER A 388 388 ARG B Protein A 7 FT #SUB 106 106 SER A 302 302 GLU B Protein S 3 FT #SUB 107 107 LEU A 300 300 HIS B Protein S 1 FT #SUB 129 129 ARG A 129 129 ARG B Protein A 11 FT #SUB 129 129 ARG A 133 133 SER B Protein S 5 FT #SUB 132 132 PHE A 55 55 GLY B Protein S 4 FT #SUB 133 133 SER A 129 129 ARG B Protein S 4 FT #SUB 137 137 LEU A 56 56 ASP B Protein S 2 FT #SUB 137 137 LEU A 59 59 ASN B Protein S 1 FT #SUB 212 212 ALA A 328 328 SER B Protein S 1 FT #SUB 217 217 LEU A 328 328 SER B Protein S 1 FT #SUB 217 217 LEU A 329 329 ILE B Protein S 1 FT #SUB 300 300 HIS A 78 78 LEU B Protein S 2 FT #SUB 300 300 HIS A 107 107 LEU B Protein S 4 FT #SUB 302 302 GLU A 104 104 SER B Protein S 4 FT #SUB 302 302 GLU A 106 106 SER B Protein S 2 FT #SUB 327 327 SER A 62 62 ARG B Protein B 2 FT #SUB 329 329 ILE A 66 66 TRP B Protein A 7 FT #SUB 329 329 ILE A 217 217 LEU B Protein S 2 FT #SUB 330 330 GLY A 62 62 ARG B Protein B 2 FT #SUB 331 331 GLY A 62 62 ARG B Protein B 4 FT #SUB 333 333 ASP A 62 62 ARG B Protein S 6 FT #SUB 385 385 LEU A 58 58 THR B Protein B 1 FT #SUB 385 385 LEU A 59 59 ASN B Protein B 1 FT #SUB 386 386 GLY A 56 56 ASP B Protein B 4 FT #SUB 386 386 GLY A 58 58 THR B Protein B 2 FT #SUB 386 386 GLY A 59 59 ASN B Protein B 1 FT #SUB 387 387 ASP A 55 55 GLY B Protein S 1 FT #SUB 387 387 ASP A 56 56 ASP B Protein A 13 FT #SUB 387 387 ASP A 57 57 ALA B Protein S 3 FT #SUB 387 387 ASP A 58 58 THR B Protein A 4 FT #SUB 387 387 ASP A 101 101 ALA B Protein S 1 FT #SUB 387 387 ASP A 103 103 TYR B Protein S 1 FT #SUB 388 388 ARG A 104 104 SER B Protein S 1 FT #HET 226 226 HIS A 1 501 CO A S 3 FT #HET 226 226 HIS A 2 502 92L A S 2 FT #HET 228 228 VAL A 2 502 92L A S 1 FT #HET 267 267 SER A 2 502 92L A S 3 FT #HET 280 280 PRO A 2 502 92L A S 2 FT #HET 282 282 ASN A 2 502 92L A S 1 FT #HET 308 308 HIS A 1 501 CO A S 3 FT #HET 308 308 HIS A 2 502 92L A S 15 FT #HET 368 368 LEU A 2 502 92L A S 1 FT #HET 379 379 GLN A 2 502 92L A S 2 FT #HET 381 381 PHE A 2 502 92L A S 16 FT #HET 392 392 PHE A 2 502 92L A S 2 FT #HET 394 394 GLU A 1 501 CO A S 3 FT #HET 394 394 GLU A 2 502 92L A S 2 FT #HET 419 419 PHE A 2 502 92L A A 5 FT #HET 420 420 GLY A 2 502 92L A B 5 FT #HET 421 421 LYS A 2 502 92L A S 3 FT #HET 423 423 ASN A 2 502 92L A S 1 FT #HET 424 424 PHE A 2 502 92L A S 13 FT DISORDER 1 28 FT DISORDER 108 116 FT DISORDER 195 201 FT DISORDER 213 214 FT DISORDER 254 262 FT DISORDER 286 290 FT DISORDER 404 411 FT DISORDER 435 445 CC SEQUENCE 366 AA (ATOM); CC FSKFVRKNPK SDKFKVKRFH HIEFWCGDAT NVARRFSWGL GMRFSAKSDL STGNMVHASY CC LLTSGDLRFL FTAPYSPSLT ASIPSFDHGS CRSFFSSHGL GVRAVAIEVE DAESAFSISV CC ANGAIPSSPP IVLNEAVTIA EVKLYGDVVL RYVSYKAFLP GFERVEDAFP LDYGIRRLDH CC AVGNVPELGP ALTYVAGFTG FHQFAEFSGL NSAVLASNDE MVLLPINEPV KSQIQTYLEH CC NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK CC ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMQSGGCGG FGKGNFSELF CC KSIEEY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGHQNAAVSENQNHDDGAASSPGFKLVGFSKFVRKNPKSDKFKVKRFHHI CC ATOM ----------------------------FSKFVRKNPKSDKFKVKRFHHI CC ********************** CC SEQRES EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ATOM EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ************************************************** CC SEQRES APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ATOM APYSPSL---------TASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ******* ********************************** CC SEQRES SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAEDTEKS CC ATOM SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKA------ CC ******************************************** CC SEQRES EFLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ATOM -FLPGFERVEDA--FPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC *********** ************************************ CC SEQRES AEFTADDVGTAESGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEH CC ATOM AEF---------SGLNSAVLASNDEMVLLPINEPV-----KSQIQTYLEH CC *** *********************** ********** CC SEQRES NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ATOM NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ************************************************** CC SEQRES GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ATOM GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ************************************************** CC SEQRES CMMKDEEGKAYQSGGCGGFGKGNFSELFKSIEEYEKTLEAKQLVG CC ATOM CMM--------QSGGCGGFGKGNFSELFKSIEEY----------- CC *** *********************** SQ SEQUENCE 445 AA; MW; CN; MGHQNAAVSE NQNHDDGAAS SPGFKLVGFS KFVRKNPKSD KFKVKRFHHI EFWCGDATNV ARRFSWGLGM RFSAKSDLST GNMVHASYLL TSGDLRFLFT APYSPSLSAG EIKPTTTASI PSFDHGSCRS FFSSHGLGVR AVAIEVEDAE SAFSISVANG AIPSSPPIVL NEAVTIAEVK LYGDVVLRYV SYKAEDTEKS EFLPGFERVE DASSFPLDYG IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEFTADDVGT AESGLNSAVL ASNDEMVLLP INEPVHGTKR KSQIQTYLEH NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMKDEEGKA YQSGGCGGFG KGNFSELFKS IEEYEKTLEA KQLVG // ID 5YWGB STANDARD; PRT; 445 AA. DT CONVERTED FROM PDB (SEQRES) 5YWG DE 4-hydroxyphenylpyruvate dioxygenase OS Arabidopsis thaliana CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.600 CC R-Factor 0.264 FT #SUB 55 55 GLY B 132 132 PHE A Protein B 4 FT #SUB 55 55 GLY B 387 387 ASP A Protein B 1 FT #SUB 56 56 ASP B 137 137 LEU A Protein S 2 FT #SUB 56 56 ASP B 386 386 GLY A Protein S 4 FT #SUB 56 56 ASP B 387 387 ASP A Protein A 13 FT #SUB 57 57 ALA B 387 387 ASP A Protein B 3 FT #SUB 58 58 THR B 385 385 LEU A Protein S 1 FT #SUB 58 58 THR B 386 386 GLY A Protein S 2 FT #SUB 58 58 THR B 387 387 ASP A Protein A 4 FT #SUB 59 59 ASN B 59 59 ASN A Protein B 1 FT #SUB 59 59 ASN B 63 63 ARG A Protein A 10 FT #SUB 59 59 ASN B 64 64 PHE A Protein S 2 FT #SUB 59 59 ASN B 137 137 LEU A Protein S 1 FT #SUB 59 59 ASN B 385 385 LEU A Protein S 1 FT #SUB 59 59 ASN B 386 386 GLY A Protein S 1 FT #SUB 60 60 VAL B 59 59 ASN A Protein S 2 FT #SUB 62 62 ARG B 63 63 ARG A Protein S 6 FT #SUB 62 62 ARG B 327 327 SER A Protein S 2 FT #SUB 62 62 ARG B 330 330 GLY A Protein A 2 FT #SUB 62 62 ARG B 331 331 GLY A Protein S 4 FT #SUB 62 62 ARG B 333 333 ASP A Protein S 6 FT #SUB 63 63 ARG B 59 59 ASN A Protein S 8 FT #SUB 63 63 ARG B 62 62 ARG A Protein S 2 FT #SUB 64 64 PHE B 59 59 ASN A Protein S 1 FT #SUB 66 66 TRP B 66 66 TRP A Protein S 15 FT #SUB 66 66 TRP B 329 329 ILE A Protein S 7 FT #SUB 78 78 LEU B 300 300 HIS A Protein S 2 FT #SUB 101 101 ALA B 387 387 ASP A Protein S 1 FT #SUB 103 103 TYR B 387 387 ASP A Protein S 1 FT #SUB 104 104 SER B 302 302 GLU A Protein S 4 FT #SUB 104 104 SER B 388 388 ARG A Protein S 1 FT #SUB 106 106 SER B 302 302 GLU A Protein S 2 FT #SUB 107 107 LEU B 300 300 HIS A Protein S 4 FT #SUB 129 129 ARG B 129 129 ARG A Protein S 11 FT #SUB 129 129 ARG B 133 133 SER A Protein S 4 FT #SUB 132 132 PHE B 55 55 GLY A Protein S 5 FT #SUB 132 132 PHE B 56 56 ASP A Protein S 1 FT #SUB 133 133 SER B 129 129 ARG A Protein A 5 FT #SUB 137 137 LEU B 59 59 ASN A Protein S 1 FT #SUB 217 217 LEU B 329 329 ILE A Protein S 2 FT #SUB 300 300 HIS B 104 104 SER A Protein B 1 FT #SUB 300 300 HIS B 107 107 LEU A Protein S 1 FT #SUB 302 302 GLU B 104 104 SER A Protein S 3 FT #SUB 302 302 GLU B 106 106 SER A Protein S 3 FT #SUB 327 327 SER B 62 62 ARG A Protein B 2 FT #SUB 328 328 SER B 212 212 ALA A Protein S 1 FT #SUB 328 328 SER B 217 217 LEU A Protein B 1 FT #SUB 329 329 ILE B 66 66 TRP A Protein A 7 FT #SUB 329 329 ILE B 217 217 LEU A Protein S 1 FT #SUB 330 330 GLY B 62 62 ARG A Protein B 3 FT #SUB 331 331 GLY B 62 62 ARG A Protein B 4 FT #SUB 333 333 ASP B 62 62 ARG A Protein S 8 FT #SUB 385 385 LEU B 59 59 ASN A Protein B 1 FT #SUB 386 386 GLY B 56 56 ASP A Protein B 3 FT #SUB 387 387 ASP B 56 56 ASP A Protein A 11 FT #SUB 387 387 ASP B 57 57 ALA A Protein S 7 FT #SUB 387 387 ASP B 58 58 THR A Protein A 8 FT #SUB 387 387 ASP B 86 86 ALA A Protein S 3 FT #SUB 387 387 ASP B 88 88 TYR A Protein S 1 FT #SUB 387 387 ASP B 101 101 ALA A Protein S 1 FT #SUB 387 387 ASP B 103 103 TYR A Protein A 2 FT #SUB 388 388 ARG B 103 103 TYR A Protein S 1 FT #SUB 388 388 ARG B 104 104 SER A Protein S 7 FT #SUB 389 389 PRO B 78 78 LEU A Protein S 2 FT #HET 226 226 HIS B 3 501 CO B S 3 FT #HET 226 226 HIS B 4 502 92L B S 2 FT #HET 228 228 VAL B 4 502 92L B S 1 FT #HET 267 267 SER B 4 502 92L B S 2 FT #HET 280 280 PRO B 4 502 92L B S 1 FT #HET 282 282 ASN B 4 502 92L B S 4 FT #HET 308 308 HIS B 3 501 CO B S 3 FT #HET 308 308 HIS B 4 502 92L B S 8 FT #HET 368 368 LEU B 4 502 92L B S 1 FT #HET 379 379 GLN B 4 502 92L B S 2 FT #HET 381 381 PHE B 4 502 92L B S 18 FT #HET 392 392 PHE B 4 502 92L B S 2 FT #HET 394 394 GLU B 3 501 CO B S 3 FT #HET 394 394 GLU B 4 502 92L B S 4 FT #HET 419 419 PHE B 4 502 92L B A 9 FT #HET 420 420 GLY B 4 502 92L B B 5 FT #HET 423 423 ASN B 4 502 92L B A 3 FT #HET 424 424 PHE B 4 502 92L B A 8 FT DISORDER 1 28 FT DISORDER 108 116 FT DISORDER 195 201 FT DISORDER 210 210 FT DISORDER 254 262 FT DISORDER 288 290 FT DISORDER 404 411 FT DISORDER 435 445 CC SEQUENCE 369 AA (ATOM); CC FSKFVRKNPK SDKFKVKRFH HIEFWCGDAT NVARRFSWGL GMRFSAKSDL STGNMVHASY CC LLTSGDLRFL FTAPYSPSLT ASIPSFDHGS CRSFFSSHGL GVRAVAIEVE DAESAFSISV CC ANGAIPSSPP IVLNEAVTIA EVKLYGDVVL RYVSYKAFLP GFERVDASSF PLDYGIRRLD CC HAVGNVPELG PALTYVAGFT GFHQFAEFSG LNSAVLASND EMVLLPINEP VHGKSQIQTY CC LEHNEGAGLQ HLALMSEDIF RTLREMRKRS SIGGFDFMPS PPPTYYQNLK KRVGDVLSDD CC QIKECEELGI LVDRDDQGTL LQIFTKPLGD RPTIFIEIIQ RVGCMMQSGG CGGFGKGNFS CC ELFKSIEEY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGHQNAAVSENQNHDDGAASSPGFKLVGFSKFVRKNPKSDKFKVKRFHHI CC ATOM ----------------------------FSKFVRKNPKSDKFKVKRFHHI CC ********************** CC SEQRES EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ATOM EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ************************************************** CC SEQRES APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ATOM APYSPSL---------TASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ******* ********************************** CC SEQRES SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAEDTEKS CC ATOM SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKA------ CC ******************************************** CC SEQRES EFLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ATOM -FLPGFERV-DASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ******** **************************************** CC SEQRES AEFTADDVGTAESGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEH CC ATOM AEF---------SGLNSAVLASNDEMVLLPINEPVHG---KSQIQTYLEH CC *** ************************* ********** CC SEQRES NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ATOM NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ************************************************** CC SEQRES GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ATOM GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ************************************************** CC SEQRES CMMKDEEGKAYQSGGCGGFGKGNFSELFKSIEEYEKTLEAKQLVG CC ATOM CMM--------QSGGCGGFGKGNFSELFKSIEEY----------- CC *** *********************** SQ SEQUENCE 445 AA; MW; CN; MGHQNAAVSE NQNHDDGAAS SPGFKLVGFS KFVRKNPKSD KFKVKRFHHI EFWCGDATNV ARRFSWGLGM RFSAKSDLST GNMVHASYLL TSGDLRFLFT APYSPSLSAG EIKPTTTASI PSFDHGSCRS FFSSHGLGVR AVAIEVEDAE SAFSISVANG AIPSSPPIVL NEAVTIAEVK LYGDVVLRYV SYKAEDTEKS EFLPGFERVE DASSFPLDYG IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEFTADDVGT AESGLNSAVL ASNDEMVLLP INEPVHGTKR KSQIQTYLEH NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMKDEEGKA YQSGGCGGFG KGNFSELFKS IEEYEKTLEA KQLVG //