ID 5YWHA STANDARD; PRT; 445 AA. DT CONVERTED FROM PDB (SEQRES) 5YWH DE 4-hydroxyphenylpyruvate dioxygenase OS Arabidopsis thaliana CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.720 CC R-Factor 0.243 FT #SUB 55 55 GLY A 132 132 PHE B Protein B 6 FT #SUB 56 56 ASP A 132 132 PHE B Protein B 1 FT #SUB 56 56 ASP A 137 137 LEU B Protein S 1 FT #SUB 56 56 ASP A 386 386 GLY B Protein S 4 FT #SUB 56 56 ASP A 387 387 ASP B Protein A 9 FT #SUB 57 57 ALA A 387 387 ASP B Protein A 6 FT #SUB 58 58 THR A 386 386 GLY B Protein S 2 FT #SUB 58 58 THR A 387 387 ASP B Protein A 6 FT #SUB 59 59 ASN A 59 59 ASN B Protein B 1 FT #SUB 59 59 ASN A 63 63 ARG B Protein A 8 FT #SUB 59 59 ASN A 64 64 PHE B Protein S 2 FT #SUB 59 59 ASN A 137 137 LEU B Protein S 1 FT #SUB 59 59 ASN A 385 385 LEU B Protein S 2 FT #SUB 60 60 VAL A 59 59 ASN B Protein S 1 FT #SUB 62 62 ARG A 63 63 ARG B Protein S 6 FT #SUB 62 62 ARG A 327 327 SER B Protein S 1 FT #SUB 62 62 ARG A 330 330 GLY B Protein A 3 FT #SUB 62 62 ARG A 331 331 GLY B Protein S 2 FT #SUB 62 62 ARG A 333 333 ASP B Protein S 6 FT #SUB 63 63 ARG A 59 59 ASN B Protein S 8 FT #SUB 63 63 ARG A 62 62 ARG B Protein S 6 FT #SUB 64 64 PHE A 59 59 ASN B Protein S 2 FT #SUB 66 66 TRP A 66 66 TRP B Protein S 10 FT #SUB 66 66 TRP A 329 329 ILE B Protein S 7 FT #SUB 78 78 LEU A 300 300 HIS B Protein S 2 FT #SUB 78 78 LEU A 389 389 PRO B Protein S 1 FT #SUB 88 88 TYR A 387 387 ASP B Protein S 2 FT #SUB 101 101 ALA A 387 387 ASP B Protein S 1 FT #SUB 103 103 TYR A 387 387 ASP B Protein S 1 FT #SUB 104 104 SER A 302 302 GLU B Protein S 3 FT #SUB 104 104 SER A 388 388 ARG B Protein A 3 FT #SUB 106 106 SER A 302 302 GLU B Protein S 4 FT #SUB 107 107 LEU A 300 300 HIS B Protein S 1 FT #SUB 129 129 ARG A 132 132 PHE B Protein S 1 FT #SUB 129 129 ARG A 133 133 SER B Protein S 8 FT #SUB 132 132 PHE A 55 55 GLY B Protein S 6 FT #SUB 132 132 PHE A 56 56 ASP B Protein S 1 FT #SUB 133 133 SER A 129 129 ARG B Protein A 11 FT #SUB 137 137 LEU A 56 56 ASP B Protein S 1 FT #SUB 137 137 LEU A 59 59 ASN B Protein S 1 FT #SUB 217 217 LEU A 329 329 ILE B Protein S 3 FT #SUB 302 302 GLU A 104 104 SER B Protein S 5 FT #SUB 302 302 GLU A 106 106 SER B Protein S 1 FT #SUB 327 327 SER A 62 62 ARG B Protein B 1 FT #SUB 329 329 ILE A 66 66 TRP B Protein A 7 FT #SUB 329 329 ILE A 217 217 LEU B Protein S 1 FT #SUB 330 330 GLY A 62 62 ARG B Protein B 2 FT #SUB 331 331 GLY A 62 62 ARG B Protein B 4 FT #SUB 333 333 ASP A 62 62 ARG B Protein S 6 FT #SUB 385 385 LEU A 59 59 ASN B Protein B 3 FT #SUB 386 386 GLY A 56 56 ASP B Protein B 4 FT #SUB 386 386 GLY A 58 58 THR B Protein B 3 FT #SUB 386 386 GLY A 59 59 ASN B Protein B 1 FT #SUB 387 387 ASP A 55 55 GLY B Protein S 1 FT #SUB 387 387 ASP A 56 56 ASP B Protein A 10 FT #SUB 387 387 ASP A 57 57 ALA B Protein S 5 FT #SUB 387 387 ASP A 58 58 THR B Protein A 8 FT #SUB 387 387 ASP A 88 88 TYR B Protein S 2 FT #SUB 387 387 ASP A 101 101 ALA B Protein S 1 FT #SUB 387 387 ASP A 103 103 TYR B Protein B 1 FT #SUB 388 388 ARG A 104 104 SER B Protein S 1 FT #SUB 388 388 ARG A 129 129 ARG B Protein S 1 FT #SUB 389 389 PRO A 78 78 LEU B Protein S 1 FT #HET 226 226 HIS A 1 501 FE A S 3 FT #HET 226 226 HIS A 2 502 92U A S 2 FT #HET 269 269 VAL A 2 502 92U A S 1 FT #HET 280 280 PRO A 2 502 92U A S 1 FT #HET 308 308 HIS A 1 501 FE A S 3 FT #HET 308 308 HIS A 2 502 92U A S 2 FT #HET 335 335 MET A 2 502 92U A S 2 FT #HET 368 368 LEU A 2 502 92U A S 1 FT #HET 381 381 PHE A 2 502 92U A S 14 FT #HET 392 392 PHE A 2 502 92U A S 2 FT #HET 394 394 GLU A 1 501 FE A S 3 FT #HET 394 394 GLU A 2 502 92U A S 3 FT #HET 419 419 PHE A 2 502 92U A A 11 FT #HET 420 420 GLY A 2 502 92U A B 3 FT #HET 421 421 LYS A 2 502 92U A S 4 FT #HET 423 423 ASN A 2 502 92U A A 4 FT #HET 424 424 PHE A 2 502 92U A S 10 FT DISORDER 1 28 FT DISORDER 109 115 FT DISORDER 195 201 FT DISORDER 254 262 FT DISORDER 288 290 FT DISORDER 404 411 FT DISORDER 435 445 CC SEQUENCE 372 AA (ATOM); CC FSKFVRKNPK SDKFKVKRFH HIEFWCGDAT NVARRFSWGL GMRFSAKSDL STGNMVHASY CC LLTSGDLRFL FTAPYSPSLS TTASIPSFDH GSCRSFFSSH GLGVRAVAIE VEDAESAFSI CC SVANGAIPSS PPIVLNEAVT IAEVKLYGDV VLRYVSYKAF LPGFERVEDA SSFPLDYGIR CC RLDHAVGNVP ELGPALTYVA GFTGFHQFAE FSGLNSAVLA SNDEMVLLPI NEPVHGKSQI CC QTYLEHNEGA GLQHLALMSE DIFRTLREMR KRSSIGGFDF MPSPPPTYYQ NLKKRVGDVL CC SDDQIKECEE LGILVDRDDQ GTLLQIFTKP LGDRPTIFIE IIQRVGCMMQ SGGCGGFGKG CC NFSELFKSIE EY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGHQNAAVSENQNHDDGAASSPGFKLVGFSKFVRKNPKSDKFKVKRFHHI CC ATOM ----------------------------FSKFVRKNPKSDKFKVKRFHHI CC ********************** CC SEQRES EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ATOM EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ************************************************** CC SEQRES APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ATOM APYSPSLS-------TTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ******** *********************************** CC SEQRES SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAEDTEKS CC ATOM SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKA------ CC ******************************************** CC SEQRES EFLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ATOM -FLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ************************************************* CC SEQRES AEFTADDVGTAESGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEH CC ATOM AEF---------SGLNSAVLASNDEMVLLPINEPVHG---KSQIQTYLEH CC *** ************************* ********** CC SEQRES NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ATOM NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ************************************************** CC SEQRES GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ATOM GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ************************************************** CC SEQRES CMMKDEEGKAYQSGGCGGFGKGNFSELFKSIEEYEKTLEAKQLVG CC ATOM CMM--------QSGGCGGFGKGNFSELFKSIEEY----------- CC *** *********************** SQ SEQUENCE 445 AA; MW; CN; MGHQNAAVSE NQNHDDGAAS SPGFKLVGFS KFVRKNPKSD KFKVKRFHHI EFWCGDATNV ARRFSWGLGM RFSAKSDLST GNMVHASYLL TSGDLRFLFT APYSPSLSAG EIKPTTTASI PSFDHGSCRS FFSSHGLGVR AVAIEVEDAE SAFSISVANG AIPSSPPIVL NEAVTIAEVK LYGDVVLRYV SYKAEDTEKS EFLPGFERVE DASSFPLDYG IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEFTADDVGT AESGLNSAVL ASNDEMVLLP INEPVHGTKR KSQIQTYLEH NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMKDEEGKA YQSGGCGGFG KGNFSELFKS IEEYEKTLEA KQLVG // ID 5YWHB STANDARD; PRT; 445 AA. DT CONVERTED FROM PDB (SEQRES) 5YWH DE 4-hydroxyphenylpyruvate dioxygenase OS Arabidopsis thaliana CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.720 CC R-Factor 0.243 FT #SUB 55 55 GLY B 132 132 PHE A Protein B 6 FT #SUB 55 55 GLY B 387 387 ASP A Protein B 1 FT #SUB 56 56 ASP B 132 132 PHE A Protein B 1 FT #SUB 56 56 ASP B 137 137 LEU A Protein S 1 FT #SUB 56 56 ASP B 386 386 GLY A Protein S 4 FT #SUB 56 56 ASP B 387 387 ASP A Protein A 10 FT #SUB 57 57 ALA B 387 387 ASP A Protein B 5 FT #SUB 58 58 THR B 386 386 GLY A Protein S 3 FT #SUB 58 58 THR B 387 387 ASP A Protein A 8 FT #SUB 59 59 ASN B 59 59 ASN A Protein B 1 FT #SUB 59 59 ASN B 60 60 VAL A Protein S 1 FT #SUB 59 59 ASN B 63 63 ARG A Protein S 8 FT #SUB 59 59 ASN B 64 64 PHE A Protein S 2 FT #SUB 59 59 ASN B 137 137 LEU A Protein S 1 FT #SUB 59 59 ASN B 385 385 LEU A Protein S 3 FT #SUB 59 59 ASN B 386 386 GLY A Protein S 1 FT #SUB 62 62 ARG B 63 63 ARG A Protein S 6 FT #SUB 62 62 ARG B 327 327 SER A Protein S 1 FT #SUB 62 62 ARG B 330 330 GLY A Protein A 2 FT #SUB 62 62 ARG B 331 331 GLY A Protein S 4 FT #SUB 62 62 ARG B 333 333 ASP A Protein S 6 FT #SUB 63 63 ARG B 59 59 ASN A Protein S 8 FT #SUB 63 63 ARG B 62 62 ARG A Protein S 6 FT #SUB 64 64 PHE B 59 59 ASN A Protein S 2 FT #SUB 66 66 TRP B 66 66 TRP A Protein S 10 FT #SUB 66 66 TRP B 329 329 ILE A Protein S 7 FT #SUB 78 78 LEU B 389 389 PRO A Protein S 1 FT #SUB 88 88 TYR B 387 387 ASP A Protein S 2 FT #SUB 101 101 ALA B 387 387 ASP A Protein S 1 FT #SUB 103 103 TYR B 387 387 ASP A Protein S 1 FT #SUB 104 104 SER B 302 302 GLU A Protein S 5 FT #SUB 104 104 SER B 388 388 ARG A Protein S 1 FT #SUB 106 106 SER B 302 302 GLU A Protein S 1 FT #SUB 129 129 ARG B 133 133 SER A Protein S 11 FT #SUB 129 129 ARG B 388 388 ARG A Protein S 1 FT #SUB 132 132 PHE B 55 55 GLY A Protein S 6 FT #SUB 132 132 PHE B 56 56 ASP A Protein S 1 FT #SUB 132 132 PHE B 129 129 ARG A Protein S 1 FT #SUB 133 133 SER B 129 129 ARG A Protein A 8 FT #SUB 137 137 LEU B 56 56 ASP A Protein S 1 FT #SUB 137 137 LEU B 59 59 ASN A Protein S 1 FT #SUB 217 217 LEU B 329 329 ILE A Protein S 1 FT #SUB 300 300 HIS B 78 78 LEU A Protein S 2 FT #SUB 300 300 HIS B 107 107 LEU A Protein S 1 FT #SUB 302 302 GLU B 104 104 SER A Protein S 3 FT #SUB 302 302 GLU B 106 106 SER A Protein S 4 FT #SUB 327 327 SER B 62 62 ARG A Protein B 1 FT #SUB 329 329 ILE B 66 66 TRP A Protein A 7 FT #SUB 329 329 ILE B 217 217 LEU A Protein S 3 FT #SUB 330 330 GLY B 62 62 ARG A Protein B 3 FT #SUB 331 331 GLY B 62 62 ARG A Protein B 2 FT #SUB 333 333 ASP B 62 62 ARG A Protein S 6 FT #SUB 385 385 LEU B 59 59 ASN A Protein B 2 FT #SUB 386 386 GLY B 56 56 ASP A Protein B 4 FT #SUB 386 386 GLY B 58 58 THR A Protein B 2 FT #SUB 387 387 ASP B 56 56 ASP A Protein A 9 FT #SUB 387 387 ASP B 57 57 ALA A Protein S 6 FT #SUB 387 387 ASP B 58 58 THR A Protein A 6 FT #SUB 387 387 ASP B 88 88 TYR A Protein S 2 FT #SUB 387 387 ASP B 101 101 ALA A Protein S 1 FT #SUB 387 387 ASP B 103 103 TYR A Protein S 1 FT #SUB 388 388 ARG B 104 104 SER A Protein S 3 FT #SUB 389 389 PRO B 78 78 LEU A Protein S 1 FT #HET 226 226 HIS B 3 501 FE B S 3 FT #HET 226 226 HIS B 4 502 92U B S 2 FT #HET 228 228 VAL B 4 502 92U B S 1 FT #HET 267 267 SER B 4 502 92U B S 3 FT #HET 269 269 VAL B 4 502 92U B S 1 FT #HET 280 280 PRO B 4 502 92U B S 2 FT #HET 293 293 GLN B 4 502 92U B S 1 FT #HET 308 308 HIS B 3 501 FE B S 3 FT #HET 308 308 HIS B 4 502 92U B S 3 FT #HET 379 379 GLN B 4 502 92U B S 3 FT #HET 381 381 PHE B 4 502 92U B S 15 FT #HET 394 394 GLU B 3 501 FE B S 3 FT #HET 394 394 GLU B 4 502 92U B S 4 FT #HET 419 419 PHE B 4 502 92U B A 10 FT #HET 420 420 GLY B 4 502 92U B B 5 FT #HET 424 424 PHE B 4 502 92U B A 10 FT #HET 427 427 LEU B 4 502 92U B S 1 FT DISORDER 1 28 FT DISORDER 108 115 FT DISORDER 195 201 FT DISORDER 254 262 FT DISORDER 286 290 FT DISORDER 404 410 FT DISORDER 435 445 CC SEQUENCE 370 AA (ATOM); CC FSKFVRKNPK SDKFKVKRFH HIEFWCGDAT NVARRFSWGL GMRFSAKSDL STGNMVHASY CC LLTSGDLRFL FTAPYSPSLT TASIPSFDHG SCRSFFSSHG LGVRAVAIEV EDAESAFSIS CC VANGAIPSSP PIVLNEAVTI AEVKLYGDVV LRYVSYKAFL PGFERVEDAS SFPLDYGIRR CC LDHAVGNVPE LGPALTYVAG FTGFHQFAEF SGLNSAVLAS NDEMVLLPIN EPVKSQIQTY CC LEHNEGAGLQ HLALMSEDIF RTLREMRKRS SIGGFDFMPS PPPTYYQNLK KRVGDVLSDD CC QIKECEELGI LVDRDDQGTL LQIFTKPLGD RPTIFIEIIQ RVGCMMYQSG GCGGFGKGNF CC SELFKSIEEY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGHQNAAVSENQNHDDGAASSPGFKLVGFSKFVRKNPKSDKFKVKRFHHI CC ATOM ----------------------------FSKFVRKNPKSDKFKVKRFHHI CC ********************** CC SEQRES EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ATOM EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ************************************************** CC SEQRES APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ATOM APYSPSL--------TTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ******* *********************************** CC SEQRES SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAEDTEKS CC ATOM SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKA------ CC ******************************************** CC SEQRES EFLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ATOM -FLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ************************************************* CC SEQRES AEFTADDVGTAESGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEH CC ATOM AEF---------SGLNSAVLASNDEMVLLPINEPV-----KSQIQTYLEH CC *** *********************** ********** CC SEQRES NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ATOM NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ************************************************** CC SEQRES GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ATOM GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ************************************************** CC SEQRES CMMKDEEGKAYQSGGCGGFGKGNFSELFKSIEEYEKTLEAKQLVG CC ATOM CMM-------YQSGGCGGFGKGNFSELFKSIEEY----------- CC *** ************************ SQ SEQUENCE 445 AA; MW; CN; MGHQNAAVSE NQNHDDGAAS SPGFKLVGFS KFVRKNPKSD KFKVKRFHHI EFWCGDATNV ARRFSWGLGM RFSAKSDLST GNMVHASYLL TSGDLRFLFT APYSPSLSAG EIKPTTTASI PSFDHGSCRS FFSSHGLGVR AVAIEVEDAE SAFSISVANG AIPSSPPIVL NEAVTIAEVK LYGDVVLRYV SYKAEDTEKS EFLPGFERVE DASSFPLDYG IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEFTADDVGT AESGLNSAVL ASNDEMVLLP INEPVHGTKR KSQIQTYLEH NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMKDEEGKA YQSGGCGGFG KGNFSELFKS IEEYEKTLEA KQLVG //