ID 5YY7A STANDARD; PRT; 445 AA. DT CONVERTED FROM PDB (SEQRES) 5YY7 DE 4-hydroxyphenylpyruvate dioxygenase OS Arabidopsis thaliana CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.300 CC R-Factor 0.273 FT #SUB 55 34 GLY A 132 111 PHE B Protein B 5 FT #SUB 56 35 ASP A 132 111 PHE B Protein S 1 FT #SUB 56 35 ASP A 137 116 LEU B Protein S 3 FT #SUB 56 35 ASP A 386 365 GLY B Protein S 2 FT #SUB 56 35 ASP A 387 366 ASP B Protein A 8 FT #SUB 57 36 ALA A 387 366 ASP B Protein A 7 FT #SUB 58 37 THR A 385 364 LEU B Protein S 2 FT #SUB 58 37 THR A 386 365 GLY B Protein S 5 FT #SUB 58 37 THR A 387 366 ASP B Protein A 9 FT #SUB 59 38 ASN A 59 38 ASN B Protein B 1 FT #SUB 59 38 ASN A 60 39 VAL B Protein S 5 FT #SUB 59 38 ASN A 63 42 ARG B Protein A 9 FT #SUB 59 38 ASN A 64 43 PHE B Protein S 2 FT #SUB 59 38 ASN A 137 116 LEU B Protein S 2 FT #SUB 59 38 ASN A 385 364 LEU B Protein S 1 FT #SUB 62 41 ARG A 63 42 ARG B Protein S 4 FT #SUB 62 41 ARG A 327 306 SER B Protein S 4 FT #SUB 62 41 ARG A 330 309 GLY B Protein A 5 FT #SUB 62 41 ARG A 331 310 GLY B Protein S 1 FT #SUB 62 41 ARG A 333 312 ASP B Protein S 5 FT #SUB 63 42 ARG A 59 38 ASN B Protein S 8 FT #SUB 63 42 ARG A 62 41 ARG B Protein S 6 FT #SUB 64 43 PHE A 59 38 ASN B Protein S 1 FT #SUB 66 45 TRP A 66 45 TRP B Protein S 11 FT #SUB 66 45 TRP A 329 308 ILE B Protein S 6 FT #SUB 78 57 LEU A 300 279 HIS B Protein S 1 FT #SUB 78 57 LEU A 389 368 PRO B Protein S 1 FT #SUB 86 65 ALA A 387 366 ASP B Protein S 3 FT #SUB 101 80 ALA A 387 366 ASP B Protein S 1 FT #SUB 103 82 TYR A 387 366 ASP B Protein S 6 FT #SUB 103 82 TYR A 388 367 ARG B Protein A 4 FT #SUB 104 83 SER A 302 281 GLU B Protein A 5 FT #SUB 104 83 SER A 388 367 ARG B Protein B 2 FT #SUB 105 84 PRO A 302 281 GLU B Protein S 1 FT #SUB 107 86 LEU A 300 279 HIS B Protein S 5 FT #SUB 132 111 PHE A 55 34 GLY B Protein S 5 FT #SUB 133 112 SER A 129 108 ARG B Protein A 4 FT #SUB 137 116 LEU A 56 35 ASP B Protein S 1 FT #SUB 137 116 LEU A 59 38 ASN B Protein S 1 FT #SUB 212 191 ALA A 328 307 SER B Protein B 1 FT #SUB 213 192 SER A 328 307 SER B Protein B 2 FT #SUB 217 196 LEU A 329 308 ILE B Protein S 1 FT #SUB 299 278 GLU A 107 86 LEU B Protein B 1 FT #SUB 300 279 HIS A 78 57 LEU B Protein S 2 FT #SUB 300 279 HIS A 107 86 LEU B Protein S 2 FT #SUB 302 281 GLU A 104 83 SER B Protein S 5 FT #SUB 302 281 GLU A 105 84 PRO B Protein S 1 FT #SUB 302 281 GLU A 106 85 SER B Protein S 1 FT #SUB 327 306 SER A 62 41 ARG B Protein B 4 FT #SUB 328 307 SER A 214 193 SER B Protein S 2 FT #SUB 328 307 SER A 215 194 PHE B Protein A 6 FT #SUB 329 308 ILE A 66 45 TRP B Protein A 7 FT #SUB 329 308 ILE A 215 194 PHE B Protein S 2 FT #SUB 329 308 ILE A 217 196 LEU B Protein S 2 FT #SUB 330 309 GLY A 62 41 ARG B Protein B 5 FT #SUB 331 310 GLY A 62 41 ARG B Protein B 4 FT #SUB 333 312 ASP A 62 41 ARG B Protein S 6 FT #SUB 385 364 LEU A 58 37 THR B Protein B 1 FT #SUB 385 364 LEU A 59 38 ASN B Protein B 1 FT #SUB 386 365 GLY A 56 35 ASP B Protein B 4 FT #SUB 386 365 GLY A 58 37 THR B Protein B 2 FT #SUB 387 366 ASP A 56 35 ASP B Protein A 10 FT #SUB 387 366 ASP A 57 36 ALA B Protein S 4 FT #SUB 387 366 ASP A 58 37 THR B Protein A 8 FT #SUB 387 366 ASP A 86 65 ALA B Protein S 3 FT #SUB 387 366 ASP A 88 67 TYR B Protein S 2 FT #SUB 387 366 ASP A 101 80 ALA B Protein S 1 FT #SUB 387 366 ASP A 103 82 TYR B Protein B 1 FT #SUB 388 367 ARG A 129 108 ARG B Protein S 1 FT #SUB 389 368 PRO A 78 57 LEU B Protein S 2 FT #HET 226 205 HIS A 1 501 CO A S 3 FT #HET 226 205 HIS A 2 502 94L A S 1 FT #HET 228 207 VAL A 1 501 CO A S 1 FT #HET 228 207 VAL A 2 502 94L A S 3 FT #HET 267 246 SER A 2 502 94L A S 4 FT #HET 280 259 PRO A 2 502 94L A S 1 FT #HET 308 287 HIS A 1 501 CO A S 3 FT #HET 308 287 HIS A 2 502 94L A S 3 FT #HET 335 314 MET A 2 502 94L A S 2 FT #HET 368 347 LEU A 2 502 94L A S 1 FT #HET 379 358 GLN A 2 502 94L A S 1 FT #HET 381 360 PHE A 2 502 94L A S 13 FT #HET 394 373 GLU A 1 501 CO A S 3 FT #HET 394 373 GLU A 2 502 94L A S 3 FT #HET 419 398 PHE A 2 502 94L A B 3 FT #HET 420 399 GLY A 2 502 94L A B 2 FT #HET 421 400 LYS A 2 502 94L A S 3 FT #HET 424 403 PHE A 2 502 94L A S 12 FT #HET 427 406 LEU A 2 502 94L A S 1 FT DISORDER 1 35 FT DISORDER 194 200 FT DISORDER 214 215 FT DISORDER 252 262 FT DISORDER 287 290 FT DISORDER 407 409 FT DISORDER 430 445 CC SEQUENCE 367 AA (ATOM); CC NPKSDKFKVK RFHHIEFWCG DATNVARRFS WGLGMRFSAK SDLSTGNMVH ASYLLTSGDL CC RFLFTAPYSP SLSAGEIKPT TTASIPSFDH GSCRSFFSSH GLGVRAVAIE VEDAESAFSI CC SVANGAIPSS PPIVLNEAVT IAEVKLYGDV VLRYVSYKEF LPGFERVEDA SPLDYGIRRL CC DHAVGNVPEL GPALTYVAGF TGFHQFASGL NSAVLASNDE MVLLPINEPV HKSQIQTYLE CC HNEGAGLQHL ALMSEDIFRT LREMRKRSSI GGFDFMPSPP PTYYQNLKKR VGDVLSDDQI CC KECEELGILV DRDDQGTLLQ IFTKPLGDRP TIFIEIIQRV GCMMKDEAYQ SGGCGGFGKG CC NFSELFK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGHQNAAVSENQNHDDGAASSPGFKLVGFSKFVRKNPKSDKFKVKRFHHI CC ATOM -----------------------------------NPKSDKFKVKRFHHI CC *************** CC SEQRES EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ATOM EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ************************************************** CC SEQRES APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ATOM APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ************************************************** CC SEQRES SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAEDTEKS CC ATOM SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYK------- CC ******************************************* CC SEQRES EFLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ATOM EFLPGFERVEDAS--PLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ************* *********************************** CC SEQRES AEFTADDVGTAESGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEH CC ATOM A-----------SGLNSAVLASNDEMVLLPINEPVH----KSQIQTYLEH CC * ************************ ********** CC SEQRES NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ATOM NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ************************************************** CC SEQRES GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ATOM GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ************************************************** CC SEQRES CMMKDEEGKAYQSGGCGGFGKGNFSELFKSIEEYEKTLEAKQLVG CC ATOM CMMKDE---AYQSGGCGGFGKGNFSELFK---------------- CC ****** ******************** SQ SEQUENCE 445 AA; MW; CN; MGHQNAAVSE NQNHDDGAAS SPGFKLVGFS KFVRKNPKSD KFKVKRFHHI EFWCGDATNV ARRFSWGLGM RFSAKSDLST GNMVHASYLL TSGDLRFLFT APYSPSLSAG EIKPTTTASI PSFDHGSCRS FFSSHGLGVR AVAIEVEDAE SAFSISVANG AIPSSPPIVL NEAVTIAEVK LYGDVVLRYV SYKAEDTEKS EFLPGFERVE DASSFPLDYG IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEFTADDVGT AESGLNSAVL ASNDEMVLLP INEPVHGTKR KSQIQTYLEH NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMKDEEGKA YQSGGCGGFG KGNFSELFKS IEEYEKTLEA KQLVG // ID 5YY7B STANDARD; PRT; 445 AA. DT CONVERTED FROM PDB (SEQRES) 5YY7 DE 4-hydroxyphenylpyruvate dioxygenase OS Arabidopsis thaliana CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.300 CC R-Factor 0.273 FT #SUB 55 34 GLY B 132 111 PHE A Protein B 5 FT #SUB 56 35 ASP B 137 116 LEU A Protein S 1 FT #SUB 56 35 ASP B 386 365 GLY A Protein S 4 FT #SUB 56 35 ASP B 387 366 ASP A Protein A 10 FT #SUB 57 36 ALA B 387 366 ASP A Protein B 4 FT #SUB 58 37 THR B 385 364 LEU A Protein S 1 FT #SUB 58 37 THR B 386 365 GLY A Protein S 2 FT #SUB 58 37 THR B 387 366 ASP A Protein A 8 FT #SUB 59 38 ASN B 59 38 ASN A Protein B 1 FT #SUB 59 38 ASN B 63 42 ARG A Protein A 8 FT #SUB 59 38 ASN B 64 43 PHE A Protein S 1 FT #SUB 59 38 ASN B 137 116 LEU A Protein S 1 FT #SUB 59 38 ASN B 385 364 LEU A Protein S 1 FT #SUB 60 39 VAL B 59 38 ASN A Protein A 5 FT #SUB 62 41 ARG B 63 42 ARG A Protein S 6 FT #SUB 62 41 ARG B 327 306 SER A Protein S 4 FT #SUB 62 41 ARG B 330 309 GLY A Protein A 5 FT #SUB 62 41 ARG B 331 310 GLY A Protein S 4 FT #SUB 62 41 ARG B 333 312 ASP A Protein S 6 FT #SUB 63 42 ARG B 59 38 ASN A Protein S 9 FT #SUB 63 42 ARG B 62 41 ARG A Protein S 4 FT #SUB 64 43 PHE B 59 38 ASN A Protein S 2 FT #SUB 66 45 TRP B 66 45 TRP A Protein S 11 FT #SUB 66 45 TRP B 329 308 ILE A Protein S 7 FT #SUB 78 57 LEU B 300 279 HIS A Protein S 2 FT #SUB 78 57 LEU B 389 368 PRO A Protein S 2 FT #SUB 86 65 ALA B 387 366 ASP A Protein S 3 FT #SUB 88 67 TYR B 387 366 ASP A Protein S 2 FT #SUB 101 80 ALA B 387 366 ASP A Protein S 1 FT #SUB 103 82 TYR B 387 366 ASP A Protein S 1 FT #SUB 104 83 SER B 302 281 GLU A Protein A 5 FT #SUB 105 84 PRO B 302 281 GLU A Protein S 1 FT #SUB 106 85 SER B 302 281 GLU A Protein S 1 FT #SUB 107 86 LEU B 299 278 GLU A Protein S 1 FT #SUB 107 86 LEU B 300 279 HIS A Protein S 2 FT #SUB 129 108 ARG B 133 112 SER A Protein S 4 FT #SUB 129 108 ARG B 388 367 ARG A Protein S 1 FT #SUB 132 111 PHE B 55 34 GLY A Protein S 5 FT #SUB 132 111 PHE B 56 35 ASP A Protein S 1 FT #SUB 137 116 LEU B 56 35 ASP A Protein S 3 FT #SUB 137 116 LEU B 59 38 ASN A Protein S 2 FT #SUB 214 193 SER B 328 307 SER A Protein B 2 FT #SUB 215 194 PHE B 328 307 SER A Protein A 6 FT #SUB 215 194 PHE B 329 308 ILE A Protein S 2 FT #SUB 217 196 LEU B 329 308 ILE A Protein S 2 FT #SUB 300 279 HIS B 78 57 LEU A Protein S 1 FT #SUB 300 279 HIS B 107 86 LEU A Protein S 5 FT #SUB 302 281 GLU B 104 83 SER A Protein S 5 FT #SUB 302 281 GLU B 105 84 PRO A Protein S 1 FT #SUB 327 306 SER B 62 41 ARG A Protein B 4 FT #SUB 328 307 SER B 212 191 ALA A Protein S 1 FT #SUB 328 307 SER B 213 192 SER A Protein S 2 FT #SUB 329 308 ILE B 66 45 TRP A Protein A 6 FT #SUB 329 308 ILE B 217 196 LEU A Protein S 1 FT #SUB 330 309 GLY B 62 41 ARG A Protein B 5 FT #SUB 331 310 GLY B 62 41 ARG A Protein B 1 FT #SUB 333 312 ASP B 62 41 ARG A Protein S 5 FT #SUB 385 364 LEU B 58 37 THR A Protein B 2 FT #SUB 385 364 LEU B 59 38 ASN A Protein B 1 FT #SUB 386 365 GLY B 56 35 ASP A Protein B 2 FT #SUB 386 365 GLY B 58 37 THR A Protein B 5 FT #SUB 387 366 ASP B 56 35 ASP A Protein A 8 FT #SUB 387 366 ASP B 57 36 ALA A Protein S 7 FT #SUB 387 366 ASP B 58 37 THR A Protein A 9 FT #SUB 387 366 ASP B 86 65 ALA A Protein S 3 FT #SUB 387 366 ASP B 101 80 ALA A Protein S 1 FT #SUB 387 366 ASP B 103 82 TYR A Protein A 6 FT #SUB 388 367 ARG B 103 82 TYR A Protein A 4 FT #SUB 388 367 ARG B 104 83 SER A Protein S 2 FT #SUB 389 368 PRO B 78 57 LEU A Protein S 1 FT #HET 226 205 HIS B 3 501 CO B S 3 FT #HET 226 205 HIS B 4 502 94L B S 3 FT #HET 265 244 LEU B 4 502 94L B S 1 FT #HET 267 246 SER B 4 502 94L B S 2 FT #HET 280 259 PRO B 4 502 94L B S 2 FT #HET 282 261 ASN B 4 502 94L B S 7 FT #HET 293 272 GLN B 4 502 94L B S 1 FT #HET 307 286 GLN B 4 502 94L B S 1 FT #HET 308 287 HIS B 3 501 CO B S 3 FT #HET 308 287 HIS B 4 502 94L B S 3 FT #HET 335 314 MET B 4 502 94L B S 1 FT #HET 379 358 GLN B 4 502 94L B S 2 FT #HET 381 360 PHE B 4 502 94L B S 14 FT #HET 394 373 GLU B 3 501 CO B S 3 FT #HET 394 373 GLU B 4 502 94L B S 2 FT #HET 419 398 PHE B 4 502 94L B A 6 FT #HET 420 399 GLY B 4 502 94L B B 2 FT #HET 423 402 ASN B 4 502 94L B A 3 FT #HET 424 403 PHE B 4 502 94L B S 8 FT #HET 429 408 LYS B 4 502 94L B B 3 FT DISORDER 1 33 FT DISORDER 196 200 FT DISORDER 212 213 FT DISORDER 253 261 FT DISORDER 288 290 FT DISORDER 408 411 FT DISORDER 430 445 CC SEQUENCE 373 AA (ATOM); CC RKNPKSDKFK VKRFHHIEFW CGDATNVARR FSWGLGMRFS AKSDLSTGNM VHASYLLTSG CC DLRFLFTAPY SPSLSAGEIK PTTTASIPSF DHGSCRSFFS SHGLGVRAVA IEVEDAESAF CC SISVANGAIP SSPPIVLNEA VTIAEVKLYG DVVLRYVSYK AEEFLPGFER VEDSFPLDYG CC IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEESGLNSAV LASNDEMVLL PINEPVHGKS CC QIQTYLEHNE GAGLQHLALM SEDIFRTLRE MRKRSSIGGF DFMPSPPPTY YQNLKKRVGD CC VLSDDQIKEC EELGILVDRD DQGTLLQIFT KPLGDRPTIF IEIIQRVGCM MKDEEQSGGC CC GGFGKGNFSE LFK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGHQNAAVSENQNHDDGAASSPGFKLVGFSKFVRKNPKSDKFKVKRFHHI CC ATOM ---------------------------------RKNPKSDKFKVKRFHHI CC ***************** CC SEQRES EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ATOM EFWCGDATNVARRFSWGLGMRFSAKSDLSTGNMVHASYLLTSGDLRFLFT CC ************************************************** CC SEQRES APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ATOM APYSPSLSAGEIKPTTTASIPSFDHGSCRSFFSSHGLGVRAVAIEVEDAE CC ************************************************** CC SEQRES SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAEDTEKS CC ATOM SAFSISVANGAIPSSPPIVLNEAVTIAEVKLYGDVVLRYVSYKAE----- CC ********************************************* CC SEQRES EFLPGFERVEDASSFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC ATOM EFLPGFERVED--SFPLDYGIRRLDHAVGNVPELGPALTYVAGFTGFHQF CC *********** ************************************* CC SEQRES AEFTADDVGTAESGLNSAVLASNDEMVLLPINEPVHGTKRKSQIQTYLEH CC ATOM AE---------ESGLNSAVLASNDEMVLLPINEPVHG---KSQIQTYLEH CC ** ************************** ********** CC SEQRES NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ATOM NEGAGLQHLALMSEDIFRTLREMRKRSSIGGFDFMPSPPPTYYQNLKKRV CC ************************************************** CC SEQRES GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ATOM GDVLSDDQIKECEELGILVDRDDQGTLLQIFTKPLGDRPTIFIEIIQRVG CC ************************************************** CC SEQRES CMMKDEEGKAYQSGGCGGFGKGNFSELFKSIEEYEKTLEAKQLVG CC ATOM CMMKDEE----QSGGCGGFGKGNFSELFK---------------- CC ******* ****************** SQ SEQUENCE 445 AA; MW; CN; MGHQNAAVSE NQNHDDGAAS SPGFKLVGFS KFVRKNPKSD KFKVKRFHHI EFWCGDATNV ARRFSWGLGM RFSAKSDLST GNMVHASYLL TSGDLRFLFT APYSPSLSAG EIKPTTTASI PSFDHGSCRS FFSSHGLGVR AVAIEVEDAE SAFSISVANG AIPSSPPIVL NEAVTIAEVK LYGDVVLRYV SYKAEDTEKS EFLPGFERVE DASSFPLDYG IRRLDHAVGN VPELGPALTY VAGFTGFHQF AEFTADDVGT AESGLNSAVL ASNDEMVLLP INEPVHGTKR KSQIQTYLEH NEGAGLQHLA LMSEDIFRTL REMRKRSSIG GFDFMPSPPP TYYQNLKKRV GDVLSDDQIK ECEELGILVD RDDQGTLLQI FTKPLGDRPT IFIEIIQRVG CMMKDEEGKA YQSGGCGGFG KGNFSELFKS IEEYEKTLEA KQLVG //