ID 4Y49A STANDARD; PRT; 854 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex subunit NAT1 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 199 210 TYR A 39 39 SER B Protein S 1 FT #SUB 199 210 TYR A 40 40 TRP B Protein S 1 FT #SUB 203 214 GLU A 42 42 GLU B Protein S 1 FT #SUB 238 249 PHE A 143 143 GLU B Protein B 1 FT #SUB 238 249 PHE A 144 144 VAL B Protein B 4 FT #SUB 239 250 ASP A 42 42 GLU B Protein B 1 FT #SUB 239 250 ASP A 144 144 VAL B Protein B 3 FT #SUB 269 280 LYS A 202 202 SER B Protein B 1 FT #SUB 270 281 ARG A 200 200 GLN B Protein S 2 FT #SUB 270 281 ARG A 201 201 ILE B Protein B 3 FT #SUB 271 282 ASN A 5 5 ILE B Protein S 2 FT #SUB 271 282 ASN A 136 136 GLN B Protein S 1 FT #SUB 273 284 ASP A 4 4 ASN B Protein B 2 FT #SUB 273 284 ASP A 5 5 ILE B Protein B 5 FT #SUB 273 284 ASP A 132 132 ASN B Protein S 1 FT #SUB 274 285 ASN A 5 5 ILE B Protein A 7 FT #SUB 275 286 PHE A 50 50 THR B Protein S 2 FT #SUB 306 317 PRO A 204 204 PHE B Protein S 1 FT #SUB 313 324 PHE A 2 2 PRO B Protein S 4 FT #SUB 343 354 ALA A 127 127 MET B Protein S 2 FT #SUB 346 357 SER A 2 2 PRO B Protein B 2 FT #SUB 347 358 ASN A 2 2 PRO B Protein B 7 FT #SUB 354 365 ARG A 52 52 LEU B Protein S 1 FT #SUB 354 365 ARG A 53 53 ASP B Protein S 1 FT #SUB 452 463 ARG A 21 21 LEU B Protein S 3 FT #SUB 452 463 ARG A 22 22 HIS B Protein B 3 FT #SUB 452 463 ARG A 23 23 ASN B Protein B 3 FT #SUB 452 463 ARG A 24 24 LEU B Protein A 3 FT #SUB 452 463 ARG A 25 25 PRO B Protein S 1 FT #SUB 453 464 PHE A 122 122 ARG B Protein S 2 FT #SUB 456 467 CYS A 22 22 HIS B Protein S 1 FT #SUB 457 468 LYS A 22 22 HIS B Protein S 1 FT #SUB 493 504 LEU A 29 29 MET B Protein B 1 FT #SUB 494 505 VAL A 21 21 LEU B Protein B 1 FT #SUB 494 505 VAL A 27 27 ASN B Protein S 1 FT #SUB 494 505 VAL A 28 28 TYR B Protein B 1 FT #SUB 495 506 GLU A 29 29 MET B Protein S 1 FT #SUB 495 506 GLU A 30 30 MET B Protein S 4 FT #SUB 498 509 TRP A 22 22 HIS B Protein S 2 FT #SUB 568 579 PHE A 30 30 MET B Protein S 10 FT #SUB 584 595 THR A 39 39 SER B Protein S 1 FT #SUB 587 598 ALA A 41 41 PRO B Protein S 1 FT #SUB 590 601 GLU A 10 10 ILE B Protein A 3 FT #SUB 591 602 MET A 10 10 ILE B Protein A 3 FT #SUB 594 605 TRP A 10 10 ILE B Protein S 1 FT #SUB 594 605 TRP A 11 11 ASN B Protein S 1 FT #SUB 594 605 TRP A 30 30 MET B Protein S 1 FT #SUB 376 387 PRO A 101 101 ASN C Protein B 3 FT #SUB 380 391 PRO A 102 102 TYR C Protein S 1 FT #SUB 413 424 HIS A 99 99 LEU C Protein B 1 FT #SUB 413 424 HIS A 101 101 ASN C Protein S 6 FT #SUB 413 424 HIS A 102 102 TYR C Protein S 1 FT #SUB 416 427 THR A 18 18 MET C Protein B 2 FT #SUB 416 427 THR A 70 70 PRO C Protein S 1 FT #SUB 446 457 GLN A 17 17 GLY C Protein B 1 FT #SUB 447 458 LEU A 18 18 MET C Protein B 1 FT #SUB 448 459 ASP A 14 14 ASN C Protein B 1 FT #SUB 449 460 LEU A 14 14 ASN C Protein B 1 FT #SUB 184 195 GLN A 436 447 THR G Protein S 1 FT #HET 395 406 LEU A 1 901 G4P A A 2 FT #HET 426 437 ARG A 1 901 G4P A A 2 FT #HET 457 468 LYS A 1 901 G4P A S 5 FT DISORDER 1 11 FT DISORDER 22 34 FT DISORDER 86 87 FT DISORDER 481 483 FT DISORDER 523 533 FT DISORDER 625 658 FT DISORDER 758 765 FT DISORDER 806 806 FT DISORDER 825 825 FT DISORDER 854 854 CC SEQUENCE 769 AA (ATOM); CC AAKIALKKYN QYKKSLKLLD AILKKDGSHV DSLALKGLDL YSVGEKDDAA SYVANAIRKI CC ESASPICCHV LGIYMRNTKE YKESIKWFTA ALNNGSTNKQ IYRDLATLQS QIGDFKNALV CC SRKKYWEAFL GYRANWTSLA VAQDVNGERQ QAINTLSQFE KLAEGKISDS EKYEHSECLM CC YKNDIMYKAA SDNQDKLQNV LKHLNDIEPC VFDKFGLLER KATIYMKLGQ LKDASIVYRT CC LIKRNPDNFK YYKLLEVSLG IQGDNKLKKA LYGKLEQFYP RCEPPKFIPL TFLQDKEELS CC KKLREYVLPQ LERGVPATFS NVKPLYQRRK SKVSPLLEKI VLDYLSGLDP TQDPIPFIWT CC NYYLSQHFLF LKDFPKAQEY IDAALDHTPT LVEFYILKAR ILKHLGLMDT AAGILEEGRQ CC LDLQDRFINC KTVKYFLRAN NIDKAVEVAS LFTKSVNGIK DLHLVEASWF IVEQAEAYYR CC LYLDRKKKLD DLAEQIANDI KENQWLVRKY KGLALKRFNA IPKFYKQFED DQLDFHSYCM CC RKGTPRAYLE MLEWGKALYT KPMYVRAMKE ASKLYFQMHD DRLKNKRKET EAKSVAAYPS CC DQDNDVFGEK LIETSTPMED FATEFYNNYS MQVREDERDY ILDFEFNYRI GKLALCFASL CC NKFAKRFGTT SGLFGSMAIV LLHDPILKKV VTKSLEKEYS ENFPLNEISN NSFDWLNFYQ CC EKFKNDINGL LFLYRYRDDV PGSSNLKEMI ISSLSPLEPH SQNEILQYY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSRKRSTKPKPAAKIALKKYNDQFLEALKLYEGKQYKKSLKLLDAILKKD CC ATOM -----------AAKIALKKYN-------------QYKKSLKLLDAILKKD CC ********** **************** CC SEQRES GSHVDSLALKGLDLYSVGEKDDAASYVANAIRKIEGASASPICCHVLGIY CC ATOM GSHVDSLALKGLDLYSVGEKDDAASYVANAIRKIE--SASPICCHVLGIY CC *********************************** ************* CC SEQRES MRNTKEYKESIKWFTAALNNGSTNKQIYRDLATLQSQIGDFKNALVSRKK CC ATOM MRNTKEYKESIKWFTAALNNGSTNKQIYRDLATLQSQIGDFKNALVSRKK CC ************************************************** CC SEQRES YWEAFLGYRANWTSLAVAQDVNGERQQAINTLSQFEKLAEGKISDSEKYE CC ATOM YWEAFLGYRANWTSLAVAQDVNGERQQAINTLSQFEKLAEGKISDSEKYE CC ************************************************** CC SEQRES HSECLMYKNDIMYKAASDNQDKLQNVLKHLNDIEPCVFDKFGLLERKATI CC ATOM HSECLMYKNDIMYKAASDNQDKLQNVLKHLNDIEPCVFDKFGLLERKATI CC ************************************************** CC SEQRES YMKLGQLKDASIVYRTLIKRNPDNFKYYKLLEVSLGIQGDNKLKKALYGK CC ATOM YMKLGQLKDASIVYRTLIKRNPDNFKYYKLLEVSLGIQGDNKLKKALYGK CC ************************************************** CC SEQRES LEQFYPRCEPPKFIPLTFLQDKEELSKKLREYVLPQLERGVPATFSNVKP CC ATOM LEQFYPRCEPPKFIPLTFLQDKEELSKKLREYVLPQLERGVPATFSNVKP CC ************************************************** CC SEQRES LYQRRKSKVSPLLEKIVLDYLSGLDPTQDPIPFIWTNYYLSQHFLFLKDF CC ATOM LYQRRKSKVSPLLEKIVLDYLSGLDPTQDPIPFIWTNYYLSQHFLFLKDF CC ************************************************** CC SEQRES PKAQEYIDAALDHTPTLVEFYILKARILKHLGLMDTAAGILEEGRQLDLQ CC ATOM PKAQEYIDAALDHTPTLVEFYILKARILKHLGLMDTAAGILEEGRQLDLQ CC ************************************************** CC SEQRES DRFINCKTVKYFLRANNIDKAVEVASLFTKNDDSVNGIKDLHLVEASWFI CC ATOM DRFINCKTVKYFLRANNIDKAVEVASLFTK---SVNGIKDLHLVEASWFI CC ****************************** ***************** CC SEQRES VEQAEAYYRLYLDRKKKLDDLASLKKEVESDKSEQIANDIKENQWLVRKY CC ATOM VEQAEAYYRLYLDRKKKLDDLA-----------EQIANDIKENQWLVRKY CC ********************** ***************** CC SEQRES KGLALKRFNAIPKFYKQFEDDQLDFHSYCMRKGTPRAYLEMLEWGKALYT CC ATOM KGLALKRFNAIPKFYKQFEDDQLDFHSYCMRKGTPRAYLEMLEWGKALYT CC ************************************************** CC SEQRES KPMYVRAMKEASKLYFQMHDDRLKRKSDSLDENSDEIQNNGQNSSSQKKK CC ATOM KPMYVRAMKEASKLYFQMHDDRLK-------------------------- CC ************************ CC SEQRES AKKEAAAMNKRKETEAKSVAAYPSDQDNDVFGEKLIETSTPMEDFATEFY CC ATOM --------NKRKETEAKSVAAYPSDQDNDVFGEKLIETSTPMEDFATEFY CC ****************************************** CC SEQRES NNYSMQVREDERDYILDFEFNYRIGKLALCFASLNKFAKRFGTTSGLFGS CC ATOM NNYSMQVREDERDYILDFEFNYRIGKLALCFASLNKFAKRFGTTSGLFGS CC ************************************************** CC SEQRES MAIVLLHATRNDTPFDPILKKVVTKSLEKEYSENFPLNEISNNSFDWLNF CC ATOM MAIVLLH--------DPILKKVVTKSLEKEYSENFPLNEISNNSFDWLNF CC ******* *********************************** CC SEQRES YQEKFGKNDINGLLFLYRYRDDVPIGSSNLKEMIISSLSPLEPHSQNEIL CC ATOM YQEKF-KNDINGLLFLYRYRDDVP-GSSNLKEMIISSLSPLEPHSQNEIL CC ***** ****************** ************************* CC SEQRES QYYL CC ATOM QYY- CC *** SQ SEQUENCE 854 AA; MW; CN; MSRKRSTKPK PAAKIALKKY NDQFLEALKL YEGKQYKKSL KLLDAILKKD GSHVDSLALK GLDLYSVGEK DDAASYVANA IRKIEGASAS PICCHVLGIY MRNTKEYKES IKWFTAALNN GSTNKQIYRD LATLQSQIGD FKNALVSRKK YWEAFLGYRA NWTSLAVAQD VNGERQQAIN TLSQFEKLAE GKISDSEKYE HSECLMYKND IMYKAASDNQ DKLQNVLKHL NDIEPCVFDK FGLLERKATI YMKLGQLKDA SIVYRTLIKR NPDNFKYYKL LEVSLGIQGD NKLKKALYGK LEQFYPRCEP PKFIPLTFLQ DKEELSKKLR EYVLPQLERG VPATFSNVKP LYQRRKSKVS PLLEKIVLDY LSGLDPTQDP IPFIWTNYYL SQHFLFLKDF PKAQEYIDAA LDHTPTLVEF YILKARILKH LGLMDTAAGI LEEGRQLDLQ DRFINCKTVK YFLRANNIDK AVEVASLFTK NDDSVNGIKD LHLVEASWFI VEQAEAYYRL YLDRKKKLDD LASLKKEVES DKSEQIANDI KENQWLVRKY KGLALKRFNA IPKFYKQFED DQLDFHSYCM RKGTPRAYLE MLEWGKALYT KPMYVRAMKE ASKLYFQMHD DRLKRKSDSL DENSDEIQNN GQNSSSQKKK AKKEAAAMNK RKETEAKSVA AYPSDQDNDV FGEKLIETST PMEDFATEFY NNYSMQVRED ERDYILDFEF NYRIGKLALC FASLNKFAKR FGTTSGLFGS MAIVLLHATR NDTPFDPILK KVVTKSLEKE YSENFPLNEI SNNSFDWLNF YQEKFGKNDI NGLLFLYRYR DDVPIGSSNL KEMIISSLSP LEPHSQNEIL QYYL // ID 4Y49B STANDARD; PRT; 238 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex catalytic subunit ARD1 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 2 2 PRO B 313 324 PHE A Protein S 4 FT #SUB 2 2 PRO B 346 357 SER A Protein A 2 FT #SUB 2 2 PRO B 347 358 ASN A Protein A 7 FT #SUB 4 4 ASN B 273 284 ASP A Protein B 2 FT #SUB 5 5 ILE B 271 282 ASN A Protein A 2 FT #SUB 5 5 ILE B 273 284 ASP A Protein A 5 FT #SUB 5 5 ILE B 274 285 ASN A Protein B 7 FT #SUB 10 10 ILE B 590 601 GLU A Protein S 3 FT #SUB 10 10 ILE B 591 602 MET A Protein S 3 FT #SUB 10 10 ILE B 594 605 TRP A Protein B 1 FT #SUB 11 11 ASN B 594 605 TRP A Protein B 1 FT #SUB 21 21 LEU B 452 463 ARG A Protein B 3 FT #SUB 21 21 LEU B 494 505 VAL A Protein S 1 FT #SUB 22 22 HIS B 452 463 ARG A Protein B 3 FT #SUB 22 22 HIS B 456 467 CYS A Protein B 1 FT #SUB 22 22 HIS B 457 468 LYS A Protein S 1 FT #SUB 22 22 HIS B 498 509 TRP A Protein S 2 FT #SUB 23 23 ASN B 452 463 ARG A Protein B 3 FT #SUB 24 24 LEU B 452 463 ARG A Protein B 3 FT #SUB 25 25 PRO B 452 463 ARG A Protein B 1 FT #SUB 27 27 ASN B 494 505 VAL A Protein S 1 FT #SUB 28 28 TYR B 494 505 VAL A Protein B 1 FT #SUB 29 29 MET B 493 504 LEU A Protein S 1 FT #SUB 29 29 MET B 495 506 GLU A Protein B 1 FT #SUB 30 30 MET B 495 506 GLU A Protein A 4 FT #SUB 30 30 MET B 568 579 PHE A Protein S 10 FT #SUB 30 30 MET B 594 605 TRP A Protein S 1 FT #SUB 39 39 SER B 199 210 TYR A Protein B 1 FT #SUB 39 39 SER B 584 595 THR A Protein B 1 FT #SUB 40 40 TRP B 199 210 TYR A Protein B 1 FT #SUB 41 41 PRO B 587 598 ALA A Protein S 1 FT #SUB 42 42 GLU B 203 214 GLU A Protein S 1 FT #SUB 42 42 GLU B 239 250 ASP A Protein S 1 FT #SUB 50 50 THR B 275 286 PHE A Protein S 2 FT #SUB 52 52 LEU B 354 365 ARG A Protein B 1 FT #SUB 53 53 ASP B 354 365 ARG A Protein B 1 FT #SUB 122 122 ARG B 453 464 PHE A Protein S 2 FT #SUB 127 127 MET B 343 354 ALA A Protein S 2 FT #SUB 132 132 ASN B 273 284 ASP A Protein S 1 FT #SUB 136 136 GLN B 271 282 ASN A Protein S 1 FT #SUB 143 143 GLU B 238 249 PHE A Protein B 1 FT #SUB 144 144 VAL B 238 249 PHE A Protein A 4 FT #SUB 144 144 VAL B 239 250 ASP A Protein S 3 FT #SUB 200 200 GLN B 270 281 ARG A Protein S 2 FT #SUB 201 201 ILE B 270 281 ARG A Protein A 3 FT #SUB 202 202 SER B 269 280 LYS A Protein S 1 FT #SUB 204 204 PHE B 306 317 PRO A Protein B 1 FT #SUB 26 26 GLU B 1 1 SER E Protein S 6 FT #SUB 28 28 TYR B 1 1 SER E Protein S 1 FT #SUB 28 28 TYR B 2 2 TYR E Protein S 7 FT #SUB 116 116 THR B 1 1 SER E Protein B 4 FT #SUB 116 116 THR B 2 2 TYR E Protein B 1 FT #SUB 117 117 SER B 1 1 SER E Protein B 1 FT #SUB 153 153 HIS B 1 1 SER E Protein B 1 FT #SUB 180 180 TYR B 6 6 HIS E Protein B 2 FT #SUB 181 181 TYR B 1 1 SER E Protein S 4 FT #HET 22 22 HIS B 1 901 G4P A S 1 FT #HET 24 24 LEU B 2 301 CMC B S 1 FT #HET 117 117 SER B 2 301 CMC B B 3 FT #HET 118 118 LEU B 2 301 CMC B A 14 FT #HET 119 119 SER B 2 301 CMC B A 5 FT #HET 120 120 VAL B 2 301 CMC B A 7 FT #HET 125 125 ARG B 2 301 CMC B A 7 FT #HET 126 126 ARG B 2 301 CMC B B 7 FT #HET 127 127 MET B 2 301 CMC B B 2 FT #HET 128 128 GLY B 2 301 CMC B B 11 FT #HET 129 129 ILE B 2 301 CMC B S 1 FT #HET 130 130 ALA B 2 301 CMC B A 12 FT #HET 131 131 GLU B 2 301 CMC B A 12 FT #HET 159 159 ARG B 2 301 CMC B S 2 FT #HET 160 160 ALA B 2 301 CMC B A 3 FT #HET 163 163 HIS B 2 301 CMC B S 1 FT #HET 164 164 LEU B 2 301 CMC B S 3 FT #HET 165 165 TYR B 2 301 CMC B S 2 FT #HET 168 168 THR B 2 301 CMC B S 1 FT DISORDER 1 1 FT DISORDER 54 88 FT DISORDER 105 108 FT DISORDER 205 238 CC SEQUENCE 164 AA (ATOM); CC PINIRRATIN DIICMQNANL HNLPENYMMK YYMYHILSWP EASFVATTTT LDGEKLVGYV CC LVKMNDDPEP PNGHITSLSV MRTYRRMGIA ENLMRQALFA LREVHQAEYV SLHVRQSNRA CC ALHLYRDTLA FEVLSIEKSY YQDGEDAYAM KKVLKLEELQ ISNF CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MPINIRRATINDIICMQNANLHNLPENYMMKYYMYHILSWPEASFVATTT CC ATOM -PINIRRATINDIICMQNANLHNLPENYMMKYYMYHILSWPEASFVATTT CC ************************************************* CC SEQRES TLDCEDSDEQDENDKLELTLDGTNDGRTIKLDPTYLAPGEKLVGYVLVKM CC ATOM TLD-----------------------------------GEKLVGYVLVKM CC *** ************ CC SEQRES NDDPDQQNEPPNGHITSLSVMRTYRRMGIAENLMRQALFALREVHQAEYV CC ATOM NDDP----EPPNGHITSLSVMRTYRRMGIAENLMRQALFALREVHQAEYV CC **** ****************************************** CC SEQRES SLHVRQSNRAALHLYRDTLAFEVLSIEKSYYQDGEDAYAMKKVLKLEELQ CC ATOM SLHVRQSNRAALHLYRDTLAFEVLSIEKSYYQDGEDAYAMKKVLKLEELQ CC ************************************************** CC SEQRES ISNFTHRRLKENEEKLEDDLESDLLEDIIKQGVNDIIV CC ATOM ISNF---------------------------------- CC **** SQ SEQUENCE 238 AA; MW; CN; MPINIRRATI NDIICMQNAN LHNLPENYMM KYYMYHILSW PEASFVATTT TLDCEDSDEQ DENDKLELTL DGTNDGRTIK LDPTYLAPGE KLVGYVLVKM NDDPDQQNEP PNGHITSLSV MRTYRRMGIA ENLMRQALFA LREVHQAEYV SLHVRQSNRA ALHLYRDTLA FEVLSIEKSY YQDGEDAYAM KKVLKLEELQ ISNFTHRRLK ENEEKLEDDL ESDLLEDIIK QGVNDIIV // ID 4Y49C STANDARD; PRT; 176 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex subunit NAT5 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 14 14 ASN C 448 459 ASP A Protein B 1 FT #SUB 14 14 ASN C 449 460 LEU A Protein B 1 FT #SUB 17 17 GLY C 446 457 GLN A Protein B 1 FT #SUB 18 18 MET C 416 427 THR A Protein S 2 FT #SUB 18 18 MET C 447 458 LEU A Protein S 1 FT #SUB 70 70 PRO C 416 427 THR A Protein B 1 FT #SUB 99 99 LEU C 413 424 HIS A Protein S 1 FT #SUB 101 101 ASN C 376 387 PRO A Protein A 3 FT #SUB 101 101 ASN C 413 424 HIS A Protein S 6 FT #SUB 102 102 TYR C 380 391 PRO A Protein S 1 FT #SUB 102 102 TYR C 413 424 HIS A Protein S 1 FT #SUB 153 153 THR C 48 59 LYS M Protein B 2 FT #SUB 154 154 VAL C 48 59 LYS M Protein B 1 FT #SUB 154 154 VAL C 49 60 LYS M Protein B 3 FT #SUB 155 155 ASN C 49 60 LYS M Protein S 1 FT #SUB 155 155 ASN C 51 62 GLY M Protein S 2 FT #HET 26 26 THR C 3 201 ACO C B 1 FT #HET 93 93 ILE C 3 201 ACO C B 1 FT #HET 94 94 GLU C 3 201 ACO C B 3 FT #HET 95 95 PHE C 3 201 ACO C B 4 FT #HET 96 96 LEU C 3 201 ACO C B 6 FT #HET 97 97 GLY C 3 201 ACO C B 2 FT #HET 98 98 VAL C 3 201 ACO C A 6 FT #HET 103 103 ARG C 3 201 ACO C A 23 FT #HET 104 104 HIS C 3 201 ACO C B 17 FT #HET 105 105 LYS C 3 201 ACO C B 7 FT #HET 106 106 SER C 3 201 ACO C A 7 FT #HET 108 108 GLY C 3 201 ACO C B 4 FT #HET 109 109 SER C 3 201 ACO C A 6 FT #HET 141 141 TRP C 3 201 ACO C S 11 FT DISORDER 1 2 FT DISORDER 42 55 FT DISORDER 82 84 FT DISORDER 176 176 CC SEQUENCE 156 AA (ATOM); CC RDICTLDNVY ANNLGMLTKL AHVTVPNLYQ DAFFSALFAK DVHFTQMAYY SEIPVGGLVA CC KLVPKELSLK GIQIEFLGVL PNYRHKSIGS KLLKFAEDKC SECHQHNVFV YLPAVDDLTK CC QWFIAHGFEQ VGETVNNFIK GVNGDEQDAI LLKKHI CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGRDICTLDNVYANNLGMLTKLAHVTVPNLYQDAFFSALFAEDSLVAKNK CC ATOM --RDICTLDNVYANNLGMLTKLAHVTVPNLYQDAFFSALFA--------- CC *************************************** CC SEQRES KPSSKKDVHFTQMAYYSEIPVGGLVAKLVPKKQNELSLKGIQIEFLGVLP CC ATOM -----KDVHFTQMAYYSEIPVGGLVAKLVPK---ELSLKGIQIEFLGVLP CC ************************** **************** CC SEQRES NYRHKSIGSKLLKFAEDKCSECHQHNVFVYLPAVDDLTKQWFIAHGFEQV CC ATOM NYRHKSIGSKLLKFAEDKCSECHQHNVFVYLPAVDDLTKQWFIAHGFEQV CC ************************************************** CC SEQRES GETVNNFIKGVNGDEQDAILLKKHIS CC ATOM GETVNNFIKGVNGDEQDAILLKKHI- CC ************************* SQ SEQUENCE 176 AA; MW; CN; MGRDICTLDN VYANNLGMLT KLAHVTVPNL YQDAFFSALF AEDSLVAKNK KPSSKKDVHF TQMAYYSEIP VGGLVAKLVP KKQNELSLKG IQIEFLGVLP NYRHKSIGSK LLKFAEDKCS ECHQHNVFVY LPAVDDLTKQ WFIAHGFEQV GETVNNFIKG VNGDEQDAIL LKKHIS // ID 4Y49E STANDARD; PRT; 8 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE ALA-ALA-ALA-ALA-ALA-ALA OS syntetic CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 1 1 SER E 26 26 GLU B Protein A 6 FT #SUB 1 1 SER E 28 28 TYR B Protein B 1 FT #SUB 1 1 SER E 116 116 THR B Protein B 4 FT #SUB 1 1 SER E 117 117 SER B Protein B 1 FT #SUB 1 1 SER E 153 153 HIS B Protein B 1 FT #SUB 1 1 SER E 181 181 TYR B Protein A 4 FT #SUB 2 2 TYR E 28 28 TYR B Protein A 7 FT #SUB 2 2 TYR E 116 116 THR B Protein B 1 FT #SUB 6 6 HIS E 180 180 TYR B Protein B 2 FT #HET 1 1 SER E 2 301 CMC B B 2 FT DISORDER 7 8 CC SEQUENCE 6 AA (ATOM); CC SYSMEH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SYSMEHFR CC ATOM SYSMEH-- CC ****** SQ SEQUENCE 8 AA; MW; CN; SYSMEHFR // ID 4Y49G STANDARD; PRT; 854 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex subunit NAT1 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 436 447 THR G 184 195 GLN A Protein S 1 FT #SUB 199 210 TYR G 39 39 SER H Protein S 2 FT #SUB 238 249 PHE G 143 143 GLU H Protein S 1 FT #SUB 238 249 PHE G 144 144 VAL H Protein A 4 FT #SUB 239 250 ASP G 42 42 GLU H Protein S 2 FT #SUB 239 250 ASP G 143 143 GLU H Protein A 4 FT #SUB 240 251 LYS G 143 143 GLU H Protein A 7 FT #SUB 269 280 LYS G 202 202 SER H Protein B 1 FT #SUB 270 281 ARG G 201 201 ILE H Protein B 3 FT #SUB 271 282 ASN G 5 5 ILE H Protein S 2 FT #SUB 271 282 ASN G 136 136 GLN H Protein S 1 FT #SUB 273 284 ASP G 5 5 ILE H Protein B 3 FT #SUB 273 284 ASP G 132 132 ASN H Protein S 1 FT #SUB 274 285 ASN G 5 5 ILE H Protein S 7 FT #SUB 275 286 PHE G 50 50 THR H Protein S 2 FT #SUB 306 317 PRO G 204 204 PHE H Protein S 2 FT #SUB 313 324 PHE G 2 2 PRO H Protein S 5 FT #SUB 343 354 ALA G 127 127 MET H Protein S 2 FT #SUB 346 357 SER G 2 2 PRO H Protein B 2 FT #SUB 347 358 ASN G 2 2 PRO H Protein B 6 FT #SUB 354 365 ARG G 52 52 LEU H Protein S 1 FT #SUB 354 365 ARG G 53 53 ASP H Protein S 1 FT #SUB 452 463 ARG G 21 21 LEU H Protein S 4 FT #SUB 452 463 ARG G 22 22 HIS H Protein A 4 FT #SUB 452 463 ARG G 23 23 ASN H Protein B 1 FT #SUB 452 463 ARG G 24 24 LEU H Protein S 7 FT #SUB 452 463 ARG G 25 25 PRO H Protein S 4 FT #SUB 453 464 PHE G 122 122 ARG H Protein S 2 FT #SUB 494 505 VAL G 27 27 ASN H Protein S 1 FT #SUB 494 505 VAL G 28 28 TYR H Protein B 1 FT #SUB 495 506 GLU G 30 30 MET H Protein S 2 FT #SUB 498 509 TRP G 21 21 LEU H Protein S 2 FT #SUB 498 509 TRP G 22 22 HIS H Protein S 8 FT #SUB 568 579 PHE G 30 30 MET H Protein S 4 FT #SUB 571 582 ASP G 30 30 MET H Protein S 5 FT #SUB 571 582 ASP G 31 31 LYS H Protein S 2 FT #SUB 584 595 THR G 39 39 SER H Protein S 1 FT #SUB 587 598 ALA G 41 41 PRO H Protein S 1 FT #SUB 590 601 GLU G 10 10 ILE H Protein A 3 FT #SUB 594 605 TRP G 10 10 ILE H Protein S 2 FT #SUB 594 605 TRP G 11 11 ASN H Protein S 1 FT #SUB 376 387 PRO G 101 101 ASN I Protein B 2 FT #SUB 380 391 PRO G 102 102 TYR I Protein S 1 FT #SUB 413 424 HIS G 99 99 LEU I Protein B 2 FT #SUB 413 424 HIS G 101 101 ASN I Protein S 3 FT #SUB 413 424 HIS G 102 102 TYR I Protein S 1 FT #SUB 416 427 THR G 18 18 MET I Protein A 2 FT #SUB 416 427 THR G 70 70 PRO I Protein S 1 FT #SUB 446 457 GLN G 15 15 ASN I Protein B 1 FT #SUB 446 457 GLN G 17 17 GLY I Protein B 1 FT #SUB 447 458 LEU G 18 18 MET I Protein B 1 FT #SUB 449 460 LEU G 14 14 ASN I Protein S 2 FT #HET 395 406 LEU G 4 901 G4P G S 3 FT #HET 426 437 ARG G 4 901 G4P G S 24 FT DISORDER 1 11 FT DISORDER 22 35 FT DISORDER 86 87 FT DISORDER 481 483 FT DISORDER 523 534 FT DISORDER 625 658 FT DISORDER 758 763 FT DISORDER 825 825 FT DISORDER 854 854 CC SEQUENCE 770 AA (ATOM); CC AAKIALKKYN YKKSLKLLDA ILKKDGSHVD SLALKGLDLY SVGEKDDAAS YVANAIRKIE CC SASPICCHVL GIYMRNTKEY KESIKWFTAA LNNGSTNKQI YRDLATLQSQ IGDFKNALVS CC RKKYWEAFLG YRANWTSLAV AQDVNGERQQ AINTLSQFEK LAEGKISDSE KYEHSECLMY CC KNDIMYKAAS DNQDKLQNVL KHLNDIEPCV FDKFGLLERK ATIYMKLGQL KDASIVYRTL CC IKRNPDNFKY YKLLEVSLGI QGDNKLKKAL YGKLEQFYPR CEPPKFIPLT FLQDKEELSK CC KLREYVLPQL ERGVPATFSN VKPLYQRRKS KVSPLLEKIV LDYLSGLDPT QDPIPFIWTN CC YYLSQHFLFL KDFPKAQEYI DAALDHTPTL VEFYILKARI LKHLGLMDTA AGILEEGRQL CC DLQDRFINCK TVKYFLRANN IDKAVEVASL FTKSVNGIKD LHLVEASWFI VEQAEAYYRL CC YLDRKKKLDD LAQIANDIKE NQWLVRKYKG LALKRFNAIP KFYKQFEDDQ LDFHSYCMRK CC GTPRAYLEML EWGKALYTKP MYVRAMKEAS KLYFQMHDDR LKNKRKETEA KSVAAYPSDQ CC DNDVFGEKLI ETSTPMEDFA TEFYNNYSMQ VREDERDYIL DFEFNYRIGK LALCFASLNK CC FAKRFGTTSG LFGSMAIVLL HPFDPILKKV VTKSLEKEYS ENFPLNEISN NSFDWLNFYQ CC EKFGKNDING LLFLYRYRDD VPGSSNLKEM IISSLSPLEP HSQNEILQYY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSRKRSTKPKPAAKIALKKYNDQFLEALKLYEGKQYKKSLKLLDAILKKD CC ATOM -----------AAKIALKKYN--------------YKKSLKLLDAILKKD CC ********** *************** CC SEQRES GSHVDSLALKGLDLYSVGEKDDAASYVANAIRKIEGASASPICCHVLGIY CC ATOM GSHVDSLALKGLDLYSVGEKDDAASYVANAIRKIE--SASPICCHVLGIY CC *********************************** ************* CC SEQRES MRNTKEYKESIKWFTAALNNGSTNKQIYRDLATLQSQIGDFKNALVSRKK CC ATOM MRNTKEYKESIKWFTAALNNGSTNKQIYRDLATLQSQIGDFKNALVSRKK CC ************************************************** CC SEQRES YWEAFLGYRANWTSLAVAQDVNGERQQAINTLSQFEKLAEGKISDSEKYE CC ATOM YWEAFLGYRANWTSLAVAQDVNGERQQAINTLSQFEKLAEGKISDSEKYE CC ************************************************** CC SEQRES HSECLMYKNDIMYKAASDNQDKLQNVLKHLNDIEPCVFDKFGLLERKATI CC ATOM HSECLMYKNDIMYKAASDNQDKLQNVLKHLNDIEPCVFDKFGLLERKATI CC ************************************************** CC SEQRES YMKLGQLKDASIVYRTLIKRNPDNFKYYKLLEVSLGIQGDNKLKKALYGK CC ATOM YMKLGQLKDASIVYRTLIKRNPDNFKYYKLLEVSLGIQGDNKLKKALYGK CC ************************************************** CC SEQRES LEQFYPRCEPPKFIPLTFLQDKEELSKKLREYVLPQLERGVPATFSNVKP CC ATOM LEQFYPRCEPPKFIPLTFLQDKEELSKKLREYVLPQLERGVPATFSNVKP CC ************************************************** CC SEQRES LYQRRKSKVSPLLEKIVLDYLSGLDPTQDPIPFIWTNYYLSQHFLFLKDF CC ATOM LYQRRKSKVSPLLEKIVLDYLSGLDPTQDPIPFIWTNYYLSQHFLFLKDF CC ************************************************** CC SEQRES PKAQEYIDAALDHTPTLVEFYILKARILKHLGLMDTAAGILEEGRQLDLQ CC ATOM PKAQEYIDAALDHTPTLVEFYILKARILKHLGLMDTAAGILEEGRQLDLQ CC ************************************************** CC SEQRES DRFINCKTVKYFLRANNIDKAVEVASLFTKNDDSVNGIKDLHLVEASWFI CC ATOM DRFINCKTVKYFLRANNIDKAVEVASLFTK---SVNGIKDLHLVEASWFI CC ****************************** ***************** CC SEQRES VEQAEAYYRLYLDRKKKLDDLASLKKEVESDKSEQIANDIKENQWLVRKY CC ATOM VEQAEAYYRLYLDRKKKLDDLA------------QIANDIKENQWLVRKY CC ********************** **************** CC SEQRES KGLALKRFNAIPKFYKQFEDDQLDFHSYCMRKGTPRAYLEMLEWGKALYT CC ATOM KGLALKRFNAIPKFYKQFEDDQLDFHSYCMRKGTPRAYLEMLEWGKALYT CC ************************************************** CC SEQRES KPMYVRAMKEASKLYFQMHDDRLKRKSDSLDENSDEIQNNGQNSSSQKKK CC ATOM KPMYVRAMKEASKLYFQMHDDRLK-------------------------- CC ************************ CC SEQRES AKKEAAAMNKRKETEAKSVAAYPSDQDNDVFGEKLIETSTPMEDFATEFY CC ATOM --------NKRKETEAKSVAAYPSDQDNDVFGEKLIETSTPMEDFATEFY CC ****************************************** CC SEQRES NNYSMQVREDERDYILDFEFNYRIGKLALCFASLNKFAKRFGTTSGLFGS CC ATOM NNYSMQVREDERDYILDFEFNYRIGKLALCFASLNKFAKRFGTTSGLFGS CC ************************************************** CC SEQRES MAIVLLHATRNDTPFDPILKKVVTKSLEKEYSENFPLNEISNNSFDWLNF CC ATOM MAIVLLH------PFDPILKKVVTKSLEKEYSENFPLNEISNNSFDWLNF CC ******* ************************************* CC SEQRES YQEKFGKNDINGLLFLYRYRDDVPIGSSNLKEMIISSLSPLEPHSQNEIL CC ATOM YQEKFGKNDINGLLFLYRYRDDVP-GSSNLKEMIISSLSPLEPHSQNEIL CC ************************ ************************* CC SEQRES QYYL CC ATOM QYY- CC *** SQ SEQUENCE 854 AA; MW; CN; MSRKRSTKPK PAAKIALKKY NDQFLEALKL YEGKQYKKSL KLLDAILKKD GSHVDSLALK GLDLYSVGEK DDAASYVANA IRKIEGASAS PICCHVLGIY MRNTKEYKES IKWFTAALNN GSTNKQIYRD LATLQSQIGD FKNALVSRKK YWEAFLGYRA NWTSLAVAQD VNGERQQAIN TLSQFEKLAE GKISDSEKYE HSECLMYKND IMYKAASDNQ DKLQNVLKHL NDIEPCVFDK FGLLERKATI YMKLGQLKDA SIVYRTLIKR NPDNFKYYKL LEVSLGIQGD NKLKKALYGK LEQFYPRCEP PKFIPLTFLQ DKEELSKKLR EYVLPQLERG VPATFSNVKP LYQRRKSKVS PLLEKIVLDY LSGLDPTQDP IPFIWTNYYL SQHFLFLKDF PKAQEYIDAA LDHTPTLVEF YILKARILKH LGLMDTAAGI LEEGRQLDLQ DRFINCKTVK YFLRANNIDK AVEVASLFTK NDDSVNGIKD LHLVEASWFI VEQAEAYYRL YLDRKKKLDD LASLKKEVES DKSEQIANDI KENQWLVRKY KGLALKRFNA IPKFYKQFED DQLDFHSYCM RKGTPRAYLE MLEWGKALYT KPMYVRAMKE ASKLYFQMHD DRLKRKSDSL DENSDEIQNN GQNSSSQKKK AKKEAAAMNK RKETEAKSVA AYPSDQDNDV FGEKLIETST PMEDFATEFY NNYSMQVRED ERDYILDFEF NYRIGKLALC FASLNKFAKR FGTTSGLFGS MAIVLLHATR NDTPFDPILK KVVTKSLEKE YSENFPLNEI SNNSFDWLNF YQEKFGKNDI NGLLFLYRYR DDVPIGSSNL KEMIISSLSP LEPHSQNEIL QYYL // ID 4Y49H STANDARD; PRT; 238 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex catalytic subunit ARD1 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 2 2 PRO H 313 324 PHE G Protein S 5 FT #SUB 2 2 PRO H 346 357 SER G Protein A 2 FT #SUB 2 2 PRO H 347 358 ASN G Protein S 6 FT #SUB 5 5 ILE H 271 282 ASN G Protein A 2 FT #SUB 5 5 ILE H 273 284 ASP G Protein A 3 FT #SUB 5 5 ILE H 274 285 ASN G Protein B 7 FT #SUB 10 10 ILE H 590 601 GLU G Protein S 3 FT #SUB 10 10 ILE H 594 605 TRP G Protein B 2 FT #SUB 11 11 ASN H 594 605 TRP G Protein B 1 FT #SUB 21 21 LEU H 452 463 ARG G Protein B 4 FT #SUB 21 21 LEU H 498 509 TRP G Protein A 2 FT #SUB 22 22 HIS H 452 463 ARG G Protein B 4 FT #SUB 22 22 HIS H 498 509 TRP G Protein S 8 FT #SUB 23 23 ASN H 452 463 ARG G Protein B 1 FT #SUB 24 24 LEU H 452 463 ARG G Protein B 7 FT #SUB 25 25 PRO H 452 463 ARG G Protein A 4 FT #SUB 27 27 ASN H 494 505 VAL G Protein S 1 FT #SUB 28 28 TYR H 494 505 VAL G Protein B 1 FT #SUB 30 30 MET H 495 506 GLU G Protein S 2 FT #SUB 30 30 MET H 568 579 PHE G Protein S 4 FT #SUB 30 30 MET H 571 582 ASP G Protein A 5 FT #SUB 31 31 LYS H 571 582 ASP G Protein B 2 FT #SUB 39 39 SER H 199 210 TYR G Protein B 2 FT #SUB 39 39 SER H 584 595 THR G Protein B 1 FT #SUB 41 41 PRO H 587 598 ALA G Protein S 1 FT #SUB 42 42 GLU H 239 250 ASP G Protein S 2 FT #SUB 50 50 THR H 275 286 PHE G Protein S 2 FT #SUB 52 52 LEU H 354 365 ARG G Protein B 1 FT #SUB 53 53 ASP H 354 365 ARG G Protein B 1 FT #SUB 122 122 ARG H 453 464 PHE G Protein S 2 FT #SUB 127 127 MET H 343 354 ALA G Protein S 2 FT #SUB 132 132 ASN H 273 284 ASP G Protein S 1 FT #SUB 136 136 GLN H 271 282 ASN G Protein S 1 FT #SUB 143 143 GLU H 238 249 PHE G Protein B 1 FT #SUB 143 143 GLU H 239 250 ASP G Protein S 4 FT #SUB 143 143 GLU H 240 251 LYS G Protein S 7 FT #SUB 144 144 VAL H 238 249 PHE G Protein A 4 FT #SUB 201 201 ILE H 270 281 ARG G Protein A 3 FT #SUB 202 202 SER H 269 280 LYS G Protein S 1 FT #SUB 204 204 PHE H 306 317 PRO G Protein B 2 FT #SUB 26 26 GLU H 1 1 SER K Protein S 5 FT #SUB 26 26 GLU H 3 3 SER K Protein S 1 FT #SUB 28 28 TYR H 1 1 SER K Protein S 1 FT #SUB 28 28 TYR H 2 2 TYR K Protein S 7 FT #SUB 116 116 THR H 1 1 SER K Protein B 4 FT #SUB 116 116 THR H 2 2 TYR K Protein B 3 FT #SUB 117 117 SER H 1 1 SER K Protein S 1 FT #SUB 180 180 TYR H 6 6 HIS K Protein B 2 FT #SUB 181 181 TYR H 1 1 SER K Protein S 7 FT #HET 22 22 HIS H 4 901 G4P G S 1 FT #HET 23 23 ASN H 5 301 CMC H B 1 FT #HET 24 24 LEU H 5 301 CMC H S 1 FT #HET 116 116 THR H 5 301 CMC H B 1 FT #HET 117 117 SER H 5 301 CMC H B 2 FT #HET 118 118 LEU H 5 301 CMC H A 12 FT #HET 119 119 SER H 5 301 CMC H B 2 FT #HET 120 120 VAL H 5 301 CMC H A 5 FT #HET 125 125 ARG H 5 301 CMC H A 6 FT #HET 126 126 ARG H 5 301 CMC H B 8 FT #HET 127 127 MET H 5 301 CMC H B 1 FT #HET 128 128 GLY H 5 301 CMC H B 6 FT #HET 130 130 ALA H 5 301 CMC H A 18 FT #HET 131 131 GLU H 5 301 CMC H B 3 FT #HET 154 154 VAL H 5 301 CMC H S 2 FT #HET 158 158 ASN H 5 301 CMC H S 1 FT #HET 159 159 ARG H 5 301 CMC H S 1 FT #HET 160 160 ALA H 5 301 CMC H A 8 FT #HET 161 161 ALA H 5 301 CMC H S 2 FT #HET 163 163 HIS H 5 301 CMC H S 3 FT #HET 164 164 LEU H 5 301 CMC H S 4 FT #HET 165 165 TYR H 5 301 CMC H S 1 FT #HET 168 168 THR H 5 301 CMC H S 1 FT DISORDER 1 1 FT DISORDER 54 88 FT DISORDER 105 108 FT DISORDER 205 238 CC SEQUENCE 164 AA (ATOM); CC PINIRRATIN DIICMQNANL HNLPENYMMK YYMYHILSWP EASFVATTTT LDGEKLVGYV CC LVKMNDDPEP PNGHITSLSV MRTYRRMGIA ENLMRQALFA LREVHQAEYV SLHVRQSNRA CC ALHLYRDTLA FEVLSIEKSY YQDGEDAYAM KKVLKLEELQ ISNF CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MPINIRRATINDIICMQNANLHNLPENYMMKYYMYHILSWPEASFVATTT CC ATOM -PINIRRATINDIICMQNANLHNLPENYMMKYYMYHILSWPEASFVATTT CC ************************************************* CC SEQRES TLDCEDSDEQDENDKLELTLDGTNDGRTIKLDPTYLAPGEKLVGYVLVKM CC ATOM TLD-----------------------------------GEKLVGYVLVKM CC *** ************ CC SEQRES NDDPDQQNEPPNGHITSLSVMRTYRRMGIAENLMRQALFALREVHQAEYV CC ATOM NDDP----EPPNGHITSLSVMRTYRRMGIAENLMRQALFALREVHQAEYV CC **** ****************************************** CC SEQRES SLHVRQSNRAALHLYRDTLAFEVLSIEKSYYQDGEDAYAMKKVLKLEELQ CC ATOM SLHVRQSNRAALHLYRDTLAFEVLSIEKSYYQDGEDAYAMKKVLKLEELQ CC ************************************************** CC SEQRES ISNFTHRRLKENEEKLEDDLESDLLEDIIKQGVNDIIV CC ATOM ISNF---------------------------------- CC **** SQ SEQUENCE 238 AA; MW; CN; MPINIRRATI NDIICMQNAN LHNLPENYMM KYYMYHILSW PEASFVATTT TLDCEDSDEQ DENDKLELTL DGTNDGRTIK LDPTYLAPGE KLVGYVLVKM NDDPDQQNEP PNGHITSLSV MRTYRRMGIA ENLMRQALFA LREVHQAEYV SLHVRQSNRA ALHLYRDTLA FEVLSIEKSY YQDGEDAYAM KKVLKLEELQ ISNFTHRRLK ENEEKLEDDL ESDLLEDIIK QGVNDIIV // ID 4Y49I STANDARD; PRT; 176 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex subunit NAT5 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 14 14 ASN I 449 460 LEU G Protein B 2 FT #SUB 15 15 ASN I 446 457 GLN G Protein B 1 FT #SUB 17 17 GLY I 446 457 GLN G Protein B 1 FT #SUB 18 18 MET I 416 427 THR G Protein S 2 FT #SUB 18 18 MET I 447 458 LEU G Protein S 1 FT #SUB 70 70 PRO I 416 427 THR G Protein B 1 FT #SUB 99 99 LEU I 413 424 HIS G Protein S 2 FT #SUB 101 101 ASN I 376 387 PRO G Protein A 2 FT #SUB 101 101 ASN I 413 424 HIS G Protein A 3 FT #SUB 102 102 TYR I 380 391 PRO G Protein S 1 FT #SUB 102 102 TYR I 413 424 HIS G Protein S 1 FT #HET 26 26 THR I 6 201 ACO I B 3 FT #HET 27 27 VAL I 6 201 ACO I S 1 FT #HET 93 93 ILE I 6 201 ACO I B 1 FT #HET 94 94 GLU I 6 201 ACO I B 3 FT #HET 95 95 PHE I 6 201 ACO I A 6 FT #HET 96 96 LEU I 6 201 ACO I B 4 FT #HET 97 97 GLY I 6 201 ACO I B 1 FT #HET 98 98 VAL I 6 201 ACO I A 9 FT #HET 103 103 ARG I 6 201 ACO I A 12 FT #HET 104 104 HIS I 6 201 ACO I A 25 FT #HET 105 105 LYS I 6 201 ACO I B 3 FT #HET 106 106 SER I 6 201 ACO I B 3 FT #HET 108 108 GLY I 6 201 ACO I B 6 FT #HET 109 109 SER I 6 201 ACO I A 4 FT #HET 141 141 TRP I 6 201 ACO I S 9 FT DISORDER 1 2 FT DISORDER 42 55 FT DISORDER 82 85 FT DISORDER 176 176 CC SEQUENCE 155 AA (ATOM); CC RDICTLDNVY ANNLGMLTKL AHVTVPNLYQ DAFFSALFAK DVHFTQMAYY SEIPVGGLVA CC KLVPKLSLKG IQIEFLGVLP NYRHKSIGSK LLKFAEDKCS ECHQHNVFVY LPAVDDLTKQ CC WFIAHGFEQV GETVNNFIKG VNGDEQDAIL LKKHI CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGRDICTLDNVYANNLGMLTKLAHVTVPNLYQDAFFSALFAEDSLVAKNK CC ATOM --RDICTLDNVYANNLGMLTKLAHVTVPNLYQDAFFSALFA--------- CC *************************************** CC SEQRES KPSSKKDVHFTQMAYYSEIPVGGLVAKLVPKKQNELSLKGIQIEFLGVLP CC ATOM -----KDVHFTQMAYYSEIPVGGLVAKLVPK----LSLKGIQIEFLGVLP CC ************************** *************** CC SEQRES NYRHKSIGSKLLKFAEDKCSECHQHNVFVYLPAVDDLTKQWFIAHGFEQV CC ATOM NYRHKSIGSKLLKFAEDKCSECHQHNVFVYLPAVDDLTKQWFIAHGFEQV CC ************************************************** CC SEQRES GETVNNFIKGVNGDEQDAILLKKHIS CC ATOM GETVNNFIKGVNGDEQDAILLKKHI- CC ************************* SQ SEQUENCE 176 AA; MW; CN; MGRDICTLDN VYANNLGMLT KLAHVTVPNL YQDAFFSALF AEDSLVAKNK KPSSKKDVHF TQMAYYSEIP VGGLVAKLVP KKQNELSLKG IQIEFLGVLP NYRHKSIGSK LLKFAEDKCS ECHQHNVFVY LPAVDDLTKQ WFIAHGFEQV GETVNNFIKG VNGDEQDAIL LKKHIS // ID 4Y49K STANDARD; PRT; 8 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE ALA-ALA-ALA-ALA-ALA-ALA OS syntetic CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 1 1 SER K 26 26 GLU H Protein A 5 FT #SUB 1 1 SER K 28 28 TYR H Protein B 1 FT #SUB 1 1 SER K 116 116 THR H Protein B 4 FT #SUB 1 1 SER K 117 117 SER H Protein B 1 FT #SUB 1 1 SER K 181 181 TYR H Protein A 7 FT #SUB 2 2 TYR K 28 28 TYR H Protein A 7 FT #SUB 2 2 TYR K 116 116 THR H Protein A 3 FT #SUB 3 3 SER K 26 26 GLU H Protein S 1 FT #SUB 6 6 HIS K 180 180 TYR H Protein B 2 FT #HET 1 1 SER K 5 301 CMC H B 2 FT DISORDER 7 8 CC SEQUENCE 6 AA (ATOM); CC SYSMEH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SYSMEHFR CC ATOM SYSMEH-- CC ****** SQ SEQUENCE 8 AA; MW; CN; SYSMEHFR // ID 4Y49M STANDARD; PRT; 854 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex subunit NAT1 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 48 59 LYS M 153 153 THR C Protein A 2 FT #SUB 48 59 LYS M 154 154 VAL C Protein B 1 FT #SUB 49 60 LYS M 154 154 VAL C Protein B 3 FT #SUB 49 60 LYS M 155 155 ASN C Protein B 1 FT #SUB 51 62 GLY M 155 155 ASN C Protein B 2 FT #SUB 199 210 TYR M 40 40 TRP N Protein S 2 FT #SUB 203 214 GLU M 42 42 GLU N Protein S 1 FT #SUB 238 249 PHE M 143 143 GLU N Protein A 4 FT #SUB 238 249 PHE M 144 144 VAL N Protein A 7 FT #SUB 240 251 LYS M 143 143 GLU N Protein A 6 FT #SUB 269 280 LYS M 202 202 SER N Protein B 1 FT #SUB 270 281 ARG M 201 201 ILE N Protein B 4 FT #SUB 271 282 ASN M 5 5 ILE N Protein S 1 FT #SUB 271 282 ASN M 136 136 GLN N Protein S 1 FT #SUB 273 284 ASP M 5 5 ILE N Protein B 3 FT #SUB 274 285 ASN M 4 4 ASN N Protein A 3 FT #SUB 274 285 ASN M 5 5 ILE N Protein S 5 FT #SUB 275 286 PHE M 4 4 ASN N Protein B 1 FT #SUB 275 286 PHE M 50 50 THR N Protein S 2 FT #SUB 306 317 PRO M 204 204 PHE N Protein S 1 FT #SUB 313 324 PHE M 2 2 PRO N Protein S 5 FT #SUB 343 354 ALA M 127 127 MET N Protein S 2 FT #SUB 346 357 SER M 2 2 PRO N Protein B 2 FT #SUB 347 358 ASN M 2 2 PRO N Protein B 7 FT #SUB 350 361 PRO M 2 2 PRO N Protein S 2 FT #SUB 354 365 ARG M 53 53 ASP N Protein A 4 FT #SUB 452 463 ARG M 21 21 LEU N Protein S 3 FT #SUB 452 463 ARG M 22 22 HIS N Protein A 3 FT #SUB 452 463 ARG M 23 23 ASN N Protein B 1 FT #SUB 452 463 ARG M 24 24 LEU N Protein S 5 FT #SUB 452 463 ARG M 25 25 PRO N Protein S 5 FT #SUB 453 464 PHE M 122 122 ARG N Protein S 2 FT #SUB 456 467 CYS M 22 22 HIS N Protein S 2 FT #SUB 494 505 VAL M 27 27 ASN N Protein S 2 FT #SUB 495 506 GLU M 30 30 MET N Protein S 5 FT #SUB 498 509 TRP M 22 22 HIS N Protein S 1 FT #SUB 568 579 PHE M 30 30 MET N Protein S 10 FT #SUB 584 595 THR M 38 38 LEU N Protein S 1 FT #SUB 584 595 THR M 39 39 SER N Protein S 1 FT #SUB 587 598 ALA M 41 41 PRO N Protein S 1 FT #SUB 590 601 GLU M 9 9 THR N Protein S 1 FT #SUB 590 601 GLU M 10 10 ILE N Protein A 3 FT #SUB 594 605 TRP M 10 10 ILE N Protein S 2 FT #SUB 594 605 TRP M 11 11 ASN N Protein S 1 FT #SUB 594 605 TRP M 30 30 MET N Protein S 2 FT #SUB 376 387 PRO M 101 101 ASN O Protein B 2 FT #SUB 413 424 HIS M 99 99 LEU O Protein B 1 FT #SUB 413 424 HIS M 101 101 ASN O Protein S 3 FT #SUB 416 427 THR M 18 18 MET O Protein B 1 FT #SUB 416 427 THR M 70 70 PRO O Protein S 1 FT #SUB 446 457 GLN M 17 17 GLY O Protein B 1 FT #SUB 447 458 LEU M 15 15 ASN O Protein B 1 FT #SUB 447 458 LEU M 18 18 MET O Protein B 2 FT #SUB 448 459 ASP M 14 14 ASN O Protein B 1 FT #SUB 449 460 LEU M 14 14 ASN O Protein B 1 FT #HET 349 360 LYS M 7 901 G4P M S 6 FT #HET 395 406 LEU M 7 901 G4P M S 1 FT #HET 426 437 ARG M 7 901 G4P M A 2 FT DISORDER 1 11 FT DISORDER 22 35 FT DISORDER 86 87 FT DISORDER 481 483 FT DISORDER 523 534 FT DISORDER 625 658 FT DISORDER 758 764 FT DISORDER 825 825 FT DISORDER 854 854 CC SEQUENCE 769 AA (ATOM); CC AAKIALKKYN YKKSLKLLDA ILKKDGSHVD SLALKGLDLY SVGEKDDAAS YVANAIRKIE CC SASPICCHVL GIYMRNTKEY KESIKWFTAA LNNGSTNKQI YRDLATLQSQ IGDFKNALVS CC RKKYWEAFLG YRANWTSLAV AQDVNGERQQ AINTLSQFEK LAEGKISDSE KYEHSECLMY CC KNDIMYKAAS DNQDKLQNVL KHLNDIEPCV FDKFGLLERK ATIYMKLGQL KDASIVYRTL CC IKRNPDNFKY YKLLEVSLGI QGDNKLKKAL YGKLEQFYPR CEPPKFIPLT FLQDKEELSK CC KLREYVLPQL ERGVPATFSN VKPLYQRRKS KVSPLLEKIV LDYLSGLDPT QDPIPFIWTN CC YYLSQHFLFL KDFPKAQEYI DAALDHTPTL VEFYILKARI LKHLGLMDTA AGILEEGRQL CC DLQDRFINCK TVKYFLRANN IDKAVEVASL FTKSVNGIKD LHLVEASWFI VEQAEAYYRL CC YLDRKKKLDD LAQIANDIKE NQWLVRKYKG LALKRFNAIP KFYKQFEDDQ LDFHSYCMRK CC GTPRAYLEML EWGKALYTKP MYVRAMKEAS KLYFQMHDDR LKNKRKETEA KSVAAYPSDQ CC DNDVFGEKLI ETSTPMEDFA TEFYNNYSMQ VREDERDYIL DFEFNYRIGK LALCFASLNK CC FAKRFGTTSG LFGSMAIVLL HFDPILKKVV TKSLEKEYSE NFPLNEISNN SFDWLNFYQE CC KFGKNDINGL LFLYRYRDDV PGSSNLKEMI ISSLSPLEPH SQNEILQYY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSRKRSTKPKPAAKIALKKYNDQFLEALKLYEGKQYKKSLKLLDAILKKD CC ATOM -----------AAKIALKKYN--------------YKKSLKLLDAILKKD CC ********** *************** CC SEQRES GSHVDSLALKGLDLYSVGEKDDAASYVANAIRKIEGASASPICCHVLGIY CC ATOM GSHVDSLALKGLDLYSVGEKDDAASYVANAIRKIE--SASPICCHVLGIY CC *********************************** ************* CC SEQRES MRNTKEYKESIKWFTAALNNGSTNKQIYRDLATLQSQIGDFKNALVSRKK CC ATOM MRNTKEYKESIKWFTAALNNGSTNKQIYRDLATLQSQIGDFKNALVSRKK CC ************************************************** CC SEQRES YWEAFLGYRANWTSLAVAQDVNGERQQAINTLSQFEKLAEGKISDSEKYE CC ATOM YWEAFLGYRANWTSLAVAQDVNGERQQAINTLSQFEKLAEGKISDSEKYE CC ************************************************** CC SEQRES HSECLMYKNDIMYKAASDNQDKLQNVLKHLNDIEPCVFDKFGLLERKATI CC ATOM HSECLMYKNDIMYKAASDNQDKLQNVLKHLNDIEPCVFDKFGLLERKATI CC ************************************************** CC SEQRES YMKLGQLKDASIVYRTLIKRNPDNFKYYKLLEVSLGIQGDNKLKKALYGK CC ATOM YMKLGQLKDASIVYRTLIKRNPDNFKYYKLLEVSLGIQGDNKLKKALYGK CC ************************************************** CC SEQRES LEQFYPRCEPPKFIPLTFLQDKEELSKKLREYVLPQLERGVPATFSNVKP CC ATOM LEQFYPRCEPPKFIPLTFLQDKEELSKKLREYVLPQLERGVPATFSNVKP CC ************************************************** CC SEQRES LYQRRKSKVSPLLEKIVLDYLSGLDPTQDPIPFIWTNYYLSQHFLFLKDF CC ATOM LYQRRKSKVSPLLEKIVLDYLSGLDPTQDPIPFIWTNYYLSQHFLFLKDF CC ************************************************** CC SEQRES PKAQEYIDAALDHTPTLVEFYILKARILKHLGLMDTAAGILEEGRQLDLQ CC ATOM PKAQEYIDAALDHTPTLVEFYILKARILKHLGLMDTAAGILEEGRQLDLQ CC ************************************************** CC SEQRES DRFINCKTVKYFLRANNIDKAVEVASLFTKNDDSVNGIKDLHLVEASWFI CC ATOM DRFINCKTVKYFLRANNIDKAVEVASLFTK---SVNGIKDLHLVEASWFI CC ****************************** ***************** CC SEQRES VEQAEAYYRLYLDRKKKLDDLASLKKEVESDKSEQIANDIKENQWLVRKY CC ATOM VEQAEAYYRLYLDRKKKLDDLA------------QIANDIKENQWLVRKY CC ********************** **************** CC SEQRES KGLALKRFNAIPKFYKQFEDDQLDFHSYCMRKGTPRAYLEMLEWGKALYT CC ATOM KGLALKRFNAIPKFYKQFEDDQLDFHSYCMRKGTPRAYLEMLEWGKALYT CC ************************************************** CC SEQRES KPMYVRAMKEASKLYFQMHDDRLKRKSDSLDENSDEIQNNGQNSSSQKKK CC ATOM KPMYVRAMKEASKLYFQMHDDRLK-------------------------- CC ************************ CC SEQRES AKKEAAAMNKRKETEAKSVAAYPSDQDNDVFGEKLIETSTPMEDFATEFY CC ATOM --------NKRKETEAKSVAAYPSDQDNDVFGEKLIETSTPMEDFATEFY CC ****************************************** CC SEQRES NNYSMQVREDERDYILDFEFNYRIGKLALCFASLNKFAKRFGTTSGLFGS CC ATOM NNYSMQVREDERDYILDFEFNYRIGKLALCFASLNKFAKRFGTTSGLFGS CC ************************************************** CC SEQRES MAIVLLHATRNDTPFDPILKKVVTKSLEKEYSENFPLNEISNNSFDWLNF CC ATOM MAIVLLH-------FDPILKKVVTKSLEKEYSENFPLNEISNNSFDWLNF CC ******* ************************************ CC SEQRES YQEKFGKNDINGLLFLYRYRDDVPIGSSNLKEMIISSLSPLEPHSQNEIL CC ATOM YQEKFGKNDINGLLFLYRYRDDVP-GSSNLKEMIISSLSPLEPHSQNEIL CC ************************ ************************* CC SEQRES QYYL CC ATOM QYY- CC *** SQ SEQUENCE 854 AA; MW; CN; MSRKRSTKPK PAAKIALKKY NDQFLEALKL YEGKQYKKSL KLLDAILKKD GSHVDSLALK GLDLYSVGEK DDAASYVANA IRKIEGASAS PICCHVLGIY MRNTKEYKES IKWFTAALNN GSTNKQIYRD LATLQSQIGD FKNALVSRKK YWEAFLGYRA NWTSLAVAQD VNGERQQAIN TLSQFEKLAE GKISDSEKYE HSECLMYKND IMYKAASDNQ DKLQNVLKHL NDIEPCVFDK FGLLERKATI YMKLGQLKDA SIVYRTLIKR NPDNFKYYKL LEVSLGIQGD NKLKKALYGK LEQFYPRCEP PKFIPLTFLQ DKEELSKKLR EYVLPQLERG VPATFSNVKP LYQRRKSKVS PLLEKIVLDY LSGLDPTQDP IPFIWTNYYL SQHFLFLKDF PKAQEYIDAA LDHTPTLVEF YILKARILKH LGLMDTAAGI LEEGRQLDLQ DRFINCKTVK YFLRANNIDK AVEVASLFTK NDDSVNGIKD LHLVEASWFI VEQAEAYYRL YLDRKKKLDD LASLKKEVES DKSEQIANDI KENQWLVRKY KGLALKRFNA IPKFYKQFED DQLDFHSYCM RKGTPRAYLE MLEWGKALYT KPMYVRAMKE ASKLYFQMHD DRLKRKSDSL DENSDEIQNN GQNSSSQKKK AKKEAAAMNK RKETEAKSVA AYPSDQDNDV FGEKLIETST PMEDFATEFY NNYSMQVRED ERDYILDFEF NYRIGKLALC FASLNKFAKR FGTTSGLFGS MAIVLLHATR NDTPFDPILK KVVTKSLEKE YSENFPLNEI SNNSFDWLNF YQEKFGKNDI NGLLFLYRYR DDVPIGSSNL KEMIISSLSP LEPHSQNEIL QYYL // ID 4Y49N STANDARD; PRT; 238 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex catalytic subunit ARD1 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 2 2 PRO N 313 324 PHE M Protein S 5 FT #SUB 2 2 PRO N 346 357 SER M Protein A 2 FT #SUB 2 2 PRO N 347 358 ASN M Protein A 7 FT #SUB 2 2 PRO N 350 361 PRO M Protein S 2 FT #SUB 4 4 ASN N 274 285 ASN M Protein S 3 FT #SUB 4 4 ASN N 275 286 PHE M Protein S 1 FT #SUB 5 5 ILE N 271 282 ASN M Protein B 1 FT #SUB 5 5 ILE N 273 284 ASP M Protein A 3 FT #SUB 5 5 ILE N 274 285 ASN M Protein B 5 FT #SUB 9 9 THR N 590 601 GLU M Protein S 1 FT #SUB 10 10 ILE N 590 601 GLU M Protein S 3 FT #SUB 10 10 ILE N 594 605 TRP M Protein S 2 FT #SUB 11 11 ASN N 594 605 TRP M Protein B 1 FT #SUB 21 21 LEU N 452 463 ARG M Protein B 3 FT #SUB 22 22 HIS N 452 463 ARG M Protein B 3 FT #SUB 22 22 HIS N 456 467 CYS M Protein B 2 FT #SUB 22 22 HIS N 498 509 TRP M Protein S 1 FT #SUB 23 23 ASN N 452 463 ARG M Protein B 1 FT #SUB 24 24 LEU N 452 463 ARG M Protein B 5 FT #SUB 25 25 PRO N 452 463 ARG M Protein B 5 FT #SUB 27 27 ASN N 494 505 VAL M Protein S 2 FT #SUB 30 30 MET N 495 506 GLU M Protein A 5 FT #SUB 30 30 MET N 568 579 PHE M Protein S 10 FT #SUB 30 30 MET N 594 605 TRP M Protein S 2 FT #SUB 38 38 LEU N 584 595 THR M Protein B 1 FT #SUB 39 39 SER N 584 595 THR M Protein B 1 FT #SUB 40 40 TRP N 199 210 TYR M Protein A 2 FT #SUB 41 41 PRO N 587 598 ALA M Protein S 1 FT #SUB 42 42 GLU N 203 214 GLU M Protein S 1 FT #SUB 50 50 THR N 275 286 PHE M Protein S 2 FT #SUB 53 53 ASP N 354 365 ARG M Protein B 4 FT #SUB 122 122 ARG N 453 464 PHE M Protein S 2 FT #SUB 127 127 MET N 343 354 ALA M Protein S 2 FT #SUB 136 136 GLN N 271 282 ASN M Protein S 1 FT #SUB 143 143 GLU N 238 249 PHE M Protein A 4 FT #SUB 143 143 GLU N 240 251 LYS M Protein S 6 FT #SUB 144 144 VAL N 238 249 PHE M Protein A 7 FT #SUB 201 201 ILE N 270 281 ARG M Protein A 4 FT #SUB 202 202 SER N 269 280 LYS M Protein S 1 FT #SUB 204 204 PHE N 306 317 PRO M Protein B 1 FT #SUB 24 24 LEU N 1 1 SER Q Protein S 1 FT #SUB 26 26 GLU N 1 1 SER Q Protein S 4 FT #SUB 26 26 GLU N 2 2 TYR Q Protein S 1 FT #SUB 28 28 TYR N 2 2 TYR Q Protein S 4 FT #SUB 28 28 TYR N 4 4 MET Q Protein S 1 FT #SUB 116 116 THR N 1 1 SER Q Protein B 3 FT #SUB 116 116 THR N 2 2 TYR Q Protein A 4 FT #SUB 117 117 SER N 1 1 SER Q Protein A 2 FT #SUB 153 153 HIS N 1 1 SER Q Protein B 1 FT #SUB 180 180 TYR N 3 3 SER Q Protein B 1 FT #SUB 180 180 TYR N 6 6 HIS Q Protein B 3 FT #SUB 181 181 TYR N 1 1 SER Q Protein S 2 FT #HET 22 22 HIS N 7 901 G4P M S 1 FT #HET 24 24 LEU N 8 301 CMC N S 2 FT #HET 116 116 THR N 8 301 CMC N B 1 FT #HET 117 117 SER N 8 301 CMC N B 2 FT #HET 118 118 LEU N 8 301 CMC N A 13 FT #HET 119 119 SER N 8 301 CMC N A 5 FT #HET 120 120 VAL N 8 301 CMC N A 6 FT #HET 125 125 ARG N 8 301 CMC N A 11 FT #HET 126 126 ARG N 8 301 CMC N B 7 FT #HET 127 127 MET N 8 301 CMC N B 1 FT #HET 128 128 GLY N 8 301 CMC N B 8 FT #HET 130 130 ALA N 8 301 CMC N A 26 FT #HET 131 131 GLU N 8 301 CMC N A 5 FT #HET 154 154 VAL N 8 301 CMC N A 3 FT #HET 160 160 ALA N 8 301 CMC N S 5 FT #HET 161 161 ALA N 8 301 CMC N S 3 FT #HET 163 163 HIS N 8 301 CMC N S 1 FT #HET 164 164 LEU N 8 301 CMC N S 3 FT #HET 165 165 TYR N 8 301 CMC N S 1 FT #HET 168 168 THR N 8 301 CMC N S 1 FT DISORDER 1 1 FT DISORDER 54 88 FT DISORDER 105 108 FT DISORDER 205 238 CC SEQUENCE 164 AA (ATOM); CC PINIRRATIN DIICMQNANL HNLPENYMMK YYMYHILSWP EASFVATTTT LDGEKLVGYV CC LVKMNDDPEP PNGHITSLSV MRTYRRMGIA ENLMRQALFA LREVHQAEYV SLHVRQSNRA CC ALHLYRDTLA FEVLSIEKSY YQDGEDAYAM KKVLKLEELQ ISNF CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MPINIRRATINDIICMQNANLHNLPENYMMKYYMYHILSWPEASFVATTT CC ATOM -PINIRRATINDIICMQNANLHNLPENYMMKYYMYHILSWPEASFVATTT CC ************************************************* CC SEQRES TLDCEDSDEQDENDKLELTLDGTNDGRTIKLDPTYLAPGEKLVGYVLVKM CC ATOM TLD-----------------------------------GEKLVGYVLVKM CC *** ************ CC SEQRES NDDPDQQNEPPNGHITSLSVMRTYRRMGIAENLMRQALFALREVHQAEYV CC ATOM NDDP----EPPNGHITSLSVMRTYRRMGIAENLMRQALFALREVHQAEYV CC **** ****************************************** CC SEQRES SLHVRQSNRAALHLYRDTLAFEVLSIEKSYYQDGEDAYAMKKVLKLEELQ CC ATOM SLHVRQSNRAALHLYRDTLAFEVLSIEKSYYQDGEDAYAMKKVLKLEELQ CC ************************************************** CC SEQRES ISNFTHRRLKENEEKLEDDLESDLLEDIIKQGVNDIIV CC ATOM ISNF---------------------------------- CC **** SQ SEQUENCE 238 AA; MW; CN; MPINIRRATI NDIICMQNAN LHNLPENYMM KYYMYHILSW PEASFVATTT TLDCEDSDEQ DENDKLELTL DGTNDGRTIK LDPTYLAPGE KLVGYVLVKM NDDPDQQNEP PNGHITSLSV MRTYRRMGIA ENLMRQALFA LREVHQAEYV SLHVRQSNRA ALHLYRDTLA FEVLSIEKSY YQDGEDAYAM KKVLKLEELQ ISNFTHRRLK ENEEKLEDDL ESDLLEDIIK QGVNDIIV // ID 4Y49O STANDARD; PRT; 176 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE N-terminal acetyltransferase A complex subunit NAT5 OS Saccharomyces cerevisiae CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 14 14 ASN O 448 459 ASP M Protein B 1 FT #SUB 14 14 ASN O 449 460 LEU M Protein B 1 FT #SUB 15 15 ASN O 447 458 LEU M Protein B 1 FT #SUB 17 17 GLY O 446 457 GLN M Protein B 1 FT #SUB 18 18 MET O 416 427 THR M Protein S 1 FT #SUB 18 18 MET O 447 458 LEU M Protein A 2 FT #SUB 70 70 PRO O 416 427 THR M Protein B 1 FT #SUB 99 99 LEU O 413 424 HIS M Protein S 1 FT #SUB 101 101 ASN O 376 387 PRO M Protein A 2 FT #SUB 101 101 ASN O 413 424 HIS M Protein A 3 FT #HET 26 26 THR O 9 201 ACO O B 3 FT #HET 93 93 ILE O 9 201 ACO O B 1 FT #HET 94 94 GLU O 9 201 ACO O B 4 FT #HET 95 95 PHE O 9 201 ACO O B 5 FT #HET 96 96 LEU O 9 201 ACO O B 7 FT #HET 97 97 GLY O 9 201 ACO O B 2 FT #HET 98 98 VAL O 9 201 ACO O B 3 FT #HET 104 104 HIS O 9 201 ACO O A 19 FT #HET 105 105 LYS O 9 201 ACO O B 11 FT #HET 106 106 SER O 9 201 ACO O A 6 FT #HET 108 108 GLY O 9 201 ACO O B 7 FT #HET 109 109 SER O 9 201 ACO O A 6 FT #HET 141 141 TRP O 9 201 ACO O S 7 FT DISORDER 1 2 FT DISORDER 42 55 FT DISORDER 82 84 FT DISORDER 102 103 FT DISORDER 176 176 CC SEQUENCE 154 AA (ATOM); CC RDICTLDNVY ANNLGMLTKL AHVTVPNLYQ DAFFSALFAK DVHFTQMAYY SEIPVGGLVA CC KLVPKELSLK GIQIEFLGVL PNHKSIGSKL LKFAEDKCSE CHQHNVFVYL PAVDDLTKQW CC FIAHGFEQVG ETVNNFIKGV NGDEQDAILL KKHI CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGRDICTLDNVYANNLGMLTKLAHVTVPNLYQDAFFSALFAEDSLVAKNK CC ATOM --RDICTLDNVYANNLGMLTKLAHVTVPNLYQDAFFSALFA--------- CC *************************************** CC SEQRES KPSSKKDVHFTQMAYYSEIPVGGLVAKLVPKKQNELSLKGIQIEFLGVLP CC ATOM -----KDVHFTQMAYYSEIPVGGLVAKLVPK---ELSLKGIQIEFLGVLP CC ************************** **************** CC SEQRES NYRHKSIGSKLLKFAEDKCSECHQHNVFVYLPAVDDLTKQWFIAHGFEQV CC ATOM N--HKSIGSKLLKFAEDKCSECHQHNVFVYLPAVDDLTKQWFIAHGFEQV CC * *********************************************** CC SEQRES GETVNNFIKGVNGDEQDAILLKKHIS CC ATOM GETVNNFIKGVNGDEQDAILLKKHI- CC ************************* SQ SEQUENCE 176 AA; MW; CN; MGRDICTLDN VYANNLGMLT KLAHVTVPNL YQDAFFSALF AEDSLVAKNK KPSSKKDVHF TQMAYYSEIP VGGLVAKLVP KKQNELSLKG IQIEFLGVLP NYRHKSIGSK LLKFAEDKCS ECHQHNVFVY LPAVDDLTKQ WFIAHGFEQV GETVNNFIKG VNGDEQDAIL LKKHIS // ID 4Y49Q STANDARD; PRT; 8 AA. DT CONVERTED FROM PDB (SEQRES) 4Y49 DE ALA-ALA-ALA-ALA-ALA-ALA OS syntetic CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.950 CC R-Factor 0.358 FT #SUB 1 1 SER Q 24 24 LEU N Protein S 1 FT #SUB 1 1 SER Q 26 26 GLU N Protein A 4 FT #SUB 1 1 SER Q 116 116 THR N Protein B 3 FT #SUB 1 1 SER Q 117 117 SER N Protein B 2 FT #SUB 1 1 SER Q 153 153 HIS N Protein B 1 FT #SUB 1 1 SER Q 181 181 TYR N Protein B 2 FT #SUB 2 2 TYR Q 26 26 GLU N Protein B 1 FT #SUB 2 2 TYR Q 28 28 TYR N Protein B 4 FT #SUB 2 2 TYR Q 116 116 THR N Protein A 4 FT #SUB 3 3 SER Q 180 180 TYR N Protein S 1 FT #SUB 4 4 MET Q 28 28 TYR N Protein S 1 FT #SUB 6 6 HIS Q 180 180 TYR N Protein B 3 FT #HET 1 1 SER Q 8 301 CMC N B 4 FT DISORDER 7 8 CC SEQUENCE 6 AA (ATOM); CC SYSMEH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SYSMEHFR CC ATOM SYSMEH-- CC ****** SQ SEQUENCE 8 AA; MW; CN; SYSMEHFR //