ID 1AROP STANDARD; PRT; 883 AA. DT CONVERTED FROM PDB (SEQRES) 1ARO DE T7 RNA POLYMERASE OS Enterobacteria phage T7 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.800 CC R-Factor 0.262 FT #SUB 307 307 ARG P 41 1040 GLY L Protein S 3 FT #SUB 307 307 ARG P 43 1042 LEU L Protein B 1 FT #SUB 309 309 GLU P 7 1006 LYS L Protein B 2 FT #SUB 310 310 ASP P 6 1005 PHE L Protein B 4 FT #SUB 310 310 ASP P 7 1006 LYS L Protein B 3 FT #SUB 310 310 ASP P 66 1065 VAL L Protein S 1 FT #SUB 311 311 VAL P 5 1004 GLN L Protein A 3 FT #SUB 311 311 VAL P 43 1042 LEU L Protein S 1 FT #SUB 312 312 TYR P 5 1004 GLN L Protein A 6 FT #SUB 312 312 TYR P 6 1005 PHE L Protein S 1 FT #SUB 312 312 TYR P 7 1006 LYS L Protein S 1 FT #SUB 717 717 GLU P 25 1024 SER L Protein S 1 FT #SUB 717 717 GLU P 26 1025 GLN L Protein S 2 FT #SUB 717 717 GLU P 27 1026 ASN L Protein S 2 FT #SUB 720 720 ARG P 35 1034 GLN L Protein S 6 FT #SUB 720 720 ARG P 38 1037 LYS L Protein S 1 FT #SUB 721 721 LYS P 39 1038 GLU L Protein A 9 FT #SUB 723 723 SER P 39 1038 GLU L Protein S 1 FT #SUB 724 724 ALA P 38 1037 LYS L Protein B 1 FT #SUB 726 726 HIS P 43 1042 LEU L Protein S 1 FT #SUB 728 728 VAL P 3 1002 ARG L Protein B 1 FT #SUB 728 728 VAL P 4 1003 VAL L Protein S 2 FT #SUB 730 730 PRO P 3 1002 ARG L Protein B 2 FT #SUB 736 736 TRP P 38 1037 LYS L Protein S 2 FT #SUB 736 736 TRP P 41 1040 GLY L Protein S 1 FT #SUB 736 736 TRP P 42 1041 TRP L Protein S 3 FT #SUB 736 736 TRP P 43 1042 LEU L Protein S 1 FT #SUB 844 844 ASP P 3 1002 ARG L Protein S 5 FT #SUB 848 848 GLN P 3 1002 ARG L Protein S 1 FT #SUB 850 850 ALA P 31 1030 ARG L Protein B 1 FT #SUB 851 851 ASP P 34 1033 ARG L Protein A 2 FT #SUB 852 852 GLN P 38 1037 LYS L Protein B 1 FT #SUB 853 853 LEU P 31 1030 ARG L Protein B 5 FT #SUB 855 855 GLU P 31 1030 ARG L Protein A 12 FT #SUB 855 855 GLU P 32 1031 GLU L Protein S 5 FT #HET 125 125 CYS P 1 904 HG P S 2 FT #HET 140 140 ALA P 1 904 HG P S 1 FT #HET 313 313 MET P 4 907 HG P S 4 FT #HET 317 317 TYR P 4 907 HG P S 1 FT #HET 397 397 SER P 5 908 HG P B 1 FT #HET 401 401 MET P 5 908 HG P A 6 FT #HET 463 463 HIS P 3 906 HG P S 1 FT #HET 467 467 CYS P 3 906 HG P A 3 FT #HET 510 510 CYS P 3 906 HG P A 5 FT #HET 517 517 GLU P 2 905 HG P A 3 FT #HET 528 528 TYR P 2 905 HG P S 2 FT #HET 530 530 CYS P 2 905 HG P A 3 FT #HET 539 539 SER P 6 909 HG P B 1 FT #HET 540 540 CYS P 6 909 HG P A 3 FT #HET 571 571 TYR P 6 909 HG P S 1 FT #HET 635 635 MET P 6 909 HG P S 1 FT DISORDER 1 7 FT DISORDER 60 72 FT DISORDER 165 181 FT DISORDER 234 240 FT DISORDER 345 383 FT DISORDER 590 611 FT DISORDER 880 883 CC Miss-BB 5 CC Miss-SC 4 CC SEQUENCE 774 AA (ATOM); CC KNDFSDIELA AIPFNTLADH YGERLAREQL ALEHESYEMG EARFRKMFER QLLITTLLPK CC MIARINDWFE EVKAKRGKRP TAFQFLQEIK PEAVAYITIK TTLACLTSAD NTTVQAVASA CC IGRAIEDEAR FGRIRDLEAK HFKKFMQVVE ADMLSKGLLG GEAWSSWHKE DSIHVGVRCI CC EMLIESTGMV SLHRQNSETI ELAPEYAEAI ATRAGALAGI SPMFQPCVVP PKPWTGITGG CC GYWANGRRPL ALVRTHSKKA LMRYEDVYMP EVYKAINIAQ NTAWKINKKV LAVANVITKW CC VYRKDKARKS RRISLEFMLE QANKFANHKA IWFPYNMDWR GRVYAVSMFN PQGNDMTKGL CC LTLAKGKPIG KEGYYWLKIH GANCAGVDKV PFPERIKFIE ENHENIMACA KSPLENTWWA CC EQDSPFCFLA FCFEYAGVQH HGLSYNCSLP LAFDGSCSGI QHFSAMLRDE VGGRAVNLLP CC SETVQDIYGI VAKKVNEILQ ADAINGGTKA LAGQWLAYGV TRSVTKRSVM TLAYGSKEFG CC FRQQVLEDTI QPAIDSGKGL MFTQPNQAAG YMAKLIWESV SVTVVAAVEA MNWLKSAAKL CC LAAEVKDKKT GEILRKRSAV HWVTPDGFPV WQEYKKPIQT RLNLMFLGQF RLQPTINTNK CC DSEIDAHKQE SGIAPNFVHS QDGSHLRKTV VWAHEKYGIE SFALIHDSFG TIPADAANLF CC KAVRETMVDT YESSDVLADF YDQFADQLHE SQLDKMPALP AKGNLNLRDI LESD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MNTINIAKNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEAR CC ATOM -------KNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEAR CC ******************************************* CC SEQRES FRKMFERQLKAGEVADNAAAKPLITTLLPKMIARINDWFEEVKAKRGKRP CC ATOM FRKMFERQL-------------LITTLLPKMIARINDWFEEVKAKRGKRP CC ********* **************************** CC SEQRES TAFQFLQEIKPEAVAYITIKTTLACLTSADNTTVQAVASAIGRAIEDEAR CC ATOM TAFQFLQEIKPEAVAYITIKTTLACLTSADNTTVQAVASAIGRAIEDEAR CC ************************************************** CC SEQRES FGRIRDLEAKHFKKNVEEQLNKRVGHVYKKAFMQVVEADMLSKGLLGGEA CC ATOM FGRIRDLEAKHFKK-----------------FMQVVEADMLSKGLLGGEA CC ************** ******************* CC SEQRES WSSWHKEDSIHVGVRCIEMLIESTGMVSLHRQNAGVVGQDSETIELAPEY CC ATOM WSSWHKEDSIHVGVRCIEMLIESTGMVSLHRQN-------SETIELAPEY CC ********************************* ********** CC SEQRES AEAIATRAGALAGISPMFQPCVVPPKPWTGITGGGYWANGRRPLALVRTH CC ATOM AEAIATRAGALAGISPMFQPCVVPPKPWTGITGGGYWANGRRPLALVRTH CC ************************************************** CC SEQRES SKKALMRYEDVYMPEVYKAINIAQNTAWKINKKVLAVANVITKWKHSPVE CC ATOM SKKALMRYEDVYMPEVYKAINIAQNTAWKINKKVLAVANVITKW------ CC ******************************************** CC SEQRES DIPAIEREELPMKPEDIDMNPEALTAWKRAAAAVYRKDKARKSRRISLEF CC ATOM ---------------------------------VYRKDKARKSRRISLEF CC ***************** CC SEQRES MLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMTKGLLTLAKGK CC ATOM MLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMTKGLLTLAKGK CC ************************************************** CC SEQRES PIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENT CC ATOM PIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENT CC ************************************************** CC SEQRES WWAEQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAML CC ATOM WWAEQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAML CC ************************************************** CC SEQRES RDEVGGRAVNLLPSETVQDIYGIVAKKVNEILQADAINGTDNEVVTVTDE CC ATOM RDEVGGRAVNLLPSETVQDIYGIVAKKVNEILQADAING----------- CC *************************************** CC SEQRES NTGEISEKVKLGTKALAGQWLAYGVTRSVTKRSVMTLAYGSKEFGFRQQV CC ATOM -----------GTKALAGQWLAYGVTRSVTKRSVMTLAYGSKEFGFRQQV CC *************************************** CC SEQRES LEDTIQPAIDSGKGLMFTQPNQAAGYMAKLIWESVSVTVVAAVEAMNWLK CC ATOM LEDTIQPAIDSGKGLMFTQPNQAAGYMAKLIWESVSVTVVAAVEAMNWLK CC ************************************************** CC SEQRES SAAKLLAAEVKDKKTGEILRKRSAVHWVTPDGFPVWQEYKKPIQTRLNLM CC ATOM SAAKLLAAEVKDKKTGEILRKRSAVHWVTPDGFPVWQEYKKPIQTRLNLM CC ************************************************** CC SEQRES FLGQFRLQPTINTNKDSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHE CC ATOM FLGQFRLQPTINTNKDSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHE CC ************************************************** CC SEQRES KYGIESFALIHDSFGTIPADAANLFKAVRETMVDTYESSDVLADFYDQFA CC ATOM KYGIESFALIHDSFGTIPADAANLFKAVRETMVDTYESSDVLADFYDQFA CC ************************************************** CC SEQRES DQLHESQLDKMPALPAKGNLNLRDILESDFAFA CC ATOM DQLHESQLDKMPALPAKGNLNLRDILESD---- CC ***************************** SQ SEQUENCE 883 AA; MW; CN; MNTINIAKND FSDIELAAIP FNTLADHYGE RLAREQLALE HESYEMGEAR FRKMFERQLK AGEVADNAAA KPLITTLLPK MIARINDWFE EVKAKRGKRP TAFQFLQEIK PEAVAYITIK TTLACLTSAD NTTVQAVASA IGRAIEDEAR FGRIRDLEAK HFKKNVEEQL NKRVGHVYKK AFMQVVEADM LSKGLLGGEA WSSWHKEDSI HVGVRCIEML IESTGMVSLH RQNAGVVGQD SETIELAPEY AEAIATRAGA LAGISPMFQP CVVPPKPWTG ITGGGYWANG RRPLALVRTH SKKALMRYED VYMPEVYKAI NIAQNTAWKI NKKVLAVANV ITKWKHSPVE DIPAIEREEL PMKPEDIDMN PEALTAWKRA AAAVYRKDKA RKSRRISLEF MLEQANKFAN HKAIWFPYNM DWRGRVYAVS MFNPQGNDMT KGLLTLAKGK PIGKEGYYWL KIHGANCAGV DKVPFPERIK FIEENHENIM ACAKSPLENT WWAEQDSPFC FLAFCFEYAG VQHHGLSYNC SLPLAFDGSC SGIQHFSAML RDEVGGRAVN LLPSETVQDI YGIVAKKVNE ILQADAINGT DNEVVTVTDE NTGEISEKVK LGTKALAGQW LAYGVTRSVT KRSVMTLAYG SKEFGFRQQV LEDTIQPAID SGKGLMFTQP NQAAGYMAKL IWESVSVTVV AAVEAMNWLK SAAKLLAAEV KDKKTGEILR KRSAVHWVTP DGFPVWQEYK KPIQTRLNLM FLGQFRLQPT INTNKDSEID AHKQESGIAP NFVHSQDGSH LRKTVVWAHE KYGIESFALI HDSFGTIPAD AANLFKAVRE TMVDTYESSD VLADFYDQFA DQLHESQLDK MPALPAKGNL NLRDILESDF AFA // ID 1AROL STANDARD; PRT; 151 AA. DT CONVERTED FROM PDB (SEQRES) 1ARO DE T7 LYSOZYME OS Enterobacteria phage T7 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.800 CC R-Factor 0.262 FT #SUB 3 1002 ARG L 728 728 VAL P Protein S 1 FT #SUB 3 1002 ARG L 730 730 PRO P Protein S 2 FT #SUB 3 1002 ARG L 844 844 ASP P Protein S 5 FT #SUB 3 1002 ARG L 848 848 GLN P Protein S 1 FT #SUB 4 1003 VAL L 728 728 VAL P Protein A 2 FT #SUB 5 1004 GLN L 311 311 VAL P Protein B 3 FT #SUB 5 1004 GLN L 312 312 TYR P Protein A 6 FT #SUB 6 1005 PHE L 310 310 ASP P Protein A 4 FT #SUB 6 1005 PHE L 312 312 TYR P Protein B 1 FT #SUB 7 1006 LYS L 309 309 GLU P Protein S 2 FT #SUB 7 1006 LYS L 310 310 ASP P Protein A 3 FT #SUB 7 1006 LYS L 312 312 TYR P Protein B 1 FT #SUB 25 1024 SER L 717 717 GLU P Protein B 1 FT #SUB 26 1025 GLN L 717 717 GLU P Protein B 2 FT #SUB 27 1026 ASN L 717 717 GLU P Protein B 2 FT #SUB 31 1030 ARG L 850 850 ALA P Protein S 1 FT #SUB 31 1030 ARG L 853 853 LEU P Protein S 5 FT #SUB 31 1030 ARG L 855 855 GLU P Protein S 12 FT #SUB 32 1031 GLU L 855 855 GLU P Protein S 5 FT #SUB 34 1033 ARG L 851 851 ASP P Protein S 2 FT #SUB 35 1034 GLN L 720 720 ARG P Protein S 6 FT #SUB 38 1037 LYS L 720 720 ARG P Protein S 1 FT #SUB 38 1037 LYS L 724 724 ALA P Protein S 1 FT #SUB 38 1037 LYS L 736 736 TRP P Protein B 2 FT #SUB 38 1037 LYS L 852 852 GLN P Protein S 1 FT #SUB 39 1038 GLU L 721 721 LYS P Protein S 9 FT #SUB 39 1038 GLU L 723 723 SER P Protein S 1 FT #SUB 41 1040 GLY L 307 307 ARG P Protein B 3 FT #SUB 41 1040 GLY L 736 736 TRP P Protein B 1 FT #SUB 42 1041 TRP L 736 736 TRP P Protein B 3 FT #SUB 43 1042 LEU L 307 307 ARG P Protein S 1 FT #SUB 43 1042 LEU L 311 311 VAL P Protein S 1 FT #SUB 43 1042 LEU L 726 726 HIS P Protein B 1 FT #SUB 43 1042 LEU L 736 736 TRP P Protein S 1 FT #SUB 66 1065 VAL L 310 310 ASP P Protein S 1 FT #HET 19 1018 CYS L 7 903 HG L A 6 FT #HET 20 1019 SER L 7 903 HG L B 1 FT #HET 131 1130 CYS L 7 903 HG L A 3 FT DISORDER 1 2 CC SEQUENCE 149 AA (ATOM); CC RVQFKQREST DAIFVHCSAT KPSQNVGVRE IRQWHKEQGW LDVGYHFIIK RDGTVEAGRD CC EMAVGSHAKG YNHNSIGVCL VGGIDDKGKF DANFTPAQMQ SLRSLLVTLL AKYEGAVLRA CC HHEVAPKACP SFDLKRWWEK NELVTSDRG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MARVQFKQRESTDAIFVHCSATKPSQNVGVREIRQWHKEQGWLDVGYHFI CC ATOM --RVQFKQRESTDAIFVHCSATKPSQNVGVREIRQWHKEQGWLDVGYHFI CC ************************************************ CC SEQRES IKRDGTVEAGRDEMAVGSHAKGYNHNSIGVCLVGGIDDKGKFDANFTPAQ CC ATOM IKRDGTVEAGRDEMAVGSHAKGYNHNSIGVCLVGGIDDKGKFDANFTPAQ CC ************************************************** CC SEQRES MQSLRSLLVTLLAKYEGAVLRAHHEVAPKACPSFDLKRWWEKNELVTSDR CC ATOM MQSLRSLLVTLLAKYEGAVLRAHHEVAPKACPSFDLKRWWEKNELVTSDR CC ************************************************** CC SEQRES G CC ATOM G CC * SQ SEQUENCE 151 AA; MW; CN; MARVQFKQRE STDAIFVHCS ATKPSQNVGV REIRQWHKEQ GWLDVGYHFI IKRDGTVEAG RDEMAVGSHA KGYNHNSIGV CLVGGIDDKG KFDANFTPAQ MQSLRSLLVT LLAKYEGAVL RAHHEVAPKA CPSFDLKRWW EKNELVTSDR G //