ID 2AWZA STANDARD; PRT; 580 AA. DT CONVERTED FROM PDB (SEQRES) 2AWZ DE Genome polyprotein OS Hepatitis C virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.150 CC R-Factor 0.222 FT #SUB 10 10 LEU A 151 151 LYS B Protein S 3 FT #SUB 46 46 GLY A 151 151 LYS B Protein B 4 FT #SUB 47 47 LEU A 151 151 LYS B Protein A 2 FT #SUB 47 47 LEU A 152 152 GLY B Protein S 2 FT #SUB 47 47 LEU A 153 153 GLY B Protein S 2 FT #SUB 50 50 LYS A 148 148 GLN B Protein S 3 FT #SUB 50 50 LYS A 152 152 GLY B Protein S 4 FT #SUB 150 150 GLU A 10 10 LEU B Protein B 2 FT #SUB 150 150 GLU A 46 46 GLY B Protein B 1 FT #SUB 151 151 LYS A 46 46 GLY B Protein B 5 FT #SUB 151 151 LYS A 47 47 LEU B Protein A 6 FT #SUB 152 152 GLY A 43 43 ARG B Protein B 4 FT #SUB 152 152 GLY A 44 44 SER B Protein B 5 FT #SUB 152 152 GLY A 46 46 GLY B Protein B 3 FT #SUB 152 152 GLY A 47 47 LEU B Protein B 4 FT #SUB 153 153 GLY A 43 43 ARG B Protein B 1 FT #SUB 337 337 ARG A 55 55 ASP B Protein S 2 FT #SUB 351 351 GLY A 50 50 LYS B Protein B 1 FT #SUB 352 352 ASP A 50 50 LYS B Protein S 1 FT #HET 1 1 SER A 1 901 SO4 A A 5 FT #HET 48 48 ARG A 2 902 SO4 A S 7 FT #HET 51 51 LYS A 2 902 SO4 A S 2 FT #HET 193 193 PHE A 5 801 5H A S 1 FT #HET 197 197 PRO A 5 801 5H A A 2 FT #HET 200 200 ARG A 5 801 5H A S 1 FT #HET 221 221 THR A 2 902 SO4 A B 1 FT #HET 222 222 ARG A 2 902 SO4 A B 2 FT #HET 223 223 CYS A 2 902 SO4 A A 6 FT #HET 368 368 SER A 5 801 5H A S 2 FT #HET 384 384 LEU A 5 801 5H A S 2 FT #HET 410 410 GLY A 5 801 5H A B 2 FT #HET 411 411 ASN A 5 801 5H A S 2 FT #HET 414 414 MET A 5 801 5H A S 11 FT #HET 415 415 TYR A 5 801 5H A S 7 FT #HET 446 446 GLN A 5 801 5H A B 2 FT #HET 447 447 ILE A 5 801 5H A B 3 FT #HET 448 448 TYR A 5 801 5H A A 11 FT #HET 449 449 GLY A 5 801 5H A B 1 FT #HET 505 505 ARG A 3 903 SO4 A S 7 FT #HET 510 510 ARG A 4 904 SO4 A S 11 FT #HET 514 514 ARG A 4 904 SO4 A S 3 FT #HET 530 530 VAL A 3 903 SO4 A A 4 FT #HET 531 531 LYS A 3 903 SO4 A A 9 FT #HET 532 532 THR A 3 903 SO4 A A 7 FT #HET 556 556 SER A 5 801 5H A S 1 FT #MOD 366 366 CYS A 5 801 5H A S FT DISORDER 541 545 FT DISORDER 563 580 CC SEQUENCE 557 AA (ATOM); CC SMSYTWTGAL ITPCAAEESK LPINALSNSL LRHHNMVYAT TSRSAGLRQK KVTFDRLQVL CC DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH CC SVWKDLLEDT VTPIDTTIMA KNEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS CC TLPQVVMGSS YGFQYSPGQR VEFLVNTWKS KKNPMGFSYD TRCFDSTVTE NDIRVEESIY CC QCCDLAPEAR QAIKSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAS CC AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAASLRVFT EAMTRYSAPP GDPPQPEYDL CC ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPVNSWLG NIIMYAPTLW CC ARMILMTHFF SILLAQEQLE KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFSLHSYSPG CC EINRVASCLR KLGVPPLRAW RHRARNVRAR LLSRGGRAAI CGKYLFNWAV KTKLKLTPIA CC DLSSWFTAGY SGGDIYH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQK CC ATOM SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQK CC ************************************************** CC SEQRES KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ATOM KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ************************************************** CC SEQRES FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPE CC ATOM FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPE CC ************************************************** CC SEQRES KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ATOM KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ************************************************** CC SEQRES VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ATOM VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ************************************************** CC SEQRES QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ATOM QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ************************************************** CC SEQRES AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ATOM AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ************************************************** CC SEQRES GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ATOM GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ************************************************** CC SEQRES RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ATOM RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ************************************************** CC SEQRES CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRAW CC ATOM CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRAW CC ************************************************** CC SEQRES RHRARNVRARLLSRGGRAAICGKYLFNWAVKTKLKLTPIAAAGRLDLSSW CC ATOM RHRARNVRARLLSRGGRAAICGKYLFNWAVKTKLKLTPIA-----DLSSW CC **************************************** ***** CC SEQRES FTAGYSGGDIYHGVSHARPRHHHHHHHHHH CC ATOM FTAGYSGGDIYH------------------ CC ************ SQ SEQUENCE 580 AA; MW; CN; SMSYTWTGAL ITPCAAEESK LPINALSNSL LRHHNMVYAT TSRSAGLRQK KVTFDRLQVL DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH SVWKDLLEDT VTPIDTTIMA KNEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS TLPQVVMGSS YGFQYSPGQR VEFLVNTWKS KKNPMGFSYD TRCFDSTVTE NDIRVEESIY QCCDLAPEAR QAIKSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAS AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAASLRVFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPVNSWLG NIIMYAPTLW ARMILMTHFF SILLAQEQLE KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFSLHSYSPG EINRVASCLR KLGVPPLRAW RHRARNVRAR LLSRGGRAAI CGKYLFNWAV KTKLKLTPIA AAGRLDLSSW FTAGYSGGDI YHGVSHARPR HHHHHHHHHH // ID 2AWZB STANDARD; PRT; 580 AA. DT CONVERTED FROM PDB (SEQRES) 2AWZ DE Genome polyprotein OS Hepatitis C virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.150 CC R-Factor 0.222 FT #SUB 10 10 LEU B 150 150 GLU A Protein S 2 FT #SUB 43 43 ARG B 152 152 GLY A Protein B 4 FT #SUB 43 43 ARG B 153 153 GLY A Protein B 1 FT #SUB 44 44 SER B 152 152 GLY A Protein A 5 FT #SUB 46 46 GLY B 150 150 GLU A Protein B 1 FT #SUB 46 46 GLY B 151 151 LYS A Protein B 5 FT #SUB 46 46 GLY B 152 152 GLY A Protein B 3 FT #SUB 47 47 LEU B 151 151 LYS A Protein A 6 FT #SUB 47 47 LEU B 152 152 GLY A Protein A 4 FT #SUB 50 50 LYS B 351 351 GLY A Protein S 1 FT #SUB 50 50 LYS B 352 352 ASP A Protein S 1 FT #SUB 55 55 ASP B 337 337 ARG A Protein S 2 FT #SUB 148 148 GLN B 50 50 LYS A Protein S 3 FT #SUB 151 151 LYS B 10 10 LEU A Protein S 3 FT #SUB 151 151 LYS B 46 46 GLY A Protein A 4 FT #SUB 151 151 LYS B 47 47 LEU A Protein B 2 FT #SUB 152 152 GLY B 47 47 LEU A Protein B 2 FT #SUB 152 152 GLY B 50 50 LYS A Protein B 4 FT #SUB 153 153 GLY B 47 47 LEU A Protein B 2 FT #HET 48 48 ARG B 6 905 SO4 B S 8 FT #HET 51 51 LYS B 6 905 SO4 B S 4 FT #HET 193 193 PHE B 8 802 5H B S 1 FT #HET 197 197 PRO B 8 802 5H B A 2 FT #HET 200 200 ARG B 8 802 5H B S 1 FT #HET 221 221 THR B 6 905 SO4 B B 1 FT #HET 222 222 ARG B 6 905 SO4 B B 2 FT #HET 223 223 CYS B 6 905 SO4 B A 6 FT #HET 316 316 ASN B 8 802 5H B S 1 FT #HET 368 368 SER B 8 802 5H B S 2 FT #HET 384 384 LEU B 8 802 5H B S 2 FT #HET 410 410 GLY B 8 802 5H B B 3 FT #HET 411 411 ASN B 8 802 5H B A 3 FT #HET 414 414 MET B 8 802 5H B S 8 FT #HET 415 415 TYR B 8 802 5H B S 7 FT #HET 446 446 GLN B 8 802 5H B B 1 FT #HET 447 447 ILE B 8 802 5H B B 2 FT #HET 448 448 TYR B 8 802 5H B A 10 FT #HET 505 505 ARG B 7 906 SO4 B S 9 FT #HET 530 530 VAL B 7 906 SO4 B A 4 FT #HET 531 531 LYS B 7 906 SO4 B A 10 FT #HET 532 532 THR B 7 906 SO4 B A 4 FT #HET 556 556 SER B 8 802 5H B S 1 FT #MOD 366 366 CYS B 8 802 5H B S FT DISORDER 563 580 CC SEQUENCE 562 AA (ATOM); CC SMSYTWTGAL ITPCAAEESK LPINALSNSL LRHHNMVYAT TSRSAGLRQK KVTFDRLQVL CC DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH CC SVWKDLLEDT VTPIDTTIMA KNEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS CC TLPQVVMGSS YGFQYSPGQR VEFLVNTWKS KKNPMGFSYD TRCFDSTVTE NDIRVEESIY CC QCCDLAPEAR QAIKSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAS CC AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAASLRVFT EAMTRYSAPP GDPPQPEYDL CC ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPVNSWLG NIIMYAPTLW CC ARMILMTHFF SILLAQEQLE KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFSLHSYSPG CC EINRVASCLR KLGVPPLRAW RHRARNVRAR LLSRGGRAAI CGKYLFNWAV KTKLKLTPIA CC AAGRLDLSSW FTAGYSGGDI YH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQK CC ATOM SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQK CC ************************************************** CC SEQRES KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ATOM KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ************************************************** CC SEQRES FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPE CC ATOM FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPE CC ************************************************** CC SEQRES KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ATOM KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ************************************************** CC SEQRES VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ATOM VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ************************************************** CC SEQRES QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ATOM QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ************************************************** CC SEQRES AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ATOM AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ************************************************** CC SEQRES GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ATOM GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ************************************************** CC SEQRES RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ATOM RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ************************************************** CC SEQRES CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRAW CC ATOM CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRAW CC ************************************************** CC SEQRES RHRARNVRARLLSRGGRAAICGKYLFNWAVKTKLKLTPIAAAGRLDLSSW CC ATOM RHRARNVRARLLSRGGRAAICGKYLFNWAVKTKLKLTPIAAAGRLDLSSW CC ************************************************** CC SEQRES FTAGYSGGDIYHGVSHARPRHHHHHHHHHH CC ATOM FTAGYSGGDIYH------------------ CC ************ SQ SEQUENCE 580 AA; MW; CN; SMSYTWTGAL ITPCAAEESK LPINALSNSL LRHHNMVYAT TSRSAGLRQK KVTFDRLQVL DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH SVWKDLLEDT VTPIDTTIMA KNEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS TLPQVVMGSS YGFQYSPGQR VEFLVNTWKS KKNPMGFSYD TRCFDSTVTE NDIRVEESIY QCCDLAPEAR QAIKSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAS AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAASLRVFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPVNSWLG NIIMYAPTLW ARMILMTHFF SILLAQEQLE KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFSLHSYSPG EINRVASCLR KLGVPPLRAW RHRARNVRAR LLSRGGRAAI CGKYLFNWAV KTKLKLTPIA AAGRLDLSSW FTAGYSGGDI YHGVSHARPR HHHHHHHHHH //