ID 2DXSA STANDARD; PRT; 552 AA. DT CONVERTED FROM PDB (SEQRES) 2DXS DE Genome polyprotein OS Hepatitis C virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.200 CC R-Factor 0.219 FT #SUB 86 86 GLU A 452 452 TYR B Protein S 6 FT #SUB 87 87 GLU A 176 176 TYR B Protein S 5 FT #SUB 90 90 LYS A 90 90 LYS B Protein A 5 FT #SUB 90 90 LYS A 91 91 LEU B Protein S 2 FT #SUB 91 91 LEU A 90 90 LYS B Protein S 2 FT #SUB 100 100 LYS A 543 543 SER B Protein B 1 FT #SUB 100 100 LYS A 544 544 GLN B Protein A 8 FT #SUB 101 101 PHE A 542 542 ALA B Protein A 4 FT #SUB 101 101 PHE A 543 543 SER B Protein S 3 FT #SUB 109 109 ARG A 109 109 ARG B Protein A 6 FT #SUB 109 109 ARG A 110 110 ASN B Protein S 3 FT #SUB 110 110 ASN A 109 109 ARG B Protein S 3 FT #SUB 110 110 ASN A 453 453 SER B Protein B 2 FT #SUB 111 111 LEU A 451 451 CYS B Protein S 3 FT #SUB 111 111 LEU A 452 452 TYR B Protein A 5 FT #SUB 111 111 LEU A 453 453 SER B Protein B 3 FT #SUB 112 112 SER A 453 453 SER B Protein A 5 FT #SUB 113 113 SER A 453 453 SER B Protein A 3 FT #SUB 113 113 SER A 454 454 ILE B Protein S 3 FT #SUB 113 113 SER A 455 455 GLU B Protein S 3 FT #SUB 113 113 SER A 458 458 ASP B Protein A 8 FT #SUB 113 113 SER A 462 462 ILE B Protein S 1 FT #SUB 114 114 LYS A 458 458 ASP B Protein A 3 FT #SUB 117 117 ASN A 458 458 ASP B Protein S 1 FT #SUB 117 117 ASN A 461 461 GLN B Protein S 2 FT #SUB 117 117 ASN A 462 462 ILE B Protein S 3 FT #SUB 117 117 ASN A 465 465 ARG B Protein S 3 FT #SUB 120 120 HIS A 465 465 ARG B Protein S 7 FT #SUB 121 121 SER A 541 541 ALA B Protein S 2 FT #SUB 176 176 TYR A 87 87 GLU B Protein S 5 FT #SUB 177 177 ASP A 81 81 LYS B Protein S 1 FT #SUB 450 450 ALA A 111 111 LEU B Protein S 1 FT #SUB 451 451 CYS A 111 111 LEU B Protein B 2 FT #SUB 452 452 TYR A 86 86 GLU B Protein S 5 FT #SUB 452 452 TYR A 111 111 LEU B Protein A 5 FT #SUB 453 453 SER A 110 110 ASN B Protein A 2 FT #SUB 453 453 SER A 111 111 LEU B Protein B 4 FT #SUB 453 453 SER A 112 112 SER B Protein A 5 FT #SUB 453 453 SER A 113 113 SER B Protein B 5 FT #SUB 454 454 ILE A 113 113 SER B Protein A 3 FT #SUB 455 455 GLU A 113 113 SER B Protein B 1 FT #SUB 458 458 ASP A 113 113 SER B Protein S 7 FT #SUB 458 458 ASP A 114 114 LYS B Protein S 3 FT #SUB 458 458 ASP A 117 117 ASN B Protein B 1 FT #SUB 461 461 GLN A 117 117 ASN B Protein S 1 FT #SUB 462 462 ILE A 113 113 SER B Protein S 1 FT #SUB 462 462 ILE A 117 117 ASN B Protein A 3 FT #SUB 465 465 ARG A 116 116 VAL B Protein S 1 FT #SUB 465 465 ARG A 117 117 ASN B Protein S 4 FT #SUB 465 465 ARG A 120 120 HIS B Protein S 9 FT #SUB 541 541 ALA A 121 121 SER B Protein S 2 FT #SUB 542 542 ALA A 101 101 PHE B Protein A 4 FT #SUB 543 543 SER A 101 101 PHE B Protein B 3 FT #SUB 544 544 GLN A 100 100 LYS B Protein A 10 FT #HET 37 37 VAL A 1 1000 JTP A S 2 FT #HET 392 392 LEU A 1 1000 JTP A A 4 FT #HET 393 393 ALA A 1 1000 JTP A B 3 FT #HET 395 395 ALA A 1 1000 JTP A A 4 FT #HET 396 396 ALA A 1 1000 JTP A A 10 FT #HET 399 399 THR A 1 1000 JTP A S 1 FT #HET 424 424 ILE A 1 1000 JTP A S 2 FT #HET 425 425 LEU A 1 1000 JTP A S 2 FT #HET 428 428 HIS A 1 1000 JTP A S 6 FT #HET 429 429 PHE A 1 1000 JTP A S 3 FT #HET 492 492 LEU A 1 1000 JTP A A 5 FT #HET 493 493 GLY A 1 1000 JTP A B 2 FT #HET 494 494 VAL A 1 1000 JTP A S 4 FT #HET 495 495 PRO A 1 1000 JTP A S 8 FT #HET 500 500 TRP A 1 1000 JTP A S 3 FT #HET 503 503 ARG A 1 1000 JTP A S 8 FT DISORDER 15 36 FT DISORDER 148 152 FT DISORDER 545 552 CC SEQUENCE 517 AA (ATOM); CC SMSYTWTGAL ITPCVYATTS RSAGLRQKKV TFDRLQVLDD HYRDVLKEMK AKASTVKAKL CC LSVEEACKLT PPHSAKSKFG YGAKDVRNLS SKAVNHIHSV WKDLLEDTVT PIDTTIMAKN CC EVFCVGRKPA RLIVFPDLGV RVCEKMALYD VVSTLPQVVM GSSYGFQYSP GQRVEFLVNT CC WKSKKNPMGF SYDTRCFDST VTENDIRVEE SIYQCCDLAP EARQAIKSLT ERLYIGGPLT CC NSKGQNCGYR RCRASGVLTT SCGNTLTCYL KASAACRAAK LQDCTMLVNG DDLVVICESA CC GTQEDAASLR VFTEAMTRYS APPGDPPQPE YDLELITSCS SNVSVAHDAS GKRVYYLTRD CC PTTPLARAAW ETARHTPVNS WLGNIIMYAP TLWARMILMT HFFSILLAQE QLEKALDCQI CC YGACYSIEPL DLPQIIERLH GLSAFSLHSY SPGEINRVAS CLRKLGVPPL RVWRHRARSV CC RARLLSQGGR AATCGKYLFN WAVKTKLKLT PIPAASQ CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQK CC ATOM SMSYTWTGALITPC----------------------VYATTSRSAGLRQK CC ************** ************** CC SEQRES KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ATOM KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ************************************************** CC SEQRES FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPE CC ATOM FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCV--- CC *********************************************** CC SEQRES KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ATOM --GRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ************************************************ CC SEQRES VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ATOM VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ************************************************** CC SEQRES QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ATOM QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ************************************************** CC SEQRES AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ATOM AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ************************************************** CC SEQRES GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ATOM GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ************************************************** CC SEQRES RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ATOM RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ************************************************** CC SEQRES CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVW CC ATOM CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVW CC ************************************************** CC SEQRES RHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQGSHHDH CC ATOM RHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQ------ CC ******************************************** CC SEQRES HH CC ATOM -- CC SQ SEQUENCE 552 AA; MW; CN; SMSYTWTGAL ITPCAAEESK LPINALSNSL LRHHNMVYAT TSRSAGLRQK KVTFDRLQVL DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH SVWKDLLEDT VTPIDTTIMA KNEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS TLPQVVMGSS YGFQYSPGQR VEFLVNTWKS KKNPMGFSYD TRCFDSTVTE NDIRVEESIY QCCDLAPEAR QAIKSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAS AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAASLRVFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPVNSWLG NIIMYAPTLW ARMILMTHFF SILLAQEQLE KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFSLHSYSPG EINRVASCLR KLGVPPLRVW RHRARSVRAR LLSQGGRAAT CGKYLFNWAV KTKLKLTPIP AASQGSHHDH HH // ID 2DXSB STANDARD; PRT; 552 AA. DT CONVERTED FROM PDB (SEQRES) 2DXS DE Genome polyprotein OS Hepatitis C virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.200 CC R-Factor 0.219 FT #SUB 81 81 LYS B 177 177 ASP A Protein S 1 FT #SUB 86 86 GLU B 452 452 TYR A Protein S 5 FT #SUB 87 87 GLU B 176 176 TYR A Protein S 5 FT #SUB 90 90 LYS B 90 90 LYS A Protein A 5 FT #SUB 90 90 LYS B 91 91 LEU A Protein S 2 FT #SUB 91 91 LEU B 90 90 LYS A Protein S 2 FT #SUB 100 100 LYS B 544 544 GLN A Protein A 10 FT #SUB 101 101 PHE B 542 542 ALA A Protein A 4 FT #SUB 101 101 PHE B 543 543 SER A Protein S 3 FT #SUB 109 109 ARG B 109 109 ARG A Protein A 6 FT #SUB 109 109 ARG B 110 110 ASN A Protein S 3 FT #SUB 110 110 ASN B 109 109 ARG A Protein S 3 FT #SUB 110 110 ASN B 453 453 SER A Protein B 2 FT #SUB 111 111 LEU B 450 450 ALA A Protein S 1 FT #SUB 111 111 LEU B 451 451 CYS A Protein S 2 FT #SUB 111 111 LEU B 452 452 TYR A Protein A 5 FT #SUB 111 111 LEU B 453 453 SER A Protein B 4 FT #SUB 112 112 SER B 453 453 SER A Protein A 5 FT #SUB 113 113 SER B 453 453 SER A Protein A 5 FT #SUB 113 113 SER B 454 454 ILE A Protein S 3 FT #SUB 113 113 SER B 455 455 GLU A Protein S 1 FT #SUB 113 113 SER B 458 458 ASP A Protein A 7 FT #SUB 113 113 SER B 462 462 ILE A Protein S 1 FT #SUB 114 114 LYS B 458 458 ASP A Protein A 3 FT #SUB 116 116 VAL B 465 465 ARG A Protein S 1 FT #SUB 117 117 ASN B 458 458 ASP A Protein S 1 FT #SUB 117 117 ASN B 461 461 GLN A Protein S 1 FT #SUB 117 117 ASN B 462 462 ILE A Protein S 3 FT #SUB 117 117 ASN B 465 465 ARG A Protein S 4 FT #SUB 120 120 HIS B 465 465 ARG A Protein S 9 FT #SUB 121 121 SER B 541 541 ALA A Protein S 2 FT #SUB 176 176 TYR B 87 87 GLU A Protein S 5 FT #SUB 451 451 CYS B 111 111 LEU A Protein B 3 FT #SUB 452 452 TYR B 86 86 GLU A Protein S 6 FT #SUB 452 452 TYR B 111 111 LEU A Protein A 5 FT #SUB 453 453 SER B 110 110 ASN A Protein A 2 FT #SUB 453 453 SER B 111 111 LEU A Protein B 3 FT #SUB 453 453 SER B 112 112 SER A Protein A 5 FT #SUB 453 453 SER B 113 113 SER A Protein B 3 FT #SUB 454 454 ILE B 113 113 SER A Protein A 3 FT #SUB 455 455 GLU B 113 113 SER A Protein A 3 FT #SUB 458 458 ASP B 113 113 SER A Protein S 8 FT #SUB 458 458 ASP B 114 114 LYS A Protein S 3 FT #SUB 458 458 ASP B 117 117 ASN A Protein B 1 FT #SUB 461 461 GLN B 117 117 ASN A Protein A 2 FT #SUB 462 462 ILE B 113 113 SER A Protein S 1 FT #SUB 462 462 ILE B 117 117 ASN A Protein A 3 FT #SUB 465 465 ARG B 117 117 ASN A Protein S 3 FT #SUB 465 465 ARG B 120 120 HIS A Protein S 7 FT #SUB 541 541 ALA B 121 121 SER A Protein S 2 FT #SUB 542 542 ALA B 101 101 PHE A Protein A 4 FT #SUB 543 543 SER B 100 100 LYS A Protein B 1 FT #SUB 543 543 SER B 101 101 PHE A Protein B 3 FT #SUB 544 544 GLN B 100 100 LYS A Protein A 8 FT #HET 37 37 VAL B 2 2000 JTP B S 2 FT #HET 392 392 LEU B 2 2000 JTP B A 4 FT #HET 393 393 ALA B 2 2000 JTP B B 3 FT #HET 395 395 ALA B 2 2000 JTP B A 4 FT #HET 396 396 ALA B 2 2000 JTP B A 12 FT #HET 399 399 THR B 2 2000 JTP B S 1 FT #HET 424 424 ILE B 2 2000 JTP B S 1 FT #HET 425 425 LEU B 2 2000 JTP B S 2 FT #HET 428 428 HIS B 2 2000 JTP B S 4 FT #HET 429 429 PHE B 2 2000 JTP B S 1 FT #HET 492 492 LEU B 2 2000 JTP B A 3 FT #HET 493 493 GLY B 2 2000 JTP B B 2 FT #HET 494 494 VAL B 2 2000 JTP B S 5 FT #HET 495 495 PRO B 2 2000 JTP B S 8 FT #HET 500 500 TRP B 2 2000 JTP B S 4 FT #HET 503 503 ARG B 2 2000 JTP B S 8 FT DISORDER 15 36 FT DISORDER 148 152 FT DISORDER 545 552 CC SEQUENCE 517 AA (ATOM); CC SMSYTWTGAL ITPCVYATTS RSAGLRQKKV TFDRLQVLDD HYRDVLKEMK AKASTVKAKL CC LSVEEACKLT PPHSAKSKFG YGAKDVRNLS SKAVNHIHSV WKDLLEDTVT PIDTTIMAKN CC EVFCVGRKPA RLIVFPDLGV RVCEKMALYD VVSTLPQVVM GSSYGFQYSP GQRVEFLVNT CC WKSKKNPMGF SYDTRCFDST VTENDIRVEE SIYQCCDLAP EARQAIKSLT ERLYIGGPLT CC NSKGQNCGYR RCRASGVLTT SCGNTLTCYL KASAACRAAK LQDCTMLVNG DDLVVICESA CC GTQEDAASLR VFTEAMTRYS APPGDPPQPE YDLELITSCS SNVSVAHDAS GKRVYYLTRD CC PTTPLARAAW ETARHTPVNS WLGNIIMYAP TLWARMILMT HFFSILLAQE QLEKALDCQI CC YGACYSIEPL DLPQIIERLH GLSAFSLHSY SPGEINRVAS CLRKLGVPPL RVWRHRARSV CC RARLLSQGGR AATCGKYLFN WAVKTKLKLT PIPAASQ CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMSYTWTGALITPCAAEESKLPINALSNSLLRHHNMVYATTSRSAGLRQK CC ATOM SMSYTWTGALITPC----------------------VYATTSRSAGLRQK CC ************** ************** CC SEQRES KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ATOM KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPPHSAKSK CC ************************************************** CC SEQRES FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCVQPE CC ATOM FGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEVFCV--- CC *********************************************** CC SEQRES KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ATOM --GRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQYSPGQR CC ************************************************ CC SEQRES VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ATOM VEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCDLAPEAR CC ************************************************** CC SEQRES QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ATOM QAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAS CC ************************************************** CC SEQRES AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ATOM AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAASLRVFTEAMTRYSAPP CC ************************************************** CC SEQRES GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ATOM GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ************************************************** CC SEQRES RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ATOM RHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALDCQIYGA CC ************************************************** CC SEQRES CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVW CC ATOM CYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGVPPLRVW CC ************************************************** CC SEQRES RHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQGSHHDH CC ATOM RHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQ------ CC ******************************************** CC SEQRES HH CC ATOM -- CC SQ SEQUENCE 552 AA; MW; CN; SMSYTWTGAL ITPCAAEESK LPINALSNSL LRHHNMVYAT TSRSAGLRQK KVTFDRLQVL DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH SVWKDLLEDT VTPIDTTIMA KNEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS TLPQVVMGSS YGFQYSPGQR VEFLVNTWKS KKNPMGFSYD TRCFDSTVTE NDIRVEESIY QCCDLAPEAR QAIKSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAS AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAASLRVFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPVNSWLG NIIMYAPTLW ARMILMTHFF SILLAQEQLE KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFSLHSYSPG EINRVASCLR KLGVPPLRVW RHRARSVRAR LLSQGGRAAT CGKYLFNWAV KTKLKLTPIP AASQGSHHDH HH //