ID 3UPHA STANDARD; PRT; 576 AA. DT CONVERTED FROM PDB (SEQRES) 3UPH DE RNA-directed RNA polymerase OS Hepatitis C virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.000 CC R-Factor 0.187 FT #SUB 69 69 LYS A 77 77 THR B Protein S 4 FT #SUB 72 72 LYS A 76 76 SER B Protein A 5 FT #SUB 72 72 LYS A 242 242 CYS B Protein S 3 FT #SUB 72 72 LYS A 243 243 CYS B Protein S 2 FT #SUB 72 72 LYS A 244 244 ASP B Protein S 2 FT #SUB 73 73 ALA A 73 73 ALA B Protein A 2 FT #SUB 73 73 ALA A 76 76 SER B Protein A 4 FT #SUB 76 76 SER A 72 72 LYS B Protein S 4 FT #SUB 76 76 SER A 73 73 ALA B Protein S 4 FT #SUB 76 76 SER A 76 76 SER B Protein S 2 FT #SUB 76 76 SER A 242 242 CYS B Protein S 1 FT #SUB 77 77 THR A 69 69 LYS B Protein B 2 FT #SUB 127 127 LEU A 130 130 THR B Protein B 4 FT #SUB 128 128 GLU A 128 128 GLU B Protein B 1 FT #SUB 128 128 GLU A 131 131 GLU B Protein S 1 FT #SUB 130 130 THR A 127 127 LEU B Protein S 4 FT #SUB 130 130 THR A 130 130 THR B Protein S 1 FT #SUB 130 130 THR A 251 251 GLN B Protein S 4 FT #SUB 131 131 GLU A 128 128 GLU B Protein S 1 FT #SUB 234 234 ARG A 247 247 PRO B Protein S 3 FT #SUB 237 237 GLU A 250 250 ARG B Protein S 1 FT #SUB 238 238 SER A 250 250 ARG B Protein A 2 FT #SUB 241 241 GLN A 241 241 GLN B Protein A 3 FT #SUB 241 241 GLN A 250 250 ARG B Protein S 2 FT #SUB 242 242 CYS A 72 72 LYS B Protein B 3 FT #SUB 242 242 CYS A 242 242 CYS B Protein B 1 FT #SUB 243 243 CYS A 72 72 LYS B Protein B 2 FT #SUB 244 244 ASP A 72 72 LYS B Protein B 2 FT #SUB 247 247 PRO A 234 234 ARG B Protein S 4 FT #SUB 247 247 PRO A 254 254 ARG B Protein A 3 FT #SUB 247 247 PRO A 258 258 GLU B Protein S 1 FT #SUB 250 250 ARG A 237 237 GLU B Protein S 3 FT #SUB 250 250 ARG A 238 238 SER B Protein S 2 FT #SUB 250 250 ARG A 241 241 GLN B Protein S 3 FT #SUB 250 250 ARG A 254 254 ARG B Protein S 1 FT #SUB 251 251 GLN A 130 130 THR B Protein S 3 FT #SUB 251 251 GLN A 251 251 GLN B Protein S 8 FT #SUB 251 251 GLN A 254 254 ARG B Protein S 1 FT #SUB 254 254 ARG A 247 247 PRO B Protein S 5 FT #SUB 254 254 ARG A 250 250 ARG B Protein S 2 FT #SUB 254 254 ARG A 251 251 GLN B Protein S 2 FT #SUB 258 258 GLU A 247 247 PRO B Protein S 1 FT #HET 193 193 PHE A 2 578 0C1 A S 5 FT #HET 197 197 PRO A 2 578 0C1 A S 2 FT #HET 200 200 ARG A 2 578 0C1 A S 1 FT #HET 316 316 ASN A 2 578 0C1 A A 2 FT #HET 319 319 ASP A 2 578 0C1 A S 1 FT #HET 366 366 CYS A 2 578 0C1 A A 19 FT #HET 367 367 SER A 2 578 0C1 A A 3 FT #HET 368 368 SER A 2 578 0C1 A S 1 FT #HET 384 384 LEU A 2 578 0C1 A S 1 FT #HET 410 410 GLY A 2 578 0C1 A B 3 FT #HET 411 411 ASN A 2 578 0C1 A S 1 FT #HET 414 414 MET A 2 578 0C1 A A 20 FT #HET 415 415 TYR A 2 578 0C1 A S 11 FT #HET 446 446 GLN A 2 578 0C1 A B 4 FT #HET 447 447 ILE A 2 578 0C1 A B 2 FT #HET 448 448 TYR A 2 578 0C1 A A 13 FT #HET 449 449 GLY A 2 578 0C1 A B 3 FT #HET 505 505 ARG A 1 577 PO4 A S 8 FT #HET 530 530 VAL A 1 577 PO4 A A 4 FT #HET 531 531 ARG A 1 577 PO4 A A 10 FT #HET 532 532 THR A 1 577 PO4 A A 4 FT #HET 556 556 SER A 2 578 0C1 A S 6 FT DISORDER 149 153 FT DISORDER 569 576 CC SEQUENCE 563 AA (ATOM); CC SMSYTWTGAL ITPCAAEESK LPINPLSNSL LRHHNMVYAT TSRSASLRQK KVTFDRLQVL CC DDHYRDVLKE MKAKASTVKA KLLSIEEACK LTPPHSAKSK FGYGAKDVRN LSSRAVNHIR CC SVWEDLLEDT ETPIDTTIMA KSEVFCVQRK PARLIVFPDL GVRVCEKMAL YDVVSTLPQA CC VMGSSYGFQY SPKQRVEFLV NTWKSKKCPM GFSYDTRCFD STVTESDIRV EESIYQCCDL CC APEARQAIRS LTERLYIGGP LTNSKGQNCG YRRCRASGVL TTSCGNTLTC YLKATAACRA CC AKLQDCTMLV NGDDLVVICE SAGTQEDAAA LRAFTEAMTR YSAPPGDPPQ PEYDLELITS CC CSSNVSVAHD ASGKRVYYLT RDPTTPLARA AWETARHTPI NSWLGNIIMY APTLWARMIL CC MTHFFSILLA QEQLGKALDC QIYGACYSIE PLDLPQIIER LHGLSAFTLH SYSPGEINRV CC ASCLRKLGVP PLRTWRHRAR SVRAKLLSQG GRAAICGRYL FNWAVRTKLK LTPIPAASQL CC DLSGWFVAGY SGGDIYHSLS RAR CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRSASLRQK CC ATOM SMSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRSASLRQK CC ************************************************** CC SEQRES KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSIEEACKLTPPHSAKSK CC ATOM KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSIEEACKLTPPHSAKSK CC ************************************************** CC SEQRES FGYGAKDVRNLSSRAVNHIRSVWEDLLEDTETPIDTTIMAKSEVFCVQPE CC ATOM FGYGAKDVRNLSSRAVNHIRSVWEDLLEDTETPIDTTIMAKSEVFCVQ-- CC ************************************************ CC SEQRES KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQAVMGSSYGFQYSPKQR CC ATOM ---RKPARLIVFPDLGVRVCEKMALYDVVSTLPQAVMGSSYGFQYSPKQR CC *********************************************** CC SEQRES VEFLVNTWKSKKCPMGFSYDTRCFDSTVTESDIRVEESIYQCCDLAPEAR CC ATOM VEFLVNTWKSKKCPMGFSYDTRCFDSTVTESDIRVEESIYQCCDLAPEAR CC ************************************************** CC SEQRES QAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAT CC ATOM QAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAT CC ************************************************** CC SEQRES AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAAALRAFTEAMTRYSAPP CC ATOM AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAAALRAFTEAMTRYSAPP CC ************************************************** CC SEQRES GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ATOM GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ************************************************** CC SEQRES RHTPINSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLGKALDCQIYGA CC ATOM RHTPINSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLGKALDCQIYGA CC ************************************************** CC SEQRES CYSIEPLDLPQIIERLHGLSAFTLHSYSPGEINRVASCLRKLGVPPLRTW CC ATOM CYSIEPLDLPQIIERLHGLSAFTLHSYSPGEINRVASCLRKLGVPPLRTW CC ************************************************** CC SEQRES RHRARSVRAKLLSQGGRAAICGRYLFNWAVRTKLKLTPIPAASQLDLSGW CC ATOM RHRARSVRAKLLSQGGRAAICGRYLFNWAVRTKLKLTPIPAASQLDLSGW CC ************************************************** CC SEQRES FVAGYSGGDIYHSLSRARPRENLYFQ CC ATOM FVAGYSGGDIYHSLSRAR-------- CC ****************** SQ SEQUENCE 576 AA; MW; CN; SMSYTWTGAL ITPCAAEESK LPINPLSNSL LRHHNMVYAT TSRSASLRQK KVTFDRLQVL DDHYRDVLKE MKAKASTVKA KLLSIEEACK LTPPHSAKSK FGYGAKDVRN LSSRAVNHIR SVWEDLLEDT ETPIDTTIMA KSEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS TLPQAVMGSS YGFQYSPKQR VEFLVNTWKS KKCPMGFSYD TRCFDSTVTE SDIRVEESIY QCCDLAPEAR QAIRSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAT AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAAALRAFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPINSWLG NIIMYAPTLW ARMILMTHFF SILLAQEQLG KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFTLHSYSPG EINRVASCLR KLGVPPLRTW RHRARSVRAK LLSQGGRAAI CGRYLFNWAV RTKLKLTPIP AASQLDLSGW FVAGYSGGDI YHSLSRARPR ENLYFQ // ID 3UPHB STANDARD; PRT; 576 AA. DT CONVERTED FROM PDB (SEQRES) 3UPH DE RNA-directed RNA polymerase OS Hepatitis C virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.000 CC R-Factor 0.187 FT #SUB 69 69 LYS B 77 77 THR A Protein S 2 FT #SUB 72 72 LYS B 76 76 SER A Protein A 4 FT #SUB 72 72 LYS B 242 242 CYS A Protein S 3 FT #SUB 72 72 LYS B 243 243 CYS A Protein S 2 FT #SUB 72 72 LYS B 244 244 ASP A Protein S 2 FT #SUB 73 73 ALA B 73 73 ALA A Protein A 2 FT #SUB 73 73 ALA B 76 76 SER A Protein B 4 FT #SUB 76 76 SER B 72 72 LYS A Protein S 5 FT #SUB 76 76 SER B 73 73 ALA A Protein S 4 FT #SUB 76 76 SER B 76 76 SER A Protein S 2 FT #SUB 77 77 THR B 69 69 LYS A Protein B 4 FT #SUB 127 127 LEU B 130 130 THR A Protein B 4 FT #SUB 128 128 GLU B 128 128 GLU A Protein B 1 FT #SUB 128 128 GLU B 131 131 GLU A Protein S 1 FT #SUB 130 130 THR B 127 127 LEU A Protein S 4 FT #SUB 130 130 THR B 130 130 THR A Protein S 1 FT #SUB 130 130 THR B 251 251 GLN A Protein S 3 FT #SUB 131 131 GLU B 128 128 GLU A Protein S 1 FT #SUB 234 234 ARG B 247 247 PRO A Protein S 4 FT #SUB 237 237 GLU B 250 250 ARG A Protein S 3 FT #SUB 238 238 SER B 250 250 ARG A Protein A 2 FT #SUB 241 241 GLN B 241 241 GLN A Protein A 3 FT #SUB 241 241 GLN B 250 250 ARG A Protein S 3 FT #SUB 242 242 CYS B 72 72 LYS A Protein B 3 FT #SUB 242 242 CYS B 76 76 SER A Protein S 1 FT #SUB 242 242 CYS B 242 242 CYS A Protein S 1 FT #SUB 243 243 CYS B 72 72 LYS A Protein B 2 FT #SUB 244 244 ASP B 72 72 LYS A Protein B 2 FT #SUB 247 247 PRO B 234 234 ARG A Protein S 3 FT #SUB 247 247 PRO B 254 254 ARG A Protein A 5 FT #SUB 247 247 PRO B 258 258 GLU A Protein S 1 FT #SUB 250 250 ARG B 237 237 GLU A Protein S 1 FT #SUB 250 250 ARG B 238 238 SER A Protein S 2 FT #SUB 250 250 ARG B 241 241 GLN A Protein S 2 FT #SUB 250 250 ARG B 254 254 ARG A Protein S 2 FT #SUB 251 251 GLN B 130 130 THR A Protein S 4 FT #SUB 251 251 GLN B 251 251 GLN A Protein S 8 FT #SUB 251 251 GLN B 254 254 ARG A Protein S 2 FT #SUB 254 254 ARG B 247 247 PRO A Protein S 3 FT #SUB 254 254 ARG B 250 250 ARG A Protein S 1 FT #SUB 254 254 ARG B 251 251 GLN A Protein S 1 FT #SUB 258 258 GLU B 247 247 PRO A Protein S 1 FT #HET 193 193 PHE B 3 577 0C1 B S 4 FT #HET 200 200 ARG B 3 577 0C1 B S 1 FT #HET 316 316 ASN B 3 577 0C1 B A 3 FT #HET 319 319 ASP B 3 577 0C1 B S 2 FT #HET 366 366 CYS B 3 577 0C1 B A 18 FT #HET 367 367 SER B 3 577 0C1 B A 3 FT #HET 368 368 SER B 3 577 0C1 B S 2 FT #HET 384 384 LEU B 3 577 0C1 B S 1 FT #HET 410 410 GLY B 3 577 0C1 B B 3 FT #HET 411 411 ASN B 3 577 0C1 B S 1 FT #HET 414 414 MET B 3 577 0C1 B A 17 FT #HET 415 415 TYR B 3 577 0C1 B S 12 FT #HET 446 446 GLN B 3 577 0C1 B B 5 FT #HET 447 447 ILE B 3 577 0C1 B B 2 FT #HET 448 448 TYR B 3 577 0C1 B A 16 FT #HET 449 449 GLY B 3 577 0C1 B B 4 FT #HET 556 556 SER B 3 577 0C1 B S 3 FT DISORDER 149 153 FT DISORDER 564 576 CC SEQUENCE 558 AA (ATOM); CC SMSYTWTGAL ITPCAAEESK LPINPLSNSL LRHHNMVYAT TSRSASLRQK KVTFDRLQVL CC DDHYRDVLKE MKAKASTVKA KLLSIEEACK LTPPHSAKSK FGYGAKDVRN LSSRAVNHIR CC SVWEDLLEDT ETPIDTTIMA KSEVFCVQRK PARLIVFPDL GVRVCEKMAL YDVVSTLPQA CC VMGSSYGFQY SPKQRVEFLV NTWKSKKCPM GFSYDTRCFD STVTESDIRV EESIYQCCDL CC APEARQAIRS LTERLYIGGP LTNSKGQNCG YRRCRASGVL TTSCGNTLTC YLKATAACRA CC AKLQDCTMLV NGDDLVVICE SAGTQEDAAA LRAFTEAMTR YSAPPGDPPQ PEYDLELITS CC CSSNVSVAHD ASGKRVYYLT RDPTTPLARA AWETARHTPI NSWLGNIIMY APTLWARMIL CC MTHFFSILLA QEQLGKALDC QIYGACYSIE PLDLPQIIER LHGLSAFTLH SYSPGEINRV CC ASCLRKLGVP PLRTWRHRAR SVRAKLLSQG GRAAICGRYL FNWAVRTKLK LTPIPAASQL CC DLSGWFVAGY SGGDIYHS CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRSASLRQK CC ATOM SMSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRSASLRQK CC ************************************************** CC SEQRES KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSIEEACKLTPPHSAKSK CC ATOM KVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSIEEACKLTPPHSAKSK CC ************************************************** CC SEQRES FGYGAKDVRNLSSRAVNHIRSVWEDLLEDTETPIDTTIMAKSEVFCVQPE CC ATOM FGYGAKDVRNLSSRAVNHIRSVWEDLLEDTETPIDTTIMAKSEVFCVQ-- CC ************************************************ CC SEQRES KGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQAVMGSSYGFQYSPKQR CC ATOM ---RKPARLIVFPDLGVRVCEKMALYDVVSTLPQAVMGSSYGFQYSPKQR CC *********************************************** CC SEQRES VEFLVNTWKSKKCPMGFSYDTRCFDSTVTESDIRVEESIYQCCDLAPEAR CC ATOM VEFLVNTWKSKKCPMGFSYDTRCFDSTVTESDIRVEESIYQCCDLAPEAR CC ************************************************** CC SEQRES QAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAT CC ATOM QAIRSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLTCYLKAT CC ************************************************** CC SEQRES AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAAALRAFTEAMTRYSAPP CC ATOM AACRAAKLQDCTMLVNGDDLVVICESAGTQEDAAALRAFTEAMTRYSAPP CC ************************************************** CC SEQRES GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ATOM GDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLARAAWETA CC ************************************************** CC SEQRES RHTPINSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLGKALDCQIYGA CC ATOM RHTPINSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLGKALDCQIYGA CC ************************************************** CC SEQRES CYSIEPLDLPQIIERLHGLSAFTLHSYSPGEINRVASCLRKLGVPPLRTW CC ATOM CYSIEPLDLPQIIERLHGLSAFTLHSYSPGEINRVASCLRKLGVPPLRTW CC ************************************************** CC SEQRES RHRARSVRAKLLSQGGRAAICGRYLFNWAVRTKLKLTPIPAASQLDLSGW CC ATOM RHRARSVRAKLLSQGGRAAICGRYLFNWAVRTKLKLTPIPAASQLDLSGW CC ************************************************** CC SEQRES FVAGYSGGDIYHSLSRARPRENLYFQ CC ATOM FVAGYSGGDIYHS------------- CC ************* SQ SEQUENCE 576 AA; MW; CN; SMSYTWTGAL ITPCAAEESK LPINPLSNSL LRHHNMVYAT TSRSASLRQK KVTFDRLQVL DDHYRDVLKE MKAKASTVKA KLLSIEEACK LTPPHSAKSK FGYGAKDVRN LSSRAVNHIR SVWEDLLEDT ETPIDTTIMA KSEVFCVQPE KGGRKPARLI VFPDLGVRVC EKMALYDVVS TLPQAVMGSS YGFQYSPKQR VEFLVNTWKS KKCPMGFSYD TRCFDSTVTE SDIRVEESIY QCCDLAPEAR QAIRSLTERL YIGGPLTNSK GQNCGYRRCR ASGVLTTSCG NTLTCYLKAT AACRAAKLQD CTMLVNGDDL VVICESAGTQ EDAAALRAFT EAMTRYSAPP GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA RHTPINSWLG NIIMYAPTLW ARMILMTHFF SILLAQEQLG KALDCQIYGA CYSIEPLDLP QIIERLHGLS AFTLHSYSPG EINRVASCLR KLGVPPLRTW RHRARSVRAK LLSQGGRAAI CGRYLFNWAV RTKLKLTPIP AASQLDLSGW FVAGYSGGDI YHSLSRARPR ENLYFQ //