ID 5W2EA STANDARD; PRT; 576 AA. DT CONVERTED FROM PDB (SEQRES) 5W2E DE Genome polyprotein OS Hepatitis C virus genotype 1b (isolate BK) CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.800 CC R-Factor 0.184 FT #SUB 75 69 LYS A 443 437 GLU B Protein S 3 FT #SUB 79 73 ALA A 441 435 ALA B Protein S 1 FT #SUB 79 73 ALA A 442 436 GLN B Protein A 5 FT #SUB 80 74 LYS A 442 436 GLN B Protein A 3 FT #SUB 82 76 SER A 31 25 PRO B Protein B 1 FT #SUB 83 77 THR A 30 24 ASN B Protein B 7 FT #SUB 83 77 THR A 31 25 PRO B Protein B 6 FT #SUB 84 78 VAL A 31 25 PRO B Protein B 1 FT #SUB 85 79 LYS A 29 23 ILE B Protein S 1 FT #SUB 85 79 LYS A 30 24 ASN B Protein S 3 FT #SUB 85 79 LYS A 31 25 PRO B Protein S 1 FT #SUB 85 79 LYS A 34 28 ASN B Protein S 3 FT #SUB 215 209 LYS A 113 107 ASP B Protein S 4 FT #SUB 250 244 ASP A 31 25 PRO B Protein S 2 FT #SUB 385 379 LYS A 106 100 LYS B Protein S 4 FT #SUB 544 538 PRO A 276 270 LYS B Protein S 2 FT #SUB 549 543 SER A 276 270 LYS B Protein S 2 FT #HET 199 193 PHE A 1 601 9VY A S 5 FT #HET 203 197 PRO A 1 601 9VY A A 9 FT #HET 206 200 ARG A 1 601 9VY A S 27 FT #HET 210 204 LEU A 1 601 9VY A S 5 FT #HET 320 314 LEU A 1 601 9VY A A 15 FT #HET 321 315 VAL A 1 601 9VY A B 3 FT #HET 322 316 ASN A 1 601 9VY A A 21 FT #HET 325 319 ASP A 1 601 9VY A A 9 FT #HET 326 320 LEU A 1 601 9VY A B 3 FT #HET 327 321 VAL A 1 601 9VY A A 12 FT #HET 369 363 ILE A 1 601 9VY A A 8 FT #HET 371 365 SER A 1 601 9VY A A 24 FT #HET 372 366 CYS A 1 601 9VY A A 11 FT #HET 374 368 SER A 1 601 9VY A A 14 FT #HET 375 369 ASN A 1 601 9VY A B 6 FT #HET 376 370 VAL A 1 601 9VY A A 2 FT #HET 390 384 LEU A 1 601 9VY A S 15 FT #HET 419 413 ILE A 1 601 9VY A A 5 FT #HET 420 414 MET A 1 601 9VY A A 15 FT #HET 421 415 TYR A 1 601 9VY A S 20 FT #HET 453 447 ILE A 1 601 9VY A A 12 FT #HET 454 448 TYR A 1 601 9VY A S 22 FT #HET 458 452 TYR A 1 601 9VY A S 13 FT #HET 472 466 LEU A 1 601 9VY A S 4 FT #HET 556 550 TRP A 1 601 9VY A S 7 FT #HET 557 551 PHE A 1 601 9VY A S 12 FT DISORDER 1 6 FT DISORDER 155 159 FT DISORDER 571 576 CC Miss-SC 1 CC SEQUENCE 559 AA (ATOM); CC HHSYTWTGAL ITPCAAEESK LPINPLSNSL LRHHNMVYAT TSRSASLRQK KVTFDRLQVL CC DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH CC SVWKDLLEDT VTPIDTTIMA KNEVFCVQRK PARLIVFPDL GVRVCEKMAL YDVVSTLPQV CC VMGSSYGFQY SPGQRVEFLV NTWKSKKNPM GFSYDTRCFD STVTENDIRV EESIYQCCDL CC APEARQAIKS LTERLYIGGP LTNSKGQNCG YRRCRASGVL TTSCGNTLTC YLKASAACRA CC AKLQDCTMLV NGDDLVVICE SAGVQEDAAS LRAFTEAMTR YSAPPGDPPQ PEYDLELITS CC CSSNVSVAHD ASGKRVYYLT RDPTTPLARA AWETARHTPV NSWLGNIIMY APTLWARMIL CC MTHFFSILLA QEQLEKALDC QIYGACYSIE PLDLPQIIER LHGLSAFSLH SYSPGEINRV CC ASCLRKLGVP PLRVWRHRAR SVRARLLSQG GRAATCGKYL FNWAVKTKLK LTPIPAASQL CC DLSGWFVAGY SGGDIYHSL CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES ASHHHHHHSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRS CC ATOM ------HHSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRS CC ******************************************** CC SEQRES ASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPP CC ATOM ASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPP CC ************************************************** CC SEQRES HSAKSKFGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEV CC ATOM HSAKSKFGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEV CC ************************************************** CC SEQRES FCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQ CC ATOM FCVQ-----RKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQ CC **** ***************************************** CC SEQRES YSPGQRVEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCD CC ATOM YSPGQRVEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCD CC ************************************************** CC SEQRES LAPEARQAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLT CC ATOM LAPEARQAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLT CC ************************************************** CC SEQRES CYLKASAACRAAKLQDCTMLVNGDDLVVICESAGVQEDAASLRAFTEAMT CC ATOM CYLKASAACRAAKLQDCTMLVNGDDLVVICESAGVQEDAASLRAFTEAMT CC ************************************************** CC SEQRES RYSAPPGDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLAR CC ATOM RYSAPPGDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLAR CC ************************************************** CC SEQRES AAWETARHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALD CC ATOM AAWETARHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALD CC ************************************************** CC SEQRES CQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGV CC ATOM CQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGV CC ************************************************** CC SEQRES PPLRVWRHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQ CC ATOM PPLRVWRHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQ CC ************************************************** CC SEQRES LDLSGWFVAGYSGGDIYHSLSRARPR CC ATOM LDLSGWFVAGYSGGDIYHSL------ CC ******************** SQ SEQUENCE 576 AA; MW; CN; ASHHHHHHSY TWTGALITPC AAEESKLPIN PLSNSLLRHH NMVYATTSRS ASLRQKKVTF DRLQVLDDHY RDVLKEMKAK ASTVKAKLLS VEEACKLTPP HSAKSKFGYG AKDVRNLSSK AVNHIHSVWK DLLEDTVTPI DTTIMAKNEV FCVQPEKGGR KPARLIVFPD LGVRVCEKMA LYDVVSTLPQ VVMGSSYGFQ YSPGQRVEFL VNTWKSKKNP MGFSYDTRCF DSTVTENDIR VEESIYQCCD LAPEARQAIK SLTERLYIGG PLTNSKGQNC GYRRCRASGV LTTSCGNTLT CYLKASAACR AAKLQDCTML VNGDDLVVIC ESAGVQEDAA SLRAFTEAMT RYSAPPGDPP QPEYDLELIT SCSSNVSVAH DASGKRVYYL TRDPTTPLAR AAWETARHTP VNSWLGNIIM YAPTLWARMI LMTHFFSILL AQEQLEKALD CQIYGACYSI EPLDLPQIIE RLHGLSAFSL HSYSPGEINR VASCLRKLGV PPLRVWRHRA RSVRARLLSQ GGRAATCGKY LFNWAVKTKL KLTPIPAASQ LDLSGWFVAG YSGGDIYHSL SRARPR // ID 5W2EB STANDARD; PRT; 576 AA. DT CONVERTED FROM PDB (SEQRES) 5W2E DE Genome polyprotein OS Hepatitis C virus genotype 1b (isolate BK) CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.800 CC R-Factor 0.184 FT #SUB 29 23 ILE B 85 79 LYS A Protein S 1 FT #SUB 30 24 ASN B 83 77 THR A Protein A 7 FT #SUB 30 24 ASN B 85 79 LYS A Protein B 3 FT #SUB 31 25 PRO B 82 76 SER A Protein S 1 FT #SUB 31 25 PRO B 83 77 THR A Protein A 6 FT #SUB 31 25 PRO B 84 78 VAL A Protein S 1 FT #SUB 31 25 PRO B 85 79 LYS A Protein B 1 FT #SUB 31 25 PRO B 250 244 ASP A Protein A 2 FT #SUB 34 28 ASN B 85 79 LYS A Protein S 3 FT #SUB 106 100 LYS B 385 379 LYS A Protein B 4 FT #SUB 113 107 ASP B 215 209 LYS A Protein S 4 FT #SUB 276 270 LYS B 544 538 PRO A Protein A 2 FT #SUB 276 270 LYS B 549 543 SER A Protein S 2 FT #SUB 441 435 ALA B 79 73 ALA A Protein B 1 FT #SUB 442 436 GLN B 79 73 ALA A Protein S 5 FT #SUB 442 436 GLN B 80 74 LYS A Protein S 3 FT #SUB 443 437 GLU B 75 69 LYS A Protein S 3 FT #HET 199 193 PHE B 2 601 9VY B S 6 FT #HET 203 197 PRO B 2 601 9VY B A 15 FT #HET 206 200 ARG B 2 601 9VY B S 37 FT #HET 210 204 LEU B 2 601 9VY B S 7 FT #HET 320 314 LEU B 2 601 9VY B A 16 FT #HET 321 315 VAL B 2 601 9VY B B 3 FT #HET 322 316 ASN B 2 601 9VY B A 20 FT #HET 325 319 ASP B 2 601 9VY B A 7 FT #HET 326 320 LEU B 2 601 9VY B B 3 FT #HET 327 321 VAL B 2 601 9VY B A 12 FT #HET 366 360 LEU B 2 601 9VY B S 1 FT #HET 369 363 ILE B 2 601 9VY B A 6 FT #HET 371 365 SER B 2 601 9VY B A 22 FT #HET 372 366 CYS B 2 601 9VY B A 15 FT #HET 374 368 SER B 2 601 9VY B A 16 FT #HET 375 369 ASN B 2 601 9VY B A 8 FT #HET 376 370 VAL B 2 601 9VY B A 2 FT #HET 390 384 LEU B 2 601 9VY B S 9 FT #HET 419 413 ILE B 2 601 9VY B A 5 FT #HET 420 414 MET B 2 601 9VY B A 12 FT #HET 421 415 TYR B 2 601 9VY B S 10 FT #HET 453 447 ILE B 2 601 9VY B A 14 FT #HET 454 448 TYR B 2 601 9VY B A 30 FT #HET 458 452 TYR B 2 601 9VY B S 13 FT #HET 472 466 LEU B 2 601 9VY B S 4 FT #HET 556 550 TRP B 2 601 9VY B S 7 FT #HET 557 551 PHE B 2 601 9VY B S 12 FT DISORDER 1 6 FT DISORDER 155 159 FT DISORDER 571 576 CC Miss-SC 1 CC SEQUENCE 559 AA (ATOM); CC HHSYTWTGAL ITPCAAEESK LPINPLSNSL LRHHNMVYAT TSRSASLRQK KVTFDRLQVL CC DDHYRDVLKE MKAKASTVKA KLLSVEEACK LTPPHSAKSK FGYGAKDVRN LSSKAVNHIH CC SVWKDLLEDT VTPIDTTIMA KNEVFCVQRK PARLIVFPDL GVRVCEKMAL YDVVSTLPQV CC VMGSSYGFQY SPGQRVEFLV NTWKSKKNPM GFSYDTRCFD STVTENDIRV EESIYQCCDL CC APEARQAIKS LTERLYIGGP LTNSKGQNCG YRRCRASGVL TTSCGNTLTC YLKASAACRA CC AKLQDCTMLV NGDDLVVICE SAGVQEDAAS LRAFTEAMTR YSAPPGDPPQ PEYDLELITS CC CSSNVSVAHD ASGKRVYYLT RDPTTPLARA AWETARHTPV NSWLGNIIMY APTLWARMIL CC MTHFFSILLA QEQLEKALDC QIYGACYSIE PLDLPQIIER LHGLSAFSLH SYSPGEINRV CC ASCLRKLGVP PLRVWRHRAR SVRARLLSQG GRAATCGKYL FNWAVKTKLK LTPIPAASQL CC DLSGWFVAGY SGGDIYHSL CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES ASHHHHHHSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRS CC ATOM ------HHSYTWTGALITPCAAEESKLPINPLSNSLLRHHNMVYATTSRS CC ******************************************** CC SEQRES ASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPP CC ATOM ASLRQKKVTFDRLQVLDDHYRDVLKEMKAKASTVKAKLLSVEEACKLTPP CC ************************************************** CC SEQRES HSAKSKFGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEV CC ATOM HSAKSKFGYGAKDVRNLSSKAVNHIHSVWKDLLEDTVTPIDTTIMAKNEV CC ************************************************** CC SEQRES FCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQ CC ATOM FCVQ-----RKPARLIVFPDLGVRVCEKMALYDVVSTLPQVVMGSSYGFQ CC **** ***************************************** CC SEQRES YSPGQRVEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCD CC ATOM YSPGQRVEFLVNTWKSKKNPMGFSYDTRCFDSTVTENDIRVEESIYQCCD CC ************************************************** CC SEQRES LAPEARQAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLT CC ATOM LAPEARQAIKSLTERLYIGGPLTNSKGQNCGYRRCRASGVLTTSCGNTLT CC ************************************************** CC SEQRES CYLKASAACRAAKLQDCTMLVNGDDLVVICESAGVQEDAASLRAFTEAMT CC ATOM CYLKASAACRAAKLQDCTMLVNGDDLVVICESAGVQEDAASLRAFTEAMT CC ************************************************** CC SEQRES RYSAPPGDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLAR CC ATOM RYSAPPGDPPQPEYDLELITSCSSNVSVAHDASGKRVYYLTRDPTTPLAR CC ************************************************** CC SEQRES AAWETARHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALD CC ATOM AAWETARHTPVNSWLGNIIMYAPTLWARMILMTHFFSILLAQEQLEKALD CC ************************************************** CC SEQRES CQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGV CC ATOM CQIYGACYSIEPLDLPQIIERLHGLSAFSLHSYSPGEINRVASCLRKLGV CC ************************************************** CC SEQRES PPLRVWRHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQ CC ATOM PPLRVWRHRARSVRARLLSQGGRAATCGKYLFNWAVKTKLKLTPIPAASQ CC ************************************************** CC SEQRES LDLSGWFVAGYSGGDIYHSLSRARPR CC ATOM LDLSGWFVAGYSGGDIYHSL------ CC ******************** SQ SEQUENCE 576 AA; MW; CN; ASHHHHHHSY TWTGALITPC AAEESKLPIN PLSNSLLRHH NMVYATTSRS ASLRQKKVTF DRLQVLDDHY RDVLKEMKAK ASTVKAKLLS VEEACKLTPP HSAKSKFGYG AKDVRNLSSK AVNHIHSVWK DLLEDTVTPI DTTIMAKNEV FCVQPEKGGR KPARLIVFPD LGVRVCEKMA LYDVVSTLPQ VVMGSSYGFQ YSPGQRVEFL VNTWKSKKNP MGFSYDTRCF DSTVTENDIR VEESIYQCCD LAPEARQAIK SLTERLYIGG PLTNSKGQNC GYRRCRASGV LTTSCGNTLT CYLKASAACR AAKLQDCTML VNGDDLVVIC ESAGVQEDAA SLRAFTEAMT RYSAPPGDPP QPEYDLELIT SCSSNVSVAH DASGKRVYYL TRDPTTPLAR AAWETARHTP VNSWLGNIIM YAPTLWARMI LMTHFFSILL AQEQLEKALD CQIYGACYSI EPLDLPQIIE RLHGLSAFSL HSYSPGEINR VASCLRKLGV PPLRVWRHRA RSVRARLLSQ GGRAATCGKY LFNWAVKTKL KLTPIPAASQ LDLSGWFVAG YSGGDIYHSL SRARPR //