ID 4G0RA STANDARD; PRT; 735 AA. DT CONVERTED FROM PDB (SEQRES) 4G0R DE Capsid protein VP1 OS H-1 parvovirus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.700 CC R-Factor 0.216 FT #SUB 288 146 LEU A 2 2 DT C DNA/RNA A 3 FT #SUB 289 147 SER A 2 2 DT C DNA/RNA B 1 FT #SUB 290 148 GLN A 2 2 DT C DNA/RNA S 3 FT #SUB 326 184 LEU A 2 2 DT C DNA/RNA S 8 FT #SUB 328 186 SER A 2 2 DT C DNA/RNA A 7 FT #SUB 328 186 SER A 3 3 DG C DNA/RNA A 6 FT #SUB 329 187 ASN A 3 3 DG C DNA/RNA S 1 FT #SUB 329 187 ASN A 4 4 DA C DNA/RNA A 4 FT #SUB 329 187 ASN A 5 5 DC C DNA/RNA A 3 FT #SUB 329 187 ASN A 6 6 DT C DNA/RNA S 5 FT #SUB 329 187 ASN A 7 7 DT C DNA/RNA S 5 FT #SUB 329 187 ASN A 9 9 DA C DNA/RNA S 1 FT #SUB 330 188 ASN A 5 5 DC C DNA/RNA B 2 FT #SUB 331 189 ILE A 5 5 DC C DNA/RNA S 1 FT #SUB 331 189 ILE A 6 6 DT C DNA/RNA S 1 FT #SUB 414 272 THR A 1 1 DC C DNA/RNA B 2 FT #SUB 414 272 THR A 2 2 DT C DNA/RNA A 7 FT #SUB 415 273 GLY A 2 2 DT C DNA/RNA B 1 FT #SUB 416 274 THR A 2 2 DT C DNA/RNA B 1 FT #SUB 417 275 TYR A 2 2 DT C DNA/RNA A 4 FT #SUB 417 275 TYR A 3 3 DG C DNA/RNA A 13 FT #SUB 418 276 ILE A 3 3 DG C DNA/RNA A 12 FT #SUB 622 480 ASP A 5 5 DC C DNA/RNA S 1 FT #SUB 638 496 LYS A 4 4 DA C DNA/RNA B 2 FT #SUB 639 497 ASN A 3 3 DG C DNA/RNA S 5 FT #SUB 639 497 ASN A 4 4 DA C DNA/RNA A 5 FT #SUB 639 497 ASN A 9 9 DA C DNA/RNA S 1 FT #SUB 640 498 ASN A 3 3 DG C DNA/RNA A 5 FT #SUB 640 498 ASN A 4 4 DA C DNA/RNA A 12 FT #SUB 642 500 PRO A 2 2 DT C DNA/RNA A 5 FT #SUB 643 501 GLY A 2 2 DT C DNA/RNA B 11 FT #HET 197 55 LEU A 4 604 DC A S 1 FT #HET 198 56 GLY A 4 604 DC A B 1 FT #HET 199 57 ASP A 4 604 DC A S 1 FT #HET 201 59 TRP A 4 604 DC A S 20 FT #HET 328 186 SER A 9 101 MG C S 1 FT #HET 329 187 ASN A 10 102 MG C S 2 FT #HET 347 205 TYR A 3 603 EDO A S 1 FT #HET 350 208 LYS A 3 603 EDO A S 1 FT #HET 423 281 PRO A 4 604 DC A S 1 FT #HET 440 298 PRO A 5 605 NA A B 1 FT #HET 441 299 ARG A 1 601 EDO A S 3 FT #HET 441 299 ARG A 5 605 NA A B 1 FT #HET 441 299 ARG A 6 606 EDO A S 4 FT #HET 442 300 ILE A 6 606 EDO A B 3 FT #HET 443 301 THR A 6 606 EDO A B 2 FT #HET 456 314 THR A 1 601 EDO A S 1 FT #HET 460 318 ASP A 1 601 EDO A B 1 FT #HET 461 319 ARG A 1 601 EDO A A 3 FT #HET 461 319 ARG A 5 605 NA A S 1 FT #HET 462 320 PHE A 1 601 EDO A A 4 FT #HET 509 367 ASP A 7 607 NA A S 2 FT #HET 516 374 HIS A 8 608 CL A A 3 FT #HET 517 375 ASP A 8 608 CL A B 1 FT #HET 528 386 LYS A 3 603 EDO A B 3 FT #HET 529 387 GLN A 3 603 EDO A A 4 FT #HET 544 402 TYR A 2 602 EDO A S 2 FT #HET 545 403 THR A 2 602 EDO A B 1 FT #HET 546 404 TRP A 2 602 EDO A A 4 FT #HET 547 405 ASP A 2 602 EDO A B 3 FT #HET 548 406 ALA A 7 607 NA A B 1 FT #HET 549 407 ILE A 7 607 NA A B 2 FT #HET 550 408 ASP A 7 607 NA A B 2 FT #HET 682 540 LYS A 4 604 DC A S 6 FT DISORDER 1 179 FT DISORDER 308 309 CC SEQUENCE 554 AA (ATOM); CC GIGVSTGTYD NQTTYKFLGD GWVEITAHAS RLLHLGMPPS ENYCRVTVHN NQTTGHGTKV CC KGNMAYDDTH QQIWTPWSLV DANAWGVWFQ PSDWQFIQNS MESLNLDSLS QELFNVVVKT CC VTEQQGAGAI KVYNNDLTAC MMVALDSNNI LPYTPAAQTS ETLGFYPWKP TAPAPYRYYF CC FMPRQLSVTS SNSAEGTQIT DTIGEPQALN SQFFTIENTL PITLLRTGDE FTTGTYIFNT CC DPLKLTHTWQ TNRHLGMPPR ITDLPTSDTA TASLTANGDR FGSTQTQNVN YVTEALRTRP CC AQIGFMQPHD NFEANRGGPF KVPVVPLDIT AGEDHDANGA IRFNYGKQHG EDWAKQGAAP CC ERYTWDAIDS AAGRDTARCF VQSAPISIPP NQNQILQRED AIAGRTNMHY TNVFNSYGPL CC SAFPHPDPIY PNGQIWDKEL DLEHKPRLHV TAPFVCKNNP PGQLFVRLGP NLTDQFDPNS CC TTVSRIVTYS TFYWKGILKF KAKLRPNLTW NPVYQATTDS VANSYMNVKK WLPSATGNMH CC SDPLICRPVP HMTY CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MAPPAKRAKRGWVPPGYKYLGPGNSLDQGEPTNPSDAAAKEHDEAYDQYI CC ATOM -------------------------------------------------- CC CC SEQRES KSGKNPYLYFSPADQRFIDQTKDAKDWGGKVGHYFFRTKRAFAPKLSTDS CC ATOM -------------------------------------------------- CC CC SEQRES EPGTSGVSRPGKRTKPPAHIFVNQARAKKKRASLAAQQRTLTMSDGTETN CC ATOM -------------------------------------------------- CC CC SEQRES QPDTGIANARVERSADGGGSSGGGGSGGGGIGVSTGTYDNQTTYKFLGDG CC ATOM -----------------------------GIGVSTGTYDNQTTYKFLGDG CC ********************* CC SEQRES WVEITAHASRLLHLGMPPSENYCRVTVHNNQTTGHGTKVKGNMAYDDTHQ CC ATOM WVEITAHASRLLHLGMPPSENYCRVTVHNNQTTGHGTKVKGNMAYDDTHQ CC ************************************************** CC SEQRES QIWTPWSLVDANAWGVWFQPSDWQFIQNSMESLNLDSLSQELFNVVVKTV CC ATOM QIWTPWSLVDANAWGVWFQPSDWQFIQNSMESLNLDSLSQELFNVVVKTV CC ************************************************** CC SEQRES TEQQGAGQDAIKVYNNDLTACMMVALDSNNILPYTPAAQTSETLGFYPWK CC ATOM TEQQGAG--AIKVYNNDLTACMMVALDSNNILPYTPAAQTSETLGFYPWK CC ******* ***************************************** CC SEQRES PTAPAPYRYYFFMPRQLSVTSSNSAEGTQITDTIGEPQALNSQFFTIENT CC ATOM PTAPAPYRYYFFMPRQLSVTSSNSAEGTQITDTIGEPQALNSQFFTIENT CC ************************************************** CC SEQRES LPITLLRTGDEFTTGTYIFNTDPLKLTHTWQTNRHLGMPPRITDLPTSDT CC ATOM LPITLLRTGDEFTTGTYIFNTDPLKLTHTWQTNRHLGMPPRITDLPTSDT CC ************************************************** CC SEQRES ATASLTANGDRFGSTQTQNVNYVTEALRTRPAQIGFMQPHDNFEANRGGP CC ATOM ATASLTANGDRFGSTQTQNVNYVTEALRTRPAQIGFMQPHDNFEANRGGP CC ************************************************** CC SEQRES FKVPVVPLDITAGEDHDANGAIRFNYGKQHGEDWAKQGAAPERYTWDAID CC ATOM FKVPVVPLDITAGEDHDANGAIRFNYGKQHGEDWAKQGAAPERYTWDAID CC ************************************************** CC SEQRES SAAGRDTARCFVQSAPISIPPNQNQILQREDAIAGRTNMHYTNVFNSYGP CC ATOM SAAGRDTARCFVQSAPISIPPNQNQILQREDAIAGRTNMHYTNVFNSYGP CC ************************************************** CC SEQRES LSAFPHPDPIYPNGQIWDKELDLEHKPRLHVTAPFVCKNNPPGQLFVRLG CC ATOM LSAFPHPDPIYPNGQIWDKELDLEHKPRLHVTAPFVCKNNPPGQLFVRLG CC ************************************************** CC SEQRES PNLTDQFDPNSTTVSRIVTYSTFYWKGILKFKAKLRPNLTWNPVYQATTD CC ATOM PNLTDQFDPNSTTVSRIVTYSTFYWKGILKFKAKLRPNLTWNPVYQATTD CC ************************************************** CC SEQRES SVANSYMNVKKWLPSATGNMHSDPLICRPVPHMTY CC ATOM SVANSYMNVKKWLPSATGNMHSDPLICRPVPHMTY CC *********************************** SQ SEQUENCE 735 AA; MW; CN; MAPPAKRAKR GWVPPGYKYL GPGNSLDQGE PTNPSDAAAK EHDEAYDQYI KSGKNPYLYF SPADQRFIDQ TKDAKDWGGK VGHYFFRTKR AFAPKLSTDS EPGTSGVSRP GKRTKPPAHI FVNQARAKKK RASLAAQQRT LTMSDGTETN QPDTGIANAR VERSADGGGS SGGGGSGGGG IGVSTGTYDN QTTYKFLGDG WVEITAHASR LLHLGMPPSE NYCRVTVHNN QTTGHGTKVK GNMAYDDTHQ QIWTPWSLVD ANAWGVWFQP SDWQFIQNSM ESLNLDSLSQ ELFNVVVKTV TEQQGAGQDA IKVYNNDLTA CMMVALDSNN ILPYTPAAQT SETLGFYPWK PTAPAPYRYY FFMPRQLSVT SSNSAEGTQI TDTIGEPQAL NSQFFTIENT LPITLLRTGD EFTTGTYIFN TDPLKLTHTW QTNRHLGMPP RITDLPTSDT ATASLTANGD RFGSTQTQNV NYVTEALRTR PAQIGFMQPH DNFEANRGGP FKVPVVPLDI TAGEDHDANG AIRFNYGKQH GEDWAKQGAA PERYTWDAID SAAGRDTARC FVQSAPISIP PNQNQILQRE DAIAGRTNMH YTNVFNSYGP LSAFPHPDPI YPNGQIWDKE LDLEHKPRLH VTAPFVCKNN PPGQLFVRLG PNLTDQFDPN STTVSRIVTY STFYWKGILK FKAKLRPNLT WNPVYQATTD SVANSYMNVK KWLPSATGNM HSDPLICRPV PHMTY // ID 4G0RC STANDARD; PRT; 10 AA. DT CONVERTED FROM PDB (SEQRES) 4G0R DE DNA (5'-D(P*CP*TP*GP*AP*CP*TP*TP*CP*AP*A)-3') OS H-1 PARVOVIRUS CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.700 CC R-Factor 0.216 FT #SUB 1 1 DC C 414 272 THR A Protein S 2 FT #SUB 2 2 DT C 288 146 LEU A Protein S 3 FT #SUB 2 2 DT C 289 147 SER A Protein S 1 FT #SUB 2 2 DT C 290 148 GLN A Protein S 3 FT #SUB 2 2 DT C 326 184 LEU A Protein S 8 FT #SUB 2 2 DT C 328 186 SER A Protein S 7 FT #SUB 2 2 DT C 414 272 THR A Protein S 7 FT #SUB 2 2 DT C 415 273 GLY A Protein S 1 FT #SUB 2 2 DT C 416 274 THR A Protein S 1 FT #SUB 2 2 DT C 417 275 TYR A Protein S 4 FT #SUB 2 2 DT C 642 500 PRO A Protein S 5 FT #SUB 2 2 DT C 643 501 GLY A Protein S 11 FT #SUB 3 3 DG C 328 186 SER A Protein S 6 FT #SUB 3 3 DG C 329 187 ASN A Protein S 1 FT #SUB 3 3 DG C 417 275 TYR A Protein S 13 FT #SUB 3 3 DG C 418 276 ILE A Protein S 12 FT #SUB 3 3 DG C 639 497 ASN A Protein S 5 FT #SUB 3 3 DG C 640 498 ASN A Protein S 5 FT #SUB 4 4 DA C 329 187 ASN A Protein S 4 FT #SUB 4 4 DA C 638 496 LYS A Protein S 2 FT #SUB 4 4 DA C 639 497 ASN A Protein S 5 FT #SUB 4 4 DA C 640 498 ASN A Protein S 12 FT #SUB 5 5 DC C 329 187 ASN A Protein S 3 FT #SUB 5 5 DC C 330 188 ASN A Protein S 2 FT #SUB 5 5 DC C 331 189 ILE A Protein S 1 FT #SUB 5 5 DC C 622 480 ASP A Protein S 1 FT #SUB 6 6 DT C 329 187 ASN A Protein S 5 FT #SUB 6 6 DT C 331 189 ILE A Protein S 1 FT #SUB 7 7 DT C 329 187 ASN A Protein S 5 FT #SUB 9 9 DA C 329 187 ASN A Protein S 1 FT #SUB 9 9 DA C 639 497 ASN A Protein S 1 FT #HET 1 1 DC C 9 101 MG C S 2 FT #HET 3 3 DG C 9 101 MG C S 2 FT #HET 4 4 DA C 10 102 MG C S 2 FT #HET 6 6 DT C 10 102 MG C S 2 FT #HET 7 7 DT C 9 101 MG C S 1 FT #HET 7 7 DT C 10 102 MG C S 2 FT #HET 9 9 DA C 9 101 MG C S 3 FT #HET 9 9 DA C 10 102 MG C S 2 CC SEQUENCE 10 AA (ATOM); CC ctgacttcaa CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES ctgacttcaa CC ATOM ctgacttcaa CC ********** SQ SEQUENCE 10 AA; MW; CN; ctgacttcaa //