ID 4RSOA STANDARD; PRT; 736 AA. DT CONVERTED FROM PDB (SEQRES) 4RSO DE Capsid protein VP1 OS Non-human primate Adeno-associated virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.500 CC R-Factor 0.217 FT #HET 253 253 ASN A 2 802 NA A S 1 FT #HET 278 278 SER A 2 802 NA A S 1 FT #HET 420 420 VAL A 1 801 DA A S 1 FT #HET 609 609 ASN A 1 801 DA A S 1 FT #HET 630 630 HIS A 1 801 DA A S 5 FT #HET 631 631 PRO A 1 801 DA A B 4 FT #HET 632 632 SER A 1 801 DA A S 3 FT #HET 637 637 GLY A 1 801 DA A B 1 FT #HET 638 638 PHE A 1 801 DA A B 1 FT #HET 639 639 GLY A 1 801 DA A B 8 FT DISORDER 1 216 CC SEQUENCE 520 AA (ATOM); CC GADGVGNSSG NWHCDSTWLG DRVITTSTRT WALPTYNNHL YKQISNGTSG GSTNDNTYFG CC YSTPWGYFDF NRFHCHFSPR DWQRLINNNW GFRPKRLNFK LFNIQVKEVT TNEGTKTIAN CC NLTSTVQVFT DSEYQLPYVL GSAHQGCLPP FPADVFMVPQ YGYLTLNNGS QALGRSSFYC CC LEYFPSQMLR TGNNFQFSYT FEDVPFHSSY AHSQSLDRLM NPLIDQYLYY LVRTQTTGTG CC GTQTLAFSQA GPSSMANQAR NWVPGPCYRQ QRVSTTTNQN NNSNFAWTGA AKFKLNGRDS CC LMNPGVAMAS HKDDDDRFFP SSGVLIFGKQ GAGNDGVDYS QVLITDEEEI KATNPVATEE CC YGAVAINNQA ANTQAQTGLV HNQGVIPGMV WQNRDVYLQG PIWAKIPHTD GNFHPSPLMG CC GFGLKHPPPQ ILIKNTPVPA DPPLTFNQAK LNSFITQYST GQVSVEIEWE LQKENSKRWN CC PEIQYTSNYY KSTNVDFAVN TEGVYSEPRP IGTRYLTRNL CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDDGRGLVLPGY CC ATOM -------------------------------------------------- CC CC SEQRES KYLGPFNGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEF CC ATOM -------------------------------------------------- CC CC SEQRES QERLQEDTSFGGNLGRAVFQAKKRVLEPLGLVEEGAKTAPGKKRPVEQSP CC ATOM -------------------------------------------------- CC CC SEQRES QEPDSSSGIGKTGQQPAKKRLNFGQTGDSESVPDPQPLGEPPAAPSGLGP CC ATOM -------------------------------------------------- CC CC SEQRES NTMASGGGAPMADNNEGADGVGNSSGNWHCDSTWLGDRVITTSTRTWALP CC ATOM ----------------GADGVGNSSGNWHCDSTWLGDRVITTSTRTWALP CC ********************************** CC SEQRES TYNNHLYKQISNGTSGGSTNDNTYFGYSTPWGYFDFNRFHCHFSPRDWQR CC ATOM TYNNHLYKQISNGTSGGSTNDNTYFGYSTPWGYFDFNRFHCHFSPRDWQR CC ************************************************** CC SEQRES LINNNWGFRPKRLNFKLFNIQVKEVTTNEGTKTIANNLTSTVQVFTDSEY CC ATOM LINNNWGFRPKRLNFKLFNIQVKEVTTNEGTKTIANNLTSTVQVFTDSEY CC ************************************************** CC SEQRES QLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQALGRSSFYCLEYF CC ATOM QLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQALGRSSFYCLEYF CC ************************************************** CC SEQRES PSQMLRTGNNFQFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLVRT CC ATOM PSQMLRTGNNFQFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLVRT CC ************************************************** CC SEQRES QTTGTGGTQTLAFSQAGPSSMANQARNWVPGPCYRQQRVSTTTNQNNNSN CC ATOM QTTGTGGTQTLAFSQAGPSSMANQARNWVPGPCYRQQRVSTTTNQNNNSN CC ************************************************** CC SEQRES FAWTGAAKFKLNGRDSLMNPGVAMASHKDDDDRFFPSSGVLIFGKQGAGN CC ATOM FAWTGAAKFKLNGRDSLMNPGVAMASHKDDDDRFFPSSGVLIFGKQGAGN CC ************************************************** CC SEQRES DGVDYSQVLITDEEEIKATNPVATEEYGAVAINNQAANTQAQTGLVHNQG CC ATOM DGVDYSQVLITDEEEIKATNPVATEEYGAVAINNQAANTQAQTGLVHNQG CC ************************************************** CC SEQRES VIPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIK CC ATOM VIPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPLMGGFGLKHPPPQILIK CC ************************************************** CC SEQRES NTPVPADPPLTFNQAKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQ CC ATOM NTPVPADPPLTFNQAKLNSFITQYSTGQVSVEIEWELQKENSKRWNPEIQ CC ************************************************** CC SEQRES YTSNYYKSTNVDFAVNTEGVYSEPRPIGTRYLTRNL CC ATOM YTSNYYKSTNVDFAVNTEGVYSEPRPIGTRYLTRNL CC ************************************ SQ SEQUENCE 736 AA; MW; CN; MAADGYLPDW LEDNLSEGIR EWWDLKPGAP KPKANQQKQD DGRGLVLPGY KYLGPFNGLD KGEPVNAADA AALEHDKAYD QQLKAGDNPY LRYNHADAEF QERLQEDTSF GGNLGRAVFQ AKKRVLEPLG LVEEGAKTAP GKKRPVEQSP QEPDSSSGIG KTGQQPAKKR LNFGQTGDSE SVPDPQPLGE PPAAPSGLGP NTMASGGGAP MADNNEGADG VGNSSGNWHC DSTWLGDRVI TTSTRTWALP TYNNHLYKQI SNGTSGGSTN DNTYFGYSTP WGYFDFNRFH CHFSPRDWQR LINNNWGFRP KRLNFKLFNI QVKEVTTNEG TKTIANNLTS TVQVFTDSEY QLPYVLGSAH QGCLPPFPAD VFMVPQYGYL TLNNGSQALG RSSFYCLEYF PSQMLRTGNN FQFSYTFEDV PFHSSYAHSQ SLDRLMNPLI DQYLYYLVRT QTTGTGGTQT LAFSQAGPSS MANQARNWVP GPCYRQQRVS TTTNQNNNSN FAWTGAAKFK LNGRDSLMNP GVAMASHKDD DDRFFPSSGV LIFGKQGAGN DGVDYSQVLI TDEEEIKATN PVATEEYGAV AINNQAANTQ AQTGLVHNQG VIPGMVWQNR DVYLQGPIWA KIPHTDGNFH PSPLMGGFGL KHPPPQILIK NTPVPADPPL TFNQAKLNSF ITQYSTGQVS VEIEWELQKE NSKRWNPEIQ YTSNYYKSTN VDFAVNTEGV YSEPRPIGTR YLTRNL //