ID 4DYPA STANDARD; PRT; 499 AA. DT CONVERTED FROM PDB (SEQRES) 4DYP DE Nucleocapsid protein OS Influenza A virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.820 CC R-Factor 0.256 FT #SUB 43 50 SER A 280 287 SER B Protein S 2 FT #SUB 44 51 ASP A 280 287 SER B Protein A 10 FT #SUB 45 52 TYR A 277 284 ALA B Protein S 3 FT #SUB 45 52 TYR A 280 287 SER B Protein A 11 FT #SUB 45 52 TYR A 282 289 TYR B Protein S 2 FT #SUB 48 55 ARG A 280 287 SER B Protein S 1 FT #SUB 277 284 ALA A 45 52 TYR B Protein A 4 FT #SUB 280 287 SER A 43 50 SER B Protein B 2 FT #SUB 280 287 SER A 44 51 ASP B Protein A 11 FT #SUB 280 287 SER A 45 52 TYR B Protein A 10 FT #SUB 280 287 SER A 48 55 ARG B Protein S 1 FT #SUB 281 288 GLY A 43 50 SER B Protein B 1 FT #SUB 281 288 GLY A 45 52 TYR B Protein B 1 FT #SUB 282 289 TYR A 45 52 TYR B Protein A 2 FT #SUB 302 309 ASN A 306 313 TYR B Protein S 5 FT #SUB 302 309 ASN A 371 378 THR B Protein S 1 FT #SUB 304 311 GLN A 301 308 GLN B Protein S 4 FT #SUB 304 311 GLN A 302 309 ASN B Protein S 7 FT #SUB 304 311 GLN A 303 310 SER B Protein S 2 FT #SUB 306 313 TYR A 302 309 ASN B Protein S 6 FT #SUB 371 378 THR A 302 309 ASN B Protein S 1 FT #HET 45 52 TYR A 2 601 0MS B A 18 FT #HET 46 53 GLU A 2 601 0MS B A 4 FT #HET 49 56 LEU A 2 601 0MS B S 1 FT #HET 92 99 ARG A 2 601 0MS B S 7 FT #HET 97 104 TRP A 2 601 0MS B S 11 FT #HET 282 289 TYR A 1 601 0MS A S 25 FT #HET 287 294 GLU A 1 601 0MS A S 1 FT #HET 295 302 ASP A 1 601 0MS A A 3 FT #HET 298 305 ARG A 1 601 0MS A S 5 FT #HET 299 306 LEU A 1 601 0MS A A 3 FT #HET 302 309 ASN A 1 601 0MS A S 10 FT #HET 306 313 TYR A 2 601 0MS B S 6 FT #HET 369 376 SER A 2 601 0MS B S 4 FT DISORDER 1 14 FT DISORDER 65 83 FT DISORDER 192 204 FT DISORDER 423 431 FT DISORDER 473 477 FT DISORDER 491 499 CC Miss-BB 1 CC Miss-SC 2 CC SEQUENCE 430 AA (ATOM); CC ATEIRASVGK MIDGIGRFYI QMCTELKLSD YEGRLIQNSL TIERMVLSAF KTGGPIYRRV CC DGKWRRELIL YDKEEIRRIW RQANNGDDAT AGLTHMMIWH SNLNDATYQR TRALVRTGMD CC PRMCSLMQGS TLPRRSGAAG AAVKGVGTMV MELIRMIKGR RTRIAYERMC NILKGKFQTA CC AQRTMVDQVR ESRNPGNAEF EDLIFLARSA LILRGSVAHK SCLPACVYGS AVASGYDFER CC EGYSLVGIDP FRLLQNSQVY SLIRPNENPA HKSQLVWMAC HSAAFEDLRV SSFIRGTKVV CC PRGKLSTRGV QIASNENMET MESSTLELRS RYWAIRTRSG GNTNQQRASS GQISIQPTFS CC VQRNLPFDRP TIMAAFDMRT EIIRLMESAR PEDVSFQGRG VFELSDEKAT SPIVPSFGSY CC FFGDNAEEYD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES RSYEQMETDGERQNATEIRASVGKMIDGIGRFYIQMCTELKLSDYEGRLI CC ATOM --------------ATEIRASVGKMIDGIGRFYIQMCTELKLSDYEGRLI CC ************************************ CC SEQRES QNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVDGKWRRE CC ATOM QNSLTIERMVLSAF-------------------KTGGPIYRRVDGKWRRE CC ************** ***************** CC SEQRES LILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRT CC ATOM LILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRT CC ************************************************** CC SEQRES GMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELIRMIKRGINDRNFW CC ATOM GMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELIRMIK--------- CC ***************************************** CC SEQRES RGENGRRTRIAYERMCNILKGKFQTAAQRTMVDQVRESRNPGNAEFEDLI CC ATOM ----GRRTRIAYERMCNILKGKFQTAAQRTMVDQVRESRNPGNAEFEDLI CC ********************************************** CC SEQRES FLARSALILRGSVAHKSCLPACVYGSAVASGYDFEREGYSLVGIDPFRLL CC ATOM FLARSALILRGSVAHKSCLPACVYGSAVASGYDFEREGYSLVGIDPFRLL CC ************************************************** CC SEQRES QNSQVYSLIRPNENPAHKSQLVWMACHSAAFEDLRVSSFIRGTKVVPRGK CC ATOM QNSQVYSLIRPNENPAHKSQLVWMACHSAAFEDLRVSSFIRGTKVVPRGK CC ************************************************** CC SEQRES LSTRGVQIASNENMETMESSTLELRSRYWAIRTRSGGNTNQQRASSGQIS CC ATOM LSTRGVQIASNENMETMESSTLELRSRYWAIRTRSGGNTNQQRASSGQIS CC ************************************************** CC SEQRES IQPTFSVQRNLPFDRPTIMAAFTGNTEGRTSDMRTEIIRLMESARPEDVS CC ATOM IQPTFSVQRNLPFDRPTIMAAF---------DMRTEIIRLMESARPEDVS CC ********************** ******************* CC SEQRES FQGRGVFELSDEKATSPIVPSFDMSNEGSYFFGDNAEEYDNLEHHHHHH CC ATOM FQGRGVFELSDEKATSPIVPSF-----GSYFFGDNAEEYD--------- CC ********************** ************* SQ SEQUENCE 499 AA; MW; CN; RSYEQMETDG ERQNATEIRA SVGKMIDGIG RFYIQMCTEL KLSDYEGRLI QNSLTIERMV LSAFDERRNK YLEEHPSAGK DPKKTGGPIY RRVDGKWRRE LILYDKEEIR RIWRQANNGD DATAGLTHMM IWHSNLNDAT YQRTRALVRT GMDPRMCSLM QGSTLPRRSG AAGAAVKGVG TMVMELIRMI KRGINDRNFW RGENGRRTRI AYERMCNILK GKFQTAAQRT MVDQVRESRN PGNAEFEDLI FLARSALILR GSVAHKSCLP ACVYGSAVAS GYDFEREGYS LVGIDPFRLL QNSQVYSLIR PNENPAHKSQ LVWMACHSAA FEDLRVSSFI RGTKVVPRGK LSTRGVQIAS NENMETMESS TLELRSRYWA IRTRSGGNTN QQRASSGQIS IQPTFSVQRN LPFDRPTIMA AFTGNTEGRT SDMRTEIIRL MESARPEDVS FQGRGVFELS DEKATSPIVP SFDMSNEGSY FFGDNAEEYD NLEHHHHHH // ID 4DYPB STANDARD; PRT; 499 AA. DT CONVERTED FROM PDB (SEQRES) 4DYP DE Nucleocapsid protein OS Influenza A virus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.820 CC R-Factor 0.256 FT #SUB 43 50 SER B 280 287 SER A Protein S 2 FT #SUB 43 50 SER B 281 288 GLY A Protein S 1 FT #SUB 44 51 ASP B 280 287 SER A Protein A 11 FT #SUB 45 52 TYR B 277 284 ALA A Protein S 4 FT #SUB 45 52 TYR B 280 287 SER A Protein A 10 FT #SUB 45 52 TYR B 281 288 GLY A Protein S 1 FT #SUB 45 52 TYR B 282 289 TYR A Protein S 2 FT #SUB 48 55 ARG B 280 287 SER A Protein S 1 FT #SUB 277 284 ALA B 45 52 TYR A Protein A 3 FT #SUB 280 287 SER B 43 50 SER A Protein B 2 FT #SUB 280 287 SER B 44 51 ASP A Protein A 10 FT #SUB 280 287 SER B 45 52 TYR A Protein A 11 FT #SUB 280 287 SER B 48 55 ARG A Protein S 1 FT #SUB 282 289 TYR B 45 52 TYR A Protein A 2 FT #SUB 301 308 GLN B 304 311 GLN A Protein B 4 FT #SUB 302 309 ASN B 304 311 GLN A Protein A 7 FT #SUB 302 309 ASN B 306 313 TYR A Protein A 6 FT #SUB 302 309 ASN B 371 378 THR A Protein S 1 FT #SUB 303 310 SER B 304 311 GLN A Protein B 2 FT #SUB 306 313 TYR B 302 309 ASN A Protein S 5 FT #SUB 371 378 THR B 302 309 ASN A Protein S 1 FT #HET 45 52 TYR B 1 601 0MS A A 17 FT #HET 46 53 GLU B 1 601 0MS A A 4 FT #HET 49 56 LEU B 1 601 0MS A S 1 FT #HET 92 99 ARG B 1 601 0MS A S 9 FT #HET 97 104 TRP B 1 601 0MS A S 10 FT #HET 282 289 TYR B 2 601 0MS B S 24 FT #HET 287 294 GLU B 2 601 0MS B S 1 FT #HET 295 302 ASP B 2 601 0MS B B 1 FT #HET 298 305 ARG B 2 601 0MS B S 3 FT #HET 299 306 LEU B 2 601 0MS B A 2 FT #HET 302 309 ASN B 2 601 0MS B S 8 FT #HET 306 313 TYR B 1 601 0MS A S 6 FT #HET 369 376 SER B 1 601 0MS A S 4 FT DISORDER 1 14 FT DISORDER 76 81 FT DISORDER 193 206 FT DISORDER 387 396 FT DISORDER 421 433 FT DISORDER 471 478 FT DISORDER 484 499 CC SEQUENCE 418 AA (ATOM); CC ATEIRASVGK MIDGIGRFYI QMCTELKLSD YEGRLIQNSL TIERMVLSAF DERRNKYLEE CC HPKKTGGPIY RRVDGKWRRE LILYDKEEIR RIWRQANNGD DATAGLTHMM IWHSNLNDAT CC YQRTRALVRT GMDPRMCSLM QGSTLPRRSG AAGAAVKGVG TMVMELIRMI KRRTRIAYER CC MCNILKGKFQ TAAQRTMVDQ VRESRNPGNA EFEDLIFLAR SALILRGSVA HKSCLPACVY CC GSAVASGYDF EREGYSLVGI DPFRLLQNSQ VYSLIRPNEN PAHKSQLVWM ACHSAAFEDL CC RVSSFIRGTK VVPRGKLSTR GVQIASNENM ETMESSTLEL RSRYWAIRTR SGGQISIQPT CC FSVQRNLPFD RPTIMARTEI IRLMESARPE DVSFQGRGVF ELSDEKATSP IVPSYFFG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES RSYEQMETDGERQNATEIRASVGKMIDGIGRFYIQMCTELKLSDYEGRLI CC ATOM --------------ATEIRASVGKMIDGIGRFYIQMCTELKLSDYEGRLI CC ************************************ CC SEQRES QNSLTIERMVLSAFDERRNKYLEEHPSAGKDPKKTGGPIYRRVDGKWRRE CC ATOM QNSLTIERMVLSAFDERRNKYLEEH------PKKTGGPIYRRVDGKWRRE CC ************************* ******************* CC SEQRES LILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRT CC ATOM LILYDKEEIRRIWRQANNGDDATAGLTHMMIWHSNLNDATYQRTRALVRT CC ************************************************** CC SEQRES GMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELIRMIKRGINDRNFW CC ATOM GMDPRMCSLMQGSTLPRRSGAAGAAVKGVGTMVMELIRMIKR-------- CC ****************************************** CC SEQRES RGENGRRTRIAYERMCNILKGKFQTAAQRTMVDQVRESRNPGNAEFEDLI CC ATOM ------RTRIAYERMCNILKGKFQTAAQRTMVDQVRESRNPGNAEFEDLI CC ******************************************** CC SEQRES FLARSALILRGSVAHKSCLPACVYGSAVASGYDFEREGYSLVGIDPFRLL CC ATOM FLARSALILRGSVAHKSCLPACVYGSAVASGYDFEREGYSLVGIDPFRLL CC ************************************************** CC SEQRES QNSQVYSLIRPNENPAHKSQLVWMACHSAAFEDLRVSSFIRGTKVVPRGK CC ATOM QNSQVYSLIRPNENPAHKSQLVWMACHSAAFEDLRVSSFIRGTKVVPRGK CC ************************************************** CC SEQRES LSTRGVQIASNENMETMESSTLELRSRYWAIRTRSGGNTNQQRASSGQIS CC ATOM LSTRGVQIASNENMETMESSTLELRSRYWAIRTRSG----------GQIS CC ************************************ **** CC SEQRES IQPTFSVQRNLPFDRPTIMAAFTGNTEGRTSDMRTEIIRLMESARPEDVS CC ATOM IQPTFSVQRNLPFDRPTIMA-------------RTEIIRLMESARPEDVS CC ******************** ***************** CC SEQRES FQGRGVFELSDEKATSPIVPSFDMSNEGSYFFGDNAEEYDNLEHHHHHH CC ATOM FQGRGVFELSDEKATSPIVP--------SYFFG---------------- CC ******************** ***** SQ SEQUENCE 499 AA; MW; CN; RSYEQMETDG ERQNATEIRA SVGKMIDGIG RFYIQMCTEL KLSDYEGRLI QNSLTIERMV LSAFDERRNK YLEEHPSAGK DPKKTGGPIY RRVDGKWRRE LILYDKEEIR RIWRQANNGD DATAGLTHMM IWHSNLNDAT YQRTRALVRT GMDPRMCSLM QGSTLPRRSG AAGAAVKGVG TMVMELIRMI KRGINDRNFW RGENGRRTRI AYERMCNILK GKFQTAAQRT MVDQVRESRN PGNAEFEDLI FLARSALILR GSVAHKSCLP ACVYGSAVAS GYDFEREGYS LVGIDPFRLL QNSQVYSLIR PNENPAHKSQ LVWMACHSAA FEDLRVSSFI RGTKVVPRGK LSTRGVQIAS NENMETMESS TLELRSRYWA IRTRSGGNTN QQRASSGQIS IQPTFSVQRN LPFDRPTIMA AFTGNTEGRT SDMRTEIIRL MESARPEDVS FQGRGVFELS DEKATSPIVP SFDMSNEGSY FFGDNAEEYD NLEHHHHHH //