ID 3O02A STANDARD; PRT; 308 AA. DT CONVERTED FROM PDB (SEQRES) 3O02 DE Cell invasion protein sipD OS Salmonella enterica CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.900 CC R-Factor 0.199 FT #SUB 4 39 GLU A 72 107 SER B Protein A 6 FT #SUB 5 40 HIS A 72 107 SER B Protein B 5 FT #SUB 5 40 HIS A 75 110 SER B Protein A 4 FT #SUB 6 41 ARG A 72 107 SER B Protein A 7 FT #SUB 7 42 GLY A 72 107 SER B Protein B 3 FT #SUB 7 42 GLY A 73 108 ALA B Protein B 1 FT #SUB 7 42 GLY A 75 110 SER B Protein B 5 FT #SUB 8 43 THR A 75 110 SER B Protein A 12 FT #SUB 8 43 THR A 76 111 ALA B Protein S 6 FT #SUB 8 43 THR A 77 112 PRO B Protein S 4 FT #SUB 11 46 ILE A 77 112 PRO B Protein S 3 FT #SUB 11 46 ILE A 82 117 PHE B Protein S 2 FT #SUB 12 47 SER A 77 112 PRO B Protein S 1 FT #SUB 15 50 GLN A 81 116 LEU B Protein S 1 FT #SUB 15 50 GLN A 82 117 PHE B Protein S 1 FT #SUB 18 53 THR A 276 311 LYS B Protein S 1 FT #SUB 19 54 LYS A 201 236 SER B Protein S 2 FT #SUB 22 57 GLN A 204 239 ASN B Protein S 5 FT #SUB 22 57 GLN A 276 311 LYS B Protein S 3 FT #HET 1 36 GLY A 1 1 NI A B 3 FT #HET 2 37 HIS A 1 1 NI A A 9 FT #HET 3 38 MET A 1 1 NI A B 1 FT #HET 6 41 ARG A 2 1 JN3 B S 3 FT #HET 10 45 ILE A 2 1 JN3 B S 1 FT #HET 303 338 LYS A 2 1 JN3 B A 4 FT #HET 305 340 PHE A 2 1 JN3 B S 6 FT DISORDER 75 98 FT DISORDER 308 308 CC SEQUENCE 283 AA (ATOM); CC GHMEHRGTDI ISLSQAATKI HQAQQTLQST PPISEENNDE RTLARQQLTS SLNALAKSGV CC SLSAEQNENL RSAFIWDMVS QNISAIGDSY LGVYENVVAV YTDFYQAFSD ILSKMGGWLL CC PGKDGNTVKL DVTSLKNDLN SLVNKYNQIN SNTVLFPAQS GSGVKVATEA EARQWLSELN CC LPNSCLKSYG SGYVVTVDLT PLQKMVQDID GLGAPGKDSK LEMDNAKYQA WQSGFKAQEE CC NMKTTLQTLT QKYSNANSLY DNLVKVLSST ISSSLETAKS FLQ CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GHMEHRGTDIISLSQAATKIHQAQQTLQSTPPISEENNDERTLARQQLTS CC ATOM GHMEHRGTDIISLSQAATKIHQAQQTLQSTPPISEENNDERTLARQQLTS CC ************************************************** CC SEQRES SLNALAKSGVSLSAEQNENLRSAFSAPTSALFSASPMAQPRTTISDAEIW CC ATOM SLNALAKSGVSLSAEQNENLRSAF------------------------IW CC ************************ ** CC SEQRES DMVSQNISAIGDSYLGVYENVVAVYTDFYQAFSDILSKMGGWLLPGKDGN CC ATOM DMVSQNISAIGDSYLGVYENVVAVYTDFYQAFSDILSKMGGWLLPGKDGN CC ************************************************** CC SEQRES TVKLDVTSLKNDLNSLVNKYNQINSNTVLFPAQSGSGVKVATEAEARQWL CC ATOM TVKLDVTSLKNDLNSLVNKYNQINSNTVLFPAQSGSGVKVATEAEARQWL CC ************************************************** CC SEQRES SELNLPNSCLKSYGSGYVVTVDLTPLQKMVQDIDGLGAPGKDSKLEMDNA CC ATOM SELNLPNSCLKSYGSGYVVTVDLTPLQKMVQDIDGLGAPGKDSKLEMDNA CC ************************************************** CC SEQRES KYQAWQSGFKAQEENMKTTLQTLTQKYSNANSLYDNLVKVLSSTISSSLE CC ATOM KYQAWQSGFKAQEENMKTTLQTLTQKYSNANSLYDNLVKVLSSTISSSLE CC ************************************************** CC SEQRES TAKSFLQG CC ATOM TAKSFLQ- CC ******* SQ SEQUENCE 308 AA; MW; CN; GHMEHRGTDI ISLSQAATKI HQAQQTLQST PPISEENNDE RTLARQQLTS SLNALAKSGV SLSAEQNENL RSAFSAPTSA LFSASPMAQP RTTISDAEIW DMVSQNISAI GDSYLGVYEN VVAVYTDFYQ AFSDILSKMG GWLLPGKDGN TVKLDVTSLK NDLNSLVNKY NQINSNTVLF PAQSGSGVKV ATEAEARQWL SELNLPNSCL KSYGSGYVVT VDLTPLQKMV QDIDGLGAPG KDSKLEMDNA KYQAWQSGFK AQEENMKTTL QTLTQKYSNA NSLYDNLVKV LSSTISSSLE TAKSFLQG // ID 3O02B STANDARD; PRT; 308 AA. DT CONVERTED FROM PDB (SEQRES) 3O02 DE Cell invasion protein sipD OS Salmonella enterica CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.900 CC R-Factor 0.199 FT #SUB 72 107 SER B 4 39 GLU A Protein A 6 FT #SUB 72 107 SER B 5 40 HIS A Protein A 5 FT #SUB 72 107 SER B 6 41 ARG A Protein A 7 FT #SUB 72 107 SER B 7 42 GLY A Protein B 3 FT #SUB 73 108 ALA B 7 42 GLY A Protein B 1 FT #SUB 75 110 SER B 5 40 HIS A Protein S 4 FT #SUB 75 110 SER B 7 42 GLY A Protein A 5 FT #SUB 75 110 SER B 8 43 THR A Protein A 12 FT #SUB 76 111 ALA B 8 43 THR A Protein B 6 FT #SUB 77 112 PRO B 8 43 THR A Protein A 4 FT #SUB 77 112 PRO B 11 46 ILE A Protein S 3 FT #SUB 77 112 PRO B 12 47 SER A Protein S 1 FT #SUB 81 116 LEU B 15 50 GLN A Protein S 1 FT #SUB 82 117 PHE B 11 46 ILE A Protein S 2 FT #SUB 82 117 PHE B 15 50 GLN A Protein S 1 FT #SUB 201 236 SER B 19 54 LYS A Protein A 2 FT #SUB 204 239 ASN B 22 57 GLN A Protein S 5 FT #SUB 276 311 LYS B 18 53 THR A Protein S 1 FT #SUB 276 311 LYS B 22 57 GLN A Protein S 3 FT #HET 69 104 ASN B 2 1 JN3 B S 4 FT #HET 73 108 ALA B 2 1 JN3 B S 2 FT #HET 283 318 LEU B 2 1 JN3 B A 3 FT #HET 286 321 ASN B 2 1 JN3 B S 7 FT #HET 287 322 LEU B 2 1 JN3 B S 1 FT #HET 290 325 VAL B 2 1 JN3 B S 2 FT DISORDER 1 11 FT DISORDER 57 59 FT DISORDER 83 99 FT DISORDER 184 186 FT DISORDER 302 308 CC SEQUENCE 267 AA (ATOM); CC SLSQAATKIH QAQQTLQSTP PISEENNDER TLARQQLTSS LNALAVSLSA EQNENLRSAF CC SAPTSALFWD MVSQNISAIG DSYLGVYENV VAVYTDFYQA FSDILSKMGG WLLPGKDGNT CC VKLDVTSLKN DLNSLVNKYN QINSNTVLFP AQGVKVATEA EARQWLSELN LPNSCLKSYG CC SGYVVTVDLT PLQKMVQDID GLGAPGKDSK LEMDNAKYQA WQSGFKAQEE NMKTTLQTLT CC QKYSNANSLY DNLVKVLSST ISSSLET CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GHMEHRGTDIISLSQAATKIHQAQQTLQSTPPISEENNDERTLARQQLTS CC ATOM -----------SLSQAATKIHQAQQTLQSTPPISEENNDERTLARQQLTS CC *************************************** CC SEQRES SLNALAKSGVSLSAEQNENLRSAFSAPTSALFSASPMAQPRTTISDAEIW CC ATOM SLNALA---VSLSAEQNENLRSAFSAPTSALF-----------------W CC ****** *********************** * CC SEQRES DMVSQNISAIGDSYLGVYENVVAVYTDFYQAFSDILSKMGGWLLPGKDGN CC ATOM DMVSQNISAIGDSYLGVYENVVAVYTDFYQAFSDILSKMGGWLLPGKDGN CC ************************************************** CC SEQRES TVKLDVTSLKNDLNSLVNKYNQINSNTVLFPAQSGSGVKVATEAEARQWL CC ATOM TVKLDVTSLKNDLNSLVNKYNQINSNTVLFPAQ---GVKVATEAEARQWL CC ********************************* ************** CC SEQRES SELNLPNSCLKSYGSGYVVTVDLTPLQKMVQDIDGLGAPGKDSKLEMDNA CC ATOM SELNLPNSCLKSYGSGYVVTVDLTPLQKMVQDIDGLGAPGKDSKLEMDNA CC ************************************************** CC SEQRES KYQAWQSGFKAQEENMKTTLQTLTQKYSNANSLYDNLVKVLSSTISSSLE CC ATOM KYQAWQSGFKAQEENMKTTLQTLTQKYSNANSLYDNLVKVLSSTISSSLE CC ************************************************** CC SEQRES TAKSFLQG CC ATOM T------- CC * SQ SEQUENCE 308 AA; MW; CN; GHMEHRGTDI ISLSQAATKI HQAQQTLQST PPISEENNDE RTLARQQLTS SLNALAKSGV SLSAEQNENL RSAFSAPTSA LFSASPMAQP RTTISDAEIW DMVSQNISAI GDSYLGVYEN VVAVYTDFYQ AFSDILSKMG GWLLPGKDGN TVKLDVTSLK NDLNSLVNKY NQINSNTVLF PAQSGSGVKV ATEAEARQWL SELNLPNSCL KSYGSGYVVT VDLTPLQKMV QDIDGLGAPG KDSKLEMDNA KYQAWQSGFK AQEENMKTTL QTLTQKYSNA NSLYDNLVKV LSSTISSSLE TAKSFLQG //