ID 4YOJA STANDARD; PRT; 306 AA. DT CONVERTED FROM PDB (SEQRES) 4YOJ DE 3C-like proteinase OS Bat coronavirus HKU4 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.900 CC R-Factor 0.162 FT #SUB 1 1 SER A 141 141 GLY B Protein B 1 FT #SUB 1 1 SER A 142 142 SER B Protein B 2 FT #SUB 1 1 SER A 143 143 PHE B Protein B 4 FT #SUB 1 1 SER A 169 169 GLU B Protein A 11 FT #SUB 1 1 SER A 172 172 ASN B Protein S 3 FT #SUB 1 1 SER A 173 173 GLY B Protein S 3 FT #SUB 1 1 SER A 175 175 HIS B Protein B 2 FT #SUB 2 2 GLY A 141 141 GLY B Protein B 2 FT #SUB 2 2 GLY A 142 142 SER B Protein B 4 FT #SUB 2 2 GLY A 173 173 GLY B Protein B 3 FT #SUB 4 4 VAL A 129 129 PHE B Protein S 4 FT #SUB 4 4 VAL A 140 140 LYS B Protein S 1 FT #SUB 4 4 VAL A 141 141 GLY B Protein S 3 FT #SUB 4 4 VAL A 142 142 SER B Protein S 1 FT #SUB 5 5 LYS A 129 129 PHE B Protein B 1 FT #SUB 6 6 MET A 127 127 GLY B Protein S 2 FT #SUB 6 6 MET A 128 128 VAL B Protein A 4 FT #SUB 6 6 MET A 129 129 PHE B Protein S 1 FT #SUB 6 6 MET A 142 142 SER B Protein S 2 FT #SUB 7 7 SER A 127 127 GLY B Protein B 2 FT #SUB 7 7 SER A 128 128 VAL B Protein B 8 FT #SUB 9 9 PRO A 10 10 SER B Protein A 3 FT #SUB 9 9 PRO A 14 14 GLU B Protein A 5 FT #SUB 9 9 PRO A 126 126 THR B Protein S 3 FT #SUB 9 9 PRO A 127 127 GLY B Protein S 3 FT #SUB 9 9 PRO A 128 128 VAL B Protein B 2 FT #SUB 10 10 SER A 9 9 PRO B Protein S 3 FT #SUB 10 10 SER A 10 10 SER B Protein A 3 FT #SUB 10 10 SER A 14 14 GLU B Protein B 2 FT #SUB 11 11 GLY A 11 11 GLY B Protein B 1 FT #SUB 11 11 GLY A 14 14 GLU B Protein B 6 FT #SUB 14 14 GLU A 9 9 PRO B Protein S 5 FT #SUB 14 14 GLU A 10 10 SER B Protein S 2 FT #SUB 14 14 GLU A 11 11 GLY B Protein S 4 FT #SUB 125 125 PRO A 9 9 PRO B Protein S 1 FT #SUB 126 126 THR A 9 9 PRO B Protein B 3 FT #SUB 127 127 GLY A 6 6 MET B Protein B 1 FT #SUB 127 127 GLY A 7 7 SER B Protein B 2 FT #SUB 127 127 GLY A 9 9 PRO B Protein B 2 FT #SUB 128 128 VAL A 6 6 MET B Protein B 2 FT #SUB 128 128 VAL A 7 7 SER B Protein A 8 FT #SUB 128 128 VAL A 9 9 PRO B Protein S 2 FT #SUB 129 129 PHE A 4 4 VAL B Protein S 4 FT #SUB 129 129 PHE A 5 5 LYS B Protein S 1 FT #SUB 140 140 LYS A 4 4 VAL B Protein B 1 FT #SUB 141 141 GLY A 1 1 SER B Protein B 1 FT #SUB 141 141 GLY A 2 2 GLY B Protein B 2 FT #SUB 141 141 GLY A 4 4 VAL B Protein B 3 FT #SUB 142 142 SER A 1 1 SER B Protein B 2 FT #SUB 142 142 SER A 2 2 GLY B Protein S 4 FT #SUB 142 142 SER A 4 4 VAL B Protein S 1 FT #SUB 142 142 SER A 6 6 MET B Protein S 2 FT #SUB 142 142 SER A 299 299 GLN B Protein S 4 FT #SUB 143 143 PHE A 1 1 SER B Protein B 3 FT #SUB 144 144 LEU A 301 301 MET B Protein S 2 FT #SUB 169 169 GLU A 1 1 SER B Protein S 11 FT #SUB 172 172 ASN A 1 1 SER B Protein A 4 FT #SUB 172 172 ASN A 217 217 ASN B Protein S 10 FT #SUB 173 173 GLY A 1 1 SER B Protein B 3 FT #SUB 173 173 GLY A 2 2 GLY B Protein B 2 FT #SUB 175 175 HIS A 1 1 SER B Protein S 2 FT #SUB 217 217 ASN A 172 172 ASN B Protein B 4 FT #SUB 218 218 GLY A 172 172 ASN B Protein B 2 FT #SUB 283 283 GLY A 286 286 THR B Protein B 1 FT #SUB 285 285 SER A 286 286 THR B Protein A 3 FT #SUB 286 286 THR A 283 283 GLY B Protein S 2 FT #SUB 286 286 THR A 285 285 SER B Protein S 3 FT #SUB 298 298 MET A 126 126 THR B Protein S 1 FT #SUB 298 298 MET A 144 144 LEU B Protein B 1 FT #SUB 299 299 GLN A 142 142 SER B Protein S 5 FT #SUB 301 301 MET A 144 144 LEU B Protein S 5 FT #HET 1 1 SER A 4 401 RFM B B 1 FT #HET 24 24 SER A 1 401 RFM A A 7 FT #HET 25 25 MET A 1 401 RFM A A 7 FT #HET 41 41 HIS A 1 401 RFM A S 10 FT #HET 44 44 CYS A 1 401 RFM A B 2 FT #HET 45 45 PRO A 1 401 RFM A B 1 FT #HET 46 46 ALA A 1 401 RFM A B 3 FT #HET 49 49 LEU A 1 401 RFM A A 10 FT #HET 54 54 TYR A 1 401 RFM A S 2 FT #HET 134 134 ARG A 3 403 ACT A S 1 FT #HET 143 143 PHE A 1 401 RFM A B 3 FT #HET 144 144 LEU A 1 401 RFM A B 8 FT #HET 145 145 CYS A 1 401 RFM A A 17 FT #HET 148 148 CYS A 1 401 RFM A S 3 FT #HET 166 166 HIS A 1 401 RFM A S 4 FT #HET 167 167 GLN A 1 401 RFM A B 1 FT #HET 168 168 MET A 1 401 RFM A A 10 FT #HET 169 169 GLU A 1 401 RFM A A 13 FT #HET 190 190 ASP A 1 401 RFM A A 6 FT #HET 191 191 LYS A 1 401 RFM A B 4 FT #HET 192 192 GLN A 1 401 RFM A A 4 FT #HET 201 201 LYS A 3 403 ACT A B 3 FT #HET 202 202 TYR A 3 403 ACT A A 2 CC SEQUENCE 306 AA (ATOM); CC SGLVKMSAPS GAVENCIVQV TCGSMTLNGL WLDNTVWCPR HIMCPADQLT DPNYDALLIS CC KTNHSFIVQK HIGAQANLRV VAHSMVGVLL KLTVDVANPS TPAYTFSTVK PGASFSVLAC CC YNGKPTGVFT VNLRHNSTIK GSFLCGSCGS VGYTENGGVI NFVYMHQMEL SNGTHTGSSF CC DGVMYGAFED KQTHQLQLTD KYCTINVVAW LYAAVLNGCK WFVKPTRVGI VTYNEWALSN CC QFTEFVGTQS IDMLAHRTGV SVEQMLAAIQ SLHAGFQGKT ILGQSTLEDE FTPDDVNMQV CC MGVVMQ CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLT CC ATOM SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLT CC ************************************************** CC SEQRES DPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPS CC ATOM DPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPS CC ************************************************** CC SEQRES TPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGS CC ATOM TPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGS CC ************************************************** CC SEQRES VGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTD CC ATOM VGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTD CC ************************************************** CC SEQRES KYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQS CC ATOM KYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQS CC ************************************************** CC SEQRES IDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQV CC ATOM IDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQV CC ************************************************** CC SEQRES MGVVMQ CC ATOM MGVVMQ CC ****** SQ SEQUENCE 306 AA; MW; CN; SGLVKMSAPS GAVENCIVQV TCGSMTLNGL WLDNTVWCPR HIMCPADQLT DPNYDALLIS KTNHSFIVQK HIGAQANLRV VAHSMVGVLL KLTVDVANPS TPAYTFSTVK PGASFSVLAC YNGKPTGVFT VNLRHNSTIK GSFLCGSCGS VGYTENGGVI NFVYMHQMEL SNGTHTGSSF DGVMYGAFED KQTHQLQLTD KYCTINVVAW LYAAVLNGCK WFVKPTRVGI VTYNEWALSN QFTEFVGTQS IDMLAHRTGV SVEQMLAAIQ SLHAGFQGKT ILGQSTLEDE FTPDDVNMQV MGVVMQ // ID 4YOJB STANDARD; PRT; 306 AA. DT CONVERTED FROM PDB (SEQRES) 4YOJ DE 3C-like proteinase OS Bat coronavirus HKU4 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.900 CC R-Factor 0.162 FT #SUB 1 1 SER B 141 141 GLY A Protein B 1 FT #SUB 1 1 SER B 142 142 SER A Protein B 2 FT #SUB 1 1 SER B 143 143 PHE A Protein B 3 FT #SUB 1 1 SER B 169 169 GLU A Protein A 11 FT #SUB 1 1 SER B 172 172 ASN A Protein S 4 FT #SUB 1 1 SER B 173 173 GLY A Protein S 3 FT #SUB 1 1 SER B 175 175 HIS A Protein B 2 FT #SUB 2 2 GLY B 141 141 GLY A Protein B 2 FT #SUB 2 2 GLY B 142 142 SER A Protein B 4 FT #SUB 2 2 GLY B 173 173 GLY A Protein B 2 FT #SUB 4 4 VAL B 129 129 PHE A Protein S 4 FT #SUB 4 4 VAL B 140 140 LYS A Protein S 1 FT #SUB 4 4 VAL B 141 141 GLY A Protein S 3 FT #SUB 4 4 VAL B 142 142 SER A Protein S 1 FT #SUB 5 5 LYS B 129 129 PHE A Protein B 1 FT #SUB 6 6 MET B 127 127 GLY A Protein S 1 FT #SUB 6 6 MET B 128 128 VAL A Protein B 2 FT #SUB 6 6 MET B 142 142 SER A Protein S 2 FT #SUB 7 7 SER B 127 127 GLY A Protein B 2 FT #SUB 7 7 SER B 128 128 VAL A Protein A 8 FT #SUB 9 9 PRO B 10 10 SER A Protein A 3 FT #SUB 9 9 PRO B 14 14 GLU A Protein A 5 FT #SUB 9 9 PRO B 125 125 PRO A Protein S 1 FT #SUB 9 9 PRO B 126 126 THR A Protein S 3 FT #SUB 9 9 PRO B 127 127 GLY A Protein S 2 FT #SUB 9 9 PRO B 128 128 VAL A Protein B 2 FT #SUB 10 10 SER B 9 9 PRO A Protein S 3 FT #SUB 10 10 SER B 10 10 SER A Protein A 3 FT #SUB 10 10 SER B 14 14 GLU A Protein B 2 FT #SUB 11 11 GLY B 11 11 GLY A Protein B 1 FT #SUB 11 11 GLY B 14 14 GLU A Protein B 4 FT #SUB 14 14 GLU B 9 9 PRO A Protein S 5 FT #SUB 14 14 GLU B 10 10 SER A Protein S 2 FT #SUB 14 14 GLU B 11 11 GLY A Protein S 6 FT #SUB 126 126 THR B 9 9 PRO A Protein B 3 FT #SUB 126 126 THR B 298 298 MET A Protein S 1 FT #SUB 127 127 GLY B 6 6 MET A Protein B 2 FT #SUB 127 127 GLY B 7 7 SER A Protein B 2 FT #SUB 127 127 GLY B 9 9 PRO A Protein B 3 FT #SUB 128 128 VAL B 6 6 MET A Protein B 4 FT #SUB 128 128 VAL B 7 7 SER A Protein A 8 FT #SUB 128 128 VAL B 9 9 PRO A Protein S 2 FT #SUB 129 129 PHE B 4 4 VAL A Protein S 4 FT #SUB 129 129 PHE B 5 5 LYS A Protein S 1 FT #SUB 129 129 PHE B 6 6 MET A Protein S 1 FT #SUB 140 140 LYS B 4 4 VAL A Protein B 1 FT #SUB 141 141 GLY B 1 1 SER A Protein B 1 FT #SUB 141 141 GLY B 2 2 GLY A Protein B 2 FT #SUB 141 141 GLY B 4 4 VAL A Protein B 3 FT #SUB 142 142 SER B 1 1 SER A Protein B 2 FT #SUB 142 142 SER B 2 2 GLY A Protein S 4 FT #SUB 142 142 SER B 4 4 VAL A Protein S 1 FT #SUB 142 142 SER B 6 6 MET A Protein S 2 FT #SUB 142 142 SER B 299 299 GLN A Protein S 5 FT #SUB 143 143 PHE B 1 1 SER A Protein B 4 FT #SUB 144 144 LEU B 298 298 MET A Protein S 1 FT #SUB 144 144 LEU B 301 301 MET A Protein S 5 FT #SUB 169 169 GLU B 1 1 SER A Protein S 11 FT #SUB 172 172 ASN B 1 1 SER A Protein A 3 FT #SUB 172 172 ASN B 217 217 ASN A Protein S 4 FT #SUB 172 172 ASN B 218 218 GLY A Protein S 2 FT #SUB 173 173 GLY B 1 1 SER A Protein B 3 FT #SUB 173 173 GLY B 2 2 GLY A Protein B 3 FT #SUB 175 175 HIS B 1 1 SER A Protein S 2 FT #SUB 217 217 ASN B 172 172 ASN A Protein A 10 FT #SUB 283 283 GLY B 286 286 THR A Protein B 2 FT #SUB 285 285 SER B 286 286 THR A Protein A 3 FT #SUB 286 286 THR B 283 283 GLY A Protein S 1 FT #SUB 286 286 THR B 285 285 SER A Protein S 3 FT #SUB 299 299 GLN B 142 142 SER A Protein S 4 FT #SUB 301 301 MET B 144 144 LEU A Protein S 2 FT #HET 24 24 SER B 4 401 RFM B A 4 FT #HET 25 25 MET B 4 401 RFM B A 6 FT #HET 41 41 HIS B 4 401 RFM B S 10 FT #HET 44 44 CYS B 4 401 RFM B B 2 FT #HET 46 46 ALA B 4 401 RFM B B 3 FT #HET 49 49 LEU B 4 401 RFM B A 10 FT #HET 54 54 TYR B 4 401 RFM B S 2 FT #HET 134 134 ARG B 5 402 ACT B S 1 FT #HET 143 143 PHE B 4 401 RFM B B 3 FT #HET 144 144 LEU B 4 401 RFM B B 8 FT #HET 145 145 CYS B 4 401 RFM B A 13 FT #HET 148 148 CYS B 4 401 RFM B S 3 FT #HET 166 166 HIS B 4 401 RFM B S 5 FT #HET 167 167 GLN B 4 401 RFM B B 1 FT #HET 168 168 MET B 4 401 RFM B A 9 FT #HET 169 169 GLU B 4 401 RFM B A 17 FT #HET 190 190 ASP B 4 401 RFM B A 6 FT #HET 191 191 LYS B 4 401 RFM B B 4 FT #HET 192 192 GLN B 4 401 RFM B A 3 FT #HET 200 200 ASP B 5 402 ACT B A 4 FT #HET 201 201 LYS B 5 402 ACT B B 3 FT #HET 202 202 TYR B 5 402 ACT B S 2 FT #HET 253 253 MET B 6 403 FMT B S 3 FT #HET 257 257 ARG B 6 403 FMT B S 4 CC SEQUENCE 306 AA (ATOM); CC SGLVKMSAPS GAVENCIVQV TCGSMTLNGL WLDNTVWCPR HIMCPADQLT DPNYDALLIS CC KTNHSFIVQK HIGAQANLRV VAHSMVGVLL KLTVDVANPS TPAYTFSTVK PGASFSVLAC CC YNGKPTGVFT VNLRHNSTIK GSFLCGSCGS VGYTENGGVI NFVYMHQMEL SNGTHTGSSF CC DGVMYGAFED KQTHQLQLTD KYCTINVVAW LYAAVLNGCK WFVKPTRVGI VTYNEWALSN CC QFTEFVGTQS IDMLAHRTGV SVEQMLAAIQ SLHAGFQGKT ILGQSTLEDE FTPDDVNMQV CC MGVVMQ CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLT CC ATOM SGLVKMSAPSGAVENCIVQVTCGSMTLNGLWLDNTVWCPRHIMCPADQLT CC ************************************************** CC SEQRES DPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPS CC ATOM DPNYDALLISKTNHSFIVQKHIGAQANLRVVAHSMVGVLLKLTVDVANPS CC ************************************************** CC SEQRES TPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGS CC ATOM TPAYTFSTVKPGASFSVLACYNGKPTGVFTVNLRHNSTIKGSFLCGSCGS CC ************************************************** CC SEQRES VGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTD CC ATOM VGYTENGGVINFVYMHQMELSNGTHTGSSFDGVMYGAFEDKQTHQLQLTD CC ************************************************** CC SEQRES KYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQS CC ATOM KYCTINVVAWLYAAVLNGCKWFVKPTRVGIVTYNEWALSNQFTEFVGTQS CC ************************************************** CC SEQRES IDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQV CC ATOM IDMLAHRTGVSVEQMLAAIQSLHAGFQGKTILGQSTLEDEFTPDDVNMQV CC ************************************************** CC SEQRES MGVVMQ CC ATOM MGVVMQ CC ****** SQ SEQUENCE 306 AA; MW; CN; SGLVKMSAPS GAVENCIVQV TCGSMTLNGL WLDNTVWCPR HIMCPADQLT DPNYDALLIS KTNHSFIVQK HIGAQANLRV VAHSMVGVLL KLTVDVANPS TPAYTFSTVK PGASFSVLAC YNGKPTGVFT VNLRHNSTIK GSFLCGSCGS VGYTENGGVI NFVYMHQMEL SNGTHTGSSF DGVMYGAFED KQTHQLQLTD KYCTINVVAW LYAAVLNGCK WFVKPTRVGI VTYNEWALSN QFTEFVGTQS IDMLAHRTGV SVEQMLAAIQ SLHAGFQGKT ILGQSTLEDE FTPDDVNMQV MGVVMQ //