ID 5TIPA STANDARD; PRT; 436 AA. DT CONVERTED FROM PDB (SEQRES) 5TIP DE Major capsid protein OS Paramecium bursaria Chlorella virus 1 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.000 CC R-Factor 0.183 FT #SUB 12 13 ALA A 220 221 GLN B Protein S 1 FT #SUB 15 16 VAL A 217 218 ARG B Protein S 1 FT #SUB 15 16 VAL A 221 222 LEU B Protein S 2 FT #SUB 16 17 TYR A 221 222 LEU B Protein S 4 FT #SUB 18 19 THR A 213 214 GLN B Protein B 1 FT #SUB 19 20 GLY A 213 214 GLN B Protein B 1 FT #SUB 19 20 GLY A 217 218 ARG B Protein B 4 FT #SUB 20 21 ASN A 34 35 THR B Protein S 1 FT #SUB 20 21 ASN A 213 214 GLN B Protein A 7 FT #SUB 20 21 ASN A 214 215 GLU B Protein S 3 FT #SUB 20 21 ASN A 217 218 ARG B Protein S 4 FT #SUB 21 22 PRO A 213 214 GLN B Protein S 3 FT #SUB 216 217 THR A 15 16 VAL B Protein S 1 FT #SUB 217 218 ARG A 15 16 VAL B Protein S 1 FT #SUB 217 218 ARG A 20 21 ASN B Protein S 9 FT #SUB 220 221 GLN A 12 13 ALA B Protein S 6 FT #SUB 220 221 GLN A 15 16 VAL B Protein S 1 FT #SUB 221 222 LEU A 15 16 VAL B Protein S 3 FT #SUB 221 222 LEU A 16 17 TYR B Protein S 4 FT #SUB 101 102 GLY A 101 102 GLY D Protein B 1 FT #SUB 101 102 GLY A 103 104 ARG D Protein B 6 FT #SUB 102 103 GLN A 102 103 GLN D Protein A 9 FT #SUB 102 103 GLN A 103 104 ARG D Protein S 4 FT #SUB 103 104 ARG A 101 102 GLY D Protein A 7 FT #SUB 103 104 ARG A 102 103 GLN D Protein A 7 FT #SUB 253 254 ASN A 169 170 TYR D Protein S 6 FT #SUB 255 256 ASN A 429 430 MET D Protein S 1 FT #HET 80 81 ASN A 12 3 XYP F S 4 FT #HET 81 82 GLY A 11 2 FUC F B 6 FT #HET 81 82 GLY A 12 3 XYP F B 2 FT #HET 81 82 GLY A 17 8 GLA F B 3 FT #HET 82 83 GLY A 10 1 BGC F B 1 FT #HET 82 83 GLY A 11 2 FUC F B 1 FT #HET 82 83 GLY A 17 8 GLA F B 4 FT #HET 137 138 ASN A 11 2 FUC F S 3 FT #HET 139 140 LEU A 12 3 XYP F S 3 FT #HET 139 140 LEU A 14 5 7CV F S 2 FT #HET 284 285 CYS A 17 8 GLA F B 1 FT #HET 285 286 SER A 17 8 GLA F A 5 FT #HET 286 287 GLY A 17 8 GLA F B 4 FT #HET 287 288 ALA A 17 8 GLA F B 1 FT #HET 288 289 GLY A 1 1 BGC E B 3 FT #HET 289 290 THR A 1 1 BGC E B 1 FT #HET 290 291 ALA A 2 2 FUC E A 2 FT #HET 292 293 ALA A 1 1 BGC E S 2 FT #HET 298 299 ASP A 10 1 BGC F S 6 FT #HET 298 299 ASP A 17 8 GLA F S 8 FT #HET 299 300 TYR A 17 8 GLA F B 1 FT #HET 319 320 LEU A 102 501 HG A S 1 FT #HET 355 356 PHE A 102 501 HG A S 2 FT #HET 368 369 CYS A 102 501 HG A S 2 FT #HET 370 371 PHE A 102 501 HG A S 1 FT #HET 373 374 ILE A 102 501 HG A S 1 FT #HET 386 387 SER A 19 1 BGC G B 1 FT #HET 387 388 ILE A 19 1 BGC G A 5 FT #HET 388 389 ASP A 19 1 BGC G A 3 FT #HET 388 389 ASP A 26 1 BGC H S 1 FT #HET 390 391 THR A 1 1 BGC E B 2 FT #HET 390 391 THR A 8 8 GLA E S 4 FT #HET 390 391 THR A 26 1 BGC H S 1 FT #HET 392 393 PRO A 1 1 BGC E S 2 FT #HET 393 394 ALA A 23 5 GLA G A 6 FT #HET 394 395 ALA A 19 1 BGC G A 2 FT #HET 397 398 GLY A 19 1 BGC G B 2 FT #HET 397 398 GLY A 23 5 GLA G B 2 FT #MOD 279 280 ASN A 1 1 BGC E S FT #MOD 301 302 ASN A 10 1 BGC F S FT #MOD 398 399 ASN A 19 1 BGC G S FT #MOD 405 406 ASN A 26 1 BGC H S CC SEQUENCE 436 AA (ATOM); CC AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS CC RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR CC MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS CC QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP CC SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG CC NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA CC GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY CC NVLRIMSGMG GLAYAN CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ATOM AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ************************************************** CC SEQRES FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ATOM FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ************************************************** CC SEQRES GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ATOM GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ************************************************** CC SEQRES IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ATOM IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ************************************************** CC SEQRES MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ATOM MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ************************************************** CC SEQRES RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ATOM RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ************************************************** CC SEQRES NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ATOM NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ************************************************** CC SEQRES YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ATOM YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ************************************************** CC SEQRES TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ATOM TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ************************************ SQ SEQUENCE 436 AA; MW; CN; AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY NVLRIMSGMG GLAYAN // ID 5TIPB STANDARD; PRT; 436 AA. DT CONVERTED FROM PDB (SEQRES) 5TIP DE Major capsid protein OS Paramecium bursaria Chlorella virus 1 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.000 CC R-Factor 0.183 FT #SUB 12 13 ALA B 220 221 GLN A Protein A 6 FT #SUB 15 16 VAL B 216 217 THR A Protein S 1 FT #SUB 15 16 VAL B 217 218 ARG A Protein S 1 FT #SUB 15 16 VAL B 220 221 GLN A Protein S 1 FT #SUB 15 16 VAL B 221 222 LEU A Protein S 3 FT #SUB 16 17 TYR B 221 222 LEU A Protein S 4 FT #SUB 20 21 ASN B 217 218 ARG A Protein A 9 FT #SUB 34 35 THR B 20 21 ASN A Protein S 1 FT #SUB 213 214 GLN B 18 19 THR A Protein S 1 FT #SUB 213 214 GLN B 19 20 GLY A Protein S 1 FT #SUB 213 214 GLN B 20 21 ASN A Protein S 7 FT #SUB 213 214 GLN B 21 22 PRO A Protein S 3 FT #SUB 214 215 GLU B 20 21 ASN A Protein A 3 FT #SUB 217 218 ARG B 15 16 VAL A Protein S 1 FT #SUB 217 218 ARG B 19 20 GLY A Protein S 4 FT #SUB 217 218 ARG B 20 21 ASN A Protein S 4 FT #SUB 220 221 GLN B 12 13 ALA A Protein S 1 FT #SUB 221 222 LEU B 15 16 VAL A Protein S 2 FT #SUB 221 222 LEU B 16 17 TYR A Protein S 4 FT #SUB 326 327 ALA B 322 323 GLN C Protein S 1 FT #HET 80 81 ASN B 37 3 XYP J S 4 FT #HET 81 82 GLY B 36 2 FUC J B 6 FT #HET 81 82 GLY B 37 3 XYP J B 2 FT #HET 81 82 GLY B 42 8 GLA J B 3 FT #HET 82 83 GLY B 35 1 BGC J B 1 FT #HET 82 83 GLY B 42 8 GLA J B 4 FT #HET 83 84 THR B 42 8 GLA J S 1 FT #HET 137 138 ASN B 36 2 FUC J S 3 FT #HET 139 140 LEU B 37 3 XYP J S 3 FT #HET 139 140 LEU B 39 5 7CV J S 2 FT #HET 284 285 CYS B 42 8 GLA J B 1 FT #HET 285 286 SER B 42 8 GLA J A 5 FT #HET 286 287 GLY B 42 8 GLA J B 4 FT #HET 288 289 GLY B 28 1 BGC I B 1 FT #HET 289 290 THR B 28 1 BGC I B 1 FT #HET 290 291 ALA B 29 2 FUC I A 3 FT #HET 290 291 ALA B 30 3 XYP I S 1 FT #HET 292 293 ALA B 28 1 BGC I S 2 FT #HET 298 299 ASP B 35 1 BGC J S 7 FT #HET 298 299 ASP B 42 8 GLA J A 9 FT #HET 299 300 TYR B 42 8 GLA J B 1 FT #HET 319 320 LEU B 103 501 HG B S 1 FT #HET 355 356 PHE B 103 501 HG B S 2 FT #HET 368 369 CYS B 103 501 HG B S 2 FT #HET 373 374 ILE B 103 501 HG B S 1 FT #HET 386 387 SER B 44 1 BGC K B 2 FT #HET 387 388 ILE B 44 1 BGC K A 3 FT #HET 388 389 ASP B 44 1 BGC K A 3 FT #HET 388 389 ASP B 51 1 BGC L S 4 FT #HET 390 391 THR B 28 1 BGC I B 2 FT #HET 390 391 THR B 32 5 GLA I S 3 FT #HET 390 391 THR B 51 1 BGC L S 3 FT #HET 392 393 PRO B 28 1 BGC I S 2 FT #HET 393 394 ALA B 48 5 GLA K A 6 FT #HET 394 395 ALA B 44 1 BGC K A 3 FT #HET 397 398 GLY B 44 1 BGC K B 2 FT #HET 397 398 GLY B 48 5 GLA K B 1 FT #MOD 279 280 ASN B 28 1 BGC I S FT #MOD 301 302 ASN B 35 1 BGC J S FT #MOD 398 399 ASN B 44 1 BGC K S FT #MOD 405 406 ASN B 51 1 BGC L S CC SEQUENCE 436 AA (ATOM); CC AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS CC RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR CC MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS CC QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP CC SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG CC NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA CC GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY CC NVLRIMSGMG GLAYAN CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ATOM AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ************************************************** CC SEQRES FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ATOM FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ************************************************** CC SEQRES GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ATOM GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ************************************************** CC SEQRES IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ATOM IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ************************************************** CC SEQRES MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ATOM MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ************************************************** CC SEQRES RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ATOM RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ************************************************** CC SEQRES NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ATOM NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ************************************************** CC SEQRES YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ATOM YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ************************************************** CC SEQRES TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ATOM TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ************************************ SQ SEQUENCE 436 AA; MW; CN; AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY NVLRIMSGMG GLAYAN // ID 5TIPC STANDARD; PRT; 436 AA. DT CONVERTED FROM PDB (SEQRES) 5TIP DE Major capsid protein OS Paramecium bursaria Chlorella virus 1 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.000 CC R-Factor 0.183 FT #SUB 322 323 GLN C 326 327 ALA B Protein S 1 FT #SUB 12 13 ALA C 220 221 GLN D Protein A 10 FT #SUB 15 16 VAL C 217 218 ARG D Protein S 1 FT #SUB 15 16 VAL C 220 221 GLN D Protein S 2 FT #SUB 15 16 VAL C 221 222 LEU D Protein S 2 FT #SUB 16 17 TYR C 221 222 LEU D Protein S 3 FT #SUB 18 19 THR C 213 214 GLN D Protein B 1 FT #SUB 19 20 GLY C 213 214 GLN D Protein B 2 FT #SUB 19 20 GLY C 217 218 ARG D Protein B 1 FT #SUB 20 21 ASN C 213 214 GLN D Protein B 2 FT #SUB 20 21 ASN C 217 218 ARG D Protein A 9 FT #SUB 21 22 PRO C 213 214 GLN D Protein S 1 FT #SUB 213 214 GLN C 20 21 ASN D Protein A 6 FT #SUB 213 214 GLN C 21 22 PRO D Protein S 1 FT #SUB 214 215 GLU C 20 21 ASN D Protein A 7 FT #SUB 216 217 THR C 15 16 VAL D Protein S 1 FT #SUB 217 218 ARG C 15 16 VAL D Protein S 1 FT #SUB 217 218 ARG C 19 20 GLY D Protein S 4 FT #SUB 217 218 ARG C 20 21 ASN D Protein S 9 FT #SUB 220 221 GLN C 12 13 ALA D Protein S 10 FT #SUB 220 221 GLN C 15 16 VAL D Protein S 6 FT #SUB 221 222 LEU C 15 16 VAL D Protein S 3 FT #SUB 221 222 LEU C 16 17 TYR D Protein S 4 FT #SUB 222 223 PRO C 16 17 TYR D Protein S 1 FT #HET 80 81 ASN C 62 3 XYP N S 5 FT #HET 81 82 GLY C 61 2 FUC N B 6 FT #HET 81 82 GLY C 62 3 XYP N B 2 FT #HET 81 82 GLY C 67 8 GLA N B 3 FT #HET 82 83 GLY C 60 1 BGC N B 1 FT #HET 82 83 GLY C 61 2 FUC N B 1 FT #HET 82 83 GLY C 67 8 GLA N B 4 FT #HET 83 84 THR C 67 8 GLA N A 2 FT #HET 137 138 ASN C 61 2 FUC N S 3 FT #HET 139 140 LEU C 62 3 XYP N S 2 FT #HET 139 140 LEU C 64 5 7CV N S 2 FT #HET 284 285 CYS C 67 8 GLA N B 1 FT #HET 285 286 SER C 67 8 GLA N A 5 FT #HET 286 287 GLY C 67 8 GLA N B 4 FT #HET 287 288 ALA C 67 8 GLA N B 1 FT #HET 288 289 GLY C 53 1 BGC M B 1 FT #HET 290 291 ALA C 54 2 FUC M S 1 FT #HET 292 293 ALA C 53 1 BGC M S 2 FT #HET 292 293 ALA C 57 5 GLA M B 1 FT #HET 298 299 ASP C 60 1 BGC N S 6 FT #HET 298 299 ASP C 67 8 GLA N S 7 FT #HET 299 300 TYR C 67 8 GLA N B 1 FT #HET 319 320 LEU C 104 501 HG C S 1 FT #HET 355 356 PHE C 104 501 HG C S 2 FT #HET 368 369 CYS C 104 501 HG C S 2 FT #HET 370 371 PHE C 104 501 HG C S 1 FT #HET 373 374 ILE C 104 501 HG C S 1 FT #HET 386 387 SER C 69 1 BGC O B 1 FT #HET 387 388 ILE C 69 1 BGC O A 4 FT #HET 388 389 ASP C 69 1 BGC O A 4 FT #HET 388 389 ASP C 105 525 BGC C S 5 FT #HET 390 391 THR C 53 1 BGC M B 1 FT #HET 390 391 THR C 57 5 GLA M S 5 FT #HET 390 391 THR C 105 525 BGC C S 1 FT #HET 392 393 PRO C 53 1 BGC M S 1 FT #HET 393 394 ALA C 73 5 GLA O A 6 FT #HET 394 395 ALA C 69 1 BGC O A 3 FT #HET 397 398 GLY C 69 1 BGC O B 3 FT #HET 397 398 GLY C 73 5 GLA O B 1 FT #MOD 279 280 ASN C 53 1 BGC M S FT #MOD 301 302 ASN C 60 1 BGC N S FT #MOD 398 399 ASN C 69 1 BGC O S FT #MOD 405 406 ASN C 105 525 BGC C S CC SEQUENCE 436 AA (ATOM); CC AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS CC RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR CC MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS CC QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP CC SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG CC NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA CC GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY CC NVLRIMSGMG GLAYAN CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ATOM AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ************************************************** CC SEQRES FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ATOM FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ************************************************** CC SEQRES GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ATOM GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ************************************************** CC SEQRES IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ATOM IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ************************************************** CC SEQRES MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ATOM MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ************************************************** CC SEQRES RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ATOM RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ************************************************** CC SEQRES NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ATOM NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ************************************************** CC SEQRES YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ATOM YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ************************************************** CC SEQRES TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ATOM TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ************************************ SQ SEQUENCE 436 AA; MW; CN; AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY NVLRIMSGMG GLAYAN // ID 5TIPD STANDARD; PRT; 436 AA. DT CONVERTED FROM PDB (SEQRES) 5TIP DE Major capsid protein OS Paramecium bursaria Chlorella virus 1 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.000 CC R-Factor 0.183 FT #SUB 101 102 GLY D 101 102 GLY A Protein B 1 FT #SUB 101 102 GLY D 103 104 ARG A Protein B 7 FT #SUB 102 103 GLN D 102 103 GLN A Protein A 9 FT #SUB 102 103 GLN D 103 104 ARG A Protein S 7 FT #SUB 103 104 ARG D 101 102 GLY A Protein A 6 FT #SUB 103 104 ARG D 102 103 GLN A Protein A 4 FT #SUB 169 170 TYR D 253 254 ASN A Protein S 6 FT #SUB 429 430 MET D 255 256 ASN A Protein S 1 FT #SUB 12 13 ALA D 220 221 GLN C Protein A 10 FT #SUB 15 16 VAL D 216 217 THR C Protein S 1 FT #SUB 15 16 VAL D 217 218 ARG C Protein S 1 FT #SUB 15 16 VAL D 220 221 GLN C Protein S 6 FT #SUB 15 16 VAL D 221 222 LEU C Protein S 3 FT #SUB 16 17 TYR D 221 222 LEU C Protein S 4 FT #SUB 16 17 TYR D 222 223 PRO C Protein S 1 FT #SUB 19 20 GLY D 217 218 ARG C Protein B 4 FT #SUB 20 21 ASN D 213 214 GLN C Protein A 6 FT #SUB 20 21 ASN D 214 215 GLU C Protein S 7 FT #SUB 20 21 ASN D 217 218 ARG C Protein S 9 FT #SUB 21 22 PRO D 213 214 GLN C Protein S 1 FT #SUB 213 214 GLN D 18 19 THR C Protein S 1 FT #SUB 213 214 GLN D 19 20 GLY C Protein S 2 FT #SUB 213 214 GLN D 20 21 ASN C Protein S 2 FT #SUB 213 214 GLN D 21 22 PRO C Protein S 1 FT #SUB 217 218 ARG D 15 16 VAL C Protein S 1 FT #SUB 217 218 ARG D 19 20 GLY C Protein S 1 FT #SUB 217 218 ARG D 20 21 ASN C Protein S 9 FT #SUB 220 221 GLN D 12 13 ALA C Protein S 10 FT #SUB 220 221 GLN D 15 16 VAL C Protein S 2 FT #SUB 221 222 LEU D 15 16 VAL C Protein S 2 FT #SUB 221 222 LEU D 16 17 TYR C Protein S 3 FT #HET 80 81 ASN D 86 3 XYP Q S 4 FT #HET 81 82 GLY D 85 2 FUC Q B 5 FT #HET 81 82 GLY D 86 3 XYP Q B 2 FT #HET 81 82 GLY D 91 8 GLA Q B 1 FT #HET 82 83 GLY D 84 1 BGC Q B 1 FT #HET 82 83 GLY D 85 2 FUC Q B 1 FT #HET 82 83 GLY D 91 8 GLA Q B 3 FT #HET 137 138 ASN D 85 2 FUC Q S 3 FT #HET 139 140 LEU D 86 3 XYP Q S 3 FT #HET 139 140 LEU D 88 5 7CV Q S 2 FT #HET 284 285 CYS D 91 8 GLA Q B 1 FT #HET 285 286 SER D 91 8 GLA Q A 6 FT #HET 286 287 GLY D 91 8 GLA Q B 6 FT #HET 287 288 ALA D 91 8 GLA Q B 1 FT #HET 288 289 GLY D 76 1 BGC P B 3 FT #HET 290 291 ALA D 77 2 FUC P S 1 FT #HET 292 293 ALA D 76 1 BGC P S 2 FT #HET 298 299 ASP D 84 1 BGC Q S 6 FT #HET 298 299 ASP D 91 8 GLA Q S 7 FT #HET 299 300 TYR D 91 8 GLA Q B 1 FT #HET 319 320 LEU D 106 501 HG D S 1 FT #HET 355 356 PHE D 106 501 HG D S 2 FT #HET 368 369 CYS D 106 501 HG D S 2 FT #HET 373 374 ILE D 106 501 HG D S 1 FT #HET 386 387 SER D 93 1 BGC R B 2 FT #HET 387 388 ILE D 93 1 BGC R A 3 FT #HET 388 389 ASP D 93 1 BGC R A 2 FT #HET 388 389 ASP D 100 1 BGC S S 4 FT #HET 390 391 THR D 76 1 BGC P B 1 FT #HET 390 391 THR D 82 7 GLA P S 3 FT #HET 392 393 PRO D 76 1 BGC P S 2 FT #HET 393 394 ALA D 97 5 GLA R A 6 FT #HET 394 395 ALA D 93 1 BGC R S 2 FT #HET 397 398 GLY D 93 1 BGC R B 3 FT #HET 397 398 GLY D 97 5 GLA R B 1 FT #MOD 279 280 ASN D 76 1 BGC P S FT #MOD 301 302 ASN D 84 1 BGC Q S FT #MOD 398 399 ASN D 93 1 BGC R S FT #MOD 405 406 ASN D 100 1 BGC S S CC SEQUENCE 436 AA (ATOM); CC AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS CC RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR CC MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS CC QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP CC SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG CC NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA CC GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY CC NVLRIMSGMG GLAYAN CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ATOM AGGLSQLVAYGAQDVYLTGNPQITFFKTVYRRYTNFAIESIQQTINGSVG CC ************************************************** CC SEQRES FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ATOM FGNKVSTQISRNGDLITDIVVEFVLTKGGNGGTTYYPAEELLQDVELEIG CC ************************************************** CC SEQRES GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ATOM GQRIDKHYNDWFRTYDALFRMNDDRYNYRRMTDWVNNELVGAQKRFYVPL CC ************************************************** CC SEQRES IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ATOM IFFFNQTPGLALPLIALQYHEVKLYFTLASQVQGVNYNGSSAIAGAAQPT CC ************************************************** CC SEQRES MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ATOM MSVWVDYIFLDTQERTRFAQLPHEYLIEQLQFTGSETATPSATTQASQNI CC ************************************************** CC SEQRES RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ATOM RLNFNHPTKYLAWNFNNPTNYGQYTALANIPGACSGAGTAAATVTTPDYG CC ************************************************** CC SEQRES NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ATOM NTGTYNEQLAVLDSAKIQLNGQDRFATRKGSYFNKVQPYQSIGGVTPAGV CC ************************************************** CC SEQRES YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ATOM YLYSFALKPAGRQPSGTCNFSRIDNATLSLTYKTCSIDATSPAAVLGNTE CC ************************************************** CC SEQRES TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ATOM TVTANTATLLTALNIYAKNYNVLRIMSGMGGLAYAN CC ************************************ SQ SEQUENCE 436 AA; MW; CN; AGGLSQLVAY GAQDVYLTGN PQITFFKTVY RRYTNFAIES IQQTINGSVG FGNKVSTQIS RNGDLITDIV VEFVLTKGGN GGTTYYPAEE LLQDVELEIG GQRIDKHYND WFRTYDALFR MNDDRYNYRR MTDWVNNELV GAQKRFYVPL IFFFNQTPGL ALPLIALQYH EVKLYFTLAS QVQGVNYNGS SAIAGAAQPT MSVWVDYIFL DTQERTRFAQ LPHEYLIEQL QFTGSETATP SATTQASQNI RLNFNHPTKY LAWNFNNPTN YGQYTALANI PGACSGAGTA AATVTTPDYG NTGTYNEQLA VLDSAKIQLN GQDRFATRKG SYFNKVQPYQ SIGGVTPAGV YLYSFALKPA GRQPSGTCNF SRIDNATLSL TYKTCSIDAT SPAAVLGNTE TVTANTATLL TALNIYAKNY NVLRIMSGMG GLAYAN //