ID 4CDNA STANDARD; PRT; 482 AA. DT CONVERTED FROM PDB (SEQRES) 4CDN DE DEOXYRIBODIPYRIMIDINE PHOTOLYASE OS METHANOSARCINA MAZEI CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.900 CC R-Factor 0.149 FT #SUB 28 10 ALA A 341 323 GLU B Protein S 2 FT #SUB 31 13 SER A 340 322 PHE B Protein S 3 FT #SUB 31 13 SER A 341 323 GLU B Protein S 1 FT #SUB 31 13 SER A 411 393 GLU B Protein A 10 FT #SUB 32 14 GLY A 355 337 ARG B Protein B 10 FT #SUB 32 14 GLY A 411 393 GLU B Protein B 1 FT #SUB 33 15 LYS A 355 337 ARG B Protein A 4 FT #SUB 33 15 LYS A 356 338 ASN B Protein S 5 FT #SUB 345 327 SER A 150 132 SER B Protein A 4 FT #SUB 345 327 SER A 151 133 GLY B Protein S 2 FT #SUB 345 327 SER A 152 134 ILE B Protein S 7 FT #SUB 345 327 SER A 153 135 SER B Protein S 2 FT #SUB 348 330 LYS A 150 132 SER B Protein S 3 FT #SUB 349 331 GLU A 153 135 SER B Protein S 2 FT #HET 14 -4 LEU A 9 1469 PGE A S 3 FT #HET 31 13 SER A 9 1469 PGE A A 4 FT #HET 32 14 GLY A 9 1469 PGE A B 10 FT #HET 33 15 LYS A 9 1469 PGE A B 2 FT #HET 34 16 GLN A 9 1469 PGE A A 6 FT #HET 44 26 SER A 2 999 FO1 A A 13 FT #HET 45 27 ARG A 2 999 FO1 A A 4 FT #HET 47 29 GLN A 2 999 FO1 A S 1 FT #HET 73 55 PHE A 2 999 FO1 A S 9 FT #HET 74 56 CYS A 2 999 FO1 A B 6 FT #HET 75 57 LEU A 2 999 FO1 A A 3 FT #HET 76 58 THR A 2 999 FO1 A A 10 FT #HET 77 59 ASP A 11 1471 PEG A A 4 FT #HET 79 61 PHE A 2 999 FO1 A S 21 FT #HET 82 64 ALA A 2 999 FO1 A S 1 FT #HET 87 69 TYR A 2 999 FO1 A S 1 FT #HET 90 72 MET A 2 999 FO1 A S 8 FT #HET 113 95 GLY A 11 1471 PEG A B 2 FT #HET 114 96 ASP A 11 1471 PEG A A 6 FT #HET 115 97 PRO A 11 1471 PEG A S 1 FT #HET 136 118 SER A 2 999 FO1 A S 6 FT #HET 140 122 ILE A 11 1471 PEG A S 1 FT #HET 141 123 LYS A 2 999 FO1 A S 3 FT #HET 144 126 TRP A 2 999 FO1 A S 2 FT #HET 144 126 TRP A 11 1471 PEG A S 3 FT #HET 155 137 PRO A 9 1469 PGE A S 5 FT #HET 157 139 PHE A 9 1469 PGE A S 1 FT #HET 177 159 ALA A 6 1466 SO4 A S 1 FT #HET 178 160 ALA A 6 1466 SO4 A A 6 FT #HET 179 161 HIS A 6 1466 SO4 A A 10 FT #HET 182 164 ARG A 6 1466 SO4 A S 3 FT #HET 190 172 PRO A 5 1465 GOL A B 4 FT #HET 191 173 GLU A 5 1465 GOL A B 2 FT #HET 194 176 GLU A 5 1465 GOL A S 4 FT #HET 238 220 ASN A 8 1468 PEG A S 1 FT #HET 240 222 ASP A 8 1468 PEG A A 6 FT #HET 241 223 PRO A 8 1468 PEG A A 5 FT #HET 242 224 LEU A 8 1468 PEG A A 5 FT #HET 243 225 PHE A 8 1468 PEG A B 1 FT #HET 270 252 TYR A 1 998 FAD A S 5 FT #HET 282 264 LEU A 1 998 FAD A A 4 FT #HET 282 264 LEU A 3 1463 GOL A S 1 FT #HET 283 265 SER A 1 998 FAD A A 11 FT #HET 284 266 ASN A 1 998 FAD A A 4 FT #HET 284 266 ASN A 3 1463 GOL A S 4 FT #HET 285 267 LEU A 1 998 FAD A B 2 FT #HET 286 268 SER A 1 998 FAD A A 11 FT #HET 289 271 LEU A 1 998 FAD A S 3 FT #HET 290 272 HIS A 2 999 FO1 A S 5 FT #HET 291 273 PHE A 2 999 FO1 A S 6 FT #HET 316 298 PHE A 1 998 FAD A S 4 FT #HET 319 301 GLU A 1 998 FAD A A 13 FT #HET 320 302 ILE A 1 998 FAD A A 5 FT #HET 323 305 TRP A 1 998 FAD A A 8 FT #HET 323 305 TRP A 4 1464 GOL A S 8 FT #HET 324 306 LYS A 1 998 FAD A A 6 FT #HET 327 309 SER A 1 998 FAD A S 1 FT #HET 359 341 ARG A 10 1470 PEG A A 4 FT #HET 360 342 SER A 10 1470 PEG A A 8 FT #HET 370 352 ALA A 8 1468 PEG A A 2 FT #HET 372 354 LYS A 7 1467 PGE A A 7 FT #HET 373 355 THR A 7 1467 PGE A B 4 FT #HET 374 356 HIS A 7 1467 PGE A B 5 FT #HET 374 356 HIS A 12 1472 PEG A B 2 FT #HET 375 357 ASP A 12 1472 PEG A B 1 FT #HET 376 358 PRO A 12 1472 PEG A S 8 FT #HET 379 361 ASN A 7 1467 PGE A S 2 FT #HET 384 366 GLU A 3 1463 GOL A S 3 FT #HET 386 368 LEU A 8 1468 PEG A S 2 FT #HET 388 370 THR A 3 1463 GOL A S 4 FT #HET 390 372 LYS A 1 998 FAD A S 7 FT #HET 390 372 LYS A 3 1463 GOL A A 12 FT #HET 392 374 HIS A 3 1463 GOL A A 3 FT #HET 393 375 GLY A 1 998 FAD A B 3 FT #HET 396 378 ARG A 1 998 FAD A A 16 FT #HET 397 379 MET A 1 998 FAD A A 3 FT #HET 397 379 MET A 4 1464 GOL A S 2 FT #HET 400 382 ALA A 1 998 FAD A S 1 FT #HET 405 387 GLU A 10 1470 PEG A A 11 FT #HET 406 388 TRP A 10 1470 PEG A S 1 FT #HET 421 403 ASN A 1 998 FAD A S 8 FT #HET 424 406 TYR A 8 1468 PEG A S 4 FT #HET 427 409 ASP A 1 998 FAD A A 18 FT #HET 428 410 GLY A 1 998 FAD A B 2 FT #HET 429 411 ARG A 2 999 FO1 A S 7 FT #HET 430 412 ASP A 1 998 FAD A S 1 FT #HET 432 414 ASN A 1 998 FAD A A 22 FT #HET 433 415 GLY A 1 998 FAD A B 11 FT #HET 436 418 GLY A 1 998 FAD A B 6 FT #HET 436 418 GLY A 4 1464 GOL A B 2 FT #HET 437 419 ILE A 1 998 FAD A A 3 FT #HET 439 421 TRP A 4 1464 GOL A S 8 FT #HET 440 422 SER A 1 998 FAD A S 2 FT #HET 453 435 GLU A 10 1470 PEG A S 2 FT #HET 454 436 VAL A 10 1470 PEG A S 2 FT #HET 463 445 TYR A 12 1472 PEG A S 6 FT DISORDER 1 13 FT DISORDER 209 212 FT DISORDER 481 482 CC SEQUENCE 463 AA (ATOM); CC LVPRGSHMNP KRIRALKSGK QGDGPVVYWM SRDQRAEDNW ALLFSRAIAK EANVPVVVVF CC CLTDEFLEAG IRQYEFMLKG LQELEVSLSR KKIPSFFLRG DPGEKISRFV KDYNAGTLVT CC DFSPLRIKNQ WIEKVISGIS IPFFEVDAHN VVPCWEASQK HEYAAHTFRP KLYALLPEFL CC EEFPELEPNS VTPELGMVET LSDVLETGVK ALLPERALLK NKDPLFEPWH FEPGEKAAKK CC VMESFIADRL DSYGALRNDP TKNMLSNLSP YLHFGQISSQ RVVLEVEKAE SNPGSKKAFL CC DEILIWKEIS DNFCYYNPGY DGFESFPSWA KESLNAHRND VRSHIYTLEE FEAGKTHDPL CC WNASQMELLS TGKMHGYTRM YWAKKILEWS ESPEKALEIA ICLNDRYELD GRDPNGYAGI CC AWSIGGVHDR AWGEREVTGK IRYMSYEGCK RKFDVKLYIE KYS CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGSSHHHHHHSSGLVPRGSHMNPKRIRALKSGKQGDGPVVYWMSRDQRAE CC ATOM -------------LVPRGSHMNPKRIRALKSGKQGDGPVVYWMSRDQRAE CC ************************************* CC SEQRES DNWALLFSRAIAKEANVPVVVVFCLTDEFLEAGIRQYEFMLKGLQELEVS CC ATOM DNWALLFSRAIAKEANVPVVVVFCLTDEFLEAGIRQYEFMLKGLQELEVS CC ************************************************** CC SEQRES LSRKKIPSFFLRGDPGEKISRFVKDYNAGTLVTDFSPLRIKNQWIEKVIS CC ATOM LSRKKIPSFFLRGDPGEKISRFVKDYNAGTLVTDFSPLRIKNQWIEKVIS CC ************************************************** CC SEQRES GISIPFFEVDAHNVVPCWEASQKHEYAAHTFRPKLYALLPEFLEEFPELE CC ATOM GISIPFFEVDAHNVVPCWEASQKHEYAAHTFRPKLYALLPEFLEEFPELE CC ************************************************** CC SEQRES PNSVTPELSAGAGMVETLSDVLETGVKALLPERALLKNKDPLFEPWHFEP CC ATOM PNSVTPEL----GMVETLSDVLETGVKALLPERALLKNKDPLFEPWHFEP CC ******** ************************************** CC SEQRES GEKAAKKVMESFIADRLDSYGALRNDPTKNMLSNLSPYLHFGQISSQRVV CC ATOM GEKAAKKVMESFIADRLDSYGALRNDPTKNMLSNLSPYLHFGQISSQRVV CC ************************************************** CC SEQRES LEVEKAESNPGSKKAFLDEILIWKEISDNFCYYNPGYDGFESFPSWAKES CC ATOM LEVEKAESNPGSKKAFLDEILIWKEISDNFCYYNPGYDGFESFPSWAKES CC ************************************************** CC SEQRES LNAHRNDVRSHIYTLEEFEAGKTHDPLWNASQMELLSTGKMHGYTRMYWA CC ATOM LNAHRNDVRSHIYTLEEFEAGKTHDPLWNASQMELLSTGKMHGYTRMYWA CC ************************************************** CC SEQRES KKILEWSESPEKALEIAICLNDRYELDGRDPNGYAGIAWSIGGVHDRAWG CC ATOM KKILEWSESPEKALEIAICLNDRYELDGRDPNGYAGIAWSIGGVHDRAWG CC ************************************************** CC SEQRES EREVTGKIRYMSYEGCKRKFDVKLYIEKYSAL CC ATOM EREVTGKIRYMSYEGCKRKFDVKLYIEKYS-- CC ****************************** SQ SEQUENCE 482 AA; MW; CN; MGSSHHHHHH SSGLVPRGSH MNPKRIRALK SGKQGDGPVV YWMSRDQRAE DNWALLFSRA IAKEANVPVV VVFCLTDEFL EAGIRQYEFM LKGLQELEVS LSRKKIPSFF LRGDPGEKIS RFVKDYNAGT LVTDFSPLRI KNQWIEKVIS GISIPFFEVD AHNVVPCWEA SQKHEYAAHT FRPKLYALLP EFLEEFPELE PNSVTPELSA GAGMVETLSD VLETGVKALL PERALLKNKD PLFEPWHFEP GEKAAKKVME SFIADRLDSY GALRNDPTKN MLSNLSPYLH FGQISSQRVV LEVEKAESNP GSKKAFLDEI LIWKEISDNF CYYNPGYDGF ESFPSWAKES LNAHRNDVRS HIYTLEEFEA GKTHDPLWNA SQMELLSTGK MHGYTRMYWA KKILEWSESP EKALEIAICL NDRYELDGRD PNGYAGIAWS IGGVHDRAWG EREVTGKIRY MSYEGCKRKF DVKLYIEKYS AL // ID 4CDNB STANDARD; PRT; 482 AA. DT CONVERTED FROM PDB (SEQRES) 4CDN DE DEOXYRIBODIPYRIMIDINE PHOTOLYASE OS METHANOSARCINA MAZEI CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.900 CC R-Factor 0.149 FT #SUB 150 132 SER B 345 327 SER A Protein B 4 FT #SUB 150 132 SER B 348 330 LYS A Protein B 3 FT #SUB 151 133 GLY B 345 327 SER A Protein B 2 FT #SUB 152 134 ILE B 345 327 SER A Protein B 7 FT #SUB 153 135 SER B 345 327 SER A Protein S 2 FT #SUB 153 135 SER B 349 331 GLU A Protein S 2 FT #SUB 340 322 PHE B 31 13 SER A Protein S 3 FT #SUB 341 323 GLU B 28 10 ALA A Protein S 2 FT #SUB 341 323 GLU B 31 13 SER A Protein S 1 FT #SUB 355 337 ARG B 32 14 GLY A Protein S 10 FT #SUB 355 337 ARG B 33 15 LYS A Protein A 4 FT #SUB 356 338 ASN B 33 15 LYS A Protein S 5 FT #SUB 411 393 GLU B 31 13 SER A Protein S 10 FT #SUB 411 393 GLU B 32 14 GLY A Protein S 1 FT #HET 44 26 SER B 14 999 FO1 B A 12 FT #HET 45 27 ARG B 14 999 FO1 B A 4 FT #HET 47 29 GLN B 14 999 FO1 B S 1 FT #HET 73 55 PHE B 14 999 FO1 B S 10 FT #HET 74 56 CYS B 14 999 FO1 B B 6 FT #HET 75 57 LEU B 14 999 FO1 B A 3 FT #HET 76 58 THR B 14 999 FO1 B A 8 FT #HET 79 61 PHE B 14 999 FO1 B S 22 FT #HET 82 64 ALA B 14 999 FO1 B S 2 FT #HET 87 69 TYR B 14 999 FO1 B S 1 FT #HET 90 72 MET B 14 999 FO1 B S 8 FT #HET 136 118 SER B 14 999 FO1 B S 5 FT #HET 141 123 LYS B 14 999 FO1 B S 3 FT #HET 144 126 TRP B 14 999 FO1 B S 2 FT #HET 177 159 ALA B 17 1465 SO4 B S 1 FT #HET 178 160 ALA B 17 1465 SO4 B A 7 FT #HET 179 161 HIS B 17 1465 SO4 B A 12 FT #HET 182 164 ARG B 17 1465 SO4 B S 3 FT #HET 248 230 PHE B 15 1463 GOL B S 1 FT #HET 270 252 TYR B 13 998 FAD B S 5 FT #HET 282 264 LEU B 13 998 FAD B A 5 FT #HET 282 264 LEU B 15 1463 GOL B S 1 FT #HET 283 265 SER B 13 998 FAD B A 11 FT #HET 284 266 ASN B 13 998 FAD B B 3 FT #HET 284 266 ASN B 15 1463 GOL B S 5 FT #HET 285 267 LEU B 13 998 FAD B B 2 FT #HET 286 268 SER B 13 998 FAD B A 11 FT #HET 289 271 LEU B 13 998 FAD B S 3 FT #HET 290 272 HIS B 14 999 FO1 B S 6 FT #HET 291 273 PHE B 14 999 FO1 B S 6 FT #HET 316 298 PHE B 13 998 FAD B S 4 FT #HET 319 301 GLU B 13 998 FAD B A 14 FT #HET 320 302 ILE B 13 998 FAD B A 6 FT #HET 323 305 TRP B 13 998 FAD B A 6 FT #HET 323 305 TRP B 16 1464 GOL B S 6 FT #HET 324 306 LYS B 13 998 FAD B A 6 FT #HET 327 309 SER B 13 998 FAD B S 1 FT #HET 340 322 PHE B 9 1469 PGE A S 2 FT #HET 341 323 GLU B 9 1469 PGE A S 2 FT #HET 352 334 ASN B 9 1469 PGE A S 7 FT #HET 355 337 ARG B 9 1469 PGE A S 4 FT #HET 384 366 GLU B 15 1463 GOL B S 4 FT #HET 388 370 THR B 15 1463 GOL B S 3 FT #HET 390 372 LYS B 13 998 FAD B S 6 FT #HET 390 372 LYS B 15 1463 GOL B A 11 FT #HET 392 374 HIS B 15 1463 GOL B A 3 FT #HET 393 375 GLY B 13 998 FAD B B 3 FT #HET 396 378 ARG B 13 998 FAD B A 18 FT #HET 397 379 MET B 13 998 FAD B A 3 FT #HET 397 379 MET B 16 1464 GOL B S 2 FT #HET 400 382 ALA B 13 998 FAD B S 1 FT #HET 421 403 ASN B 13 998 FAD B S 7 FT #HET 427 409 ASP B 13 998 FAD B A 18 FT #HET 428 410 GLY B 13 998 FAD B B 2 FT #HET 429 411 ARG B 14 999 FO1 B S 6 FT #HET 430 412 ASP B 13 998 FAD B S 1 FT #HET 432 414 ASN B 13 998 FAD B A 21 FT #HET 433 415 GLY B 13 998 FAD B B 10 FT #HET 436 418 GLY B 13 998 FAD B B 7 FT #HET 436 418 GLY B 16 1464 GOL B B 2 FT #HET 437 419 ILE B 13 998 FAD B A 3 FT #HET 439 421 TRP B 16 1464 GOL B S 10 FT #HET 440 422 SER B 13 998 FAD B S 2 FT DISORDER 1 13 FT DISORDER 207 235 FT DISORDER 481 482 CC SEQUENCE 438 AA (ATOM); CC LVPRGSHMNP KRIRALKSGK QGDGPVVYWM SRDQRAEDNW ALLFSRAIAK EANVPVVVVF CC CLTDEFLEAG IRQYEFMLKG LQELEVSLSR KKIPSFFLRG DPGEKISRFV KDYNAGTLVT CC DFSPLRIKNQ WIEKVISGIS IPFFEVDAHN VVPCWEASQK HEYAAHTFRP KLYALLPEFL CC EEFPELEPNS VTPLKNKDPL FEPWHFEPGE KAAKKVMESF IADRLDSYGA LRNDPTKNML CC SNLSPYLHFG QISSQRVVLE VEKAESNPGS KKAFLDEILI WKEISDNFCY YNPGYDGFES CC FPSWAKESLN AHRNDVRSHI YTLEEFEAGK THDPLWNASQ MELLSTGKMH GYTRMYWAKK CC ILEWSESPEK ALEIAICLND RYELDGRDPN GYAGIAWSIG GVHDRAWGER EVTGKIRYMS CC YEGCKRKFDV KLYIEKYS CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MGSSHHHHHHSSGLVPRGSHMNPKRIRALKSGKQGDGPVVYWMSRDQRAE CC ATOM -------------LVPRGSHMNPKRIRALKSGKQGDGPVVYWMSRDQRAE CC ************************************* CC SEQRES DNWALLFSRAIAKEANVPVVVVFCLTDEFLEAGIRQYEFMLKGLQELEVS CC ATOM DNWALLFSRAIAKEANVPVVVVFCLTDEFLEAGIRQYEFMLKGLQELEVS CC ************************************************** CC SEQRES LSRKKIPSFFLRGDPGEKISRFVKDYNAGTLVTDFSPLRIKNQWIEKVIS CC ATOM LSRKKIPSFFLRGDPGEKISRFVKDYNAGTLVTDFSPLRIKNQWIEKVIS CC ************************************************** CC SEQRES GISIPFFEVDAHNVVPCWEASQKHEYAAHTFRPKLYALLPEFLEEFPELE CC ATOM GISIPFFEVDAHNVVPCWEASQKHEYAAHTFRPKLYALLPEFLEEFPELE CC ************************************************** CC SEQRES PNSVTPELSAGAGMVETLSDVLETGVKALLPERALLKNKDPLFEPWHFEP CC ATOM PNSVTP-----------------------------LKNKDPLFEPWHFEP CC ****** *************** CC SEQRES GEKAAKKVMESFIADRLDSYGALRNDPTKNMLSNLSPYLHFGQISSQRVV CC ATOM GEKAAKKVMESFIADRLDSYGALRNDPTKNMLSNLSPYLHFGQISSQRVV CC ************************************************** CC SEQRES LEVEKAESNPGSKKAFLDEILIWKEISDNFCYYNPGYDGFESFPSWAKES CC ATOM LEVEKAESNPGSKKAFLDEILIWKEISDNFCYYNPGYDGFESFPSWAKES CC ************************************************** CC SEQRES LNAHRNDVRSHIYTLEEFEAGKTHDPLWNASQMELLSTGKMHGYTRMYWA CC ATOM LNAHRNDVRSHIYTLEEFEAGKTHDPLWNASQMELLSTGKMHGYTRMYWA CC ************************************************** CC SEQRES KKILEWSESPEKALEIAICLNDRYELDGRDPNGYAGIAWSIGGVHDRAWG CC ATOM KKILEWSESPEKALEIAICLNDRYELDGRDPNGYAGIAWSIGGVHDRAWG CC ************************************************** CC SEQRES EREVTGKIRYMSYEGCKRKFDVKLYIEKYSAL CC ATOM EREVTGKIRYMSYEGCKRKFDVKLYIEKYS-- CC ****************************** SQ SEQUENCE 482 AA; MW; CN; MGSSHHHHHH SSGLVPRGSH MNPKRIRALK SGKQGDGPVV YWMSRDQRAE DNWALLFSRA IAKEANVPVV VVFCLTDEFL EAGIRQYEFM LKGLQELEVS LSRKKIPSFF LRGDPGEKIS RFVKDYNAGT LVTDFSPLRI KNQWIEKVIS GISIPFFEVD AHNVVPCWEA SQKHEYAAHT FRPKLYALLP EFLEEFPELE PNSVTPELSA GAGMVETLSD VLETGVKALL PERALLKNKD PLFEPWHFEP GEKAAKKVME SFIADRLDSY GALRNDPTKN MLSNLSPYLH FGQISSQRVV LEVEKAESNP GSKKAFLDEI LIWKEISDNF CYYNPGYDGF ESFPSWAKES LNAHRNDVRS HIYTLEEFEA GKTHDPLWNA SQMELLSTGK MHGYTRMYWA KKILEWSESP EKALEIAICL NDRYELDGRD PNGYAGIAWS IGGVHDRAWG EREVTGKIRY MSYEGCKRKF DVKLYIEKYS AL //