ID 5C3SA STANDARD; PRT; 343 AA. DT CONVERTED FROM PDB (SEQRES) 5C3S DE Thymine dioxygenase OS Neurospora crassa CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.150 CC R-Factor 0.186 FT #SUB 40 38 HIS A 239 237 ASN B Protein S 2 FT #SUB 44 42 THR A 238 236 PRO B Protein A 3 FT #SUB 44 42 THR A 239 237 ASN B Protein S 3 FT #SUB 215 213 GLU A 315 313 GLU B Protein B 2 FT #SUB 219 217 TYR A 314 312 SER B Protein S 3 FT #SUB 238 236 PRO A 44 42 THR B Protein B 3 FT #SUB 239 237 ASN A 40 38 HIS B Protein B 2 FT #SUB 239 237 ASN A 44 42 THR B Protein B 3 FT #SUB 258 256 ASP A 317 315 LYS B Protein S 6 FT #SUB 313 311 GLU A 333 331 ALA B Protein A 6 FT #SUB 313 311 GLU A 334 332 THR B Protein A 4 FT #SUB 314 312 SER A 219 217 TYR B Protein A 4 FT #SUB 314 312 SER A 330 328 ARG B Protein A 6 FT #SUB 314 312 SER A 333 331 ALA B Protein B 1 FT #SUB 314 312 SER A 334 332 THR B Protein A 2 FT #SUB 315 313 GLU A 215 213 GLU B Protein S 2 FT #SUB 315 313 GLU A 330 328 ARG B Protein B 3 FT #SUB 316 314 ARG A 330 328 ARG B Protein B 3 FT #SUB 317 315 LYS A 258 256 ASP B Protein S 7 FT #SUB 317 315 LYS A 326 324 TYR B Protein B 3 FT #SUB 318 316 TYR A 318 316 TYR B Protein S 5 FT #SUB 319 317 GLU A 329 327 GLN B Protein S 5 FT #SUB 325 323 LYS A 319 317 GLU B Protein S 1 FT #SUB 326 324 TYR A 317 315 LYS B Protein S 5 FT #SUB 329 327 GLN A 319 317 GLU B Protein S 6 FT #SUB 330 328 ARG A 314 312 SER B Protein A 5 FT #SUB 330 328 ARG A 315 313 GLU B Protein S 3 FT #SUB 330 328 ARG A 316 314 ARG B Protein S 4 FT #SUB 330 328 ARG A 317 315 LYS B Protein S 1 FT #SUB 333 331 ALA A 313 311 GLU B Protein A 6 FT #SUB 333 331 ALA A 314 312 SER B Protein S 2 FT #SUB 334 332 THR A 312 310 ALA B Protein S 2 FT #SUB 334 332 THR A 313 311 GLU B Protein A 5 FT #SUB 334 332 THR A 314 312 SER B Protein S 4 FT #SUB 2 0 SER A 12 10 GLY C Protein S 1 FT #SUB 3 1 MET A 9 7 ASN C Protein S 1 FT #SUB 3 1 MET A 10 8 GLU C Protein B 2 FT #SUB 3 1 MET A 11 9 ASP C Protein B 3 FT #SUB 4 2 GLU A 10 8 GLU C Protein B 1 FT #SUB 4 2 GLU A 11 9 ASP C Protein B 1 FT #SUB 5 3 LYS A 10 8 GLU C Protein A 3 FT #SUB 5 3 LYS A 11 9 ASP C Protein B 1 FT #SUB 5 3 LYS A 241 239 GLN C Protein S 5 FT #SUB 5 3 LYS A 242 240 PHE C Protein S 1 FT #SUB 5 3 LYS A 243 241 ILE C Protein S 1 FT #SUB 6 4 ALA A 10 8 GLU C Protein A 4 FT #SUB 9 7 ASN A 244 242 ASP C Protein S 3 FT #SUB 10 8 GLU A 279 277 PRO C Protein A 6 FT #SUB 10 8 GLU A 280 278 LYS C Protein S 1 FT #SUB 11 9 ASP A 279 277 PRO C Protein B 3 FT #SUB 11 9 ASP A 280 278 LYS C Protein S 3 FT #SUB 11 9 ASP A 281 279 GLN C Protein S 7 FT #SUB 241 239 GLN A 281 279 GLN C Protein S 4 FT #SUB 242 240 PHE A 281 279 GLN C Protein B 1 FT #SUB 243 241 ILE A 281 279 GLN C Protein S 1 FT #HET 89 87 ASN A 3 403 FYU A S 1 FT #HET 192 190 ARG A 2 402 AKG A S 5 FT #HET 192 190 ARG A 3 403 FYU A S 4 FT #HET 194 192 LEU A 2 402 AKG A S 3 FT #HET 196 194 TYR A 2 402 AKG A S 5 FT #HET 216 214 HIS A 1 401 NI A S 3 FT #HET 216 214 HIS A 2 402 AKG A S 8 FT #HET 216 214 HIS A 3 403 FYU A S 2 FT #HET 218 216 ASP A 1 401 NI A S 3 FT #HET 218 216 ASP A 2 402 AKG A S 1 FT #HET 218 216 ASP A 3 403 FYU A A 7 FT #HET 219 217 TYR A 3 403 FYU A A 19 FT #HET 225 223 LEU A 2 402 AKG A S 6 FT #HET 233 231 LEU A 2 402 AKG A S 2 FT #HET 273 271 HIS A 1 401 NI A S 3 FT #HET 273 271 HIS A 2 402 AKG A S 2 FT #HET 275 273 VAL A 2 402 AKG A S 4 FT #HET 288 286 ARG A 2 402 AKG A S 8 FT #HET 290 288 SER A 2 402 AKG A S 4 FT #HET 294 292 PHE A 2 402 AKG A S 4 FT #HET 294 292 PHE A 3 403 FYU A S 23 FT #HET 331 329 LEU A 3 403 FYU A S 2 FT DISORDER 335 343 CC SEQUENCE 334 AA (ATOM); CC GSMEKAAVNE DGLVIPLIDF SKFLEGDETL KLETAKAILH GFQTAGFIYL KNIPIQPDFR CC EHVFNTSAKF FKLPKEKKLE VGWTTPEANR GYSAPGREKV TQLTDPAEIE KIRSAAPDIK CC ESYEIGREDE PGHPNPWPAE QDDLVGFKST MNNFFDQCKA LHIEVMRAIA VGMGIDANYF CC DSFVDVGDNI LRLLHYPAVK SEVFKINPGQ VRAGEHTDYG SITLLFQDSR GGLQVKSPNG CC QFIDATPIEN TVVVNAGDLL ARWSNDTIKS TVHRVVEPPK QEDVHPPRYS IAYFCNPNHK CC SYIEAIPGTY AAESERKYEG INSGKYLVQR LAAT CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSMEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ATOM GSMEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ************************************************** CC SEQRES KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAPGREKV CC ATOM KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAPGREKV CC ************************************************** CC SEQRES TQLTDPAEIEKIRSAAPDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ATOM TQLTDPAEIEKIRSAAPDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ************************************************** CC SEQRES MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ATOM MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ************************************************** CC SEQRES SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ATOM SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ************************************************** CC SEQRES TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ATOM TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ************************************************** CC SEQRES SYIEAIPGTYAAESERKYEGINSGKYLVQRLAATYLEHHHHHH CC ATOM SYIEAIPGTYAAESERKYEGINSGKYLVQRLAAT--------- CC ********************************** SQ SEQUENCE 343 AA; MW; CN; GSMEKAAVNE DGLVIPLIDF SKFLEGDETL KLETAKAILH GFQTAGFIYL KNIPIQPDFR EHVFNTSAKF FKLPKEKKLE VGWTTPEANR GYSAPGREKV TQLTDPAEIE KIRSAAPDIK ESYEIGREDE PGHPNPWPAE QDDLVGFKST MNNFFDQCKA LHIEVMRAIA VGMGIDANYF DSFVDVGDNI LRLLHYPAVK SEVFKINPGQ VRAGEHTDYG SITLLFQDSR GGLQVKSPNG QFIDATPIEN TVVVNAGDLL ARWSNDTIKS TVHRVVEPPK QEDVHPPRYS IAYFCNPNHK SYIEAIPGTY AAESERKYEG INSGKYLVQR LAATYLEHHH HHH // ID 5C3SB STANDARD; PRT; 343 AA. DT CONVERTED FROM PDB (SEQRES) 5C3S DE Thymine dioxygenase OS Neurospora crassa CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.150 CC R-Factor 0.186 FT #SUB 40 38 HIS B 239 237 ASN A Protein S 2 FT #SUB 44 42 THR B 238 236 PRO A Protein A 3 FT #SUB 44 42 THR B 239 237 ASN A Protein S 3 FT #SUB 215 213 GLU B 315 313 GLU A Protein B 2 FT #SUB 219 217 TYR B 314 312 SER A Protein S 4 FT #SUB 238 236 PRO B 44 42 THR A Protein B 3 FT #SUB 239 237 ASN B 40 38 HIS A Protein B 2 FT #SUB 239 237 ASN B 44 42 THR A Protein B 3 FT #SUB 258 256 ASP B 317 315 LYS A Protein S 7 FT #SUB 312 310 ALA B 334 332 THR A Protein A 2 FT #SUB 313 311 GLU B 333 331 ALA A Protein A 6 FT #SUB 313 311 GLU B 334 332 THR A Protein A 5 FT #SUB 314 312 SER B 219 217 TYR A Protein A 3 FT #SUB 314 312 SER B 330 328 ARG A Protein A 5 FT #SUB 314 312 SER B 333 331 ALA A Protein B 2 FT #SUB 314 312 SER B 334 332 THR A Protein A 4 FT #SUB 315 313 GLU B 215 213 GLU A Protein S 2 FT #SUB 315 313 GLU B 330 328 ARG A Protein B 3 FT #SUB 316 314 ARG B 330 328 ARG A Protein B 4 FT #SUB 317 315 LYS B 258 256 ASP A Protein S 6 FT #SUB 317 315 LYS B 326 324 TYR A Protein B 5 FT #SUB 317 315 LYS B 330 328 ARG A Protein B 1 FT #SUB 318 316 TYR B 318 316 TYR A Protein S 5 FT #SUB 319 317 GLU B 325 323 LYS A Protein S 1 FT #SUB 319 317 GLU B 329 327 GLN A Protein S 6 FT #SUB 326 324 TYR B 317 315 LYS A Protein S 3 FT #SUB 329 327 GLN B 319 317 GLU A Protein S 5 FT #SUB 330 328 ARG B 314 312 SER A Protein A 6 FT #SUB 330 328 ARG B 315 313 GLU A Protein S 3 FT #SUB 330 328 ARG B 316 314 ARG A Protein S 3 FT #SUB 333 331 ALA B 313 311 GLU A Protein A 6 FT #SUB 333 331 ALA B 314 312 SER A Protein S 1 FT #SUB 334 332 THR B 313 311 GLU A Protein S 4 FT #SUB 334 332 THR B 314 312 SER A Protein S 2 FT #SUB 5 3 LYS B 61 59 GLU C Protein S 1 FT #SUB 9 7 ASN B 229 227 SER C Protein S 1 FT #SUB 10 8 GLU B 249 247 GLU C Protein S 7 FT #SUB 230 228 ARG B 249 247 GLU D Protein S 4 FT #SUB 249 247 GLU B 51 49 LYS D Protein S 2 FT #SUB 249 247 GLU B 52 50 ASN D Protein S 5 FT #SUB 280 278 LYS B 61 59 GLU D Protein S 8 FT #HET 44 42 THR B 6 403 EDO B B 2 FT #HET 45 43 ALA B 6 403 EDO B B 2 FT #HET 89 87 ASN B 7 404 FYU B S 1 FT #HET 192 190 ARG B 5 402 AKG B S 5 FT #HET 192 190 ARG B 7 404 FYU B S 4 FT #HET 194 192 LEU B 5 402 AKG B S 3 FT #HET 196 194 TYR B 5 402 AKG B S 5 FT #HET 216 214 HIS B 4 401 NI B S 3 FT #HET 216 214 HIS B 5 402 AKG B S 6 FT #HET 216 214 HIS B 7 404 FYU B S 2 FT #HET 218 216 ASP B 4 401 NI B S 3 FT #HET 218 216 ASP B 5 402 AKG B S 1 FT #HET 218 216 ASP B 7 404 FYU B A 7 FT #HET 219 217 TYR B 7 404 FYU B A 16 FT #HET 225 223 LEU B 5 402 AKG B S 6 FT #HET 233 231 LEU B 5 402 AKG B S 2 FT #HET 236 234 LYS B 6 403 EDO B B 1 FT #HET 237 235 SER B 6 403 EDO B B 2 FT #HET 238 236 PRO B 6 403 EDO B A 4 FT #HET 269 267 LYS B 6 403 EDO B S 3 FT #HET 271 269 THR B 6 403 EDO B S 2 FT #HET 273 271 HIS B 4 401 NI B S 3 FT #HET 273 271 HIS B 5 402 AKG B S 2 FT #HET 275 273 VAL B 5 402 AKG B S 4 FT #HET 288 286 ARG B 5 402 AKG B S 8 FT #HET 290 288 SER B 5 402 AKG B S 4 FT #HET 294 292 PHE B 5 402 AKG B S 4 FT #HET 294 292 PHE B 7 404 FYU B S 24 FT #HET 331 329 LEU B 7 404 FYU B S 2 FT DISORDER 1 2 FT DISORDER 96 117 FT DISORDER 335 343 CC SEQUENCE 310 AA (ATOM); CC MEKAAVNEDG LVIPLIDFSK FLEGDETLKL ETAKAILHGF QTAGFIYLKN IPIQPDFREH CC VFNTSAKFFK LPKEKKLEVG WTTPEANRGY SAPDIKESYE IGREDEPGHP NPWPAEQDDL CC VGFKSTMNNF FDQCKALHIE VMRAIAVGMG IDANYFDSFV DVGDNILRLL HYPAVKSEVF CC KINPGQVRAG EHTDYGSITL LFQDSRGGLQ VKSPNGQFID ATPIENTVVV NAGDLLARWS CC NDTIKSTVHR VVEPPKQEDV HPPRYSIAYF CNPNHKSYIE AIPGTYAAES ERKYEGINSG CC KYLVQRLAAT CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSMEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ATOM --MEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ************************************************ CC SEQRES KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAPGREKV CC ATOM KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAP----- CC ********************************************* CC SEQRES TQLTDPAEIEKIRSAAPDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ATOM -----------------DIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ********************************* CC SEQRES MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ATOM MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ************************************************** CC SEQRES SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ATOM SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ************************************************** CC SEQRES TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ATOM TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ************************************************** CC SEQRES SYIEAIPGTYAAESERKYEGINSGKYLVQRLAATYLEHHHHHH CC ATOM SYIEAIPGTYAAESERKYEGINSGKYLVQRLAAT--------- CC ********************************** SQ SEQUENCE 343 AA; MW; CN; GSMEKAAVNE DGLVIPLIDF SKFLEGDETL KLETAKAILH GFQTAGFIYL KNIPIQPDFR EHVFNTSAKF FKLPKEKKLE VGWTTPEANR GYSAPGREKV TQLTDPAEIE KIRSAAPDIK ESYEIGREDE PGHPNPWPAE QDDLVGFKST MNNFFDQCKA LHIEVMRAIA VGMGIDANYF DSFVDVGDNI LRLLHYPAVK SEVFKINPGQ VRAGEHTDYG SITLLFQDSR GGLQVKSPNG QFIDATPIEN TVVVNAGDLL ARWSNDTIKS TVHRVVEPPK QEDVHPPRYS IAYFCNPNHK SYIEAIPGTY AAESERKYEG INSGKYLVQR LAATYLEHHH HHH // ID 5C3SC STANDARD; PRT; 343 AA. DT CONVERTED FROM PDB (SEQRES) 5C3S DE Thymine dioxygenase OS Neurospora crassa CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.150 CC R-Factor 0.186 FT #SUB 9 7 ASN C 3 1 MET A Protein B 1 FT #SUB 10 8 GLU C 3 1 MET A Protein B 2 FT #SUB 10 8 GLU C 4 2 GLU A Protein S 1 FT #SUB 10 8 GLU C 5 3 LYS A Protein S 3 FT #SUB 10 8 GLU C 6 4 ALA A Protein S 4 FT #SUB 11 9 ASP C 3 1 MET A Protein B 3 FT #SUB 11 9 ASP C 4 2 GLU A Protein B 1 FT #SUB 11 9 ASP C 5 3 LYS A Protein B 1 FT #SUB 12 10 GLY C 2 0 SER A Protein B 1 FT #SUB 241 239 GLN C 5 3 LYS A Protein S 5 FT #SUB 242 240 PHE C 5 3 LYS A Protein B 1 FT #SUB 243 241 ILE C 5 3 LYS A Protein S 1 FT #SUB 244 242 ASP C 9 7 ASN A Protein S 3 FT #SUB 279 277 PRO C 10 8 GLU A Protein A 6 FT #SUB 279 277 PRO C 11 9 ASP A Protein B 3 FT #SUB 280 278 LYS C 10 8 GLU A Protein B 1 FT #SUB 280 278 LYS C 11 9 ASP A Protein A 3 FT #SUB 281 279 GLN C 11 9 ASP A Protein A 7 FT #SUB 281 279 GLN C 241 239 GLN A Protein A 4 FT #SUB 281 279 GLN C 242 240 PHE A Protein S 1 FT #SUB 281 279 GLN C 243 241 ILE A Protein S 1 FT #SUB 61 59 GLU C 5 3 LYS B Protein S 1 FT #SUB 229 227 SER C 9 7 ASN B Protein S 1 FT #SUB 249 247 GLU C 10 8 GLU B Protein S 7 FT #SUB 56 54 GLN C 62 60 HIS D Protein S 11 FT #SUB 58 56 ASP C 56 54 GLN D Protein S 5 FT #SUB 58 56 ASP C 58 56 ASP D Protein S 1 FT #SUB 58 56 ASP C 59 57 PHE D Protein S 1 FT #SUB 59 57 PHE C 153 151 ASN D Protein S 5 FT #SUB 149 147 SER C 156 154 ASP D Protein S 2 FT #SUB 153 151 ASN C 149 147 SER D Protein S 3 FT #SUB 153 151 ASN C 153 151 ASN D Protein S 1 FT #SUB 157 155 GLN C 149 147 SER D Protein S 2 FT #HET 44 42 THR C 10 403 EDO C B 1 FT #HET 89 87 ASN C 11 404 FYU C S 1 FT #HET 192 190 ARG C 9 402 AKG C S 5 FT #HET 192 190 ARG C 11 404 FYU C S 5 FT #HET 194 192 LEU C 9 402 AKG C S 3 FT #HET 196 194 TYR C 9 402 AKG C S 5 FT #HET 216 214 HIS C 8 401 NI C S 3 FT #HET 216 214 HIS C 9 402 AKG C S 7 FT #HET 216 214 HIS C 11 404 FYU C S 3 FT #HET 218 216 ASP C 8 401 NI C S 3 FT #HET 218 216 ASP C 9 402 AKG C S 1 FT #HET 218 216 ASP C 11 404 FYU C A 6 FT #HET 219 217 TYR C 11 404 FYU C A 16 FT #HET 225 223 LEU C 9 402 AKG C S 6 FT #HET 233 231 LEU C 9 402 AKG C S 2 FT #HET 236 234 LYS C 10 403 EDO C B 3 FT #HET 237 235 SER C 10 403 EDO C B 2 FT #HET 238 236 PRO C 10 403 EDO C B 3 FT #HET 269 267 LYS C 10 403 EDO C S 1 FT #HET 271 269 THR C 10 403 EDO C A 8 FT #HET 273 271 HIS C 8 401 NI C S 3 FT #HET 273 271 HIS C 9 402 AKG C S 2 FT #HET 275 273 VAL C 9 402 AKG C S 4 FT #HET 288 286 ARG C 9 402 AKG C S 8 FT #HET 290 288 SER C 9 402 AKG C S 3 FT #HET 294 292 PHE C 9 402 AKG C S 4 FT #HET 294 292 PHE C 11 404 FYU C S 23 FT #HET 331 329 LEU C 11 404 FYU C S 2 FT DISORDER 1 2 FT DISORDER 96 116 FT DISORDER 335 343 CC SEQUENCE 311 AA (ATOM); CC MEKAAVNEDG LVIPLIDFSK FLEGDETLKL ETAKAILHGF QTAGFIYLKN IPIQPDFREH CC VFNTSAKFFK LPKEKKLEVG WTTPEANRGY SAPPDIKESY EIGREDEPGH PNPWPAEQDD CC LVGFKSTMNN FFDQCKALHI EVMRAIAVGM GIDANYFDSF VDVGDNILRL LHYPAVKSEV CC FKINPGQVRA GEHTDYGSIT LLFQDSRGGL QVKSPNGQFI DATPIENTVV VNAGDLLARW CC SNDTIKSTVH RVVEPPKQED VHPPRYSIAY FCNPNHKSYI EAIPGTYAAE SERKYEGINS CC GKYLVQRLAA T CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSMEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ATOM --MEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ************************************************ CC SEQRES KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAPGREKV CC ATOM KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAP----- CC ********************************************* CC SEQRES TQLTDPAEIEKIRSAAPDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ATOM ----------------PDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ********************************** CC SEQRES MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ATOM MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ************************************************** CC SEQRES SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ATOM SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ************************************************** CC SEQRES TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ATOM TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ************************************************** CC SEQRES SYIEAIPGTYAAESERKYEGINSGKYLVQRLAATYLEHHHHHH CC ATOM SYIEAIPGTYAAESERKYEGINSGKYLVQRLAAT--------- CC ********************************** SQ SEQUENCE 343 AA; MW; CN; GSMEKAAVNE DGLVIPLIDF SKFLEGDETL KLETAKAILH GFQTAGFIYL KNIPIQPDFR EHVFNTSAKF FKLPKEKKLE VGWTTPEANR GYSAPGREKV TQLTDPAEIE KIRSAAPDIK ESYEIGREDE PGHPNPWPAE QDDLVGFKST MNNFFDQCKA LHIEVMRAIA VGMGIDANYF DSFVDVGDNI LRLLHYPAVK SEVFKINPGQ VRAGEHTDYG SITLLFQDSR GGLQVKSPNG QFIDATPIEN TVVVNAGDLL ARWSNDTIKS TVHRVVEPPK QEDVHPPRYS IAYFCNPNHK SYIEAIPGTY AAESERKYEG INSGKYLVQR LAATYLEHHH HHH // ID 5C3SD STANDARD; PRT; 343 AA. DT CONVERTED FROM PDB (SEQRES) 5C3S DE Thymine dioxygenase OS Neurospora crassa CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.150 CC R-Factor 0.186 FT #SUB 51 49 LYS D 249 247 GLU B Protein S 2 FT #SUB 52 50 ASN D 249 247 GLU B Protein S 5 FT #SUB 61 59 GLU D 280 278 LYS B Protein S 8 FT #SUB 249 247 GLU D 230 228 ARG B Protein S 4 FT #SUB 56 54 GLN D 58 56 ASP C Protein S 5 FT #SUB 58 56 ASP D 58 56 ASP C Protein S 1 FT #SUB 59 57 PHE D 58 56 ASP C Protein B 1 FT #SUB 62 60 HIS D 56 54 GLN C Protein S 11 FT #SUB 149 147 SER D 153 151 ASN C Protein A 3 FT #SUB 149 147 SER D 157 155 GLN C Protein S 2 FT #SUB 153 151 ASN D 59 57 PHE C Protein S 5 FT #SUB 153 151 ASN D 153 151 ASN C Protein B 1 FT #SUB 156 154 ASP D 149 147 SER C Protein S 2 FT #HET 44 42 THR D 14 403 EDO D B 1 FT #HET 89 87 ASN D 15 404 FYU D S 1 FT #HET 192 190 ARG D 13 402 AKG D S 5 FT #HET 192 190 ARG D 15 404 FYU D S 4 FT #HET 194 192 LEU D 13 402 AKG D S 3 FT #HET 196 194 TYR D 13 402 AKG D S 5 FT #HET 216 214 HIS D 12 401 NI D S 3 FT #HET 216 214 HIS D 13 402 AKG D S 9 FT #HET 216 214 HIS D 15 404 FYU D S 2 FT #HET 218 216 ASP D 12 401 NI D S 3 FT #HET 218 216 ASP D 13 402 AKG D S 1 FT #HET 218 216 ASP D 15 404 FYU D A 7 FT #HET 219 217 TYR D 15 404 FYU D A 15 FT #HET 225 223 LEU D 13 402 AKG D S 5 FT #HET 233 231 LEU D 13 402 AKG D S 2 FT #HET 236 234 LYS D 14 403 EDO D B 3 FT #HET 237 235 SER D 14 403 EDO D B 2 FT #HET 238 236 PRO D 14 403 EDO D B 3 FT #HET 269 267 LYS D 14 403 EDO D S 1 FT #HET 271 269 THR D 14 403 EDO D A 8 FT #HET 273 271 HIS D 12 401 NI D S 3 FT #HET 273 271 HIS D 13 402 AKG D S 2 FT #HET 275 273 VAL D 13 402 AKG D S 4 FT #HET 288 286 ARG D 13 402 AKG D S 7 FT #HET 290 288 SER D 13 402 AKG D S 4 FT #HET 294 292 PHE D 13 402 AKG D S 4 FT #HET 294 292 PHE D 15 404 FYU D S 11 FT #HET 331 329 LEU D 15 404 FYU D S 1 FT DISORDER 1 4 FT DISORDER 334 343 CC SEQUENCE 329 AA (ATOM); CC KAAVNEDGLV IPLIDFSKFL EGDETLKLET AKAILHGFQT AGFIYLKNIP IQPDFREHVF CC NTSAKFFKLP KEKKLEVGWT TPEANRGYSA PGREKVTQLT DPAEIEKIRS AAPDIKESYE CC IGREDEPGHP NPWPAEQDDL VGFKSTMNNF FDQCKALHIE VMRAIAVGMG IDANYFDSFV CC DVGDNILRLL HYPAVKSEVF KINPGQVRAG EHTDYGSITL LFQDSRGGLQ VKSPNGQFID CC ATPIENTVVV NAGDLLARWS NDTIKSTVHR VVEPPKQEDV HPPRYSIAYF CNPNHKSYIE CC AIPGTYAAES ERKYEGINSG KYLVQRLAA CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSMEKAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ATOM ----KAAVNEDGLVIPLIDFSKFLEGDETLKLETAKAILHGFQTAGFIYL CC ********************************************** CC SEQRES KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAPGREKV CC ATOM KNIPIQPDFREHVFNTSAKFFKLPKEKKLEVGWTTPEANRGYSAPGREKV CC ************************************************** CC SEQRES TQLTDPAEIEKIRSAAPDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ATOM TQLTDPAEIEKIRSAAPDIKESYEIGREDEPGHPNPWPAEQDDLVGFKST CC ************************************************** CC SEQRES MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ATOM MNNFFDQCKALHIEVMRAIAVGMGIDANYFDSFVDVGDNILRLLHYPAVK CC ************************************************** CC SEQRES SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ATOM SEVFKINPGQVRAGEHTDYGSITLLFQDSRGGLQVKSPNGQFIDATPIEN CC ************************************************** CC SEQRES TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ATOM TVVVNAGDLLARWSNDTIKSTVHRVVEPPKQEDVHPPRYSIAYFCNPNHK CC ************************************************** CC SEQRES SYIEAIPGTYAAESERKYEGINSGKYLVQRLAATYLEHHHHHH CC ATOM SYIEAIPGTYAAESERKYEGINSGKYLVQRLAA---------- CC ********************************* SQ SEQUENCE 343 AA; MW; CN; GSMEKAAVNE DGLVIPLIDF SKFLEGDETL KLETAKAILH GFQTAGFIYL KNIPIQPDFR EHVFNTSAKF FKLPKEKKLE VGWTTPEANR GYSAPGREKV TQLTDPAEIE KIRSAAPDIK ESYEIGREDE PGHPNPWPAE QDDLVGFKST MNNFFDQCKA LHIEVMRAIA VGMGIDANYF DSFVDVGDNI LRLLHYPAVK SEVFKINPGQ VRAGEHTDYG SITLLFQDSR GGLQVKSPNG QFIDATPIEN TVVVNAGDLL ARWSNDTIKS TVHRVVEPPK QEDVHPPRYS IAYFCNPNHK SYIEAIPGTY AAESERKYEG INSGKYLVQR LAATYLEHHH HHH //