ID 4ZQDA STANDARD; PRT; 384 AA. DT CONVERTED FROM PDB (SEQRES) 4ZQD DE Aryl hydrocarbon receptor nuclear translocator OS Mus musculus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.870 CC R-Factor 0.218 FT #SUB 25 105 MET A 56 57 MET B Protein S 3 FT #SUB 28 108 TYR A 56 57 MET B Protein S 4 FT #SUB 28 108 TYR A 60 61 ILE B Protein A 5 FT #SUB 32 112 LEU A 60 61 ILE B Protein S 1 FT #SUB 32 112 LEU A 63 64 LEU B Protein S 1 FT #SUB 35 115 MET A 67 68 LYS B Protein S 4 FT #SUB 48 128 LYS A 29 30 GLU B Protein S 4 FT #SUB 56 136 VAL A 36 37 LEU B Protein S 1 FT #SUB 59 139 MET A 62 63 PHE B Protein S 1 FT #SUB 60 140 LYS A 39 40 GLU B Protein S 10 FT #SUB 81 161 ASP A 87 88 ASP B Protein A 7 FT #SUB 85 165 LYS A 90 91 TYR B Protein S 1 FT #SUB 86 166 HIS A 73 74 CYS B Protein S 2 FT #SUB 87 167 LEU A 222 223 ILE B Protein S 1 FT #SUB 90 170 GLU A 197 198 GLN B Protein B 1 FT #SUB 91 171 ALA A 195 196 THR B Protein B 1 FT #SUB 91 171 ALA A 196 197 GLY B Protein B 1 FT #SUB 91 171 ALA A 222 223 ILE B Protein S 1 FT #SUB 92 172 ALA A 224 225 MET B Protein S 1 FT #SUB 96 176 LEU A 90 91 TYR B Protein S 3 FT #SUB 96 176 LEU A 93 94 ALA B Protein S 1 FT #SUB 110 190 SER A 90 91 TYR B Protein S 1 FT #SUB 136 216 ASP A 345 346 GLU B Protein S 8 FT #SUB 137 217 ASP A 343 344 LEU B Protein S 1 FT #SUB 139 219 ASP A 241 242 LYS B Protein S 1 FT #SUB 140 220 LYS A 239 240 ASP B Protein A 5 FT #SUB 140 220 LYS A 342 343 VAL B Protein S 1 FT #SUB 143 223 GLU A 239 240 ASP B Protein S 4 FT #SUB 143 223 GLU A 240 241 SER B Protein S 7 FT #SUB 144 224 GLN A 237 238 PRO B Protein S 1 FT #SUB 144 224 GLN A 239 240 ASP B Protein S 4 FT #SUB 180 260 ARG A 92 93 LYS B Protein S 5 FT #SUB 180 260 ARG A 93 94 ALA B Protein S 4 FT #SUB 180 260 ARG A 94 95 LEU B Protein S 4 FT #SUB 180 260 ARG A 95 96 GLU B Protein S 3 FT #SUB 180 260 ARG A 237 238 PRO B Protein B 2 FT #SUB 181 261 ARG A 95 96 GLU B Protein B 3 FT #SUB 181 261 ARG A 237 238 PRO B Protein S 4 FT #SUB 182 262 SER A 95 96 GLU B Protein S 4 FT #SUB 184 264 ILE A 319 320 GLU B Protein S 1 FT #SUB 184 264 ILE A 343 344 LEU B Protein S 1 FT #SUB 186 266 ARG A 343 344 LEU B Protein S 5 FT #SUB 186 266 ARG A 344 345 SER B Protein S 2 FT #SUB 225 305 VAL A 305 306 GLN B Protein S 1 FT #SUB 227 307 HIS A 319 320 GLU B Protein S 5 FT #SUB 229 309 THR A 93 94 ALA B Protein A 7 FT #SUB 229 309 THR A 95 96 GLU B Protein A 9 FT #SUB 230 310 GLY A 93 94 ALA B Protein B 3 FT #SUB 230 310 GLY A 95 96 GLU B Protein B 2 FT #SUB 231 311 TYR A 89 90 LEU B Protein S 4 FT #SUB 231 311 TYR A 92 93 LYS B Protein S 8 FT #SUB 231 311 TYR A 93 94 ALA B Protein S 5 FT #SUB 258 338 VAL A 89 90 LEU B Protein S 2 FT #SUB 258 338 VAL A 93 94 ALA B Protein S 1 FT #SUB 260 340 ILE A 90 91 TYR B Protein S 1 FT #SUB 260 340 ILE A 93 94 ALA B Protein S 3 FT #SUB 260 340 ILE A 94 95 LEU B Protein S 2 FT #SUB 262 342 ARG A 193 194 HIS B Protein S 2 FT #SUB 262 342 ARG A 195 196 THR B Protein S 4 FT #SUB 262 342 ARG A 224 225 MET B Protein S 2 FT #SUB 262 342 ARG A 226 227 GLU B Protein S 9 FT #SUB 264 344 GLN A 166 167 ASP B Protein S 1 FT #SUB 264 344 GLN A 193 194 HIS B Protein S 1 FT #SUB 264 344 GLN A 305 306 GLN B Protein A 4 FT #SUB 264 344 GLN A 319 320 GLU B Protein S 1 FT #SUB 286 366 ARG A 277 278 TYR B Protein S 1 FT #SUB 286 366 ARG A 278 279 GLU B Protein S 1 FT #SUB 286 366 ARG A 280 281 TYR B Protein S 4 FT #SUB 286 366 ARG A 282 283 ALA B Protein S 3 FT #SUB 294 374 THR A 354 355 SER B Protein B 2 FT #SUB 294 374 THR A 355 356 MET B Protein B 1 FT #SUB 295 375 PHE A 281 282 HIS B Protein S 1 FT #SUB 295 375 PHE A 282 283 ALA B Protein S 2 FT #SUB 295 375 PHE A 353 354 PHE B Protein A 4 FT #SUB 296 376 VAL A 353 354 PHE B Protein A 7 FT #SUB 298 378 HIS A 351 352 VAL B Protein S 1 FT #SUB 298 378 HIS A 353 354 PHE B Protein S 1 FT #SUB 308 388 PRO A 352 353 VAL B Protein S 2 FT #SUB 312 392 LEU A 353 354 PHE B Protein S 3 FT #SUB 312 392 LEU A 354 355 SER B Protein S 4 FT #SUB 312 392 LEU A 355 356 MET B Protein S 1 FT #SUB 313 393 GLY A 355 356 MET B Protein B 2 FT #SUB 366 446 PHE A 277 278 TYR B Protein S 2 FT #SUB 366 446 PHE A 289 290 THR B Protein S 1 FT #SUB 368 448 ASN A 250 251 ASP B Protein S 1 FT #SUB 369 449 PRO A 277 278 TYR B Protein S 2 FT #SUB 369 449 PRO A 289 290 THR B Protein S 1 FT #SUB 369 449 PRO A 292 293 HIS B Protein S 3 FT #SUB 369 449 PRO A 293 294 GLN B Protein S 1 FT #SUB 376 456 TYR A 285 286 SER B Protein S 1 FT #SUB 289 369 ILE A 44 45 HIS D Protein S 1 FT #SUB 289 369 ILE A 45 46 SER D Protein S 1 FT #SUB 289 369 ILE A 48 49 SER D Protein S 2 FT #SUB 289 369 ILE A 49 50 HIS D Protein A 8 FT #SUB 290 370 GLU A 34 35 TYR D Protein S 1 FT #SUB 290 370 GLU A 48 49 SER D Protein S 2 FT #SUB 290 370 GLU A 49 50 HIS D Protein B 2 FT #SUB 336 416 VAL A 49 50 HIS D Protein S 3 FT #SUB 337 417 LYS A 124 125 GLU D Protein S 3 FT #SUB 338 418 LEU A 181 182 THR D Protein S 3 FT #SUB 374 454 ILE A 45 46 SER D Protein S 1 FT DISORDER 1 19 FT DISORDER 42 47 FT DISORDER 63 79 FT DISORDER 146 178 FT DISORDER 190 220 FT DISORDER 235 254 FT DISORDER 266 280 CC SEQUENCE 243 AA (ATOM); CC RRRNKMTAYI TELSDMVPTC SAKLTILRMA VSHMKSLTDQ ELKHLILEAA DGFLFIVSCE CC TGRVVYVSDS VTPVLNQPQS EWFGSTLYDQ VHPDDVDKLR EQLSRRSFIC RMRCPHFVVV CC HCTGYIKAFC LVAIGRLQVT EFISRHNIEG IFTFVDHRCV ATVGYQPQEL LGKNIVEFCH CC PEDQQLLRDS FQQVVKLKGQ VLSVMFRFRS KTREWLWMRT SSFTFQNPYS DEIEYIICTN CC TNV CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSSADKERLARENHSEIERRRRNKMTAYITELSDMVPTCSALARKPDKLT CC ATOM -------------------RRRNKMTAYITELSDMVPTCSA------KLT CC ********************** *** CC SEQRES ILRMAVSHMKSLRGTGNTSTDGSYKPSFLTDQELKHLILEAADGFLFIVS CC ATOM ILRMAVSHMKSL-----------------TDQELKHLILEAADGFLFIVS CC ************ ********************* CC SEQRES CETGRVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQLSTSEN CC ATOM CETGRVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQL----- CC ********************************************* CC SEQRES ALTGRVLDLKTGTVKKEGQQSSMRMCMGSRRSFICRMRCGTSSVDPVSMN CC ATOM ----------------------------SRRSFICRMRC----------- CC *********** CC SEQRES RLSFLRNRCRNGLGSVKEGEPHFVVVHCTGYIKAWPPAGVSLPDDDPEAG CC ATOM --------------------PHFVVVHCTGYIKA---------------- CC ************** CC SEQRES QGSKFCLVAIGRLQVTSSPNCTDMSNICQPTEFISRHNIEGIFTFVDHRC CC ATOM ----FCLVAIGRLQV---------------TEFISRHNIEGIFTFVDHRC CC *********** ******************** CC SEQRES VATVGYQPQELLGKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFR CC ATOM VATVGYQPQELLGKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFR CC ************************************************** CC SEQRES SKTREWLWMRTSSFTFQNPYSDEIEYIICTNTNV CC ATOM SKTREWLWMRTSSFTFQNPYSDEIEYIICTNTNV CC ********************************** SQ SEQUENCE 384 AA; MW; CN; MSSADKERLA RENHSEIERR RRNKMTAYIT ELSDMVPTCS ALARKPDKLT ILRMAVSHMK SLRGTGNTST DGSYKPSFLT DQELKHLILE AADGFLFIVS CETGRVVYVS DSVTPVLNQP QSEWFGSTLY DQVHPDDVDK LREQLSTSEN ALTGRVLDLK TGTVKKEGQQ SSMRMCMGSR RSFICRMRCG TSSVDPVSMN RLSFLRNRCR NGLGSVKEGE PHFVVVHCTG YIKAWPPAGV SLPDDDPEAG QGSKFCLVAI GRLQVTSSPN CTDMSNICQP TEFISRHNIE GIFTFVDHRC VATVGYQPQE LLGKNIVEFC HPEDQQLLRD SFQQVVKLKG QVLSVMFRFR SKTREWLWMR TSSFTFQNPY SDEIEYIICT NTNV // ID 4ZQDB STANDARD; PRT; 360 AA. DT CONVERTED FROM PDB (SEQRES) 4ZQD DE Endothelial PAS domain-containing protein 1 OS Mus musculus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.870 CC R-Factor 0.218 FT #SUB 29 30 GLU B 48 128 LYS A Protein S 4 FT #SUB 36 37 LEU B 56 136 VAL A Protein S 1 FT #SUB 39 40 GLU B 60 140 LYS A Protein S 10 FT #SUB 56 57 MET B 25 105 MET A Protein S 3 FT #SUB 56 57 MET B 28 108 TYR A Protein S 4 FT #SUB 60 61 ILE B 28 108 TYR A Protein S 5 FT #SUB 60 61 ILE B 32 112 LEU A Protein S 1 FT #SUB 62 63 PHE B 59 139 MET A Protein S 1 FT #SUB 63 64 LEU B 32 112 LEU A Protein S 1 FT #SUB 67 68 LYS B 35 115 MET A Protein S 4 FT #SUB 73 74 CYS B 86 166 HIS A Protein S 2 FT #SUB 87 88 ASP B 81 161 ASP A Protein S 7 FT #SUB 89 90 LEU B 231 311 TYR A Protein A 4 FT #SUB 89 90 LEU B 258 338 VAL A Protein A 2 FT #SUB 90 91 TYR B 85 165 LYS A Protein S 1 FT #SUB 90 91 TYR B 96 176 LEU A Protein S 3 FT #SUB 90 91 TYR B 110 190 SER A Protein S 1 FT #SUB 90 91 TYR B 260 340 ILE A Protein B 1 FT #SUB 92 93 LYS B 180 260 ARG A Protein B 5 FT #SUB 92 93 LYS B 231 311 TYR A Protein A 8 FT #SUB 93 94 ALA B 96 176 LEU A Protein S 1 FT #SUB 93 94 ALA B 180 260 ARG A Protein B 4 FT #SUB 93 94 ALA B 229 309 THR A Protein A 7 FT #SUB 93 94 ALA B 230 310 GLY A Protein B 3 FT #SUB 93 94 ALA B 231 311 TYR A Protein A 5 FT #SUB 93 94 ALA B 258 338 VAL A Protein S 1 FT #SUB 93 94 ALA B 260 340 ILE A Protein A 3 FT #SUB 94 95 LEU B 180 260 ARG A Protein B 4 FT #SUB 94 95 LEU B 260 340 ILE A Protein A 2 FT #SUB 95 96 GLU B 180 260 ARG A Protein S 3 FT #SUB 95 96 GLU B 181 261 ARG A Protein S 3 FT #SUB 95 96 GLU B 182 262 SER A Protein S 4 FT #SUB 95 96 GLU B 229 309 THR A Protein S 9 FT #SUB 95 96 GLU B 230 310 GLY A Protein S 2 FT #SUB 166 167 ASP B 264 344 GLN A Protein S 1 FT #SUB 193 194 HIS B 262 342 ARG A Protein S 2 FT #SUB 193 194 HIS B 264 344 GLN A Protein S 1 FT #SUB 195 196 THR B 91 171 ALA A Protein B 1 FT #SUB 195 196 THR B 262 342 ARG A Protein S 4 FT #SUB 196 197 GLY B 91 171 ALA A Protein B 1 FT #SUB 197 198 GLN B 90 170 GLU A Protein S 1 FT #SUB 222 223 ILE B 87 167 LEU A Protein S 1 FT #SUB 222 223 ILE B 91 171 ALA A Protein S 1 FT #SUB 224 225 MET B 92 172 ALA A Protein S 1 FT #SUB 224 225 MET B 262 342 ARG A Protein S 2 FT #SUB 226 227 GLU B 262 342 ARG A Protein S 9 FT #SUB 237 238 PRO B 144 224 GLN A Protein B 1 FT #SUB 237 238 PRO B 180 260 ARG A Protein S 2 FT #SUB 237 238 PRO B 181 261 ARG A Protein S 4 FT #SUB 239 240 ASP B 140 220 LYS A Protein S 5 FT #SUB 239 240 ASP B 143 223 GLU A Protein A 4 FT #SUB 239 240 ASP B 144 224 GLN A Protein S 4 FT #SUB 240 241 SER B 143 223 GLU A Protein A 7 FT #SUB 241 242 LYS B 139 219 ASP A Protein S 1 FT #SUB 250 251 ASP B 368 448 ASN A Protein B 1 FT #SUB 277 278 TYR B 286 366 ARG A Protein B 1 FT #SUB 277 278 TYR B 366 446 PHE A Protein S 2 FT #SUB 277 278 TYR B 369 449 PRO A Protein S 2 FT #SUB 278 279 GLU B 286 366 ARG A Protein B 1 FT #SUB 280 281 TYR B 286 366 ARG A Protein B 4 FT #SUB 281 282 HIS B 295 375 PHE A Protein S 1 FT #SUB 282 283 ALA B 286 366 ARG A Protein A 3 FT #SUB 282 283 ALA B 295 375 PHE A Protein A 2 FT #SUB 285 286 SER B 376 456 TYR A Protein S 1 FT #SUB 289 290 THR B 366 446 PHE A Protein S 1 FT #SUB 289 290 THR B 369 449 PRO A Protein S 1 FT #SUB 292 293 HIS B 369 449 PRO A Protein S 3 FT #SUB 293 294 GLN B 369 449 PRO A Protein S 1 FT #SUB 305 306 GLN B 225 305 VAL A Protein S 1 FT #SUB 305 306 GLN B 264 344 GLN A Protein S 4 FT #SUB 319 320 GLU B 184 264 ILE A Protein S 1 FT #SUB 319 320 GLU B 227 307 HIS A Protein S 5 FT #SUB 319 320 GLU B 264 344 GLN A Protein S 1 FT #SUB 342 343 VAL B 140 220 LYS A Protein B 1 FT #SUB 343 344 LEU B 137 217 ASP A Protein B 1 FT #SUB 343 344 LEU B 184 264 ILE A Protein S 1 FT #SUB 343 344 LEU B 186 266 ARG A Protein B 5 FT #SUB 344 345 SER B 186 266 ARG A Protein A 2 FT #SUB 345 346 GLU B 136 216 ASP A Protein S 8 FT #SUB 351 352 VAL B 298 378 HIS A Protein S 1 FT #SUB 352 353 VAL B 308 388 PRO A Protein B 2 FT #SUB 353 354 PHE B 295 375 PHE A Protein A 4 FT #SUB 353 354 PHE B 296 376 VAL A Protein A 7 FT #SUB 353 354 PHE B 298 378 HIS A Protein S 1 FT #SUB 353 354 PHE B 312 392 LEU A Protein B 3 FT #SUB 354 355 SER B 294 374 THR A Protein A 2 FT #SUB 354 355 SER B 312 392 LEU A Protein B 4 FT #SUB 355 356 MET B 294 374 THR A Protein B 1 FT #SUB 355 356 MET B 312 392 LEU A Protein B 1 FT #SUB 355 356 MET B 313 393 GLY A Protein S 2 FT #HET 243 244 PHE B 1 401 0X3 B S 3 FT #HET 245 246 SER B 1 401 0X3 B S 2 FT #HET 247 248 HIS B 1 401 0X3 B S 23 FT #HET 251 252 MET B 1 401 0X3 B A 8 FT #HET 253 254 PHE B 1 401 0X3 B S 10 FT #HET 260 261 ILE B 1 401 0X3 B S 1 FT #HET 276 277 ALA B 1 401 0X3 B A 7 FT #HET 279 280 PHE B 1 401 0X3 B S 1 FT #HET 280 281 TYR B 1 401 0X3 B S 7 FT #HET 306 307 TYR B 1 401 0X3 B S 2 FT #HET 308 309 MET B 1 401 0X3 B S 2 FT #HET 320 321 THR B 1 401 0X3 B S 7 FT #HET 336 337 ILE B 1 401 0X3 B S 1 FT #HET 338 339 CYS B 1 401 0X3 B S 7 FT #HET 340 341 ASN B 1 401 0X3 B S 5 FT DISORDER 1 24 FT DISORDER 75 85 FT DISORDER 149 161 FT DISORDER 176 180 FT DISORDER 201 218 CC SEQUENCE 289 AA (ATOM); CC RRSKETEVFY ELAHELPLPH SVSSHLDKAS IMRLAISFLR THKLLSSVCS MDNLYLKALE CC GFIAVVTQDG DMIFLSENIS KFMGLTQVEL TGHSIFDFTH PCDHEEIREN LTLSTERDFF CC MRMKCTVTVN LKSATWKVLH CTGQVRVSCL IIMCEPIQHP SHMDIPLDSK TFLSRHSMDM CC KFTYCDDRIL ELIGYHPEEL LGRSAYEFYH ALDSENMTKS HQNLCTKGQV VSGQYRMLAK CC HGGYVWLETQ GTVIYNPRNL QPQCIMCVNY VLSEIEKNDV VFSMDQTES CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MADKEKKRSSSELRKEKSRDAARCRRSKETEVFYELAHELPLPHSVSSHL CC ATOM ------------------------RRSKETEVFYELAHELPLPHSVSSHL CC ************************** CC SEQRES DKASIMRLAISFLRTHKLLSSVCSENESEAEADQQMDNLYLKALEGFIAV CC ATOM DKASIMRLAISFLRTHKLLSSVCS-----------MDNLYLKALEGFIAV CC ************************ *************** CC SEQRES VTQDGDMIFLSENISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLTLKN CC ATOM VTQDGDMIFLSENISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLTL-- CC ************************************************ CC SEQRES GSGFGKKSKDVSTERDFFMRMKCTVTNRGRTVNLKSATWKVLHCTGQVRV CC ATOM -----------STERDFFMRMKCTV-----TVNLKSATWKVLHCTGQVRV CC ************** ******************** CC SEQRES YNNCPPHSSLCGSKEPLLSCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMD CC ATOM ------------------SCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMD CC ******************************** CC SEQRES MKFTYCDDRILELIGYHPEELLGRSAYEFYHALDSENMTKSHQNLCTKGQ CC ATOM MKFTYCDDRILELIGYHPEELLGRSAYEFYHALDSENMTKSHQNLCTKGQ CC ************************************************** CC SEQRES VVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKND CC ATOM VVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKND CC ************************************************** CC SEQRES VVFSMDQTES CC ATOM VVFSMDQTES CC ********** SQ SEQUENCE 360 AA; MW; CN; MADKEKKRSS SELRKEKSRD AARCRRSKET EVFYELAHEL PLPHSVSSHL DKASIMRLAI SFLRTHKLLS SVCSENESEA EADQQMDNLY LKALEGFIAV VTQDGDMIFL SENISKFMGL TQVELTGHSI FDFTHPCDHE EIRENLTLKN GSGFGKKSKD VSTERDFFMR MKCTVTNRGR TVNLKSATWK VLHCTGQVRV YNNCPPHSSL CGSKEPLLSC LIIMCEPIQH PSHMDIPLDS KTFLSRHSMD MKFTYCDDRI LELIGYHPEE LLGRSAYEFY HALDSENMTK SHQNLCTKGQ VVSGQYRMLA KHGGYVWLET QGTVIYNPRN LQPQCIMCVN YVLSEIEKND VVFSMDQTES // ID 4ZQDC STANDARD; PRT; 384 AA. DT CONVERTED FROM PDB (SEQRES) 4ZQD DE Aryl hydrocarbon receptor nuclear translocator OS Mus musculus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.870 CC R-Factor 0.218 FT #SUB 25 105 MET C 52 53 LYS D Protein S 2 FT #SUB 25 105 MET C 53 54 ALA D Protein S 2 FT #SUB 25 105 MET C 56 57 MET D Protein S 1 FT #SUB 28 108 TYR C 53 54 ALA D Protein S 5 FT #SUB 28 108 TYR C 56 57 MET D Protein A 11 FT #SUB 28 108 TYR C 57 58 ARG D Protein S 5 FT #SUB 28 108 TYR C 60 61 ILE D Protein B 1 FT #SUB 29 109 ILE C 56 57 MET D Protein B 2 FT #SUB 32 112 LEU C 56 57 MET D Protein S 1 FT #SUB 32 112 LEU C 59 60 ALA D Protein S 1 FT #SUB 32 112 LEU C 60 61 ILE D Protein A 5 FT #SUB 32 112 LEU C 63 64 LEU D Protein S 1 FT #SUB 35 115 MET C 63 64 LEU D Protein S 4 FT #SUB 35 115 MET C 64 65 ARG D Protein S 2 FT #SUB 47 127 ASP C 25 26 ARG D Protein S 3 FT #SUB 49 129 LEU C 25 26 ARG D Protein S 7 FT #SUB 49 129 LEU C 28 29 LYS D Protein S 5 FT #SUB 49 129 LEU C 29 30 GLU D Protein S 7 FT #SUB 49 129 LEU C 32 33 VAL D Protein S 1 FT #SUB 52 132 LEU C 32 33 VAL D Protein S 2 FT #SUB 52 132 LEU C 33 34 PHE D Protein S 2 FT #SUB 53 133 ARG C 32 33 VAL D Protein S 2 FT #SUB 56 136 VAL C 36 37 LEU D Protein A 6 FT #SUB 59 139 MET C 39 40 GLU D Protein S 3 FT #SUB 59 139 MET C 40 41 LEU D Protein S 2 FT #SUB 59 139 MET C 62 63 PHE D Protein S 4 FT #SUB 81 161 ASP C 87 88 ASP D Protein A 12 FT #SUB 83 163 GLU C 69 70 LEU D Protein S 1 FT #SUB 84 164 LEU C 87 88 ASP D Protein S 1 FT #SUB 84 164 LEU C 90 91 TYR D Protein S 2 FT #SUB 85 165 LYS C 90 91 TYR D Protein S 4 FT #SUB 87 167 LEU C 98 99 ILE D Protein S 3 FT #SUB 87 167 LEU C 109 110 PHE D Protein S 1 FT #SUB 88 168 ILE C 90 91 TYR D Protein S 2 FT #SUB 90 170 GLU C 197 198 GLN D Protein A 4 FT #SUB 90 170 GLU C 199 200 ARG D Protein S 3 FT #SUB 91 171 ALA C 98 99 ILE D Protein S 1 FT #SUB 91 171 ALA C 195 196 THR D Protein B 4 FT #SUB 91 171 ALA C 196 197 GLY D Protein B 4 FT #SUB 91 171 ALA C 222 223 ILE D Protein A 4 FT #SUB 91 171 ALA C 223 224 ILE D Protein S 4 FT #SUB 91 171 ALA C 224 225 MET D Protein S 1 FT #SUB 92 172 ALA C 224 225 MET D Protein S 1 FT #SUB 96 176 LEU C 89 90 LEU D Protein S 1 FT #SUB 96 176 LEU C 90 91 TYR D Protein S 3 FT #SUB 110 190 SER C 90 91 TYR D Protein S 1 FT #SUB 136 216 ASP C 345 346 GLU D Protein S 3 FT #SUB 140 220 LYS C 342 343 VAL D Protein S 1 FT #SUB 143 223 GLU C 239 240 ASP D Protein S 3 FT #SUB 143 223 GLU C 240 241 SER D Protein S 10 FT #SUB 180 260 ARG C 92 93 LYS D Protein S 11 FT #SUB 180 260 ARG C 93 94 ALA D Protein S 4 FT #SUB 180 260 ARG C 94 95 LEU D Protein S 3 FT #SUB 180 260 ARG C 95 96 GLU D Protein S 13 FT #SUB 180 260 ARG C 237 238 PRO D Protein B 3 FT #SUB 181 261 ARG C 95 96 GLU D Protein B 1 FT #SUB 181 261 ARG C 237 238 PRO D Protein S 4 FT #SUB 184 264 ILE C 319 320 GLU D Protein S 1 FT #SUB 184 264 ILE C 343 344 LEU D Protein S 1 FT #SUB 186 266 ARG C 343 344 LEU D Protein S 3 FT #SUB 225 305 VAL C 305 306 GLN D Protein S 3 FT #SUB 227 307 HIS C 319 320 GLU D Protein S 6 FT #SUB 229 309 THR C 93 94 ALA D Protein A 7 FT #SUB 229 309 THR C 95 96 GLU D Protein S 3 FT #SUB 230 310 GLY C 93 94 ALA D Protein B 3 FT #SUB 230 310 GLY C 95 96 GLU D Protein B 2 FT #SUB 231 311 TYR C 89 90 LEU D Protein S 5 FT #SUB 231 311 TYR C 92 93 LYS D Protein S 12 FT #SUB 231 311 TYR C 93 94 ALA D Protein S 5 FT #SUB 260 340 ILE C 90 91 TYR D Protein S 1 FT #SUB 260 340 ILE C 93 94 ALA D Protein S 2 FT #SUB 260 340 ILE C 94 95 LEU D Protein S 3 FT #SUB 264 344 GLN C 305 306 GLN D Protein S 3 FT #SUB 286 366 ARG C 278 279 GLU D Protein S 2 FT #SUB 286 366 ARG C 280 281 TYR D Protein S 4 FT #SUB 286 366 ARG C 281 282 HIS D Protein S 1 FT #SUB 286 366 ARG C 282 283 ALA D Protein S 3 FT #SUB 292 372 ILE C 355 356 MET D Protein S 1 FT #SUB 293 373 PHE C 355 356 MET D Protein B 2 FT #SUB 294 374 THR C 353 354 PHE D Protein B 1 FT #SUB 294 374 THR C 354 355 SER D Protein B 3 FT #SUB 294 374 THR C 355 356 MET D Protein A 4 FT #SUB 295 375 PHE C 281 282 HIS D Protein S 3 FT #SUB 295 375 PHE C 282 283 ALA D Protein S 4 FT #SUB 295 375 PHE C 309 310 LEU D Protein S 1 FT #SUB 295 375 PHE C 353 354 PHE D Protein A 6 FT #SUB 296 376 VAL C 353 354 PHE D Protein A 4 FT #SUB 298 378 HIS C 348 349 LYS D Protein S 2 FT #SUB 308 388 PRO C 352 353 VAL D Protein S 1 FT #SUB 309 389 GLN C 350 351 ASP D Protein S 1 FT #SUB 309 389 GLN C 352 353 VAL D Protein S 2 FT #SUB 312 392 LEU C 352 353 VAL D Protein S 1 FT #SUB 312 392 LEU C 353 354 PHE D Protein S 3 FT #SUB 312 392 LEU C 354 355 SER D Protein S 3 FT #SUB 312 392 LEU C 358 359 THR D Protein S 2 FT #SUB 313 393 GLY C 355 356 MET D Protein B 2 FT #SUB 366 446 PHE C 277 278 TYR D Protein S 4 FT #SUB 368 448 ASN C 250 251 ASP D Protein S 10 FT #SUB 368 448 ASN C 277 278 TYR D Protein S 2 FT #SUB 369 449 PRO C 277 278 TYR D Protein S 4 FT #SUB 369 449 PRO C 289 290 THR D Protein S 1 FT #SUB 369 449 PRO C 292 293 HIS D Protein S 1 FT #SUB 370 450 TYR C 249 250 MET D Protein S 14 FT #SUB 370 450 TYR C 250 251 ASP D Protein S 2 FT #SUB 370 450 TYR C 292 293 HIS D Protein A 3 FT #SUB 371 451 SER C 250 251 ASP D Protein A 7 FT #SUB 375 455 GLU C 275 276 SER D Protein S 4 FT #SUB 375 455 GLU C 277 278 TYR D Protein S 4 FT #SUB 376 456 TYR C 277 278 TYR D Protein S 2 FT DISORDER 1 17 FT DISORDER 62 79 FT DISORDER 148 178 FT DISORDER 190 220 FT DISORDER 236 254 FT DISORDER 265 280 CC SEQUENCE 252 AA (ATOM); CC ERRRRNKMTA YITELSDMVP TCSALARKPD KLTILRMAVS HMKSTDQELK HLILEAADGF CC LFIVSCETGR VVYVSDSVTP VLNQPQSEWF GSTLYDQVHP DDVDKLREQL STSRRSFICR CC MRCPHFVVVH CTGYIKAWFC LVAIGRLQTE FISRHNIEGI FTFVDHRCVA TVGYQPQELL CC GKNIVEFCHP EDQQLLRDSF QQVVKLKGQV LSVMFRFRSK TREWLWMRTS SFTFQNPYSD CC EIEYIICTNT NV CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSSADKERLARENHSEIERRRRNKMTAYITELSDMVPTCSALARKPDKLT CC ATOM -----------------ERRRRNKMTAYITELSDMVPTCSALARKPDKLT CC ********************************* CC SEQRES ILRMAVSHMKSLRGTGNTSTDGSYKPSFLTDQELKHLILEAADGFLFIVS CC ATOM ILRMAVSHMKS------------------TDQELKHLILEAADGFLFIVS CC *********** ********************* CC SEQRES CETGRVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQLSTSEN CC ATOM CETGRVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQLST--- CC *********************************************** CC SEQRES ALTGRVLDLKTGTVKKEGQQSSMRMCMGSRRSFICRMRCGTSSVDPVSMN CC ATOM ----------------------------SRRSFICRMRC----------- CC *********** CC SEQRES RLSFLRNRCRNGLGSVKEGEPHFVVVHCTGYIKAWPPAGVSLPDDDPEAG CC ATOM --------------------PHFVVVHCTGYIKAW--------------- CC *************** CC SEQRES QGSKFCLVAIGRLQVTSSPNCTDMSNICQPTEFISRHNIEGIFTFVDHRC CC ATOM ----FCLVAIGRLQ----------------TEFISRHNIEGIFTFVDHRC CC ********** ******************** CC SEQRES VATVGYQPQELLGKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFR CC ATOM VATVGYQPQELLGKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFR CC ************************************************** CC SEQRES SKTREWLWMRTSSFTFQNPYSDEIEYIICTNTNV CC ATOM SKTREWLWMRTSSFTFQNPYSDEIEYIICTNTNV CC ********************************** SQ SEQUENCE 384 AA; MW; CN; MSSADKERLA RENHSEIERR RRNKMTAYIT ELSDMVPTCS ALARKPDKLT ILRMAVSHMK SLRGTGNTST DGSYKPSFLT DQELKHLILE AADGFLFIVS CETGRVVYVS DSVTPVLNQP QSEWFGSTLY DQVHPDDVDK LREQLSTSEN ALTGRVLDLK TGTVKKEGQQ SSMRMCMGSR RSFICRMRCG TSSVDPVSMN RLSFLRNRCR NGLGSVKEGE PHFVVVHCTG YIKAWPPAGV SLPDDDPEAG QGSKFCLVAI GRLQVTSSPN CTDMSNICQP TEFISRHNIE GIFTFVDHRC VATVGYQPQE LLGKNIVEFC HPEDQQLLRD SFQQVVKLKG QVLSVMFRFR SKTREWLWMR TSSFTFQNPY SDEIEYIICT NTNV // ID 4ZQDD STANDARD; PRT; 360 AA. DT CONVERTED FROM PDB (SEQRES) 4ZQD DE Endothelial PAS domain-containing protein 1 OS Mus musculus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.870 CC R-Factor 0.218 FT #SUB 34 35 TYR D 290 370 GLU A Protein S 1 FT #SUB 44 45 HIS D 289 369 ILE A Protein B 1 FT #SUB 45 46 SER D 289 369 ILE A Protein B 1 FT #SUB 45 46 SER D 374 454 ILE A Protein S 1 FT #SUB 48 49 SER D 289 369 ILE A Protein S 2 FT #SUB 48 49 SER D 290 370 GLU A Protein S 2 FT #SUB 49 50 HIS D 289 369 ILE A Protein S 8 FT #SUB 49 50 HIS D 290 370 GLU A Protein S 2 FT #SUB 49 50 HIS D 336 416 VAL A Protein S 3 FT #SUB 124 125 GLU D 337 417 LYS A Protein S 3 FT #SUB 181 182 THR D 338 418 LEU A Protein S 3 FT #SUB 25 26 ARG D 47 127 ASP C Protein S 3 FT #SUB 25 26 ARG D 49 129 LEU C Protein S 7 FT #SUB 28 29 LYS D 49 129 LEU C Protein A 5 FT #SUB 29 30 GLU D 49 129 LEU C Protein A 7 FT #SUB 32 33 VAL D 49 129 LEU C Protein S 1 FT #SUB 32 33 VAL D 52 132 LEU C Protein S 2 FT #SUB 32 33 VAL D 53 133 ARG C Protein S 2 FT #SUB 33 34 PHE D 52 132 LEU C Protein S 2 FT #SUB 36 37 LEU D 56 136 VAL C Protein A 6 FT #SUB 39 40 GLU D 59 139 MET C Protein B 3 FT #SUB 40 41 LEU D 59 139 MET C Protein A 2 FT #SUB 52 53 LYS D 25 105 MET C Protein A 2 FT #SUB 53 54 ALA D 25 105 MET C Protein B 2 FT #SUB 53 54 ALA D 28 108 TYR C Protein A 5 FT #SUB 56 57 MET D 25 105 MET C Protein S 1 FT #SUB 56 57 MET D 28 108 TYR C Protein S 11 FT #SUB 56 57 MET D 29 109 ILE C Protein S 2 FT #SUB 56 57 MET D 32 112 LEU C Protein S 1 FT #SUB 57 58 ARG D 28 108 TYR C Protein S 5 FT #SUB 59 60 ALA D 32 112 LEU C Protein S 1 FT #SUB 60 61 ILE D 28 108 TYR C Protein S 1 FT #SUB 60 61 ILE D 32 112 LEU C Protein A 5 FT #SUB 62 63 PHE D 59 139 MET C Protein S 4 FT #SUB 63 64 LEU D 32 112 LEU C Protein S 1 FT #SUB 63 64 LEU D 35 115 MET C Protein A 4 FT #SUB 64 65 ARG D 35 115 MET C Protein B 2 FT #SUB 69 70 LEU D 83 163 GLU C Protein S 1 FT #SUB 87 88 ASP D 81 161 ASP C Protein S 12 FT #SUB 87 88 ASP D 84 164 LEU C Protein B 1 FT #SUB 89 90 LEU D 96 176 LEU C Protein S 1 FT #SUB 89 90 LEU D 231 311 TYR C Protein A 5 FT #SUB 90 91 TYR D 84 164 LEU C Protein S 2 FT #SUB 90 91 TYR D 85 165 LYS C Protein S 4 FT #SUB 90 91 TYR D 88 168 ILE C Protein S 2 FT #SUB 90 91 TYR D 96 176 LEU C Protein S 3 FT #SUB 90 91 TYR D 110 190 SER C Protein S 1 FT #SUB 90 91 TYR D 260 340 ILE C Protein B 1 FT #SUB 92 93 LYS D 180 260 ARG C Protein A 11 FT #SUB 92 93 LYS D 231 311 TYR C Protein A 12 FT #SUB 93 94 ALA D 180 260 ARG C Protein B 4 FT #SUB 93 94 ALA D 229 309 THR C Protein A 7 FT #SUB 93 94 ALA D 230 310 GLY C Protein B 3 FT #SUB 93 94 ALA D 231 311 TYR C Protein A 5 FT #SUB 93 94 ALA D 260 340 ILE C Protein S 2 FT #SUB 94 95 LEU D 180 260 ARG C Protein B 3 FT #SUB 94 95 LEU D 260 340 ILE C Protein A 3 FT #SUB 95 96 GLU D 180 260 ARG C Protein S 13 FT #SUB 95 96 GLU D 181 261 ARG C Protein S 1 FT #SUB 95 96 GLU D 229 309 THR C Protein S 3 FT #SUB 95 96 GLU D 230 310 GLY C Protein S 2 FT #SUB 98 99 ILE D 87 167 LEU C Protein S 3 FT #SUB 98 99 ILE D 91 171 ALA C Protein S 1 FT #SUB 109 110 PHE D 87 167 LEU C Protein S 1 FT #SUB 195 196 THR D 91 171 ALA C Protein B 4 FT #SUB 196 197 GLY D 91 171 ALA C Protein B 4 FT #SUB 197 198 GLN D 90 170 GLU C Protein S 4 FT #SUB 199 200 ARG D 90 170 GLU C Protein S 3 FT #SUB 222 223 ILE D 91 171 ALA C Protein A 4 FT #SUB 223 224 ILE D 91 171 ALA C Protein B 4 FT #SUB 224 225 MET D 91 171 ALA C Protein B 1 FT #SUB 224 225 MET D 92 172 ALA C Protein S 1 FT #SUB 237 238 PRO D 180 260 ARG C Protein S 3 FT #SUB 237 238 PRO D 181 261 ARG C Protein S 4 FT #SUB 239 240 ASP D 143 223 GLU C Protein A 3 FT #SUB 240 241 SER D 143 223 GLU C Protein A 10 FT #SUB 249 250 MET D 370 450 TYR C Protein A 14 FT #SUB 250 251 ASP D 368 448 ASN C Protein A 10 FT #SUB 250 251 ASP D 370 450 TYR C Protein B 2 FT #SUB 250 251 ASP D 371 451 SER C Protein S 7 FT #SUB 275 276 SER D 375 455 GLU C Protein S 4 FT #SUB 277 278 TYR D 366 446 PHE C Protein S 4 FT #SUB 277 278 TYR D 368 448 ASN C Protein S 2 FT #SUB 277 278 TYR D 369 449 PRO C Protein S 4 FT #SUB 277 278 TYR D 375 455 GLU C Protein S 4 FT #SUB 277 278 TYR D 376 456 TYR C Protein B 2 FT #SUB 278 279 GLU D 286 366 ARG C Protein B 2 FT #SUB 280 281 TYR D 286 366 ARG C Protein B 4 FT #SUB 281 282 HIS D 286 366 ARG C Protein B 1 FT #SUB 281 282 HIS D 295 375 PHE C Protein S 3 FT #SUB 282 283 ALA D 286 366 ARG C Protein A 3 FT #SUB 282 283 ALA D 295 375 PHE C Protein A 4 FT #SUB 289 290 THR D 369 449 PRO C Protein S 1 FT #SUB 292 293 HIS D 369 449 PRO C Protein S 1 FT #SUB 292 293 HIS D 370 450 TYR C Protein S 3 FT #SUB 305 306 GLN D 225 305 VAL C Protein S 3 FT #SUB 305 306 GLN D 264 344 GLN C Protein S 3 FT #SUB 309 310 LEU D 295 375 PHE C Protein S 1 FT #SUB 319 320 GLU D 184 264 ILE C Protein S 1 FT #SUB 319 320 GLU D 227 307 HIS C Protein S 6 FT #SUB 342 343 VAL D 140 220 LYS C Protein B 1 FT #SUB 343 344 LEU D 184 264 ILE C Protein S 1 FT #SUB 343 344 LEU D 186 266 ARG C Protein B 3 FT #SUB 345 346 GLU D 136 216 ASP C Protein S 3 FT #SUB 348 349 LYS D 298 378 HIS C Protein S 2 FT #SUB 350 351 ASP D 309 389 GLN C Protein B 1 FT #SUB 352 353 VAL D 308 388 PRO C Protein B 1 FT #SUB 352 353 VAL D 309 389 GLN C Protein S 2 FT #SUB 352 353 VAL D 312 392 LEU C Protein S 1 FT #SUB 353 354 PHE D 294 374 THR C Protein B 1 FT #SUB 353 354 PHE D 295 375 PHE C Protein A 6 FT #SUB 353 354 PHE D 296 376 VAL C Protein A 4 FT #SUB 353 354 PHE D 312 392 LEU C Protein B 3 FT #SUB 354 355 SER D 294 374 THR C Protein A 3 FT #SUB 354 355 SER D 312 392 LEU C Protein B 3 FT #SUB 355 356 MET D 292 372 ILE C Protein S 1 FT #SUB 355 356 MET D 293 373 PHE C Protein S 2 FT #SUB 355 356 MET D 294 374 THR C Protein A 4 FT #SUB 355 356 MET D 313 393 GLY C Protein S 2 FT #SUB 358 359 THR D 312 392 LEU C Protein S 2 FT DISORDER 1 23 FT DISORDER 75 85 FT DISORDER 149 162 FT DISORDER 177 178 FT DISORDER 182 183 FT DISORDER 201 218 CC SEQUENCE 290 AA (ATOM); CC CRRSKETEVF YELAHELPLP HSVSSHLDKA SIMRLAISFL RTHKLLSSVC SMDNLYLKAL CC EGFIAVVTQD GDMIFLSENI SKFMGLTQVE LTGHSIFDFT HPCDHEEIRE NLTLTERDFF CC MRMKCTVTGR TLKSATWKVL HCTGQVRVSC LIIMCEPIQH PSHMDIPLDS KTFLSRHSMD CC MKFTYCDDRI LELIGYHPEE LLGRSAYEFY HALDSENMTK SHQNLCTKGQ VVSGQYRMLA CC KHGGYVWLET QGTVIYNPRN LQPQCIMCVN YVLSEIEKND VVFSMDQTES CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MADKEKKRSSSELRKEKSRDAARCRRSKETEVFYELAHELPLPHSVSSHL CC ATOM -----------------------CRRSKETEVFYELAHELPLPHSVSSHL CC *************************** CC SEQRES DKASIMRLAISFLRTHKLLSSVCSENESEAEADQQMDNLYLKALEGFIAV CC ATOM DKASIMRLAISFLRTHKLLSSVCS-----------MDNLYLKALEGFIAV CC ************************ *************** CC SEQRES VTQDGDMIFLSENISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLTLKN CC ATOM VTQDGDMIFLSENISKFMGLTQVELTGHSIFDFTHPCDHEEIRENLTL-- CC ************************************************ CC SEQRES GSGFGKKSKDVSTERDFFMRMKCTVTNRGRTVNLKSATWKVLHCTGQVRV CC ATOM ------------TERDFFMRMKCTVT--GRT--LKSATWKVLHCTGQVRV CC ************** *** ***************** CC SEQRES YNNCPPHSSLCGSKEPLLSCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMD CC ATOM ------------------SCLIIMCEPIQHPSHMDIPLDSKTFLSRHSMD CC ******************************** CC SEQRES MKFTYCDDRILELIGYHPEELLGRSAYEFYHALDSENMTKSHQNLCTKGQ CC ATOM MKFTYCDDRILELIGYHPEELLGRSAYEFYHALDSENMTKSHQNLCTKGQ CC ************************************************** CC SEQRES VVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKND CC ATOM VVSGQYRMLAKHGGYVWLETQGTVIYNPRNLQPQCIMCVNYVLSEIEKND CC ************************************************** CC SEQRES VVFSMDQTES CC ATOM VVFSMDQTES CC ********** SQ SEQUENCE 360 AA; MW; CN; MADKEKKRSS SELRKEKSRD AARCRRSKET EVFYELAHEL PLPHSVSSHL DKASIMRLAI SFLRTHKLLS SVCSENESEA EADQQMDNLY LKALEGFIAV VTQDGDMIFL SENISKFMGL TQVELTGHSI FDFTHPCDHE EIRENLTLKN GSGFGKKSKD VSTERDFFMR MKCTVTNRGR TVNLKSATWK VLHCTGQVRV YNNCPPHSSL CGSKEPLLSC LIIMCEPIQH PSHMDIPLDS KTFLSRHSMD MKFTYCDDRI LELIGYHPEE LLGRSAYEFY HALDSENMTK SHQNLCTKGQ VVSGQYRMLA KHGGYVWLET QGTVIYNPRN LQPQCIMCVN YVLSEIEKND VVFSMDQTES //