ID 2WVTA STANDARD; PRT; 443 AA. DT CONVERTED FROM PDB (SEQRES) 2WVT DE ALPHA-L-FUCOSIDASE OS BACTEROIDES THETAIOTAOMICRON CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.800 CC R-Factor 0.209 FT #SUB 266 296 LYS A 265 295 VAL B Protein S 1 FT #SUB 283 313 GLU A 415 445 THR B Protein S 2 FT #SUB 283 313 GLU A 416 446 THR B Protein S 4 FT #SUB 296 326 TYR A 387 417 ARG B Protein S 2 FT #SUB 296 326 TYR A 413 443 VAL B Protein S 2 FT #SUB 296 326 TYR A 414 444 GLU B Protein S 1 FT #SUB 296 326 TYR A 415 445 THR B Protein S 5 FT #SUB 296 326 TYR A 421 451 ASN B Protein S 4 FT #SUB 299 329 THR A 387 417 ARG B Protein S 4 FT #SUB 301 331 ILE A 384 414 TYR B Protein S 2 FT #SUB 301 331 ILE A 385 415 SER B Protein S 3 FT #SUB 302 332 GLU A 387 417 ARG B Protein S 1 FT #SUB 302 332 GLU A 421 451 ASN B Protein S 4 FT #SUB 361 391 LYS A 363 393 ASP B Protein S 1 FT #SUB 363 393 ASP A 361 391 LYS B Protein S 1 FT #SUB 363 393 ASP A 363 393 ASP B Protein A 5 FT #SUB 363 393 ASP A 381 411 ASN B Protein S 1 FT #SUB 379 409 VAL A 384 414 TYR B Protein B 2 FT #SUB 380 410 PHE A 382 412 GLN B Protein B 1 FT #SUB 380 410 PHE A 383 413 PRO B Protein B 2 FT #SUB 380 410 PHE A 384 414 TYR B Protein B 5 FT #SUB 381 411 ASN A 363 393 ASP B Protein S 1 FT #SUB 381 411 ASN A 381 411 ASN B Protein S 1 FT #SUB 381 411 ASN A 382 412 GLN B Protein B 3 FT #SUB 381 411 ASN A 383 413 PRO B Protein S 3 FT #SUB 382 412 GLN A 380 410 PHE B Protein B 1 FT #SUB 382 412 GLN A 381 411 ASN B Protein B 2 FT #SUB 382 412 GLN A 382 412 GLN B Protein A 10 FT #SUB 382 412 GLN A 384 414 TYR B Protein S 6 FT #SUB 383 413 PRO A 380 410 PHE B Protein B 2 FT #SUB 383 413 PRO A 381 411 ASN B Protein A 2 FT #SUB 384 414 TYR A 301 331 ILE B Protein B 2 FT #SUB 384 414 TYR A 380 410 PHE B Protein A 4 FT #SUB 384 414 TYR A 382 412 GLN B Protein S 5 FT #SUB 384 414 TYR A 428 458 ASN B Protein S 5 FT #SUB 384 414 TYR A 430 460 GLY B Protein S 5 FT #SUB 384 414 TYR A 431 461 GLU B Protein S 12 FT #SUB 384 414 TYR A 432 462 PRO B Protein S 4 FT #SUB 384 414 TYR A 433 463 TYR B Protein S 3 FT #SUB 385 415 SER A 301 331 ILE B Protein A 3 FT #SUB 387 417 ARG A 296 326 TYR B Protein S 2 FT #SUB 387 417 ARG A 299 329 THR B Protein S 4 FT #SUB 387 417 ARG A 302 332 GLU B Protein S 1 FT #SUB 413 443 VAL A 296 326 TYR B Protein S 2 FT #SUB 414 444 GLU A 296 326 TYR B Protein B 2 FT #SUB 415 445 THR A 283 313 GLU B Protein B 2 FT #SUB 415 445 THR A 296 326 TYR B Protein A 4 FT #SUB 416 446 THR A 283 313 GLU B Protein A 3 FT #SUB 421 451 ASN A 296 326 TYR B Protein S 2 FT #SUB 421 451 ASN A 302 332 GLU B Protein S 4 FT #SUB 428 458 ASN A 384 414 TYR B Protein S 5 FT #SUB 430 460 GLY A 384 414 TYR B Protein B 4 FT #SUB 431 461 GLU A 384 414 TYR B Protein B 13 FT #SUB 432 462 PRO A 384 414 TYR B Protein A 7 FT #SUB 433 463 TYR A 384 414 TYR B Protein S 4 FT #HET 36 66 HIS A 1 1473 FHN A S 4 FT #HET 57 87 GLU A 1 1473 FHN A S 4 FT #HET 58 88 TRP A 1 1473 FHN A S 7 FT #HET 58 88 TRP A 4 1476 IMD A S 2 FT #HET 105 135 HIS A 1 1473 FHN A S 7 FT #HET 106 136 HIS A 1 1473 FHN A S 5 FT #HET 148 178 TYR A 1 1473 FHN A S 2 FT #HET 197 227 TRP A 1 1473 FHN A S 1 FT #HET 199 229 ASP A 1 1473 FHN A S 10 FT #HET 202 232 TRP A 1 1473 FHN A S 1 FT #HET 202 232 TRP A 2 1474 GOL A S 1 FT #HET 202 232 TRP A 4 1476 IMD A S 1 FT #HET 232 262 ARG A 1 1473 FHN A S 2 FT #HET 232 262 ARG A 2 1474 GOL A S 9 FT #HET 241 271 ARG A 2 1474 GOL A S 5 FT #HET 242 272 HIS A 2 1474 GOL A S 6 FT #HET 254 284 GLU A 3 1475 GOL A S 5 FT #HET 256 286 GLY A 2 1474 GOL A B 1 FT #HET 256 286 GLY A 3 1475 GOL A B 3 FT #HET 257 287 TYR A 3 1475 GOL A A 14 FT #HET 258 288 GLU A 1 1473 FHN A S 13 FT #HET 260 290 ARG A 3 1475 GOL A S 1 FT #HET 278 308 CYS A 1 1473 FHN A S 1 FT #HET 286 316 TRP A 1 1473 FHN A S 17 FT #HET 342 372 LYS A 5 1477 IMD A S 1 FT #HET 346 376 ARG A 5 1477 IMD A S 1 FT #HET 363 393 ASP A 8 1476 GOL B S 4 FT #HET 363 393 ASP A 9 1477 GOL B S 11 FT DISORDER 1 4 FT DISORDER 443 443 CC SEQUENCE 438 AA (ATOM); CC EIPLKYGATN EGKRQDPAMQ KFRDNRLGAF IHWGLYAIPG GEWNGKVYGG AAEWLKSWAK CC VPADEWLKLM DQWNPTKFDA KKWAKMAKEM GTKYVKITTK HHEGFCLWPS KYTKYTVANT CC PYKRDILGEL VKAYNDEGID VHFYFSVMDW SNPDYRYDIK SKEDSIAFSR FLEFTDNQLK CC ELATRYPTVK DFWFDGTWDA SVKKNGWWTA HAEQMLKELV PGVAINSRLR ADDKGKRHFD CC SNGRLMGDYE SGYERRLPDP VKDLKVTQWD WEACMTIPEN QWGYHKDWSL SYVKTPIEVI CC DRIVHAVSMG GNMVVNFGPQ ADGDFRPEEK AMATAIGKWM NRYGKAVYAC DYAGFEKQDW CC GYYTRGKNDE VYMVVFNQPY SERLIVKTPK GITVEKATLL TTGEDITVVE TTRNEYNVSV CC PKKNPGEPYV IQLKVRAA CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES EAKKEIPLKYGATNEGKRQDPAMQKFRDNRLGAFIHWGLYAIPGGEWNGK CC ATOM ----EIPLKYGATNEGKRQDPAMQKFRDNRLGAFIHWGLYAIPGGEWNGK CC ********************************************** CC SEQRES VYGGAAEWLKSWAKVPADEWLKLMDQWNPTKFDAKKWAKMAKEMGTKYVK CC ATOM VYGGAAEWLKSWAKVPADEWLKLMDQWNPTKFDAKKWAKMAKEMGTKYVK CC ************************************************** CC SEQRES ITTKHHEGFCLWPSKYTKYTVANTPYKRDILGELVKAYNDEGIDVHFYFS CC ATOM ITTKHHEGFCLWPSKYTKYTVANTPYKRDILGELVKAYNDEGIDVHFYFS CC ************************************************** CC SEQRES VMDWSNPDYRYDIKSKEDSIAFSRFLEFTDNQLKELATRYPTVKDFWFDG CC ATOM VMDWSNPDYRYDIKSKEDSIAFSRFLEFTDNQLKELATRYPTVKDFWFDG CC ************************************************** CC SEQRES TWDASVKKNGWWTAHAEQMLKELVPGVAINSRLRADDKGKRHFDSNGRLM CC ATOM TWDASVKKNGWWTAHAEQMLKELVPGVAINSRLRADDKGKRHFDSNGRLM CC ************************************************** CC SEQRES GDYESGYERRLPDPVKDLKVTQWDWEACMTIPENQWGYHKDWSLSYVKTP CC ATOM GDYESGYERRLPDPVKDLKVTQWDWEACMTIPENQWGYHKDWSLSYVKTP CC ************************************************** CC SEQRES IEVIDRIVHAVSMGGNMVVNFGPQADGDFRPEEKAMATAIGKWMNRYGKA CC ATOM IEVIDRIVHAVSMGGNMVVNFGPQADGDFRPEEKAMATAIGKWMNRYGKA CC ************************************************** CC SEQRES VYACDYAGFEKQDWGYYTRGKNDEVYMVVFNQPYSERLIVKTPKGITVEK CC ATOM VYACDYAGFEKQDWGYYTRGKNDEVYMVVFNQPYSERLIVKTPKGITVEK CC ************************************************** CC SEQRES ATLLTTGEDITVVETTRNEYNVSVPKKNPGEPYVIQLKVRAAK CC ATOM ATLLTTGEDITVVETTRNEYNVSVPKKNPGEPYVIQLKVRAA- CC ****************************************** SQ SEQUENCE 443 AA; MW; CN; EAKKEIPLKY GATNEGKRQD PAMQKFRDNR LGAFIHWGLY AIPGGEWNGK VYGGAAEWLK SWAKVPADEW LKLMDQWNPT KFDAKKWAKM AKEMGTKYVK ITTKHHEGFC LWPSKYTKYT VANTPYKRDI LGELVKAYND EGIDVHFYFS VMDWSNPDYR YDIKSKEDSI AFSRFLEFTD NQLKELATRY PTVKDFWFDG TWDASVKKNG WWTAHAEQML KELVPGVAIN SRLRADDKGK RHFDSNGRLM GDYESGYERR LPDPVKDLKV TQWDWEACMT IPENQWGYHK DWSLSYVKTP IEVIDRIVHA VSMGGNMVVN FGPQADGDFR PEEKAMATAI GKWMNRYGKA VYACDYAGFE KQDWGYYTRG KNDEVYMVVF NQPYSERLIV KTPKGITVEK ATLLTTGEDI TVVETTRNEY NVSVPKKNPG EPYVIQLKVR AAK // ID 2WVTB STANDARD; PRT; 443 AA. DT CONVERTED FROM PDB (SEQRES) 2WVT DE ALPHA-L-FUCOSIDASE OS BACTEROIDES THETAIOTAOMICRON CC EXPDTA X-RAY DIFFRACTION CC RESOLU 1.800 CC R-Factor 0.209 FT #SUB 265 295 VAL B 266 296 LYS A Protein S 1 FT #SUB 283 313 GLU B 415 445 THR A Protein S 2 FT #SUB 283 313 GLU B 416 446 THR A Protein S 3 FT #SUB 296 326 TYR B 387 417 ARG A Protein S 2 FT #SUB 296 326 TYR B 413 443 VAL A Protein S 2 FT #SUB 296 326 TYR B 414 444 GLU A Protein S 2 FT #SUB 296 326 TYR B 415 445 THR A Protein S 4 FT #SUB 296 326 TYR B 421 451 ASN A Protein S 2 FT #SUB 299 329 THR B 387 417 ARG A Protein S 4 FT #SUB 301 331 ILE B 384 414 TYR A Protein S 2 FT #SUB 301 331 ILE B 385 415 SER A Protein S 3 FT #SUB 302 332 GLU B 387 417 ARG A Protein S 1 FT #SUB 302 332 GLU B 421 451 ASN A Protein S 4 FT #SUB 361 391 LYS B 363 393 ASP A Protein S 1 FT #SUB 363 393 ASP B 361 391 LYS A Protein S 1 FT #SUB 363 393 ASP B 363 393 ASP A Protein A 5 FT #SUB 363 393 ASP B 381 411 ASN A Protein S 1 FT #SUB 380 410 PHE B 382 412 GLN A Protein B 1 FT #SUB 380 410 PHE B 383 413 PRO A Protein B 2 FT #SUB 380 410 PHE B 384 414 TYR A Protein B 4 FT #SUB 381 411 ASN B 363 393 ASP A Protein S 1 FT #SUB 381 411 ASN B 381 411 ASN A Protein S 1 FT #SUB 381 411 ASN B 382 412 GLN A Protein B 2 FT #SUB 381 411 ASN B 383 413 PRO A Protein S 2 FT #SUB 382 412 GLN B 380 410 PHE A Protein B 1 FT #SUB 382 412 GLN B 381 411 ASN A Protein B 3 FT #SUB 382 412 GLN B 382 412 GLN A Protein A 10 FT #SUB 382 412 GLN B 384 414 TYR A Protein S 5 FT #SUB 383 413 PRO B 380 410 PHE A Protein B 2 FT #SUB 383 413 PRO B 381 411 ASN A Protein A 3 FT #SUB 384 414 TYR B 301 331 ILE A Protein B 2 FT #SUB 384 414 TYR B 379 409 VAL A Protein S 2 FT #SUB 384 414 TYR B 380 410 PHE A Protein A 5 FT #SUB 384 414 TYR B 382 412 GLN A Protein S 6 FT #SUB 384 414 TYR B 428 458 ASN A Protein S 5 FT #SUB 384 414 TYR B 430 460 GLY A Protein S 4 FT #SUB 384 414 TYR B 431 461 GLU A Protein S 13 FT #SUB 384 414 TYR B 432 462 PRO A Protein S 7 FT #SUB 384 414 TYR B 433 463 TYR A Protein S 4 FT #SUB 385 415 SER B 301 331 ILE A Protein A 3 FT #SUB 387 417 ARG B 296 326 TYR A Protein S 2 FT #SUB 387 417 ARG B 299 329 THR A Protein S 4 FT #SUB 387 417 ARG B 302 332 GLU A Protein S 1 FT #SUB 413 443 VAL B 296 326 TYR A Protein S 2 FT #SUB 414 444 GLU B 296 326 TYR A Protein B 1 FT #SUB 415 445 THR B 283 313 GLU A Protein B 2 FT #SUB 415 445 THR B 296 326 TYR A Protein A 5 FT #SUB 416 446 THR B 283 313 GLU A Protein A 4 FT #SUB 421 451 ASN B 296 326 TYR A Protein S 4 FT #SUB 421 451 ASN B 302 332 GLU A Protein S 4 FT #SUB 428 458 ASN B 384 414 TYR A Protein S 5 FT #SUB 430 460 GLY B 384 414 TYR A Protein B 5 FT #SUB 431 461 GLU B 384 414 TYR A Protein B 12 FT #SUB 432 462 PRO B 384 414 TYR A Protein A 4 FT #SUB 433 463 TYR B 384 414 TYR A Protein S 3 FT #HET 36 66 HIS B 6 1474 FHN B S 4 FT #HET 55 85 ALA B 7 1475 GOL B S 2 FT #HET 57 87 GLU B 6 1474 FHN B S 4 FT #HET 57 87 GLU B 7 1475 GOL B S 1 FT #HET 58 88 TRP B 6 1474 FHN B S 7 FT #HET 58 88 TRP B 7 1475 GOL B S 8 FT #HET 60 90 LYS B 11 1479 IMD B S 1 FT #HET 61 91 SER B 11 1479 IMD B A 6 FT #HET 62 92 TRP B 7 1475 GOL B S 1 FT #HET 105 135 HIS B 6 1474 FHN B S 7 FT #HET 106 136 HIS B 6 1474 FHN B S 4 FT #HET 148 178 TYR B 6 1474 FHN B S 3 FT #HET 154 184 TRP B 11 1479 IMD B S 7 FT #HET 161 191 TYR B 11 1479 IMD B S 2 FT #HET 197 227 TRP B 6 1474 FHN B S 1 FT #HET 199 229 ASP B 6 1474 FHN B S 10 FT #HET 202 232 TRP B 6 1474 FHN B S 1 FT #HET 202 232 TRP B 10 1478 IMD B S 6 FT #HET 232 262 ARG B 6 1474 FHN B S 2 FT #HET 258 288 GLU B 6 1474 FHN B S 13 FT #HET 265 295 VAL B 8 1476 GOL B S 2 FT #HET 278 308 CYS B 6 1474 FHN B S 1 FT #HET 284 314 ASN B 7 1475 GOL B S 9 FT #HET 286 316 TRP B 6 1474 FHN B S 16 FT #HET 286 316 TRP B 7 1475 GOL B S 2 FT #HET 305 335 ASP B 9 1477 GOL B S 6 FT #HET 361 391 LYS B 8 1476 GOL B A 13 FT #HET 361 391 LYS B 9 1477 GOL B S 2 FT #HET 362 392 GLN B 8 1476 GOL B B 6 FT #HET 363 393 ASP B 8 1476 GOL B A 8 FT #HET 365 395 GLY B 9 1477 GOL B B 2 FT #HET 366 396 TYR B 9 1477 GOL B S 10 FT #HET 381 411 ASN B 9 1477 GOL B S 6 FT DISORDER 1 4 CC SEQUENCE 439 AA (ATOM); CC EIPLKYGATN EGKRQDPAMQ KFRDNRLGAF IHWGLYAIPG GEWNGKVYGG AAEWLKSWAK CC VPADEWLKLM DQWNPTKFDA KKWAKMAKEM GTKYVKITTK HHEGFCLWPS KYTKYTVANT CC PYKRDILGEL VKAYNDEGID VHFYFSVMDW SNPDYRYDIK SKEDSIAFSR FLEFTDNQLK CC ELATRYPTVK DFWFDGTWDA SVKKNGWWTA HAEQMLKELV PGVAINSRLR ADDKGKRHFD CC SNGRLMGDYE SGYERRLPDP VKDLKVTQWD WEACMTIPEN QWGYHKDWSL SYVKTPIEVI CC DRIVHAVSMG GNMVVNFGPQ ADGDFRPEEK AMATAIGKWM NRYGKAVYAC DYAGFEKQDW CC GYYTRGKNDE VYMVVFNQPY SERLIVKTPK GITVEKATLL TTGEDITVVE TTRNEYNVSV CC PKKNPGEPYV IQLKVRAAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES EAKKEIPLKYGATNEGKRQDPAMQKFRDNRLGAFIHWGLYAIPGGEWNGK CC ATOM ----EIPLKYGATNEGKRQDPAMQKFRDNRLGAFIHWGLYAIPGGEWNGK CC ********************************************** CC SEQRES VYGGAAEWLKSWAKVPADEWLKLMDQWNPTKFDAKKWAKMAKEMGTKYVK CC ATOM VYGGAAEWLKSWAKVPADEWLKLMDQWNPTKFDAKKWAKMAKEMGTKYVK CC ************************************************** CC SEQRES ITTKHHEGFCLWPSKYTKYTVANTPYKRDILGELVKAYNDEGIDVHFYFS CC ATOM ITTKHHEGFCLWPSKYTKYTVANTPYKRDILGELVKAYNDEGIDVHFYFS CC ************************************************** CC SEQRES VMDWSNPDYRYDIKSKEDSIAFSRFLEFTDNQLKELATRYPTVKDFWFDG CC ATOM VMDWSNPDYRYDIKSKEDSIAFSRFLEFTDNQLKELATRYPTVKDFWFDG CC ************************************************** CC SEQRES TWDASVKKNGWWTAHAEQMLKELVPGVAINSRLRADDKGKRHFDSNGRLM CC ATOM TWDASVKKNGWWTAHAEQMLKELVPGVAINSRLRADDKGKRHFDSNGRLM CC ************************************************** CC SEQRES GDYESGYERRLPDPVKDLKVTQWDWEACMTIPENQWGYHKDWSLSYVKTP CC ATOM GDYESGYERRLPDPVKDLKVTQWDWEACMTIPENQWGYHKDWSLSYVKTP CC ************************************************** CC SEQRES IEVIDRIVHAVSMGGNMVVNFGPQADGDFRPEEKAMATAIGKWMNRYGKA CC ATOM IEVIDRIVHAVSMGGNMVVNFGPQADGDFRPEEKAMATAIGKWMNRYGKA CC ************************************************** CC SEQRES VYACDYAGFEKQDWGYYTRGKNDEVYMVVFNQPYSERLIVKTPKGITVEK CC ATOM VYACDYAGFEKQDWGYYTRGKNDEVYMVVFNQPYSERLIVKTPKGITVEK CC ************************************************** CC SEQRES ATLLTTGEDITVVETTRNEYNVSVPKKNPGEPYVIQLKVRAAK CC ATOM ATLLTTGEDITVVETTRNEYNVSVPKKNPGEPYVIQLKVRAAK CC ******************************************* SQ SEQUENCE 443 AA; MW; CN; EAKKEIPLKY GATNEGKRQD PAMQKFRDNR LGAFIHWGLY AIPGGEWNGK VYGGAAEWLK SWAKVPADEW LKLMDQWNPT KFDAKKWAKM AKEMGTKYVK ITTKHHEGFC LWPSKYTKYT VANTPYKRDI LGELVKAYND EGIDVHFYFS VMDWSNPDYR YDIKSKEDSI AFSRFLEFTD NQLKELATRY PTVKDFWFDG TWDASVKKNG WWTAHAEQML KELVPGVAIN SRLRADDKGK RHFDSNGRLM GDYESGYERR LPDPVKDLKV TQWDWEACMT IPENQWGYHK DWSLSYVKTP IEVIDRIVHA VSMGGNMVVN FGPQADGDFR PEEKAMATAI GKWMNRYGKA VYACDYAGFE KQDWGYYTRG KNDEVYMVVF NQPYSERLIV KTPKGITVEK ATLLTTGEDI TVVETTRNEY NVSVPKKNPG EPYVIQLKVR AAK //