ID 4MIVA STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 89 89 TYR A 95 95 VAL B Protein S 4 FT #SUB 89 89 TYR A 370 370 VAL B Protein S 2 FT #SUB 94 94 ASP A 101 101 PHE B Protein B 1 FT #SUB 94 94 ASP A 336 336 PHE B Protein A 7 FT #SUB 95 95 VAL A 89 89 TYR B Protein A 5 FT #SUB 95 95 VAL A 101 101 PHE B Protein B 2 FT #SUB 95 95 VAL A 336 336 PHE B Protein A 2 FT #SUB 97 97 HIS A 99 99 ASN B Protein S 2 FT #SUB 97 97 HIS A 100 100 SER B Protein S 2 FT #SUB 97 97 HIS A 101 101 PHE B Protein S 6 FT #SUB 99 99 ASN A 97 97 HIS B Protein B 2 FT #SUB 100 100 SER A 97 97 HIS B Protein B 1 FT #SUB 101 101 PHE A 94 94 ASP B Protein S 1 FT #SUB 101 101 PHE A 97 97 HIS B Protein S 6 FT #SUB 185 185 HIS A 488 488 GLU B Protein S 2 FT #SUB 336 336 PHE A 94 94 ASP B Protein S 7 FT #SUB 336 336 PHE A 95 95 VAL B Protein S 2 FT #SUB 367 367 HIS A 367 367 HIS B Protein S 4 FT #SUB 370 370 VAL A 89 89 TYR B Protein S 2 FT #SUB 370 370 VAL A 479 479 TRP B Protein A 2 FT #SUB 371 371 THR A 478 478 PRO B Protein S 3 FT #SUB 371 371 THR A 479 479 TRP B Protein A 7 FT #SUB 373 373 SER A 373 373 SER B Protein S 4 FT #SUB 390 390 LEU A 394 394 MET B Protein B 1 FT #SUB 391 391 ASN A 394 394 MET B Protein A 4 FT #SUB 393 393 LYS A 393 393 LYS B Protein A 4 FT #SUB 393 393 LYS A 394 394 MET B Protein A 4 FT #SUB 393 393 LYS A 432 432 TYR B Protein S 3 FT #SUB 394 394 MET A 390 390 LEU B Protein S 1 FT #SUB 394 394 MET A 391 391 ASN B Protein S 8 FT #SUB 394 394 MET A 393 393 LYS B Protein S 4 FT #SUB 394 394 MET A 483 483 PRO B Protein S 1 FT #SUB 394 394 MET A 502 502 LEU B Protein S 1 FT #SUB 395 395 PRO A 500 500 ASN B Protein S 3 FT #SUB 395 395 PRO A 502 502 LEU B Protein S 1 FT #SUB 398 398 ILE A 486 486 VAL B Protein B 1 FT #SUB 398 398 ILE A 498 498 LEU B Protein S 2 FT #SUB 400 400 GLN A 486 486 VAL B Protein A 2 FT #SUB 400 400 GLN A 487 487 LEU B Protein S 4 FT #SUB 400 400 GLN A 488 488 GLU B Protein A 6 FT #SUB 403 403 TYR A 486 486 VAL B Protein S 1 FT #SUB 403 403 TYR A 488 488 GLU B Protein S 2 FT #SUB 403 403 TYR A 496 496 GLN B Protein S 8 FT #SUB 403 403 TYR A 497 497 PRO B Protein S 6 FT #SUB 403 403 TYR A 498 498 LEU B Protein S 2 FT #SUB 404 404 VAL A 488 488 GLU B Protein S 3 FT #SUB 412 412 LEU A 497 497 PRO B Protein S 1 FT #SUB 412 412 LEU A 498 498 LEU B Protein S 1 FT #SUB 416 416 THR A 499 499 HIS B Protein S 2 FT #SUB 431 431 TYR A 498 498 LEU B Protein S 6 FT #SUB 431 431 TYR A 499 499 HIS B Protein S 1 FT #SUB 431 431 TYR A 500 500 ASN B Protein S 4 FT #SUB 432 432 TYR A 393 393 LYS B Protein S 1 FT #SUB 432 432 TYR A 500 500 ASN B Protein S 4 FT #SUB 432 432 TYR A 501 501 GLU B Protein S 2 FT #SUB 432 432 TYR A 502 502 LEU B Protein S 1 FT #SUB 478 478 PRO A 95 95 VAL B Protein S 2 FT #SUB 478 478 PRO A 371 371 THR B Protein A 3 FT #SUB 479 479 TRP A 370 370 VAL B Protein S 2 FT #SUB 479 479 TRP A 371 371 THR B Protein S 7 FT #SUB 483 483 PRO A 394 394 MET B Protein S 1 FT #SUB 486 486 VAL A 398 398 ILE B Protein S 1 FT #SUB 486 486 VAL A 400 400 GLN B Protein S 2 FT #SUB 487 487 LEU A 400 400 GLN B Protein A 6 FT #SUB 488 488 GLU A 185 185 HIS B Protein S 2 FT #SUB 488 488 GLU A 400 400 GLN B Protein S 6 FT #SUB 488 488 GLU A 403 403 TYR B Protein S 2 FT #SUB 488 488 GLU A 404 404 VAL B Protein S 3 FT #SUB 490 490 LYS A 185 185 HIS B Protein S 4 FT #SUB 496 496 GLN A 403 403 TYR B Protein A 7 FT #SUB 497 497 PRO A 403 403 TYR B Protein A 6 FT #SUB 497 497 PRO A 412 412 LEU B Protein B 1 FT #SUB 498 498 LEU A 398 398 ILE B Protein S 2 FT #SUB 498 498 LEU A 403 403 TYR B Protein S 2 FT #SUB 498 498 LEU A 412 412 LEU B Protein B 1 FT #SUB 498 498 LEU A 431 431 TYR B Protein A 6 FT #SUB 499 499 HIS A 412 412 LEU B Protein S 2 FT #SUB 499 499 HIS A 416 416 THR B Protein S 4 FT #SUB 499 499 HIS A 431 431 TYR B Protein B 3 FT #SUB 500 500 ASN A 395 395 PRO B Protein S 3 FT #SUB 500 500 ASN A 431 431 TYR B Protein A 5 FT #SUB 500 500 ASN A 432 432 TYR B Protein B 4 FT #SUB 501 501 GLU A 428 428 ARG B Protein S 1 FT #SUB 501 501 GLU A 432 432 TYR B Protein S 3 FT #SUB 502 502 LEU A 395 395 PRO B Protein S 1 FT #SUB 502 502 LEU A 432 432 TYR B Protein S 2 FT #HET 31 31 ASP A 39 601 CA A S 2 FT #HET 32 32 ASP A 39 601 CA A A 3 FT #HET 39 39 ALA A 1 1 NAG I B 1 FT #HET 59 59 LEU A 1 1 NAG I S 2 FT #HET 61 61 ARG A 1 1 NAG I S 1 FT #HET 70 70 FGP A 39 601 CA A A 5 FT #HET 97 97 HIS A 41 609 CL A S 2 FT #HET 147 147 GLN A 3 1 NAG J S 6 FT #HET 154 154 ARG A 3 1 NAG J S 4 FT #HET 157 157 LEU A 4 2 NAG J S 1 FT #HET 197 197 PHE A 3 1 NAG J S 4 FT #HET 198 198 GLY A 3 1 NAG J B 1 FT #HET 204 204 MET A 3 1 NAG J B 1 FT #HET 205 205 GLY A 3 1 NAG J B 2 FT #HET 206 206 ARG A 3 1 NAG J B 8 FT #HET 207 207 ILE A 3 1 NAG J S 1 FT #HET 273 273 ASP A 39 601 CA A S 3 FT #HET 274 274 ASN A 39 601 CA A S 3 FT #HET 299 299 PRO A 5 1 NAG K B 2 FT #HET 300 300 GLU A 6 2 NAG K B 1 FT #HET 302 302 PRO A 6 2 NAG K S 2 FT #HET 406 406 PRO A 40 608 NAG A B 1 FT #HET 409 409 GLN A 40 608 NAG A A 2 FT #HET 410 410 ASP A 40 608 NAG A B 2 FT #MOD 41 41 ASN A 1 1 NAG I S FT #MOD 151 151 ASN A 3 1 NAG J S FT #MOD 264 264 ASN A 5 1 NAG K S FT #MOD 413 413 ASN A 40 608 NAG A S FT DISORDER 1 21 FT DISORDER 506 510 CC SEQUENCE 484 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NGSVLQVGRN ITRIKLLVRK FLQTQDDRPF FLYVAFHDPH RCGHSQPQYG TFCEKFGNGE CC SGMGRIPDWT PQAYDPLDVL VPYFVPNTPA ARADLAAQYT TVGRMDQGVG LVLQELRDAG CC VLNDTLVIFT SDNGIPFPSG RTNLYWPGTA EPLLVSSPEH PKRWGQVSEA YVSLLDLTPT CC ILDWFSIPYP SYAIFGSKTI HLTGRSLLPA LEAEPLWATV FGSQSHHEVT MSYPMRSVQH CC RHFRLVHNLN FKMPFPIDQD FYVSPTFQDL LNRTTAGQPT GWYKDLRHYY YRARWELYDR CC SRDPHETQNL ATDPRFAQLL EMLRDQLAKW QWETHDPWVC APDGVLEEKL SPQCQPLHNE CC LRSH CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM ELRSH----- CC ***** SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVB STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 89 89 TYR B 95 95 VAL A Protein S 5 FT #SUB 89 89 TYR B 370 370 VAL A Protein S 2 FT #SUB 94 94 ASP B 101 101 PHE A Protein B 1 FT #SUB 94 94 ASP B 336 336 PHE A Protein A 7 FT #SUB 95 95 VAL B 89 89 TYR A Protein B 4 FT #SUB 95 95 VAL B 336 336 PHE A Protein A 2 FT #SUB 95 95 VAL B 478 478 PRO A Protein S 2 FT #SUB 97 97 HIS B 99 99 ASN A Protein S 2 FT #SUB 97 97 HIS B 100 100 SER A Protein S 1 FT #SUB 97 97 HIS B 101 101 PHE A Protein S 6 FT #SUB 99 99 ASN B 97 97 HIS A Protein B 2 FT #SUB 100 100 SER B 97 97 HIS A Protein B 2 FT #SUB 101 101 PHE B 94 94 ASP A Protein S 1 FT #SUB 101 101 PHE B 95 95 VAL A Protein S 2 FT #SUB 101 101 PHE B 97 97 HIS A Protein S 6 FT #SUB 185 185 HIS B 488 488 GLU A Protein S 2 FT #SUB 185 185 HIS B 490 490 LYS A Protein S 4 FT #SUB 336 336 PHE B 94 94 ASP A Protein S 7 FT #SUB 336 336 PHE B 95 95 VAL A Protein S 2 FT #SUB 367 367 HIS B 367 367 HIS A Protein S 4 FT #SUB 370 370 VAL B 89 89 TYR A Protein S 2 FT #SUB 370 370 VAL B 479 479 TRP A Protein A 2 FT #SUB 371 371 THR B 478 478 PRO A Protein S 3 FT #SUB 371 371 THR B 479 479 TRP A Protein A 7 FT #SUB 373 373 SER B 373 373 SER A Protein S 4 FT #SUB 390 390 LEU B 394 394 MET A Protein B 1 FT #SUB 391 391 ASN B 394 394 MET A Protein A 8 FT #SUB 393 393 LYS B 393 393 LYS A Protein A 4 FT #SUB 393 393 LYS B 394 394 MET A Protein A 4 FT #SUB 393 393 LYS B 432 432 TYR A Protein S 1 FT #SUB 394 394 MET B 390 390 LEU A Protein S 1 FT #SUB 394 394 MET B 391 391 ASN A Protein S 4 FT #SUB 394 394 MET B 393 393 LYS A Protein S 4 FT #SUB 394 394 MET B 483 483 PRO A Protein S 1 FT #SUB 395 395 PRO B 500 500 ASN A Protein S 3 FT #SUB 395 395 PRO B 502 502 LEU A Protein S 1 FT #SUB 398 398 ILE B 486 486 VAL A Protein B 1 FT #SUB 398 398 ILE B 498 498 LEU A Protein S 2 FT #SUB 400 400 GLN B 486 486 VAL A Protein A 2 FT #SUB 400 400 GLN B 487 487 LEU A Protein S 6 FT #SUB 400 400 GLN B 488 488 GLU A Protein A 6 FT #SUB 403 403 TYR B 488 488 GLU A Protein S 2 FT #SUB 403 403 TYR B 496 496 GLN A Protein S 7 FT #SUB 403 403 TYR B 497 497 PRO A Protein S 6 FT #SUB 403 403 TYR B 498 498 LEU A Protein S 2 FT #SUB 404 404 VAL B 488 488 GLU A Protein S 3 FT #SUB 412 412 LEU B 497 497 PRO A Protein S 1 FT #SUB 412 412 LEU B 498 498 LEU A Protein S 1 FT #SUB 412 412 LEU B 499 499 HIS A Protein S 2 FT #SUB 416 416 THR B 499 499 HIS A Protein S 4 FT #SUB 428 428 ARG B 501 501 GLU A Protein S 1 FT #SUB 431 431 TYR B 498 498 LEU A Protein S 6 FT #SUB 431 431 TYR B 499 499 HIS A Protein S 3 FT #SUB 431 431 TYR B 500 500 ASN A Protein S 5 FT #SUB 432 432 TYR B 393 393 LYS A Protein S 3 FT #SUB 432 432 TYR B 500 500 ASN A Protein S 4 FT #SUB 432 432 TYR B 501 501 GLU A Protein S 3 FT #SUB 432 432 TYR B 502 502 LEU A Protein S 2 FT #SUB 478 478 PRO B 371 371 THR A Protein A 3 FT #SUB 479 479 TRP B 370 370 VAL A Protein S 2 FT #SUB 479 479 TRP B 371 371 THR A Protein S 7 FT #SUB 483 483 PRO B 394 394 MET A Protein S 1 FT #SUB 486 486 VAL B 398 398 ILE A Protein S 1 FT #SUB 486 486 VAL B 400 400 GLN A Protein S 2 FT #SUB 486 486 VAL B 403 403 TYR A Protein S 1 FT #SUB 487 487 LEU B 400 400 GLN A Protein B 4 FT #SUB 488 488 GLU B 185 185 HIS A Protein S 2 FT #SUB 488 488 GLU B 400 400 GLN A Protein S 6 FT #SUB 488 488 GLU B 403 403 TYR A Protein S 2 FT #SUB 488 488 GLU B 404 404 VAL A Protein S 3 FT #SUB 496 496 GLN B 403 403 TYR A Protein A 8 FT #SUB 497 497 PRO B 403 403 TYR A Protein A 6 FT #SUB 497 497 PRO B 412 412 LEU A Protein B 1 FT #SUB 498 498 LEU B 398 398 ILE A Protein S 2 FT #SUB 498 498 LEU B 403 403 TYR A Protein S 2 FT #SUB 498 498 LEU B 412 412 LEU A Protein B 1 FT #SUB 498 498 LEU B 431 431 TYR A Protein A 6 FT #SUB 499 499 HIS B 416 416 THR A Protein S 2 FT #SUB 499 499 HIS B 431 431 TYR A Protein B 1 FT #SUB 500 500 ASN B 395 395 PRO A Protein S 3 FT #SUB 500 500 ASN B 431 431 TYR A Protein A 4 FT #SUB 500 500 ASN B 432 432 TYR A Protein B 4 FT #SUB 501 501 GLU B 432 432 TYR A Protein S 2 FT #SUB 502 502 LEU B 394 394 MET A Protein S 1 FT #SUB 502 502 LEU B 395 395 PRO A Protein S 1 FT #SUB 502 502 LEU B 432 432 TYR A Protein S 1 FT #SUB 167 167 ASP B 130 130 THR D Protein S 4 FT #SUB 303 303 LYS B 490 490 LYS G Protein S 3 FT #SUB 346 346 ARG B 470 470 LYS G Protein S 1 FT #SUB 353 353 GLU B 490 490 LYS G Protein B 1 FT #SUB 354 354 ALA B 489 489 GLU G Protein S 1 FT #SUB 354 354 ALA B 491 491 LEU G Protein S 2 FT #SUB 354 354 ALA B 492 492 SER G Protein B 2 FT #SUB 354 354 ALA B 494 494 GLN G Protein S 1 FT #SUB 355 355 GLU B 492 492 SER G Protein A 3 FT #SUB 357 357 LEU B 473 473 TRP G Protein S 1 FT #SUB 381 381 HIS B 463 463 MET G Protein S 1 FT #SUB 381 381 HIS B 466 466 ASP G Protein S 7 FT #SUB 382 382 ARG B 466 466 ASP G Protein S 8 FT #SUB 384 384 PHE B 459 459 GLN G Protein S 3 FT #SUB 459 459 GLN B 384 384 PHE G Protein S 5 FT #SUB 459 459 GLN B 440 440 ASP G Protein S 1 FT #SUB 459 459 GLN B 460 460 LEU G Protein S 2 FT #SUB 460 460 LEU B 463 463 MET G Protein S 2 FT #SUB 463 463 MET B 381 381 HIS G Protein A 3 FT #SUB 463 463 MET B 460 460 LEU G Protein S 1 FT #SUB 463 463 MET B 463 463 MET G Protein S 4 FT #SUB 463 463 MET B 467 467 GLN G Protein S 2 FT #SUB 466 466 ASP B 381 381 HIS G Protein S 7 FT #SUB 466 466 ASP B 382 382 ARG G Protein S 6 FT #SUB 467 467 GLN B 463 463 MET G Protein S 1 FT #SUB 470 470 LYS B 346 346 ARG G Protein S 1 FT #SUB 473 473 TRP B 357 357 LEU G Protein S 1 FT #SUB 491 491 LEU B 354 354 ALA G Protein B 1 FT #SUB 492 492 SER B 355 355 GLU G Protein S 4 FT #HET 31 31 ASP B 42 600 CA B S 2 FT #HET 32 32 ASP B 42 600 CA B A 3 FT #HET 39 39 ALA B 7 1 NAG L B 1 FT #HET 61 61 ARG B 7 1 NAG L S 10 FT #HET 61 61 ARG B 8 2 NAG L S 1 FT #HET 70 70 FGP B 42 600 CA B S 4 FT #HET 101 101 PHE B 41 609 CL A A 3 FT #HET 102 102 ASP B 41 609 CL A B 1 FT #HET 147 147 GLN B 9 1 NAG M S 6 FT #HET 153 153 THR B 9 1 NAG M S 2 FT #HET 154 154 ARG B 9 1 NAG M S 1 FT #HET 197 197 PHE B 9 1 NAG M S 5 FT #HET 198 198 GLY B 9 1 NAG M B 1 FT #HET 204 204 MET B 9 1 NAG M B 1 FT #HET 205 205 GLY B 9 1 NAG M B 2 FT #HET 206 206 ARG B 9 1 NAG M B 8 FT #HET 208 208 PRO B 10 2 NAG M S 3 FT #HET 273 273 ASP B 42 600 CA B S 3 FT #HET 274 274 ASN B 42 600 CA B S 3 FT #HET 299 299 PRO B 11 1 NAG N B 2 FT #HET 300 300 GLU B 11 1 NAG N S 2 FT #HET 300 300 GLU B 12 2 NAG N B 1 FT #HET 302 302 PRO B 12 2 NAG N S 2 FT #HET 406 406 PRO B 43 607 NAG B B 1 FT #HET 409 409 GLN B 43 607 NAG B A 2 FT #HET 410 410 ASP B 43 607 NAG B B 2 FT #MOD 41 41 ASN B 7 1 NAG L S FT #MOD 151 151 ASN B 9 1 NAG M S FT #MOD 264 264 ASN B 11 1 NAG N S FT #MOD 413 413 ASN B 43 607 NAG B S FT DISORDER 1 21 FT DISORDER 503 510 CC SEQUENCE 481 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NGSVLQVGRN ITRIKLLVRK FLQTQDDRPF FLYVAFHDPH RCGHSQPQYG TFCEKFGNGE CC SGMGRIPDWT PQAYDPLDVL VPYFVPNTPA ARADLAAQYT TVGRMDQGVG LVLQELRDAG CC VLNDTLVIFT SDNGIPFPSG RTNLYWPGTA EPLLVSSPEH PKRWGQVSEA YVSLLDLTPT CC ILDWFSIPYP SYAIFGSKTI HLTGRSLLPA LEAEPLWATV FGSQSHHEVT MSYPMRSVQH CC RHFRLVHNLN FKMPFPIDQD FYVSPTFQDL LNRTTAGQPT GWYKDLRHYY YRARWELYDR CC SRDPHETQNL ATDPRFAQLL EMLRDQLAKW QWETHDPWVC APDGVLEEKL SPQCQPLHNE CC L CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM EL-------- CC ** SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVC STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 89 89 TYR C 95 95 VAL D Protein S 4 FT #SUB 89 89 TYR C 370 370 VAL D Protein S 2 FT #SUB 94 94 ASP C 101 101 PHE D Protein B 1 FT #SUB 94 94 ASP C 336 336 PHE D Protein A 7 FT #SUB 95 95 VAL C 89 89 TYR D Protein B 4 FT #SUB 95 95 VAL C 336 336 PHE D Protein A 2 FT #SUB 95 95 VAL C 478 478 PRO D Protein S 1 FT #SUB 97 97 HIS C 99 99 ASN D Protein S 2 FT #SUB 97 97 HIS C 100 100 SER D Protein S 1 FT #SUB 97 97 HIS C 101 101 PHE D Protein S 6 FT #SUB 99 99 ASN C 97 97 HIS D Protein B 2 FT #SUB 100 100 SER C 97 97 HIS D Protein B 1 FT #SUB 101 101 PHE C 94 94 ASP D Protein S 1 FT #SUB 101 101 PHE C 97 97 HIS D Protein S 6 FT #SUB 185 185 HIS C 488 488 GLU D Protein S 2 FT #SUB 336 336 PHE C 94 94 ASP D Protein S 7 FT #SUB 336 336 PHE C 95 95 VAL D Protein S 2 FT #SUB 367 367 HIS C 367 367 HIS D Protein S 4 FT #SUB 370 370 VAL C 89 89 TYR D Protein S 2 FT #SUB 370 370 VAL C 479 479 TRP D Protein A 2 FT #SUB 371 371 THR C 478 478 PRO D Protein S 3 FT #SUB 371 371 THR C 479 479 TRP D Protein A 7 FT #SUB 373 373 SER C 373 373 SER D Protein S 4 FT #SUB 390 390 LEU C 394 394 MET D Protein B 1 FT #SUB 391 391 ASN C 394 394 MET D Protein A 5 FT #SUB 393 393 LYS C 393 393 LYS D Protein A 6 FT #SUB 393 393 LYS C 394 394 MET D Protein A 3 FT #SUB 393 393 LYS C 432 432 TYR D Protein S 3 FT #SUB 394 394 MET C 390 390 LEU D Protein S 1 FT #SUB 394 394 MET C 391 391 ASN D Protein S 10 FT #SUB 394 394 MET C 392 392 PHE D Protein S 1 FT #SUB 394 394 MET C 393 393 LYS D Protein S 3 FT #SUB 395 395 PRO C 500 500 ASN D Protein S 2 FT #SUB 398 398 ILE C 486 486 VAL D Protein B 1 FT #SUB 398 398 ILE C 498 498 LEU D Protein S 2 FT #SUB 400 400 GLN C 486 486 VAL D Protein A 2 FT #SUB 400 400 GLN C 487 487 LEU D Protein S 6 FT #SUB 400 400 GLN C 488 488 GLU D Protein A 6 FT #SUB 403 403 TYR C 486 486 VAL D Protein S 1 FT #SUB 403 403 TYR C 488 488 GLU D Protein S 2 FT #SUB 403 403 TYR C 496 496 GLN D Protein S 7 FT #SUB 403 403 TYR C 497 497 PRO D Protein S 6 FT #SUB 403 403 TYR C 498 498 LEU D Protein S 2 FT #SUB 404 404 VAL C 488 488 GLU D Protein S 3 FT #SUB 412 412 LEU C 497 497 PRO D Protein S 1 FT #SUB 412 412 LEU C 498 498 LEU D Protein S 1 FT #SUB 416 416 THR C 499 499 HIS D Protein S 2 FT #SUB 431 431 TYR C 498 498 LEU D Protein S 6 FT #SUB 431 431 TYR C 499 499 HIS D Protein S 1 FT #SUB 431 431 TYR C 500 500 ASN D Protein S 4 FT #SUB 432 432 TYR C 393 393 LYS D Protein S 3 FT #SUB 432 432 TYR C 500 500 ASN D Protein S 4 FT #SUB 432 432 TYR C 501 501 GLU D Protein S 4 FT #SUB 478 478 PRO C 95 95 VAL D Protein S 1 FT #SUB 478 478 PRO C 371 371 THR D Protein A 3 FT #SUB 479 479 TRP C 370 370 VAL D Protein S 2 FT #SUB 479 479 TRP C 371 371 THR D Protein S 7 FT #SUB 483 483 PRO C 394 394 MET D Protein S 1 FT #SUB 486 486 VAL C 398 398 ILE D Protein S 1 FT #SUB 486 486 VAL C 400 400 GLN D Protein S 2 FT #SUB 486 486 VAL C 403 403 TYR D Protein S 1 FT #SUB 487 487 LEU C 400 400 GLN D Protein B 4 FT #SUB 488 488 GLU C 185 185 HIS D Protein S 2 FT #SUB 488 488 GLU C 400 400 GLN D Protein S 6 FT #SUB 488 488 GLU C 403 403 TYR D Protein S 2 FT #SUB 488 488 GLU C 404 404 VAL D Protein S 3 FT #SUB 496 496 GLN C 403 403 TYR D Protein A 8 FT #SUB 497 497 PRO C 403 403 TYR D Protein A 7 FT #SUB 497 497 PRO C 412 412 LEU D Protein B 1 FT #SUB 498 498 LEU C 398 398 ILE D Protein S 2 FT #SUB 498 498 LEU C 403 403 TYR D Protein S 2 FT #SUB 498 498 LEU C 412 412 LEU D Protein B 1 FT #SUB 498 498 LEU C 431 431 TYR D Protein A 6 FT #SUB 499 499 HIS C 412 412 LEU D Protein S 1 FT #SUB 499 499 HIS C 416 416 THR D Protein S 3 FT #SUB 499 499 HIS C 431 431 TYR D Protein B 1 FT #SUB 500 500 ASN C 395 395 PRO D Protein S 2 FT #SUB 500 500 ASN C 431 431 TYR D Protein A 5 FT #SUB 500 500 ASN C 432 432 TYR D Protein B 4 FT #SUB 501 501 GLU C 428 428 ARG D Protein S 1 FT #SUB 501 501 GLU C 432 432 TYR D Protein S 3 FT #SUB 502 502 LEU C 395 395 PRO D Protein S 1 FT #SUB 502 502 LEU C 432 432 TYR D Protein S 2 FT #HET 31 31 ASP C 44 601 CA C S 2 FT #HET 32 32 ASP C 44 601 CA C A 3 FT #HET 39 39 ALA C 13 1 NAG O B 1 FT #HET 59 59 LEU C 13 1 NAG O S 1 FT #HET 61 61 ARG C 13 1 NAG O S 6 FT #HET 70 70 FGP C 44 601 CA C A 7 FT #HET 147 147 GLN C 15 1 NAG P S 6 FT #HET 154 154 ARG C 15 1 NAG P S 2 FT #HET 157 157 LEU C 16 2 NAG P S 1 FT #HET 197 197 PHE C 15 1 NAG P S 5 FT #HET 198 198 GLY C 15 1 NAG P B 1 FT #HET 204 204 MET C 15 1 NAG P B 1 FT #HET 205 205 GLY C 15 1 NAG P B 2 FT #HET 206 206 ARG C 15 1 NAG P B 9 FT #HET 207 207 ILE C 15 1 NAG P S 2 FT #HET 273 273 ASP C 44 601 CA C S 3 FT #HET 274 274 ASN C 44 601 CA C S 3 FT #HET 299 299 PRO C 17 1 NAG Q B 5 FT #HET 300 300 GLU C 17 1 NAG Q A 2 FT #HET 300 300 GLU C 18 2 NAG Q B 3 FT #HET 308 308 VAL C 13 1 NAG O S 1 FT #HET 393 393 LYS C 46 609 CL C S 2 FT #HET 406 406 PRO C 45 608 NAG C B 1 FT #HET 409 409 GLN C 45 608 NAG C A 2 FT #HET 410 410 ASP C 45 608 NAG C A 3 FT #MOD 41 41 ASN C 13 1 NAG O S FT #MOD 151 151 ASN C 15 1 NAG P S FT #MOD 264 264 ASN C 17 1 NAG Q S FT #MOD 413 413 ASN C 45 608 NAG C S FT DISORDER 1 20 FT DISORDER 505 510 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 484 AA (ATOM); CC RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG CC LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE CC ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG CC ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA CC GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP CC TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ CC HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD CC RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN CC ELRS CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM --------------------RPRNALLLLADDGGFESGAYNNSAIATPHL CC ****************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM ELRS------ CC **** SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVD STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 130 130 THR D 167 167 ASP B Protein S 4 FT #SUB 89 89 TYR D 95 95 VAL C Protein S 4 FT #SUB 89 89 TYR D 370 370 VAL C Protein S 2 FT #SUB 94 94 ASP D 101 101 PHE C Protein B 1 FT #SUB 94 94 ASP D 336 336 PHE C Protein A 7 FT #SUB 95 95 VAL D 89 89 TYR C Protein B 4 FT #SUB 95 95 VAL D 336 336 PHE C Protein A 2 FT #SUB 95 95 VAL D 478 478 PRO C Protein S 1 FT #SUB 97 97 HIS D 99 99 ASN C Protein S 2 FT #SUB 97 97 HIS D 100 100 SER C Protein S 1 FT #SUB 97 97 HIS D 101 101 PHE C Protein S 6 FT #SUB 99 99 ASN D 97 97 HIS C Protein B 2 FT #SUB 100 100 SER D 97 97 HIS C Protein B 1 FT #SUB 101 101 PHE D 94 94 ASP C Protein S 1 FT #SUB 101 101 PHE D 97 97 HIS C Protein S 6 FT #SUB 185 185 HIS D 488 488 GLU C Protein S 2 FT #SUB 336 336 PHE D 94 94 ASP C Protein S 7 FT #SUB 336 336 PHE D 95 95 VAL C Protein S 2 FT #SUB 367 367 HIS D 367 367 HIS C Protein S 4 FT #SUB 370 370 VAL D 89 89 TYR C Protein S 2 FT #SUB 370 370 VAL D 479 479 TRP C Protein A 2 FT #SUB 371 371 THR D 478 478 PRO C Protein S 3 FT #SUB 371 371 THR D 479 479 TRP C Protein A 7 FT #SUB 373 373 SER D 373 373 SER C Protein S 4 FT #SUB 390 390 LEU D 394 394 MET C Protein B 1 FT #SUB 391 391 ASN D 394 394 MET C Protein A 10 FT #SUB 392 392 PHE D 394 394 MET C Protein B 1 FT #SUB 393 393 LYS D 393 393 LYS C Protein A 6 FT #SUB 393 393 LYS D 394 394 MET C Protein A 3 FT #SUB 393 393 LYS D 432 432 TYR C Protein S 3 FT #SUB 394 394 MET D 390 390 LEU C Protein S 1 FT #SUB 394 394 MET D 391 391 ASN C Protein S 5 FT #SUB 394 394 MET D 393 393 LYS C Protein S 3 FT #SUB 394 394 MET D 483 483 PRO C Protein S 1 FT #SUB 395 395 PRO D 500 500 ASN C Protein S 2 FT #SUB 395 395 PRO D 502 502 LEU C Protein S 1 FT #SUB 398 398 ILE D 486 486 VAL C Protein B 1 FT #SUB 398 398 ILE D 498 498 LEU C Protein S 2 FT #SUB 400 400 GLN D 486 486 VAL C Protein A 2 FT #SUB 400 400 GLN D 487 487 LEU C Protein S 4 FT #SUB 400 400 GLN D 488 488 GLU C Protein A 6 FT #SUB 403 403 TYR D 486 486 VAL C Protein S 1 FT #SUB 403 403 TYR D 488 488 GLU C Protein S 2 FT #SUB 403 403 TYR D 496 496 GLN C Protein S 8 FT #SUB 403 403 TYR D 497 497 PRO C Protein S 7 FT #SUB 403 403 TYR D 498 498 LEU C Protein S 2 FT #SUB 404 404 VAL D 488 488 GLU C Protein S 3 FT #SUB 412 412 LEU D 497 497 PRO C Protein S 1 FT #SUB 412 412 LEU D 498 498 LEU C Protein S 1 FT #SUB 412 412 LEU D 499 499 HIS C Protein S 1 FT #SUB 416 416 THR D 499 499 HIS C Protein S 3 FT #SUB 428 428 ARG D 501 501 GLU C Protein S 1 FT #SUB 431 431 TYR D 498 498 LEU C Protein S 6 FT #SUB 431 431 TYR D 499 499 HIS C Protein S 1 FT #SUB 431 431 TYR D 500 500 ASN C Protein S 5 FT #SUB 432 432 TYR D 393 393 LYS C Protein S 3 FT #SUB 432 432 TYR D 500 500 ASN C Protein S 4 FT #SUB 432 432 TYR D 501 501 GLU C Protein S 3 FT #SUB 432 432 TYR D 502 502 LEU C Protein S 2 FT #SUB 478 478 PRO D 95 95 VAL C Protein S 1 FT #SUB 478 478 PRO D 371 371 THR C Protein A 3 FT #SUB 479 479 TRP D 370 370 VAL C Protein S 2 FT #SUB 479 479 TRP D 371 371 THR C Protein S 7 FT #SUB 486 486 VAL D 398 398 ILE C Protein S 1 FT #SUB 486 486 VAL D 400 400 GLN C Protein S 2 FT #SUB 486 486 VAL D 403 403 TYR C Protein S 1 FT #SUB 487 487 LEU D 400 400 GLN C Protein A 6 FT #SUB 488 488 GLU D 185 185 HIS C Protein S 2 FT #SUB 488 488 GLU D 400 400 GLN C Protein S 6 FT #SUB 488 488 GLU D 403 403 TYR C Protein S 2 FT #SUB 488 488 GLU D 404 404 VAL C Protein S 3 FT #SUB 496 496 GLN D 403 403 TYR C Protein A 7 FT #SUB 497 497 PRO D 403 403 TYR C Protein A 6 FT #SUB 497 497 PRO D 412 412 LEU C Protein B 1 FT #SUB 498 498 LEU D 398 398 ILE C Protein S 2 FT #SUB 498 498 LEU D 403 403 TYR C Protein S 2 FT #SUB 498 498 LEU D 412 412 LEU C Protein B 1 FT #SUB 498 498 LEU D 431 431 TYR C Protein A 6 FT #SUB 499 499 HIS D 416 416 THR C Protein S 2 FT #SUB 499 499 HIS D 431 431 TYR C Protein B 1 FT #SUB 500 500 ASN D 395 395 PRO C Protein S 2 FT #SUB 500 500 ASN D 431 431 TYR C Protein A 4 FT #SUB 500 500 ASN D 432 432 TYR C Protein B 4 FT #SUB 501 501 GLU D 432 432 TYR C Protein S 4 FT #SUB 354 354 ALA D 332 332 SER E Protein S 1 FT #SUB 355 355 GLU D 332 332 SER E Protein B 1 FT #SUB 338 338 SER D 202 202 SER F Protein A 5 FT #SUB 339 339 LYS D 202 202 SER F Protein A 2 FT #HET 31 31 ASP D 47 601 CA D S 2 FT #HET 32 32 ASP D 47 601 CA D A 3 FT #HET 39 39 ALA D 19 1 NAG R B 1 FT #HET 59 59 LEU D 19 1 NAG R S 1 FT #HET 61 61 ARG D 19 1 NAG R S 5 FT #HET 70 70 FGP D 47 601 CA D S 4 FT #HET 147 147 GLN D 21 1 NAG S S 6 FT #HET 153 153 THR D 21 1 NAG S S 2 FT #HET 154 154 ARG D 21 1 NAG S A 3 FT #HET 157 157 LEU D 22 2 NAG S S 1 FT #HET 197 197 PHE D 21 1 NAG S S 4 FT #HET 198 198 GLY D 21 1 NAG S B 1 FT #HET 204 204 MET D 21 1 NAG S B 1 FT #HET 205 205 GLY D 21 1 NAG S B 2 FT #HET 206 206 ARG D 21 1 NAG S B 8 FT #HET 208 208 PRO D 22 2 NAG S S 1 FT #HET 273 273 ASP D 47 601 CA D S 3 FT #HET 274 274 ASN D 47 601 CA D S 3 FT #HET 299 299 PRO D 23 1 NAG T B 2 FT #HET 300 300 GLU D 24 2 NAG T B 2 FT #HET 302 302 PRO D 24 2 NAG T S 1 FT #HET 406 406 PRO D 48 608 NAG D B 1 FT #HET 409 409 GLN D 48 608 NAG D S 1 FT #HET 410 410 ASP D 48 608 NAG D B 1 FT #HET 428 428 ARG D 46 609 CL C S 1 FT #HET 429 429 HIS D 49 609 PEG D A 4 FT #HET 432 432 TYR D 46 609 CL C S 1 FT #HET 432 432 TYR D 49 609 PEG D B 1 FT #HET 433 433 ARG D 49 609 PEG D A 7 FT #HET 434 434 ALA D 49 609 PEG D A 5 FT #HET 437 437 GLU D 49 609 PEG D S 7 FT #HET 439 439 TYR D 49 609 PEG D S 1 FT #HET 450 450 ASN D 49 609 PEG D S 3 FT #HET 492 492 SER D 30 2 NAG W S 2 FT #HET 493 493 PRO D 30 2 NAG W S 1 FT #MOD 41 41 ASN D 19 1 NAG R S FT #MOD 151 151 ASN D 21 1 NAG S S FT #MOD 264 264 ASN D 23 1 NAG T S FT #MOD 413 413 ASN D 48 608 NAG D S FT DISORDER 1 21 FT DISORDER 502 510 CC SEQUENCE 480 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NGSVLQVGRN ITRIKLLVRK FLQTQDDRPF FLYVAFHDPH RCGHSQPQYG TFCEKFGNGE CC SGMGRIPDWT PQAYDPLDVL VPYFVPNTPA ARADLAAQYT TVGRMDQGVG LVLQELRDAG CC VLNDTLVIFT SDNGIPFPSG RTNLYWPGTA EPLLVSSPEH PKRWGQVSEA YVSLLDLTPT CC ILDWFSIPYP SYAIFGSKTI HLTGRSLLPA LEAEPLWATV FGSQSHHEVT MSYPMRSVQH CC RHFRLVHNLN FKMPFPIDQD FYVSPTFQDL LNRTTAGQPT GWYKDLRHYY YRARWELYDR CC SRDPHETQNL ATDPRFAQLL EMLRDQLAKW QWETHDPWVC APDGVLEEKL SPQCQPLHNE CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM E--------- CC * SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVE STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 332 332 SER E 354 354 ALA D Protein S 1 FT #SUB 332 332 SER E 355 355 GLU D Protein S 1 FT #SUB 89 89 TYR E 95 95 VAL F Protein S 4 FT #SUB 89 89 TYR E 370 370 VAL F Protein S 2 FT #SUB 94 94 ASP E 101 101 PHE F Protein B 1 FT #SUB 94 94 ASP E 336 336 PHE F Protein A 10 FT #SUB 95 95 VAL E 89 89 TYR F Protein B 4 FT #SUB 95 95 VAL E 101 101 PHE F Protein B 2 FT #SUB 95 95 VAL E 336 336 PHE F Protein A 3 FT #SUB 97 97 HIS E 99 99 ASN F Protein S 2 FT #SUB 97 97 HIS E 100 100 SER F Protein S 2 FT #SUB 97 97 HIS E 101 101 PHE F Protein S 6 FT #SUB 99 99 ASN E 97 97 HIS F Protein B 2 FT #SUB 100 100 SER E 97 97 HIS F Protein B 1 FT #SUB 101 101 PHE E 94 94 ASP F Protein S 1 FT #SUB 101 101 PHE E 97 97 HIS F Protein S 6 FT #SUB 185 185 HIS E 488 488 GLU F Protein S 2 FT #SUB 367 367 HIS E 367 367 HIS F Protein S 4 FT #SUB 370 370 VAL E 89 89 TYR F Protein S 2 FT #SUB 370 370 VAL E 479 479 TRP F Protein A 2 FT #SUB 371 371 THR E 478 478 PRO F Protein S 3 FT #SUB 371 371 THR E 479 479 TRP F Protein A 7 FT #SUB 373 373 SER E 373 373 SER F Protein S 4 FT #SUB 390 390 LEU E 394 394 MET F Protein B 1 FT #SUB 391 391 ASN E 394 394 MET F Protein S 1 FT #SUB 393 393 LYS E 393 393 LYS F Protein A 4 FT #SUB 393 393 LYS E 394 394 MET F Protein A 3 FT #SUB 393 393 LYS E 432 432 TYR F Protein S 3 FT #SUB 394 394 MET E 390 390 LEU F Protein S 1 FT #SUB 394 394 MET E 391 391 ASN F Protein S 4 FT #SUB 394 394 MET E 393 393 LYS F Protein S 3 FT #SUB 395 395 PRO E 500 500 ASN F Protein S 2 FT #SUB 398 398 ILE E 486 486 VAL F Protein B 1 FT #SUB 400 400 GLN E 486 486 VAL F Protein A 2 FT #SUB 400 400 GLN E 487 487 LEU F Protein S 4 FT #SUB 400 400 GLN E 488 488 GLU F Protein A 6 FT #SUB 403 403 TYR E 486 486 VAL F Protein S 1 FT #SUB 403 403 TYR E 488 488 GLU F Protein S 2 FT #SUB 403 403 TYR E 496 496 GLN F Protein S 5 FT #SUB 403 403 TYR E 497 497 PRO F Protein S 6 FT #SUB 404 404 VAL E 488 488 GLU F Protein S 3 FT #SUB 412 412 LEU E 497 497 PRO F Protein S 1 FT #SUB 412 412 LEU E 498 498 LEU F Protein S 1 FT #SUB 416 416 THR E 499 499 HIS F Protein S 3 FT #SUB 431 431 TYR E 498 498 LEU F Protein S 5 FT #SUB 431 431 TYR E 499 499 HIS F Protein S 1 FT #SUB 431 431 TYR E 500 500 ASN F Protein S 2 FT #SUB 432 432 TYR E 393 393 LYS F Protein S 1 FT #SUB 432 432 TYR E 500 500 ASN F Protein S 4 FT #SUB 432 432 TYR E 501 501 GLU F Protein S 2 FT #SUB 478 478 PRO E 371 371 THR F Protein A 3 FT #SUB 479 479 TRP E 370 370 VAL F Protein S 1 FT #SUB 479 479 TRP E 371 371 THR F Protein S 5 FT #SUB 486 486 VAL E 398 398 ILE F Protein S 1 FT #SUB 486 486 VAL E 400 400 GLN F Protein S 2 FT #SUB 487 487 LEU E 400 400 GLN F Protein B 5 FT #SUB 488 488 GLU E 400 400 GLN F Protein S 6 FT #SUB 488 488 GLU E 403 403 TYR F Protein S 1 FT #SUB 496 496 GLN E 403 403 TYR F Protein A 4 FT #SUB 497 497 PRO E 403 403 TYR F Protein A 6 FT #SUB 498 498 LEU E 398 398 ILE F Protein S 2 FT #SUB 498 498 LEU E 403 403 TYR F Protein S 2 FT #SUB 498 498 LEU E 412 412 LEU F Protein B 1 FT #SUB 498 498 LEU E 431 431 TYR F Protein B 4 FT #SUB 499 499 HIS E 416 416 THR F Protein S 1 FT #SUB 499 499 HIS E 431 431 TYR F Protein B 2 FT #SUB 500 500 ASN E 395 395 PRO F Protein S 4 FT #SUB 500 500 ASN E 431 431 TYR F Protein A 3 FT #SUB 500 500 ASN E 432 432 TYR F Protein B 4 FT #SUB 501 501 GLU E 432 432 TYR F Protein S 3 FT #SUB 502 502 LEU E 432 432 TYR F Protein S 1 FT #SUB 452 452 ALA E 167 167 ASP G Protein B 1 FT #SUB 452 452 ALA E 168 168 ASP G Protein B 4 FT #SUB 453 453 THR E 166 166 GLN G Protein S 5 FT #SUB 453 453 THR E 167 167 ASP G Protein S 2 FT #SUB 453 453 THR E 168 168 ASP G Protein B 1 FT #SUB 455 455 PRO E 22 22 PRO G Protein A 6 FT #SUB 461 461 LEU E 168 168 ASP G Protein S 1 FT #HET 31 31 ASP E 50 600 CA E S 2 FT #HET 32 32 ASP E 50 600 CA E A 3 FT #HET 39 39 ALA E 25 1 NAG U B 1 FT #HET 70 70 FGP E 50 600 CA E A 6 FT #HET 147 147 GLN E 27 1 NAG V S 6 FT #HET 197 197 PHE E 27 1 NAG V S 4 FT #HET 198 198 GLY E 27 1 NAG V B 1 FT #HET 204 204 MET E 27 1 NAG V B 1 FT #HET 206 206 ARG E 27 1 NAG V B 9 FT #HET 207 207 ILE E 27 1 NAG V S 2 FT #HET 208 208 PRO E 28 2 NAG V S 1 FT #HET 273 273 ASP E 50 600 CA E S 3 FT #HET 274 274 ASN E 50 600 CA E S 3 FT #HET 308 308 VAL E 25 1 NAG U S 1 FT #MOD 41 41 ASN E 25 1 NAG U S FT #MOD 151 151 ASN E 27 1 NAG V S FT DISORDER 1 21 FT DISORDER 505 510 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 483 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NGSVLQVGRN ITRIKLLVRK FLQTQDDRPF FLYVAFHDPH RCGHSQPQYG TFCEKFGNGE CC SGMGRIPDWT PQAYDPLDVL VPYFVPNTPA ARADLAAQYT TVGRMDQGVG LVLQELRDAG CC VLNDTLVIFT SDNGIPFPSG RTNLYWPGTA EPLLVSSPEH PKRWGQVSEA YVSLLDLTPT CC ILDWFSIPYP SYAIFGSKTI HLTGRSLLPA LEAEPLWATV FGSQSHHEVT MSYPMRSVQH CC RHFRLVHNLN FKMPFPIDQD FYVSPTFQDL LNRTTAGQPT GWYKDLRHYY YRARWELYDR CC SRDPHETQNL ATDPRFAQLL EMLRDQLAKW QWETHDPWVC APDGVLEEKL SPQCQPLHNE CC LRS CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM ELRS------ CC **** SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVF STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 202 202 SER F 338 338 SER D Protein S 5 FT #SUB 202 202 SER F 339 339 LYS D Protein S 2 FT #SUB 89 89 TYR F 95 95 VAL E Protein S 4 FT #SUB 89 89 TYR F 370 370 VAL E Protein S 2 FT #SUB 94 94 ASP F 101 101 PHE E Protein B 1 FT #SUB 95 95 VAL F 89 89 TYR E Protein B 4 FT #SUB 97 97 HIS F 99 99 ASN E Protein S 2 FT #SUB 97 97 HIS F 100 100 SER E Protein S 1 FT #SUB 97 97 HIS F 101 101 PHE E Protein S 6 FT #SUB 99 99 ASN F 97 97 HIS E Protein B 2 FT #SUB 100 100 SER F 97 97 HIS E Protein B 2 FT #SUB 101 101 PHE F 94 94 ASP E Protein S 1 FT #SUB 101 101 PHE F 95 95 VAL E Protein S 2 FT #SUB 101 101 PHE F 97 97 HIS E Protein S 6 FT #SUB 336 336 PHE F 94 94 ASP E Protein S 10 FT #SUB 336 336 PHE F 95 95 VAL E Protein S 3 FT #SUB 367 367 HIS F 367 367 HIS E Protein S 4 FT #SUB 370 370 VAL F 89 89 TYR E Protein S 2 FT #SUB 370 370 VAL F 479 479 TRP E Protein B 1 FT #SUB 371 371 THR F 478 478 PRO E Protein S 3 FT #SUB 371 371 THR F 479 479 TRP E Protein A 5 FT #SUB 373 373 SER F 373 373 SER E Protein S 4 FT #SUB 390 390 LEU F 394 394 MET E Protein B 1 FT #SUB 391 391 ASN F 394 394 MET E Protein A 4 FT #SUB 393 393 LYS F 393 393 LYS E Protein A 4 FT #SUB 393 393 LYS F 394 394 MET E Protein A 3 FT #SUB 393 393 LYS F 432 432 TYR E Protein S 1 FT #SUB 394 394 MET F 390 390 LEU E Protein S 1 FT #SUB 394 394 MET F 391 391 ASN E Protein S 1 FT #SUB 394 394 MET F 393 393 LYS E Protein S 3 FT #SUB 395 395 PRO F 500 500 ASN E Protein S 4 FT #SUB 398 398 ILE F 486 486 VAL E Protein B 1 FT #SUB 398 398 ILE F 498 498 LEU E Protein S 2 FT #SUB 400 400 GLN F 486 486 VAL E Protein A 2 FT #SUB 400 400 GLN F 487 487 LEU E Protein S 5 FT #SUB 400 400 GLN F 488 488 GLU E Protein A 6 FT #SUB 403 403 TYR F 488 488 GLU E Protein S 1 FT #SUB 403 403 TYR F 496 496 GLN E Protein S 4 FT #SUB 403 403 TYR F 497 497 PRO E Protein S 6 FT #SUB 403 403 TYR F 498 498 LEU E Protein S 2 FT #SUB 412 412 LEU F 498 498 LEU E Protein S 1 FT #SUB 416 416 THR F 499 499 HIS E Protein S 1 FT #SUB 431 431 TYR F 498 498 LEU E Protein S 4 FT #SUB 431 431 TYR F 499 499 HIS E Protein S 2 FT #SUB 431 431 TYR F 500 500 ASN E Protein S 3 FT #SUB 432 432 TYR F 393 393 LYS E Protein S 3 FT #SUB 432 432 TYR F 500 500 ASN E Protein S 4 FT #SUB 432 432 TYR F 501 501 GLU E Protein S 3 FT #SUB 432 432 TYR F 502 502 LEU E Protein S 1 FT #SUB 478 478 PRO F 371 371 THR E Protein A 3 FT #SUB 479 479 TRP F 370 370 VAL E Protein S 2 FT #SUB 479 479 TRP F 371 371 THR E Protein S 7 FT #SUB 486 486 VAL F 398 398 ILE E Protein S 1 FT #SUB 486 486 VAL F 400 400 GLN E Protein S 2 FT #SUB 486 486 VAL F 403 403 TYR E Protein S 1 FT #SUB 487 487 LEU F 400 400 GLN E Protein B 4 FT #SUB 488 488 GLU F 185 185 HIS E Protein S 2 FT #SUB 488 488 GLU F 400 400 GLN E Protein S 6 FT #SUB 488 488 GLU F 403 403 TYR E Protein S 2 FT #SUB 488 488 GLU F 404 404 VAL E Protein S 3 FT #SUB 496 496 GLN F 403 403 TYR E Protein A 5 FT #SUB 497 497 PRO F 403 403 TYR E Protein A 6 FT #SUB 497 497 PRO F 412 412 LEU E Protein B 1 FT #SUB 498 498 LEU F 412 412 LEU E Protein B 1 FT #SUB 498 498 LEU F 431 431 TYR E Protein A 5 FT #SUB 499 499 HIS F 416 416 THR E Protein S 3 FT #SUB 499 499 HIS F 431 431 TYR E Protein B 1 FT #SUB 500 500 ASN F 395 395 PRO E Protein S 2 FT #SUB 500 500 ASN F 431 431 TYR E Protein S 2 FT #SUB 500 500 ASN F 432 432 TYR E Protein B 4 FT #SUB 501 501 GLU F 432 432 TYR E Protein A 2 FT #SUB 416 416 THR F 105 105 ARG G Protein B 1 FT #SUB 417 417 ALA F 102 102 ASP G Protein B 1 FT #SUB 417 417 ALA F 103 103 LYS G Protein B 1 FT #SUB 417 417 ALA F 105 105 ARG G Protein B 3 FT #SUB 418 418 GLY F 102 102 ASP G Protein B 6 FT #SUB 419 419 GLN F 103 103 LYS G Protein S 5 FT #SUB 43 43 SER F 164 164 GLN H Protein S 1 FT #SUB 218 218 LEU F 154 154 ARG H Protein S 3 FT #SUB 218 218 LEU F 161 161 LYS H Protein B 2 FT #SUB 219 219 ASP F 154 154 ARG H Protein S 1 FT #SUB 219 219 ASP F 161 161 LYS H Protein B 4 FT #SUB 220 220 VAL F 161 161 LYS H Protein B 1 FT #SUB 228 228 ASN F 137 137 ALA H Protein S 1 FT #SUB 228 228 ASN F 142 142 ASN H Protein A 4 FT #SUB 233 233 ARG F 142 142 ASN H Protein S 3 FT #HET 31 31 ASP F 51 601 CA F S 2 FT #HET 32 32 ASP F 51 601 CA F A 3 FT #HET 39 39 ALA F 52 602 NAG F B 1 FT #HET 70 70 FGP F 51 601 CA F A 5 FT #HET 154 154 ARG F 29 1 NAG W S 2 FT #HET 197 197 PHE F 29 1 NAG W S 4 FT #HET 198 198 GLY F 29 1 NAG W B 1 FT #HET 204 204 MET F 29 1 NAG W B 1 FT #HET 205 205 GLY F 29 1 NAG W B 1 FT #HET 206 206 ARG F 29 1 NAG W B 10 FT #HET 207 207 ILE F 29 1 NAG W S 2 FT #HET 273 273 ASP F 51 601 CA F S 3 FT #HET 274 274 ASN F 51 601 CA F S 3 FT #HET 299 299 PRO F 53 605 NAG F B 2 FT #HET 300 300 GLU F 53 605 NAG F S 2 FT #HET 308 308 VAL F 52 602 NAG F S 1 FT #HET 406 406 PRO F 31 1 NAG X B 1 FT #HET 409 409 GLN F 31 1 NAG X A 2 FT #HET 410 410 ASP F 31 1 NAG X A 5 FT #MOD 41 41 ASN F 52 602 NAG F S FT #MOD 151 151 ASN F 29 1 NAG W S FT #MOD 264 264 ASN F 53 605 NAG F S FT #MOD 413 413 ASN F 31 1 NAG X S FT DISORDER 1 21 FT DISORDER 185 186 FT DISORDER 502 510 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 478 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NGSVLQVGRN ITRIKLLVRK FLQTQDDRPF FLYVAFHDPH RCGQPQYGTF CEKFGNGESG CC MGRIPDWTPQ AYDPLDVLVP YFVPNTPAAR ADLAAQYTTV GRMDQGVGLV LQELRDAGVL CC NDTLVIFTSD NGIPFPSGRT NLYWPGTAEP LLVSSPEHPK RWGQVSEAYV SLLDLTPTIL CC DWFSIPYPSY AIFGSKTIHL TGRSLLPALE AEPLWATVFG SQSHHEVTMS YPMRSVQHRH CC FRLVHNLNFK MPFPIDQDFY VSPTFQDLLN RTTAGQPTGW YKDLRHYYYR ARWELYDRSR CC DPHETQNLAT DPRFAQLLEM LRDQLAKWQW ETHDPWVCAP DGVLEEKLSP QCQPLHNE CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCG--QPQYGTFCEKFGNG CC ********************************** ************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM E--------- CC * SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVG STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 346 346 ARG G 470 470 LYS B Protein S 1 FT #SUB 354 354 ALA G 491 491 LEU B Protein S 1 FT #SUB 355 355 GLU G 492 492 SER B Protein A 4 FT #SUB 357 357 LEU G 473 473 TRP B Protein S 1 FT #SUB 381 381 HIS G 463 463 MET B Protein S 3 FT #SUB 381 381 HIS G 466 466 ASP B Protein S 7 FT #SUB 382 382 ARG G 466 466 ASP B Protein S 6 FT #SUB 384 384 PHE G 459 459 GLN B Protein S 5 FT #SUB 440 440 ASP G 459 459 GLN B Protein S 1 FT #SUB 459 459 GLN G 384 384 PHE B Protein S 3 FT #SUB 460 460 LEU G 459 459 GLN B Protein S 2 FT #SUB 460 460 LEU G 463 463 MET B Protein S 1 FT #SUB 463 463 MET G 381 381 HIS B Protein S 1 FT #SUB 463 463 MET G 460 460 LEU B Protein S 2 FT #SUB 463 463 MET G 463 463 MET B Protein S 4 FT #SUB 463 463 MET G 467 467 GLN B Protein S 1 FT #SUB 466 466 ASP G 381 381 HIS B Protein S 7 FT #SUB 466 466 ASP G 382 382 ARG B Protein S 8 FT #SUB 467 467 GLN G 463 463 MET B Protein S 2 FT #SUB 470 470 LYS G 346 346 ARG B Protein S 1 FT #SUB 473 473 TRP G 357 357 LEU B Protein S 1 FT #SUB 489 489 GLU G 354 354 ALA B Protein B 1 FT #SUB 490 490 LYS G 303 303 LYS B Protein B 3 FT #SUB 490 490 LYS G 353 353 GLU B Protein B 1 FT #SUB 491 491 LEU G 354 354 ALA B Protein B 2 FT #SUB 492 492 SER G 354 354 ALA B Protein S 2 FT #SUB 492 492 SER G 355 355 GLU B Protein S 3 FT #SUB 494 494 GLN G 354 354 ALA B Protein S 1 FT #SUB 22 22 PRO G 455 455 PRO E Protein S 6 FT #SUB 166 166 GLN G 453 453 THR E Protein B 5 FT #SUB 167 167 ASP G 452 452 ALA E Protein B 1 FT #SUB 167 167 ASP G 453 453 THR E Protein B 2 FT #SUB 168 168 ASP G 452 452 ALA E Protein A 4 FT #SUB 168 168 ASP G 453 453 THR E Protein B 1 FT #SUB 168 168 ASP G 461 461 LEU E Protein S 1 FT #SUB 102 102 ASP G 417 417 ALA F Protein B 1 FT #SUB 102 102 ASP G 418 418 GLY F Protein A 6 FT #SUB 103 103 LYS G 417 417 ALA F Protein B 1 FT #SUB 103 103 LYS G 419 419 GLN F Protein S 5 FT #SUB 105 105 ARG G 416 416 THR F Protein S 1 FT #SUB 105 105 ARG G 417 417 ALA F Protein S 3 FT #SUB 89 89 TYR G 95 95 VAL H Protein S 6 FT #SUB 94 94 ASP G 101 101 PHE H Protein B 2 FT #SUB 94 94 ASP G 336 336 PHE H Protein A 8 FT #SUB 95 95 VAL G 89 89 TYR H Protein A 6 FT #SUB 95 95 VAL G 101 101 PHE H Protein B 3 FT #SUB 95 95 VAL G 336 336 PHE H Protein B 1 FT #SUB 97 97 HIS G 99 99 ASN H Protein S 3 FT #SUB 97 97 HIS G 100 100 SER H Protein S 3 FT #SUB 97 97 HIS G 101 101 PHE H Protein S 7 FT #SUB 99 99 ASN G 97 97 HIS H Protein B 3 FT #SUB 100 100 SER G 97 97 HIS H Protein B 3 FT #SUB 101 101 PHE G 94 94 ASP H Protein S 2 FT #SUB 101 101 PHE G 95 95 VAL H Protein S 4 FT #SUB 101 101 PHE G 97 97 HIS H Protein A 8 FT #SUB 185 185 HIS G 488 488 GLU H Protein S 2 FT #SUB 336 336 PHE G 94 94 ASP H Protein S 9 FT #SUB 336 336 PHE G 95 95 VAL H Protein S 3 FT #SUB 367 367 HIS G 367 367 HIS H Protein S 4 FT #SUB 370 370 VAL G 89 89 TYR H Protein S 2 FT #SUB 370 370 VAL G 479 479 TRP H Protein A 5 FT #SUB 371 371 THR G 478 478 PRO H Protein S 3 FT #SUB 371 371 THR G 479 479 TRP H Protein A 10 FT #SUB 373 373 SER G 373 373 SER H Protein S 1 FT #SUB 390 390 LEU G 394 394 MET H Protein B 1 FT #SUB 391 391 ASN G 394 394 MET H Protein A 9 FT #SUB 393 393 LYS G 393 393 LYS H Protein A 3 FT #SUB 393 393 LYS G 394 394 MET H Protein A 4 FT #SUB 393 393 LYS G 432 432 TYR H Protein S 1 FT #SUB 394 394 MET G 390 390 LEU H Protein S 1 FT #SUB 394 394 MET G 391 391 ASN H Protein S 4 FT #SUB 394 394 MET G 393 393 LYS H Protein S 3 FT #SUB 395 395 PRO G 502 502 LEU H Protein S 1 FT #SUB 398 398 ILE G 486 486 VAL H Protein B 1 FT #SUB 398 398 ILE G 498 498 LEU H Protein S 1 FT #SUB 400 400 GLN G 486 486 VAL H Protein A 3 FT #SUB 400 400 GLN G 487 487 LEU H Protein S 5 FT #SUB 400 400 GLN G 488 488 GLU H Protein A 6 FT #SUB 403 403 TYR G 486 486 VAL H Protein S 1 FT #SUB 403 403 TYR G 488 488 GLU H Protein S 2 FT #SUB 403 403 TYR G 496 496 GLN H Protein S 7 FT #SUB 403 403 TYR G 497 497 PRO H Protein S 7 FT #SUB 403 403 TYR G 498 498 LEU H Protein S 2 FT #SUB 412 412 LEU G 499 499 HIS H Protein S 1 FT #SUB 416 416 THR G 499 499 HIS H Protein S 6 FT #SUB 431 431 TYR G 498 498 LEU H Protein S 5 FT #SUB 431 431 TYR G 499 499 HIS H Protein S 1 FT #SUB 431 431 TYR G 500 500 ASN H Protein S 6 FT #SUB 432 432 TYR G 500 500 ASN H Protein S 3 FT #SUB 432 432 TYR G 502 502 LEU H Protein S 2 FT #SUB 478 478 PRO G 371 371 THR H Protein A 3 FT #SUB 479 479 TRP G 370 370 VAL H Protein S 1 FT #SUB 479 479 TRP G 371 371 THR H Protein S 9 FT #SUB 486 486 VAL G 398 398 ILE H Protein S 1 FT #SUB 486 486 VAL G 400 400 GLN H Protein S 2 FT #SUB 486 486 VAL G 403 403 TYR H Protein S 1 FT #SUB 487 487 LEU G 400 400 GLN H Protein B 3 FT #SUB 488 488 GLU G 185 185 HIS H Protein S 2 FT #SUB 488 488 GLU G 400 400 GLN H Protein S 5 FT #SUB 488 488 GLU G 403 403 TYR H Protein S 2 FT #SUB 488 488 GLU G 404 404 VAL H Protein S 3 FT #SUB 496 496 GLN G 403 403 TYR H Protein A 8 FT #SUB 497 497 PRO G 403 403 TYR H Protein A 6 FT #SUB 497 497 PRO G 412 412 LEU H Protein B 1 FT #SUB 498 498 LEU G 398 398 ILE H Protein S 2 FT #SUB 498 498 LEU G 403 403 TYR H Protein S 2 FT #SUB 498 498 LEU G 431 431 TYR H Protein A 5 FT #SUB 499 499 HIS G 412 412 LEU H Protein S 2 FT #SUB 499 499 HIS G 416 416 THR H Protein S 3 FT #SUB 499 499 HIS G 431 431 TYR H Protein B 1 FT #SUB 500 500 ASN G 395 395 PRO H Protein S 1 FT #SUB 500 500 ASN G 431 431 TYR H Protein A 4 FT #SUB 500 500 ASN G 432 432 TYR H Protein B 4 FT #SUB 501 501 GLU G 432 432 TYR H Protein B 2 FT #HET 31 31 ASP G 54 601 CA G S 2 FT #HET 32 32 ASP G 54 601 CA G A 3 FT #HET 39 39 ALA G 33 1 NAG Y B 1 FT #HET 59 59 LEU G 33 1 NAG Y S 1 FT #HET 61 61 ARG G 33 1 NAG Y S 9 FT #HET 70 70 FGP G 54 601 CA G A 4 FT #HET 147 147 GLN G 55 604 NAG G S 5 FT #HET 153 153 THR G 55 604 NAG G S 2 FT #HET 154 154 ARG G 55 604 NAG G A 5 FT #HET 197 197 PHE G 55 604 NAG G S 2 FT #HET 204 204 MET G 55 604 NAG G B 1 FT #HET 205 205 GLY G 55 604 NAG G B 2 FT #HET 206 206 ARG G 55 604 NAG G B 9 FT #HET 273 273 ASP G 54 601 CA G S 3 FT #HET 274 274 ASN G 54 601 CA G S 3 FT #HET 299 299 PRO G 56 605 NAG G B 1 FT #MOD 41 41 ASN G 33 1 NAG Y S FT #MOD 151 151 ASN G 55 604 NAG G S FT #MOD 264 264 ASN G 56 605 NAG G S FT DISORDER 1 21 FT DISORDER 143 143 FT DISORDER 502 510 CC SEQUENCE 479 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NSVLQVGRNI TRIKLLVRKF LQTQDDRPFF LYVAFHDPHR CGHSQPQYGT FCEKFGNGES CC GMGRIPDWTP QAYDPLDVLV PYFVPNTPAA RADLAAQYTT VGRMDQGVGL VLQELRDAGV CC LNDTLVIFTS DNGIPFPSGR TNLYWPGTAE PLLVSSPEHP KRWGQVSEAY VSLLDLTPTI CC LDWFSIPYPS YAIFGSKTIH LTGRSLLPAL EAEPLWATVF GSQSHHEVTM SYPMRSVQHR CC HFRLVHNLNF KMPFPIDQDF YVSPTFQDLL NRTTAGQPTG WYKDLRHYYY RARWELYDRS CC RDPHETQNLA TDPRFAQLLE MLRDQLAKWQ WETHDPWVCA PDGVLEEKLS PQCQPLHNE CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEEN-SVLQVGR CC ****************************************** ******* CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM E--------- CC * SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH // ID 4MIVH STANDARD; PRT; 510 AA. DT CONVERTED FROM PDB (SEQRES) 4MIV DE N-sulphoglucosamine sulphohydrolase OS Homo sapiens CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.217 FT #SUB 137 137 ALA H 228 228 ASN F Protein B 1 FT #SUB 142 142 ASN H 228 228 ASN F Protein S 4 FT #SUB 142 142 ASN H 233 233 ARG F Protein B 3 FT #SUB 154 154 ARG H 218 218 LEU F Protein S 3 FT #SUB 154 154 ARG H 219 219 ASP F Protein S 1 FT #SUB 161 161 LYS H 218 218 LEU F Protein S 2 FT #SUB 161 161 LYS H 219 219 ASP F Protein S 4 FT #SUB 161 161 LYS H 220 220 VAL F Protein S 1 FT #SUB 164 164 GLN H 43 43 SER F Protein S 1 FT #SUB 89 89 TYR H 95 95 VAL G Protein S 6 FT #SUB 89 89 TYR H 370 370 VAL G Protein S 2 FT #SUB 94 94 ASP H 101 101 PHE G Protein B 2 FT #SUB 94 94 ASP H 336 336 PHE G Protein A 9 FT #SUB 95 95 VAL H 89 89 TYR G Protein A 6 FT #SUB 95 95 VAL H 101 101 PHE G Protein B 4 FT #SUB 95 95 VAL H 336 336 PHE G Protein A 3 FT #SUB 97 97 HIS H 99 99 ASN G Protein S 3 FT #SUB 97 97 HIS H 100 100 SER G Protein S 3 FT #SUB 97 97 HIS H 101 101 PHE G Protein S 8 FT #SUB 99 99 ASN H 97 97 HIS G Protein B 3 FT #SUB 100 100 SER H 97 97 HIS G Protein B 3 FT #SUB 101 101 PHE H 94 94 ASP G Protein S 2 FT #SUB 101 101 PHE H 95 95 VAL G Protein S 3 FT #SUB 101 101 PHE H 97 97 HIS G Protein S 7 FT #SUB 185 185 HIS H 488 488 GLU G Protein S 2 FT #SUB 336 336 PHE H 94 94 ASP G Protein S 8 FT #SUB 336 336 PHE H 95 95 VAL G Protein S 1 FT #SUB 367 367 HIS H 367 367 HIS G Protein S 4 FT #SUB 370 370 VAL H 479 479 TRP G Protein B 1 FT #SUB 371 371 THR H 478 478 PRO G Protein S 3 FT #SUB 371 371 THR H 479 479 TRP G Protein A 9 FT #SUB 373 373 SER H 373 373 SER G Protein S 1 FT #SUB 390 390 LEU H 394 394 MET G Protein B 1 FT #SUB 391 391 ASN H 394 394 MET G Protein A 4 FT #SUB 393 393 LYS H 393 393 LYS G Protein A 3 FT #SUB 393 393 LYS H 394 394 MET G Protein A 3 FT #SUB 394 394 MET H 390 390 LEU G Protein S 1 FT #SUB 394 394 MET H 391 391 ASN G Protein S 9 FT #SUB 394 394 MET H 393 393 LYS G Protein S 4 FT #SUB 395 395 PRO H 500 500 ASN G Protein S 1 FT #SUB 398 398 ILE H 486 486 VAL G Protein B 1 FT #SUB 398 398 ILE H 498 498 LEU G Protein S 2 FT #SUB 400 400 GLN H 486 486 VAL G Protein A 2 FT #SUB 400 400 GLN H 487 487 LEU G Protein S 3 FT #SUB 400 400 GLN H 488 488 GLU G Protein B 5 FT #SUB 403 403 TYR H 486 486 VAL G Protein S 1 FT #SUB 403 403 TYR H 488 488 GLU G Protein S 2 FT #SUB 403 403 TYR H 496 496 GLN G Protein S 8 FT #SUB 403 403 TYR H 497 497 PRO G Protein S 6 FT #SUB 403 403 TYR H 498 498 LEU G Protein S 2 FT #SUB 404 404 VAL H 488 488 GLU G Protein S 3 FT #SUB 412 412 LEU H 497 497 PRO G Protein S 1 FT #SUB 412 412 LEU H 499 499 HIS G Protein S 2 FT #SUB 416 416 THR H 499 499 HIS G Protein S 3 FT #SUB 431 431 TYR H 498 498 LEU G Protein S 5 FT #SUB 431 431 TYR H 499 499 HIS G Protein S 1 FT #SUB 431 431 TYR H 500 500 ASN G Protein S 4 FT #SUB 432 432 TYR H 393 393 LYS G Protein S 1 FT #SUB 432 432 TYR H 500 500 ASN G Protein S 4 FT #SUB 432 432 TYR H 501 501 GLU G Protein S 2 FT #SUB 478 478 PRO H 371 371 THR G Protein A 3 FT #SUB 479 479 TRP H 370 370 VAL G Protein S 5 FT #SUB 479 479 TRP H 371 371 THR G Protein A 10 FT #SUB 486 486 VAL H 398 398 ILE G Protein S 1 FT #SUB 486 486 VAL H 400 400 GLN G Protein S 3 FT #SUB 486 486 VAL H 403 403 TYR G Protein S 1 FT #SUB 487 487 LEU H 400 400 GLN G Protein A 5 FT #SUB 488 488 GLU H 185 185 HIS G Protein S 2 FT #SUB 488 488 GLU H 400 400 GLN G Protein S 6 FT #SUB 488 488 GLU H 403 403 TYR G Protein S 2 FT #SUB 496 496 GLN H 403 403 TYR G Protein A 7 FT #SUB 497 497 PRO H 403 403 TYR G Protein A 7 FT #SUB 498 498 LEU H 398 398 ILE G Protein S 1 FT #SUB 498 498 LEU H 403 403 TYR G Protein S 2 FT #SUB 498 498 LEU H 431 431 TYR G Protein A 5 FT #SUB 499 499 HIS H 412 412 LEU G Protein S 1 FT #SUB 499 499 HIS H 416 416 THR G Protein S 6 FT #SUB 499 499 HIS H 431 431 TYR G Protein B 1 FT #SUB 500 500 ASN H 431 431 TYR G Protein A 6 FT #SUB 500 500 ASN H 432 432 TYR G Protein B 3 FT #SUB 502 502 LEU H 395 395 PRO G Protein S 1 FT #SUB 502 502 LEU H 432 432 TYR G Protein S 2 FT #HET 31 31 ASP H 57 601 CA H S 2 FT #HET 32 32 ASP H 57 601 CA H S 2 FT #HET 70 70 FGP H 57 601 CA H A 5 FT #HET 147 147 GLN H 35 1 NAG Z S 6 FT #HET 154 154 ARG H 35 1 NAG Z S 3 FT #HET 197 197 PHE H 35 1 NAG Z S 6 FT #HET 198 198 GLY H 35 1 NAG Z B 1 FT #HET 204 204 MET H 35 1 NAG Z B 1 FT #HET 205 205 GLY H 35 1 NAG Z B 2 FT #HET 206 206 ARG H 35 1 NAG Z B 8 FT #HET 207 207 ILE H 35 1 NAG Z S 1 FT #HET 273 273 ASP H 57 601 CA H S 3 FT #HET 274 274 ASN H 57 601 CA H S 3 FT #HET 406 406 PRO H 58 606 NAG H B 1 FT #HET 409 409 GLN H 58 606 NAG H A 2 FT #HET 410 410 ASP H 58 606 NAG H A 3 FT #MOD 151 151 ASN H 35 1 NAG Z S FT #MOD 264 264 ASN H 37 1 NAG a S FT #MOD 413 413 ASN H 58 606 NAG H S FT DISORDER 1 21 FT DISORDER 504 510 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 482 AA (ATOM); CC PRNALLLLAD DGGFESGAYN NSAIATPHLD ALARRSLLFR NAFTSVSSXS PSRASLLTGL CC PQHQNGMYGL HQDVHHFNSF DKVRSLPLLL SQAGVRTGII GKKHVGPETV YPFDFAYTEE CC NGSVLQVGRN ITRIKLLVRK FLQTQDDRPF FLYVAFHDPH RCGHSQPQYG TFCEKFGNGE CC SGMGRIPDWT PQAYDPLDVL VPYFVPNTPA ARADLAAQYT TVGRMDQGVG LVLQELRDAG CC VLNDTLVIFT SDNGIPFPSG RTNLYWPGTA EPLLVSSPEH PKRWGQVSEA YVSLLDLTPT CC ILDWFSIPYP SYAIFGSKTI HLTGRSLLPA LEAEPLWATV FGSQSHHEVT MSYPMRSVQH CC RHFRLVHNLN FKMPFPIDQD FYVSPTFQDL LNRTTAGQPT GWYKDLRHYY YRARWELYDR CC SRDPHETQNL ATDPRFAQLL EMLRDQLAKW QWETHDPWVC APDGVLEEKL SPQCQPLHNE CC LR CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSAIATPHL CC ATOM ---------------------PRNALLLLADDGGFESGAYNNSAIATPHL CC ***************************** CC SEQRES DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ATOM DALARRSLLFRNAFTSVSSXSPSRASLLTGLPQHQNGMYGLHQDVHHFNS CC ************************************************** CC SEQRES FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ATOM FDKVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGR CC ************************************************** CC SEQRES NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ATOM NITRIKLLVRKFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNG CC ************************************************** CC SEQRES ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ATOM ESGMGRIPDWTPQAYDPLDVLVPYFVPNTPAARADLAAQYTTVGRMDQGV CC ************************************************** CC SEQRES GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ATOM GLVLQELRDAGVLNDTLVIFTSDNGIPFPSGRTNLYWPGTAEPLLVSSPE CC ************************************************** CC SEQRES HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ATOM HPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYAIFGSKTIHLTGRSLLP CC ************************************************** CC SEQRES ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ATOM ALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNFKMPFPIDQ CC ************************************************** CC SEQRES DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ATOM DFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN CC ************************************************** CC SEQRES LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ATOM LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHN CC ************************************************** CC SEQRES ELRSHHHHHH CC ATOM ELR------- CC *** SQ SEQUENCE 510 AA; MW; CN; MSCPVPACCA LLLVLGLCRA RPRNALLLLA DDGGFESGAY NNSAIATPHL DALARRSLLF RNAFTSVSSX SPSRASLLTG LPQHQNGMYG LHQDVHHFNS FDKVRSLPLL LSQAGVRTGI IGKKHVGPET VYPFDFAYTE ENGSVLQVGR NITRIKLLVR KFLQTQDDRP FFLYVAFHDP HRCGHSQPQY GTFCEKFGNG ESGMGRIPDW TPQAYDPLDV LVPYFVPNTP AARADLAAQY TTVGRMDQGV GLVLQELRDA GVLNDTLVIF TSDNGIPFPS GRTNLYWPGT AEPLLVSSPE HPKRWGQVSE AYVSLLDLTP TILDWFSIPY PSYAIFGSKT IHLTGRSLLP ALEAEPLWAT VFGSQSHHEV TMSYPMRSVQ HRHFRLVHNL NFKMPFPIDQ DFYVSPTFQD LLNRTTAGQP TGWYKDLRHY YYRARWELYD RSRDPHETQN LATDPRFAQL LEMLRDQLAK WQWETHDPWV CAPDGVLEEK LSPQCQPLHN ELRSHHHHHH //