ID 4YD9A STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 1070 1070 ALA A 871 2871 PHE E Protein B 1 FT #SUB 1071 1071 ASN A 871 2871 PHE E Protein S 2 FT #SUB 1071 1071 ASN A 901 2901 LEU E Protein B 1 FT #SUB 1071 1071 ASN A 902 2902 ILE E Protein B 4 FT #SUB 1071 1071 ASN A 903 2903 TYR E Protein A 6 FT #SUB 1073 1073 ALA A 900 2900 SER E Protein B 1 FT #SUB 1073 1073 ALA A 901 2901 LEU E Protein B 1 FT #SUB 1074 1074 ILE A 870 2870 LEU E Protein S 1 FT #SUB 1074 1074 ILE A 871 2871 PHE E Protein S 2 FT #SUB 1074 1074 ILE A 901 2901 LEU E Protein A 6 FT #SUB 1075 1075 GLU A 898 2898 LYS E Protein S 1 FT #SUB 1075 1075 GLU A 899 2899 PRO E Protein S 3 FT #SUB 1075 1075 GLU A 900 2900 SER E Protein S 4 FT #SUB 1075 1075 GLU A 901 2901 LEU E Protein A 3 FT #SUB 1078 1078 ARG A 870 2870 LEU E Protein S 2 FT #SUB 1078 1078 ARG A 872 2872 GLU E Protein S 2 FT #SUB 1078 1078 ARG A 875 2875 SER E Protein S 1 FT #SUB 1078 1078 ARG A 876 2876 LYS E Protein S 1 FT #SUB 1078 1078 ARG A 877 2877 ILE E Protein S 7 FT #SUB 1103 1103 PHE A 871 2871 PHE E Protein S 7 FT #SUB 1147 1147 MET A 849 2849 ASP E Protein S 2 FT #SUB 1164 1164 ASP A 744 2744 PHE E Protein S 3 FT #SUB 1187 1187 LYS A 809 2809 LEU E Protein S 1 FT #SUB 1187 1187 LYS A 897 2897 PRO E Protein S 3 FT #SUB 1188 1188 PHE A 809 2809 LEU E Protein B 1 FT #SUB 1189 1189 ASP A 809 2809 LEU E Protein B 1 FT #SUB 1189 1189 ASP A 849 2849 ASP E Protein B 1 FT #SUB 1189 1189 ASP A 850 2850 ARG E Protein B 1 FT #SUB 1189 1189 ASP A 851 2851 ASN E Protein A 11 FT #SUB 1191 1191 PRO A 849 2849 ASP E Protein S 5 FT #SUB 1206 1206 HIS A 457 2457 LYS E Protein S 1 FT #SUB 1207 1207 TYR A 739 2739 LEU E Protein A 4 FT #SUB 1208 1208 THR A 743 2743 ALA E Protein B 2 FT #SUB 1209 1209 ASP A 743 2743 ALA E Protein B 2 FT #SUB 1210 1210 LYS A 743 2743 ALA E Protein A 6 FT #SUB 1210 1210 LYS A 744 2744 PHE E Protein S 15 FT #SUB 1211 1211 TYR A 739 2739 LEU E Protein S 9 FT #SUB 1211 1211 TYR A 740 2740 ASP E Protein S 1 FT #SUB 1212 1212 HIS A 740 2740 ASP E Protein A 3 FT #SUB 1212 1212 HIS A 744 2744 PHE E Protein S 2 FT #SUB 1213 1213 VAL A 740 2740 ASP E Protein A 2 FT #SUB 1230 1230 LEU A 847 2847 HIS E Protein S 3 FT #SUB 1232 1232 THR A 738 2738 ALA E Protein B 1 FT #SUB 1232 1232 THR A 740 2740 ASP E Protein B 1 FT #SUB 1232 1232 THR A 741 2741 GLN E Protein A 5 FT #SUB 1233 1233 SER A 738 2738 ALA E Protein A 5 FT #SUB 1233 1233 SER A 849 2849 ASP E Protein S 1 FT #SUB 1234 1234 VAL A 736 2736 TYR E Protein B 1 FT #SUB 1234 1234 VAL A 737 2737 CYS E Protein B 2 FT #SUB 1234 1234 VAL A 738 2738 ALA E Protein B 4 FT #SUB 1234 1234 VAL A 739 2739 LEU E Protein B 4 FT #SUB 1235 1235 ILE A 736 2736 TYR E Protein A 4 FT #SUB 1236 1236 TYR A 736 2736 TYR E Protein A 5 FT #SUB 1238 1238 PRO A 736 2736 TYR E Protein S 1 FT #SUB 1245 1245 GLU A 729 2729 LYS E Protein A 2 FT #SUB 1481 1481 GLU A 459 2459 ASP E Protein S 1 FT #SUB 1483 1483 ASN A 487 2487 VAL E Protein B 3 FT #SUB 1483 1483 ASN A 488 2488 ARG E Protein A 8 FT #SUB 1483 1483 ASN A 490 2490 PRO E Protein S 1 FT #SUB 1484 1484 CYS A 486 2486 ILE E Protein B 3 FT #SUB 1484 1484 CYS A 487 2487 VAL E Protein B 4 FT #SUB 1485 1485 ALA A 486 2486 ILE E Protein B 1 FT #SUB 1486 1486 LEU A 458 2458 PHE E Protein S 3 FT #SUB 1486 1486 LEU A 486 2486 ILE E Protein A 9 FT #SUB 1486 1486 LEU A 487 2487 VAL E Protein S 1 FT #SUB 1486 1486 LEU A 488 2488 ARG E Protein S 1 FT #SUB 1487 1487 PRO A 486 2486 ILE E Protein S 6 FT #SUB 1490 1490 ASN A 461 2461 HIS E Protein S 6 FT #SUB 1558 1558 LEU A 440 2440 ASP E Protein S 1 FT #SUB 1603 1603 PHE A 399 2399 LEU E Protein B 1 FT #SUB 1604 1604 ASP A 399 2399 LEU E Protein B 1 FT #SUB 1604 1604 ASP A 442 2442 LEU E Protein A 5 FT #SUB 1605 1605 ARG A 440 2440 ASP E Protein B 1 FT #SUB 1606 1606 LEU A 440 2440 ASP E Protein A 3 FT #SUB 1622 1622 GLN A 326 2326 ILE E Protein B 1 FT #SUB 1648 1648 PRO A 327 2327 GLU E Protein B 2 FT #SUB 1649 1649 THR A 325 2325 ALA E Protein S 2 FT #SUB 1649 1649 THR A 327 2327 GLU E Protein A 3 FT #SUB 1650 1650 ILE A 323 2323 ASN E Protein B 1 FT #SUB 1650 1650 ILE A 324 2324 CYS E Protein B 1 FT #SUB 1650 1650 ILE A 325 2325 ALA E Protein B 2 FT #SUB 1650 1650 ILE A 326 2326 ILE E Protein A 4 FT #SUB 1650 1650 ILE A 327 2327 GLU E Protein A 3 FT #SUB 1651 1651 ILE A 323 2323 ASN E Protein B 2 FT #SUB 1652 1652 PHE A 323 2323 ASN E Protein A 11 FT #SUB 1654 1654 PRO A 323 2323 ASN E Protein S 1 FT #SUB 416 416 ASN A 276 3196 THR F Protein S 10 FT #SUB 419 419 THR A 279 3199 GLN F Protein S 1 FT #SUB 1350 1350 GLU A 354 3274 ARG F Protein S 2 FT #SUB 1533 1533 ALA A 354 3274 ARG F Protein B 2 FT #SUB 1535 1535 LEU A 352 3272 HIS F Protein S 5 FT #SUB 1539 1539 GLN A 352 3272 HIS F Protein S 1 FT #SUB 1543 1543 ILE A 352 3272 HIS F Protein S 3 FT #SUB 388 388 GLY A 136 2136 THR Z Protein B 5 FT #SUB 389 389 THR A 136 2136 THR Z Protein A 8 FT #SUB 389 389 THR A 138 2138 LYS Z Protein S 1 FT #SUB 390 390 LEU A 136 2136 THR Z Protein B 1 FT #SUB 391 391 MET A 138 2138 LYS Z Protein S 2 FT #SUB 328 328 LYS A 1571 1571 TYR b Protein S 8 FT #SUB 328 328 LYS A 1631 1631 GLU b Protein S 4 FT #SUB 329 329 ARG A 1574 1574 VAL b Protein S 3 FT #SUB 329 329 ARG A 1582 1582 ASN b Protein A 8 FT #SUB 329 329 ARG A 1583 1583 CYS b Protein A 14 FT #SUB 329 329 ARG A 1584 1584 GLY b Protein A 3 FT #SUB 329 329 ARG A 1585 1585 ASN b Protein S 3 FT #SUB 331 331 SER A 1581 1581 GLU b Protein A 2 FT #SUB 331 331 SER A 1582 1582 ASN b Protein A 4 FT #SUB 332 332 ASP A 1581 1581 GLU b Protein B 1 FT #SUB 335 335 ASP A 1578 1578 ILE b Protein S 3 FT #SUB 335 335 ASP A 1579 1579 GLY b Protein S 4 FT #SUB 472 472 ASP A 1638 1638 SER b Protein S 2 FT #SUB 472 472 ASP A 1639 1639 LYS b Protein S 3 FT #SUB 472 472 ASP A 1640 1640 ILE b Protein S 3 FT #SUB 473 473 ALA A 1639 1639 LYS b Protein A 3 FT #SUB 1362 1362 ALA A 1378 1378 PHE b Protein S 1 FT #SUB 1365 1365 TYR A 1392 1392 ASP b Protein S 1 FT #SUB 1365 1365 TYR A 1437 1437 ARG b Protein S 17 FT #SUB 1372 1372 ILE A 1390 1390 ASP b Protein A 4 FT #SUB 1372 1372 ILE A 1391 1391 ARG b Protein S 1 FT #SUB 1372 1372 ILE A 1438 1438 ASP b Protein S 3 FT #SUB 1390 1390 ASP A 1372 1372 ILE b Protein S 6 FT #SUB 1437 1437 ARG A 1365 1365 TYR b Protein S 16 FT #SUB 1438 1438 ASP A 1372 1372 ILE b Protein S 4 FT #SUB 1571 1571 TYR A 328 328 LYS b Protein S 7 FT #SUB 1574 1574 VAL A 329 329 ARG b Protein A 4 FT #SUB 1578 1578 ILE A 335 335 ASP b Protein A 3 FT #SUB 1579 1579 GLY A 335 335 ASP b Protein B 5 FT #SUB 1581 1581 GLU A 330 330 GLN b Protein S 4 FT #SUB 1581 1581 GLU A 331 331 SER b Protein S 3 FT #SUB 1581 1581 GLU A 332 332 ASP b Protein S 1 FT #SUB 1582 1582 ASN A 329 329 ARG b Protein B 4 FT #SUB 1582 1582 ASN A 330 330 GLN b Protein A 3 FT #SUB 1582 1582 ASN A 331 331 SER b Protein B 4 FT #SUB 1583 1583 CYS A 329 329 ARG b Protein B 15 FT #SUB 1583 1583 CYS A 330 330 GLN b Protein A 2 FT #SUB 1584 1584 GLY A 329 329 ARG b Protein B 8 FT #SUB 1585 1585 ASN A 329 329 ARG b Protein S 2 FT #SUB 1631 1631 GLU A 328 328 LYS b Protein S 4 FT #SUB 1638 1638 SER A 472 472 ASP b Protein S 1 FT #SUB 1639 1639 LYS A 472 472 ASP b Protein S 2 FT #SUB 1639 1639 LYS A 474 474 LEU b Protein S 1 FT #SUB 106 106 GLN A 609 2609 ALA c Protein B 1 FT #SUB 107 107 HIS A 625 2625 LEU c Protein S 1 FT #SUB 108 108 PRO A 626 2626 ARG c Protein S 4 FT #SUB 109 109 LEU A 626 2626 ARG c Protein S 3 FT #SUB 109 109 LEU A 635 2635 TYR c Protein S 1 FT #SUB 109 109 LEU A 636 2636 THR c Protein S 1 FT #SUB 111 111 ILE A 637 2637 VAL c Protein S 1 FT #SUB 116 116 LYS A 692 2692 LYS c Protein S 3 FT #SUB 116 116 LYS A 693 2693 TYR c Protein S 2 FT #SUB 117 117 LYS A 632 2632 GLU c Protein S 3 FT #SUB 117 117 LYS A 634 2634 THR c Protein A 6 FT #SUB 117 117 LYS A 693 2693 TYR c Protein S 2 FT #SUB 124 124 TYR A 610 2610 ASP c Protein S 3 FT #SUB 136 136 ALA A 619 2619 VAL c Protein S 1 FT #SUB 137 137 ARG A 610 2610 ASP c Protein B 1 FT #SUB 138 138 ALA A 612 2612 TYR c Protein S 3 FT #SUB 138 138 ALA A 619 2619 VAL c Protein A 2 FT #SUB 189 189 ARG A 612 2612 TYR c Protein S 7 FT #SUB 189 189 ARG A 617 2617 ASP c Protein A 5 FT #SUB 190 190 PHE A 617 2617 ASP c Protein A 2 FT #SUB 190 190 PHE A 618 2618 SER c Protein S 1 FT #SUB 190 190 PHE A 619 2619 VAL c Protein S 3 FT #SUB 191 191 THR A 617 2617 ASP c Protein S 1 FT #HET 41 41 HIS A 131 5001 CUO A S 7 FT #HET 58 58 CYS A 131 5001 CUO A S 1 FT #HET 60 60 HIS A 131 5001 CUO A S 6 FT #HET 69 69 HIS A 131 5001 CUO A S 9 FT #HET 179 179 HIS A 131 5001 CUO A S 4 FT #HET 183 183 HIS A 131 5001 CUO A S 5 FT #HET 206 206 PHE A 131 5001 CUO A S 5 FT #HET 210 210 HIS A 131 5001 CUO A S 8 FT #HET 313 313 THR A 1 1 NAG e S 3 FT #HET 389 389 THR A 1 1 NAG e S 2 FT #HET 391 391 MET A 1 1 NAG e S 2 FT #HET 391 391 MET A 2 2 NAG e S 1 FT #HET 462 462 HIS A 132 5002 CUO A S 4 FT #HET 480 480 CYS A 132 5002 CUO A S 1 FT #HET 482 482 HIS A 132 5002 CUO A S 7 FT #HET 487 487 PHE A 132 5002 CUO A S 3 FT #HET 491 491 HIS A 132 5002 CUO A S 9 FT #HET 603 603 HIS A 132 5002 CUO A S 7 FT #HET 607 607 HIS A 132 5002 CUO A S 7 FT #HET 634 634 HIS A 132 5002 CUO A S 7 FT #HET 740 740 THR A 3 1 NAG f S 1 FT #HET 763 763 LEU A 132 5002 CUO A S 1 FT #HET 804 804 ASP A 3 1 NAG f S 8 FT #HET 810 810 LEU A 3 1 NAG f S 2 FT #HET 877 877 HIS A 133 5003 CUO A S 7 FT #HET 895 895 CYS A 133 5003 CUO A S 1 FT #HET 897 897 HIS A 133 5003 CUO A S 3 FT #HET 906 906 HIS A 133 5003 CUO A S 7 FT #HET 1015 1015 HIS A 133 5003 CUO A S 5 FT #HET 1019 1019 HIS A 133 5003 CUO A S 8 FT #HET 1042 1042 PHE A 133 5003 CUO A S 4 FT #HET 1046 1046 HIS A 133 5003 CUO A S 9 FT #HET 1294 1294 HIS A 134 5004 CUO A S 9 FT #HET 1298 1298 ALA A 6 2 NAG g B 1 FT #HET 1299 1299 GLN A 5 1 NAG g A 11 FT #HET 1300 1300 CYS A 5 1 NAG g B 1 FT #HET 1301 1301 PRO A 5 1 NAG g S 3 FT #HET 1308 1308 VAL A 6 2 NAG g S 1 FT #HET 1312 1312 CYS A 134 5004 CUO A S 1 FT #HET 1314 1314 HIS A 134 5004 CUO A S 6 FT #HET 1319 1319 PHE A 134 5004 CUO A S 1 FT #HET 1323 1323 HIS A 134 5004 CUO A S 9 FT #HET 1427 1427 HIS A 134 5004 CUO A S 6 FT #HET 1431 1431 HIS A 134 5004 CUO A S 7 FT #HET 1454 1454 PHE A 134 5004 CUO A S 3 FT #HET 1458 1458 HIS A 134 5004 CUO A S 8 FT #HET 1494 1494 ARG A 5 1 NAG g S 5 FT #HET 1500 1500 THR A 5 1 NAG g B 3 FT #HET 1563 1563 LYS A 10 2 NAG h S 2 FT #HET 1564 1564 ALA A 9 1 NAG h S 1 FT #HET 1634 1634 ALA A 9 1 NAG h S 1 FT #HET 1638 1638 SER A 9 1 NAG h S 3 FT #HET 1641 1641 THR A 10 2 NAG h S 1 FT #HET 1644 1644 ILE A 10 2 NAG h S 1 FT #MOD 387 387 ASN A 1 1 NAG e S FT #MOD 806 806 ASN A 3 1 NAG f S FT #MOD 1498 1498 ASN A 5 1 NAG g S FT #MOD 1636 1636 ASN A 9 1 NAG h S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9B STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 800 2800 LYS B 100 3020 SER C Protein S 1 FT #SUB 800 2800 LYS B 103 3023 ILE C Protein A 3 FT #SUB 802 2802 ASP B 12 2932 PRO C Protein S 3 FT #SUB 867 2867 PRO B 12 2932 PRO C Protein S 7 FT #SUB 868 2868 GLU B 12 2932 PRO C Protein B 1 FT #SUB 903 2903 TYR B 12 2932 PRO C Protein S 2 FT #SUB 321 2321 GLU B 1623 1623 ASN D Protein S 1 FT #SUB 323 2323 ASN B 1651 1651 ILE D Protein B 3 FT #SUB 323 2323 ASN B 1652 1652 PHE D Protein A 9 FT #SUB 323 2323 ASN B 1654 1654 PRO D Protein S 2 FT #SUB 325 2325 ALA B 1649 1649 THR D Protein B 2 FT #SUB 325 2325 ALA B 1650 1650 ILE D Protein B 1 FT #SUB 326 2326 ILE B 1650 1650 ILE D Protein A 8 FT #SUB 327 2327 GLU B 1648 1648 PRO D Protein S 2 FT #SUB 327 2327 GLU B 1649 1649 THR D Protein S 6 FT #SUB 327 2327 GLU B 1650 1650 ILE D Protein A 4 FT #SUB 399 2399 LEU B 1603 1603 PHE D Protein S 1 FT #SUB 439 2439 PHE B 1558 1558 LEU D Protein B 1 FT #SUB 440 2440 ASP B 1558 1558 LEU D Protein B 1 FT #SUB 440 2440 ASP B 1605 1605 ARG D Protein B 1 FT #SUB 440 2440 ASP B 1606 1606 LEU D Protein S 4 FT #SUB 442 2442 LEU B 1604 1604 ASP D Protein S 6 FT #SUB 457 2457 LYS B 1206 1206 HIS D Protein S 2 FT #SUB 458 2458 PHE B 1481 1481 GLU D Protein S 1 FT #SUB 458 2458 PHE B 1486 1486 LEU D Protein B 1 FT #SUB 459 2459 ASP B 1481 1481 GLU D Protein S 1 FT #SUB 461 2461 HIS B 1490 1490 ASN D Protein S 5 FT #SUB 485 2485 THR B 1485 1485 ALA D Protein S 2 FT #SUB 486 2486 ILE B 1484 1484 CYS D Protein B 2 FT #SUB 486 2486 ILE B 1485 1485 ALA D Protein B 3 FT #SUB 486 2486 ILE B 1486 1486 LEU D Protein A 7 FT #SUB 486 2486 ILE B 1487 1487 PRO D Protein A 4 FT #SUB 487 2487 VAL B 1483 1483 ASN D Protein A 3 FT #SUB 488 2488 ARG B 1483 1483 ASN D Protein A 9 FT #SUB 490 2490 PRO B 1483 1483 ASN D Protein S 1 FT #SUB 729 2729 LYS B 1244 1244 GLY D Protein S 1 FT #SUB 729 2729 LYS B 1245 1245 GLU D Protein B 3 FT #SUB 734 2734 LYS B 1207 1207 TYR D Protein S 3 FT #SUB 736 2736 TYR B 1207 1207 TYR D Protein S 2 FT #SUB 736 2736 TYR B 1234 1234 VAL D Protein B 1 FT #SUB 736 2736 TYR B 1235 1235 ILE D Protein B 4 FT #SUB 736 2736 TYR B 1236 1236 TYR D Protein A 6 FT #SUB 736 2736 TYR B 1238 1238 PRO D Protein S 1 FT #SUB 736 2736 TYR B 1245 1245 GLU D Protein S 1 FT #SUB 737 2737 CYS B 1234 1234 VAL D Protein B 2 FT #SUB 738 2738 ALA B 1233 1233 SER D Protein S 2 FT #SUB 738 2738 ALA B 1234 1234 VAL D Protein B 4 FT #SUB 739 2739 LEU B 1207 1207 TYR D Protein S 3 FT #SUB 739 2739 LEU B 1211 1211 TYR D Protein S 7 FT #SUB 739 2739 LEU B 1234 1234 VAL D Protein A 3 FT #SUB 740 2740 ASP B 1232 1232 THR D Protein S 12 FT #SUB 740 2740 ASP B 1233 1233 SER D Protein S 1 FT #SUB 741 2741 GLN B 1232 1232 THR D Protein S 2 FT #SUB 743 2743 ALA B 1208 1208 THR D Protein S 1 FT #SUB 743 2743 ALA B 1210 1210 LYS D Protein A 4 FT #SUB 744 2744 PHE B 1164 1164 ASP D Protein S 4 FT #SUB 744 2744 PHE B 1210 1210 LYS D Protein S 5 FT #SUB 744 2744 PHE B 1212 1212 HIS D Protein S 4 FT #SUB 809 2809 LEU B 1187 1187 LYS D Protein S 2 FT #SUB 809 2809 LEU B 1188 1188 PHE D Protein S 2 FT #SUB 809 2809 LEU B 1189 1189 ASP D Protein S 1 FT #SUB 847 2847 HIS B 1230 1230 LEU D Protein S 2 FT #SUB 849 2849 ASP B 1147 1147 MET D Protein B 2 FT #SUB 849 2849 ASP B 1191 1191 PRO D Protein A 3 FT #SUB 849 2849 ASP B 1233 1233 SER D Protein S 4 FT #SUB 850 2850 ARG B 1189 1189 ASP D Protein B 1 FT #SUB 851 2851 ASN B 1189 1189 ASP D Protein S 5 FT #SUB 870 2870 LEU B 1074 1074 ILE D Protein B 1 FT #SUB 870 2870 LEU B 1078 1078 ARG D Protein B 3 FT #SUB 871 2871 PHE B 1071 1071 ASN D Protein S 2 FT #SUB 871 2871 PHE B 1074 1074 ILE D Protein S 1 FT #SUB 871 2871 PHE B 1103 1103 PHE D Protein B 4 FT #SUB 872 2872 GLU B 1078 1078 ARG D Protein B 1 FT #SUB 873 2873 HIS B 1101 1101 VAL D Protein S 1 FT #SUB 875 2875 SER B 1078 1078 ARG D Protein B 3 FT #SUB 876 2876 LYS B 1078 1078 ARG D Protein B 1 FT #SUB 877 2877 ILE B 1078 1078 ARG D Protein A 9 FT #SUB 899 2899 PRO B 1075 1075 GLU D Protein B 3 FT #SUB 900 2900 SER B 1075 1075 GLU D Protein A 5 FT #SUB 901 2901 LEU B 1073 1073 ALA D Protein B 1 FT #SUB 901 2901 LEU B 1074 1074 ILE D Protein A 6 FT #SUB 901 2901 LEU B 1075 1075 GLU D Protein A 2 FT #SUB 902 2902 ILE B 1071 1071 ASN D Protein A 3 FT #SUB 903 2903 TYR B 1071 1071 ASN D Protein B 4 FT #SUB 903 2903 TYR B 1074 1074 ILE D Protein S 1 FT #SUB 90 2090 LYS B 375 2375 GLY E Protein S 1 FT #SUB 94 2094 ARG B 94 2094 ARG E Protein S 6 FT #SUB 94 2094 ARG B 182 2182 TYR E Protein A 7 FT #SUB 94 2094 ARG B 370 2370 MET E Protein S 9 FT #SUB 96 2096 SER B 374 2374 ASN E Protein S 3 FT #SUB 97 2097 LEU B 234 2234 GLN E Protein S 1 FT #SUB 97 2097 LEU B 235 2235 PRO E Protein S 2 FT #SUB 98 2098 GLN B 240 2240 GLN E Protein S 1 FT #SUB 98 2098 GLN B 374 2374 ASN E Protein S 1 FT #SUB 99 2099 GLU B 375 2375 GLY E Protein S 1 FT #SUB 109 2109 ARG B 503 2503 ASN E Protein S 2 FT #SUB 109 2109 ARG B 585 2585 GLY E Protein B 1 FT #SUB 112 2112 LYS B 583 2583 ARG E Protein S 3 FT #SUB 112 2112 LYS B 584 2584 HIS E Protein B 1 FT #SUB 113 2113 ASP B 519 2519 ASN E Protein S 3 FT #SUB 113 2113 ASP B 585 2585 GLY E Protein A 5 FT #SUB 114 2114 ARG B 526 2526 ARG E Protein S 7 FT #SUB 115 2115 SER B 519 2519 ASN E Protein S 4 FT #SUB 115 2115 SER B 522 2522 SER E Protein B 3 FT #SUB 116 2116 SER B 615 2615 TRP E Protein S 3 FT #SUB 121 2121 THR B 615 2615 TRP E Protein S 5 FT #SUB 167 2167 ASN B 502 2502 LEU E Protein A 3 FT #SUB 168 2168 ARG B 502 2502 LEU E Protein A 2 FT #SUB 168 2168 ARG B 503 2503 ASN E Protein B 4 FT #SUB 168 2168 ARG B 515 2515 ARG E Protein S 11 FT #SUB 168 2168 ARG B 587 2587 SER E Protein A 5 FT #SUB 169 2169 HIS B 503 2503 ASN E Protein B 2 FT #SUB 169 2169 HIS B 585 2585 GLY E Protein S 1 FT #SUB 169 2169 HIS B 587 2587 SER E Protein S 3 FT #SUB 170 2170 GLY B 503 2503 ASN E Protein B 2 FT #SUB 182 2182 TYR B 94 2094 ARG E Protein S 4 FT #SUB 199 2199 PRO B 235 2235 PRO E Protein A 2 FT #SUB 199 2199 PRO B 236 2236 ALA E Protein B 2 FT #SUB 199 2199 PRO B 237 2237 LEU E Protein B 2 FT #SUB 200 2200 PHE B 237 2237 LEU E Protein B 8 FT #SUB 235 2235 PRO B 97 2097 LEU E Protein S 2 FT #SUB 235 2235 PRO B 199 2199 PRO E Protein B 2 FT #SUB 236 2236 ALA B 199 2199 PRO E Protein B 2 FT #SUB 237 2237 LEU B 199 2199 PRO E Protein A 2 FT #SUB 237 2237 LEU B 200 2200 PHE E Protein A 9 FT #SUB 341 2341 PRO B 614 2614 ALA E Protein S 2 FT #SUB 341 2341 PRO B 615 2615 TRP E Protein B 3 FT #SUB 342 2342 TYR B 615 2615 TRP E Protein B 1 FT #SUB 344 2344 LEU B 515 2515 ARG E Protein B 5 FT #SUB 345 2345 ASN B 515 2515 ARG E Protein A 6 FT #SUB 346 2346 PRO B 513 2513 GLU E Protein S 1 FT #SUB 346 2346 PRO B 515 2515 ARG E Protein S 3 FT #SUB 370 2370 MET B 94 2094 ARG E Protein S 4 FT #SUB 374 2374 ASN B 96 2096 SER E Protein A 3 FT #SUB 374 2374 ASN B 98 2098 GLN E Protein S 1 FT #SUB 375 2375 GLY B 90 2090 LYS E Protein B 1 FT #SUB 375 2375 GLY B 99 2099 GLU E Protein B 1 FT #SUB 502 2502 LEU B 84 2084 ARG E Protein S 3 FT #SUB 502 2502 LEU B 167 2167 ASN E Protein S 4 FT #SUB 502 2502 LEU B 168 2168 ARG E Protein S 1 FT #SUB 503 2503 ASN B 109 2109 ARG E Protein S 1 FT #SUB 503 2503 ASN B 168 2168 ARG E Protein A 3 FT #SUB 503 2503 ASN B 169 2169 HIS E Protein S 2 FT #SUB 503 2503 ASN B 170 2170 GLY E Protein S 1 FT #SUB 513 2513 GLU B 346 2346 PRO E Protein S 1 FT #SUB 515 2515 ARG B 168 2168 ARG E Protein S 2 FT #SUB 515 2515 ARG B 344 2344 LEU E Protein S 4 FT #SUB 515 2515 ARG B 345 2345 ASN E Protein S 8 FT #SUB 515 2515 ARG B 346 2346 PRO E Protein S 3 FT #SUB 516 2516 ASP B 168 2168 ARG E Protein S 5 FT #SUB 519 2519 ASN B 113 2113 ASP E Protein S 3 FT #SUB 519 2519 ASN B 115 2115 SER E Protein A 4 FT #SUB 522 2522 SER B 115 2115 SER E Protein S 2 FT #SUB 526 2526 ARG B 114 2114 ARG E Protein S 6 FT #SUB 582 2582 ARG B 109 2109 ARG E Protein A 5 FT #SUB 583 2583 ARG B 112 2112 LYS E Protein B 1 FT #SUB 585 2585 GLY B 109 2109 ARG E Protein B 1 FT #SUB 585 2585 GLY B 113 2113 ASP E Protein B 4 FT #SUB 585 2585 GLY B 169 2169 HIS E Protein B 1 FT #SUB 587 2587 SER B 168 2168 ARG E Protein A 2 FT #SUB 587 2587 SER B 169 2169 HIS E Protein S 2 FT #SUB 614 2614 ALA B 341 2341 PRO E Protein S 2 FT #SUB 615 2615 TRP B 121 2121 THR E Protein S 3 FT #SUB 615 2615 TRP B 124 2124 SER E Protein S 2 FT #SUB 615 2615 TRP B 125 2125 PHE E Protein S 1 FT #SUB 615 2615 TRP B 131 2131 LEU E Protein S 1 FT #SUB 615 2615 TRP B 341 2341 PRO E Protein S 1 FT #SUB 615 2615 TRP B 342 2342 TYR E Protein S 4 FT #SUB 136 2136 THR B 388 388 GLY G Protein S 4 FT #SUB 136 2136 THR B 389 389 THR G Protein A 10 FT #SUB 138 2138 LYS B 389 389 THR G Protein S 1 FT #SUB 609 2609 ALA B 106 106 GLN b Protein S 1 FT #SUB 610 2610 ASP B 124 124 TYR b Protein S 3 FT #SUB 610 2610 ASP B 137 137 ARG b Protein S 1 FT #SUB 612 2612 TYR B 138 138 ALA b Protein S 3 FT #SUB 612 2612 TYR B 189 189 ARG b Protein S 5 FT #SUB 617 2617 ASP B 189 189 ARG b Protein S 5 FT #SUB 617 2617 ASP B 190 190 PHE b Protein B 5 FT #SUB 617 2617 ASP B 191 191 THR b Protein S 1 FT #SUB 619 2619 VAL B 136 136 ALA b Protein S 1 FT #SUB 619 2619 VAL B 137 137 ARG b Protein S 1 FT #SUB 619 2619 VAL B 138 138 ALA b Protein S 3 FT #SUB 619 2619 VAL B 190 190 PHE b Protein A 3 FT #SUB 625 2625 LEU B 108 108 PRO b Protein S 2 FT #SUB 626 2626 ARG B 108 108 PRO b Protein S 2 FT #SUB 626 2626 ARG B 109 109 LEU b Protein S 4 FT #SUB 632 2632 GLU B 117 117 LYS b Protein B 3 FT #SUB 634 2634 THR B 117 117 LYS b Protein S 6 FT #SUB 634 2634 THR B 118 118 ALA b Protein S 2 FT #SUB 635 2635 TYR B 109 109 LEU b Protein S 1 FT #SUB 635 2635 TYR B 119 119 LYS b Protein S 7 FT #SUB 636 2636 THR B 109 109 LEU b Protein B 1 FT #SUB 637 2637 VAL B 111 111 ILE b Protein S 3 FT #SUB 638 2638 ARG B 111 111 ILE b Protein B 1 FT #SUB 691 2691 GLN B 111 111 ILE b Protein S 3 FT #SUB 691 2691 GLN B 112 112 ASP b Protein S 1 FT #SUB 693 2693 TYR B 116 116 LYS b Protein S 2 FT #SUB 693 2693 TYR B 117 117 LYS b Protein S 3 FT #HET 126 2126 HIS B 136 3001 CUO B S 4 FT #HET 138 2138 LYS B 27 1 NAG o S 3 FT #HET 138 2138 LYS B 28 2 NAG o S 2 FT #HET 144 2144 CYS B 136 3001 CUO B S 1 FT #HET 146 2146 HIS B 136 3001 CUO B S 6 FT #HET 151 2151 PHE B 136 3001 CUO B S 1 FT #HET 155 2155 HIS B 136 3001 CUO B S 9 FT #HET 267 2267 HIS B 136 3001 CUO B S 7 FT #HET 271 2271 HIS B 136 3001 CUO B S 5 FT #HET 294 2294 PHE B 136 3001 CUO B S 1 FT #HET 298 2298 HIS B 136 3001 CUO B S 9 FT #HET 429 2429 LEU B 136 3001 CUO B S 1 FT #HET 543 2543 HIS B 137 3002 CUO B S 7 FT #HET 559 2559 CYS B 137 3002 CUO B S 1 FT #HET 561 2561 HIS B 137 3002 CUO B S 6 FT #HET 566 2566 PHE B 137 3002 CUO B S 1 FT #HET 570 2570 HIS B 137 3002 CUO B S 9 FT #HET 680 2680 HIS B 137 3002 CUO B S 6 FT #HET 684 2684 HIS B 137 3002 CUO B S 7 FT #HET 707 2707 PHE B 137 3002 CUO B S 4 FT #HET 711 2711 HIS B 137 3002 CUO B S 8 FT #MOD 472 2472 ASN B 12 1 NAG i S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9C STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 12 2932 PRO C 802 2802 ASP B Protein S 3 FT #SUB 12 2932 PRO C 867 2867 PRO B Protein A 7 FT #SUB 12 2932 PRO C 868 2868 GLU B Protein B 1 FT #SUB 12 2932 PRO C 903 2903 TYR B Protein S 2 FT #SUB 100 3020 SER C 800 2800 LYS B Protein S 1 FT #SUB 103 3023 ILE C 800 2800 LYS B Protein S 3 FT #SUB 276 3196 THR C 416 416 ASN D Protein S 1 FT #SUB 279 3199 GLN C 419 419 THR D Protein S 1 FT #SUB 348 3268 LYS C 1534 1534 GLY D Protein S 1 FT #SUB 348 3268 LYS C 1539 1539 GLN D Protein S 1 FT #SUB 351 3271 VAL C 1533 1533 ALA D Protein A 3 FT #SUB 352 3272 HIS C 1404 1404 TYR D Protein B 1 FT #SUB 352 3272 HIS C 1535 1535 LEU D Protein S 5 FT #SUB 352 3272 HIS C 1539 1539 GLN D Protein S 1 FT #SUB 352 3272 HIS C 1543 1543 ILE D Protein S 4 FT #SUB 353 3273 LEU C 1401 1401 ARG D Protein B 1 FT #SUB 354 3274 ARG C 1350 1350 GLU D Protein S 3 FT #SUB 354 3274 ARG C 1401 1401 ARG D Protein A 2 FT #SUB 354 3274 ARG C 1533 1533 ALA D Protein S 2 FT #HET 41 2961 HIS C 139 3402 CUO C S 8 FT #HET 60 2980 HIS C 139 3402 CUO C S 3 FT #HET 69 2989 HIS C 139 3402 CUO C S 13 FT #HET 121 3041 ALA C 149 2101 CUO G B 6 FT #HET 122 3042 ASP C 149 2101 CUO G A 25 FT #HET 123 3043 THR C 149 2101 CUO G B 7 FT #HET 169 3089 HIS C 139 3402 CUO C S 7 FT #HET 173 3093 HIS C 139 3402 CUO C S 5 FT #HET 196 3116 PHE C 139 3402 CUO C S 5 FT #HET 199 3119 HIS C 139 3402 CUO C S 1 FT #HET 200 3120 HIS C 139 3402 CUO C S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9D STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 1071 1071 ASN D 871 2871 PHE B Protein S 2 FT #SUB 1071 1071 ASN D 902 2902 ILE B Protein B 3 FT #SUB 1071 1071 ASN D 903 2903 TYR B Protein A 4 FT #SUB 1073 1073 ALA D 901 2901 LEU B Protein B 1 FT #SUB 1074 1074 ILE D 870 2870 LEU B Protein S 1 FT #SUB 1074 1074 ILE D 871 2871 PHE B Protein S 1 FT #SUB 1074 1074 ILE D 901 2901 LEU B Protein A 6 FT #SUB 1074 1074 ILE D 903 2903 TYR B Protein S 1 FT #SUB 1075 1075 GLU D 899 2899 PRO B Protein S 3 FT #SUB 1075 1075 GLU D 900 2900 SER B Protein S 5 FT #SUB 1075 1075 GLU D 901 2901 LEU B Protein S 2 FT #SUB 1078 1078 ARG D 870 2870 LEU B Protein S 3 FT #SUB 1078 1078 ARG D 872 2872 GLU B Protein S 1 FT #SUB 1078 1078 ARG D 875 2875 SER B Protein S 3 FT #SUB 1078 1078 ARG D 876 2876 LYS B Protein S 1 FT #SUB 1078 1078 ARG D 877 2877 ILE B Protein S 9 FT #SUB 1101 1101 VAL D 873 2873 HIS B Protein S 1 FT #SUB 1103 1103 PHE D 871 2871 PHE B Protein S 4 FT #SUB 1147 1147 MET D 849 2849 ASP B Protein S 2 FT #SUB 1164 1164 ASP D 744 2744 PHE B Protein S 4 FT #SUB 1187 1187 LYS D 809 2809 LEU B Protein S 2 FT #SUB 1188 1188 PHE D 809 2809 LEU B Protein B 2 FT #SUB 1189 1189 ASP D 809 2809 LEU B Protein B 1 FT #SUB 1189 1189 ASP D 850 2850 ARG B Protein B 1 FT #SUB 1189 1189 ASP D 851 2851 ASN B Protein A 5 FT #SUB 1191 1191 PRO D 849 2849 ASP B Protein S 3 FT #SUB 1206 1206 HIS D 457 2457 LYS B Protein S 2 FT #SUB 1207 1207 TYR D 734 2734 LYS B Protein S 3 FT #SUB 1207 1207 TYR D 736 2736 TYR B Protein S 2 FT #SUB 1207 1207 TYR D 739 2739 LEU B Protein A 3 FT #SUB 1208 1208 THR D 743 2743 ALA B Protein B 1 FT #SUB 1210 1210 LYS D 743 2743 ALA B Protein A 4 FT #SUB 1210 1210 LYS D 744 2744 PHE B Protein S 5 FT #SUB 1211 1211 TYR D 739 2739 LEU B Protein S 7 FT #SUB 1212 1212 HIS D 744 2744 PHE B Protein S 4 FT #SUB 1230 1230 LEU D 847 2847 HIS B Protein S 2 FT #SUB 1232 1232 THR D 740 2740 ASP B Protein A 12 FT #SUB 1232 1232 THR D 741 2741 GLN B Protein S 2 FT #SUB 1233 1233 SER D 738 2738 ALA B Protein A 2 FT #SUB 1233 1233 SER D 740 2740 ASP B Protein B 1 FT #SUB 1233 1233 SER D 849 2849 ASP B Protein S 4 FT #SUB 1234 1234 VAL D 736 2736 TYR B Protein B 1 FT #SUB 1234 1234 VAL D 737 2737 CYS B Protein B 2 FT #SUB 1234 1234 VAL D 738 2738 ALA B Protein B 4 FT #SUB 1234 1234 VAL D 739 2739 LEU B Protein B 3 FT #SUB 1235 1235 ILE D 736 2736 TYR B Protein A 4 FT #SUB 1236 1236 TYR D 736 2736 TYR B Protein A 6 FT #SUB 1238 1238 PRO D 736 2736 TYR B Protein S 1 FT #SUB 1244 1244 GLY D 729 2729 LYS B Protein B 1 FT #SUB 1245 1245 GLU D 729 2729 LYS B Protein A 3 FT #SUB 1245 1245 GLU D 736 2736 TYR B Protein S 1 FT #SUB 1481 1481 GLU D 458 2458 PHE B Protein S 1 FT #SUB 1481 1481 GLU D 459 2459 ASP B Protein S 1 FT #SUB 1483 1483 ASN D 487 2487 VAL B Protein B 3 FT #SUB 1483 1483 ASN D 488 2488 ARG B Protein A 9 FT #SUB 1483 1483 ASN D 490 2490 PRO B Protein S 1 FT #SUB 1484 1484 CYS D 486 2486 ILE B Protein B 2 FT #SUB 1485 1485 ALA D 485 2485 THR B Protein A 2 FT #SUB 1485 1485 ALA D 486 2486 ILE B Protein B 3 FT #SUB 1486 1486 LEU D 458 2458 PHE B Protein S 1 FT #SUB 1486 1486 LEU D 486 2486 ILE B Protein A 7 FT #SUB 1487 1487 PRO D 486 2486 ILE B Protein S 4 FT #SUB 1490 1490 ASN D 461 2461 HIS B Protein S 5 FT #SUB 1558 1558 LEU D 439 2439 PHE B Protein S 1 FT #SUB 1558 1558 LEU D 440 2440 ASP B Protein S 1 FT #SUB 1603 1603 PHE D 399 2399 LEU B Protein B 1 FT #SUB 1604 1604 ASP D 442 2442 LEU B Protein A 6 FT #SUB 1605 1605 ARG D 440 2440 ASP B Protein B 1 FT #SUB 1606 1606 LEU D 440 2440 ASP B Protein S 4 FT #SUB 1623 1623 ASN D 321 2321 GLU B Protein S 1 FT #SUB 1648 1648 PRO D 327 2327 GLU B Protein B 2 FT #SUB 1649 1649 THR D 325 2325 ALA B Protein S 2 FT #SUB 1649 1649 THR D 327 2327 GLU B Protein A 6 FT #SUB 1650 1650 ILE D 325 2325 ALA B Protein B 1 FT #SUB 1650 1650 ILE D 326 2326 ILE B Protein A 8 FT #SUB 1650 1650 ILE D 327 2327 GLU B Protein A 4 FT #SUB 1651 1651 ILE D 323 2323 ASN B Protein A 3 FT #SUB 1652 1652 PHE D 323 2323 ASN B Protein A 9 FT #SUB 1654 1654 PRO D 323 2323 ASN B Protein S 2 FT #SUB 416 416 ASN D 276 3196 THR C Protein S 1 FT #SUB 419 419 THR D 279 3199 GLN C Protein S 1 FT #SUB 1350 1350 GLU D 354 3274 ARG C Protein S 3 FT #SUB 1401 1401 ARG D 353 3273 LEU C Protein S 1 FT #SUB 1401 1401 ARG D 354 3274 ARG C Protein S 2 FT #SUB 1404 1404 TYR D 352 3272 HIS C Protein S 1 FT #SUB 1533 1533 ALA D 351 3271 VAL C Protein B 3 FT #SUB 1533 1533 ALA D 354 3274 ARG C Protein B 2 FT #SUB 1534 1534 GLY D 348 3268 LYS C Protein B 1 FT #SUB 1535 1535 LEU D 352 3272 HIS C Protein S 5 FT #SUB 1539 1539 GLN D 348 3268 LYS C Protein S 1 FT #SUB 1539 1539 GLN D 352 3272 HIS C Protein S 1 FT #SUB 1543 1543 ILE D 352 3272 HIS C Protein S 4 FT #SUB 328 328 LYS D 1571 1571 TYR G Protein S 3 FT #SUB 328 328 LYS D 1631 1631 GLU G Protein S 1 FT #SUB 329 329 ARG D 1574 1574 VAL G Protein S 1 FT #SUB 329 329 ARG D 1582 1582 ASN G Protein S 5 FT #SUB 329 329 ARG D 1583 1583 CYS G Protein S 16 FT #SUB 329 329 ARG D 1584 1584 GLY G Protein S 1 FT #SUB 329 329 ARG D 1585 1585 ASN G Protein S 3 FT #SUB 330 330 GLN D 1582 1582 ASN G Protein A 4 FT #SUB 331 331 SER D 1581 1581 GLU G Protein A 11 FT #SUB 331 331 SER D 1582 1582 ASN G Protein A 2 FT #SUB 335 335 ASP D 1578 1578 ILE G Protein S 1 FT #SUB 335 335 ASP D 1579 1579 GLY G Protein S 4 FT #SUB 470 470 ARG D 1640 1640 ILE G Protein S 2 FT #SUB 472 472 ASP D 1638 1638 SER G Protein S 2 FT #SUB 472 472 ASP D 1639 1639 LYS G Protein S 1 FT #SUB 472 472 ASP D 1640 1640 ILE G Protein S 1 FT #SUB 473 473 ALA D 1640 1640 ILE G Protein S 1 FT #SUB 1362 1362 ALA D 1378 1378 PHE G Protein S 1 FT #SUB 1365 1365 TYR D 1437 1437 ARG G Protein S 17 FT #SUB 1372 1372 ILE D 1390 1390 ASP G Protein A 2 FT #SUB 1372 1372 ILE D 1391 1391 ARG G Protein S 3 FT #SUB 1372 1372 ILE D 1392 1392 ASP G Protein S 3 FT #SUB 1372 1372 ILE D 1438 1438 ASP G Protein S 6 FT #SUB 1390 1390 ASP D 1372 1372 ILE G Protein S 6 FT #SUB 1391 1391 ARG D 1372 1372 ILE G Protein B 3 FT #SUB 1392 1392 ASP D 1365 1365 TYR G Protein S 1 FT #SUB 1437 1437 ARG D 1365 1365 TYR G Protein S 19 FT #SUB 1438 1438 ASP D 1372 1372 ILE G Protein S 7 FT #SUB 1571 1571 TYR D 328 328 LYS G Protein S 9 FT #SUB 1572 1572 ILE D 329 329 ARG G Protein B 3 FT #SUB 1573 1573 CYS D 329 329 ARG G Protein B 2 FT #SUB 1574 1574 VAL D 329 329 ARG G Protein B 4 FT #SUB 1578 1578 ILE D 335 335 ASP G Protein B 1 FT #SUB 1579 1579 GLY D 335 335 ASP G Protein B 5 FT #SUB 1582 1582 ASN D 329 329 ARG G Protein B 5 FT #SUB 1582 1582 ASN D 330 330 GLN G Protein B 2 FT #SUB 1583 1583 CYS D 328 328 LYS G Protein B 1 FT #SUB 1583 1583 CYS D 329 329 ARG G Protein B 11 FT #SUB 1584 1584 GLY D 329 329 ARG G Protein B 15 FT #SUB 1585 1585 ASN D 329 329 ARG G Protein A 6 FT #SUB 1631 1631 GLU D 328 328 LYS G Protein S 4 FT #SUB 1638 1638 SER D 472 472 ASP G Protein B 5 FT #SUB 1640 1640 ILE D 472 472 ASP G Protein B 1 FT #SUB 106 106 GLN D 609 2609 ALA H Protein B 1 FT #SUB 107 107 HIS D 625 2625 LEU H Protein S 2 FT #SUB 109 109 LEU D 626 2626 ARG H Protein S 3 FT #SUB 109 109 LEU D 635 2635 TYR H Protein S 1 FT #SUB 109 109 LEU D 636 2636 THR H Protein S 1 FT #SUB 109 109 LEU D 637 2637 VAL H Protein B 1 FT #SUB 111 111 ILE D 637 2637 VAL H Protein S 1 FT #SUB 111 111 ILE D 639 2639 GLU H Protein S 3 FT #SUB 112 112 ASP D 691 2691 GLN H Protein B 1 FT #SUB 116 116 LYS D 691 2691 GLN H Protein B 1 FT #SUB 117 117 LYS D 632 2632 GLU H Protein S 1 FT #SUB 117 117 LYS D 634 2634 THR H Protein S 4 FT #SUB 117 117 LYS D 637 2637 VAL H Protein B 1 FT #SUB 117 117 LYS D 691 2691 GLN H Protein B 1 FT #SUB 117 117 LYS D 693 2693 TYR H Protein S 7 FT #SUB 119 119 LYS D 633 2633 ASP H Protein S 1 FT #SUB 119 119 LYS D 634 2634 THR H Protein S 1 FT #SUB 124 124 TYR D 610 2610 ASP H Protein S 5 FT #SUB 138 138 ALA D 612 2612 TYR H Protein S 2 FT #SUB 138 138 ALA D 619 2619 VAL H Protein S 1 FT #SUB 189 189 ARG D 612 2612 TYR H Protein S 4 FT #SUB 189 189 ARG D 617 2617 ASP H Protein B 1 FT #SUB 190 190 PHE D 617 2617 ASP H Protein A 5 FT #SUB 190 190 PHE D 618 2618 SER H Protein S 1 FT #SUB 190 190 PHE D 619 2619 VAL H Protein S 2 FT #SUB 388 388 GLY D 136 2136 THR K Protein B 6 FT #SUB 389 389 THR D 136 2136 THR K Protein A 9 FT #SUB 390 390 LEU D 136 2136 THR K Protein B 1 FT #HET 41 41 HIS D 140 5001 CUO D S 7 FT #HET 60 60 HIS D 140 5001 CUO D S 5 FT #HET 65 65 PHE D 140 5001 CUO D S 1 FT #HET 69 69 HIS D 140 5001 CUO D S 9 FT #HET 179 179 HIS D 140 5001 CUO D S 6 FT #HET 183 183 HIS D 140 5001 CUO D S 7 FT #HET 206 206 PHE D 140 5001 CUO D S 3 FT #HET 210 210 HIS D 140 5001 CUO D S 9 FT #HET 313 313 THR D 14 1 NAG j S 3 FT #HET 389 389 THR D 14 1 NAG j S 4 FT #HET 391 391 MET D 14 1 NAG j S 1 FT #HET 391 391 MET D 15 2 NAG j S 1 FT #HET 462 462 HIS D 141 5002 CUO D S 6 FT #HET 482 482 HIS D 141 5002 CUO D S 4 FT #HET 491 491 HIS D 141 5002 CUO D S 7 FT #HET 603 603 HIS D 141 5002 CUO D S 5 FT #HET 607 607 HIS D 141 5002 CUO D S 6 FT #HET 630 630 PHE D 141 5002 CUO D S 3 FT #HET 634 634 HIS D 141 5002 CUO D S 7 FT #HET 763 763 LEU D 141 5002 CUO D S 1 FT #HET 804 804 ASP D 16 1 NAG k S 5 FT #HET 808 808 THR D 16 1 NAG k S 4 FT #HET 810 810 LEU D 16 1 NAG k S 2 FT #HET 877 877 HIS D 142 5003 CUO D S 7 FT #HET 895 895 CYS D 142 5003 CUO D S 1 FT #HET 897 897 HIS D 142 5003 CUO D S 4 FT #HET 906 906 HIS D 142 5003 CUO D S 10 FT #HET 1015 1015 HIS D 142 5003 CUO D S 4 FT #HET 1019 1019 HIS D 142 5003 CUO D S 5 FT #HET 1042 1042 PHE D 142 5003 CUO D S 2 FT #HET 1046 1046 HIS D 142 5003 CUO D S 9 FT #HET 1294 1294 HIS D 143 5004 CUO D S 7 FT #HET 1298 1298 ALA D 20 2 NAG l B 3 FT #HET 1299 1299 GLN D 19 1 NAG l A 10 FT #HET 1299 1299 GLN D 20 2 NAG l B 3 FT #HET 1301 1301 PRO D 19 1 NAG l S 1 FT #HET 1312 1312 CYS D 143 5004 CUO D S 1 FT #HET 1314 1314 HIS D 143 5004 CUO D S 4 FT #HET 1323 1323 HIS D 143 5004 CUO D S 9 FT #HET 1427 1427 HIS D 143 5004 CUO D S 7 FT #HET 1431 1431 HIS D 143 5004 CUO D S 7 FT #HET 1454 1454 PHE D 143 5004 CUO D S 4 FT #HET 1458 1458 HIS D 143 5004 CUO D S 9 FT #HET 1494 1494 ARG D 19 1 NAG l S 2 FT #HET 1500 1500 THR D 19 1 NAG l B 4 FT #HET 1564 1564 ALA D 23 1 NAG m S 3 FT #HET 1565 1565 SER D 23 1 NAG m B 1 FT #HET 1634 1634 ALA D 23 1 NAG m S 1 FT #HET 1635 1635 VAL D 23 1 NAG m A 3 FT #HET 1639 1639 LYS D 23 1 NAG m S 7 FT #MOD 387 387 ASN D 14 1 NAG j S FT #MOD 806 806 ASN D 16 1 NAG k S FT #MOD 1498 1498 ASN D 19 1 NAG l S FT #MOD 1636 1636 ASN D 23 1 NAG m S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9E STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 323 2323 ASN E 1650 1650 ILE A Protein B 1 FT #SUB 323 2323 ASN E 1651 1651 ILE A Protein B 2 FT #SUB 323 2323 ASN E 1652 1652 PHE A Protein A 11 FT #SUB 323 2323 ASN E 1654 1654 PRO A Protein S 1 FT #SUB 324 2324 CYS E 1650 1650 ILE A Protein B 1 FT #SUB 325 2325 ALA E 1649 1649 THR A Protein B 2 FT #SUB 325 2325 ALA E 1650 1650 ILE A Protein B 2 FT #SUB 326 2326 ILE E 1622 1622 GLN A Protein S 1 FT #SUB 326 2326 ILE E 1650 1650 ILE A Protein A 4 FT #SUB 327 2327 GLU E 1648 1648 PRO A Protein S 2 FT #SUB 327 2327 GLU E 1649 1649 THR A Protein S 3 FT #SUB 327 2327 GLU E 1650 1650 ILE A Protein S 3 FT #SUB 399 2399 LEU E 1603 1603 PHE A Protein S 1 FT #SUB 399 2399 LEU E 1604 1604 ASP A Protein S 1 FT #SUB 440 2440 ASP E 1558 1558 LEU A Protein B 1 FT #SUB 440 2440 ASP E 1605 1605 ARG A Protein B 1 FT #SUB 440 2440 ASP E 1606 1606 LEU A Protein A 3 FT #SUB 442 2442 LEU E 1604 1604 ASP A Protein A 5 FT #SUB 457 2457 LYS E 1206 1206 HIS A Protein S 1 FT #SUB 458 2458 PHE E 1486 1486 LEU A Protein S 3 FT #SUB 459 2459 ASP E 1481 1481 GLU A Protein S 1 FT #SUB 461 2461 HIS E 1490 1490 ASN A Protein S 6 FT #SUB 486 2486 ILE E 1484 1484 CYS A Protein B 3 FT #SUB 486 2486 ILE E 1485 1485 ALA A Protein B 1 FT #SUB 486 2486 ILE E 1486 1486 LEU A Protein A 9 FT #SUB 486 2486 ILE E 1487 1487 PRO A Protein A 6 FT #SUB 487 2487 VAL E 1483 1483 ASN A Protein A 3 FT #SUB 487 2487 VAL E 1484 1484 CYS A Protein A 4 FT #SUB 487 2487 VAL E 1486 1486 LEU A Protein B 1 FT #SUB 488 2488 ARG E 1483 1483 ASN A Protein A 8 FT #SUB 488 2488 ARG E 1486 1486 LEU A Protein B 1 FT #SUB 490 2490 PRO E 1483 1483 ASN A Protein S 1 FT #SUB 729 2729 LYS E 1245 1245 GLU A Protein B 2 FT #SUB 736 2736 TYR E 1234 1234 VAL A Protein B 1 FT #SUB 736 2736 TYR E 1235 1235 ILE A Protein B 4 FT #SUB 736 2736 TYR E 1236 1236 TYR A Protein A 5 FT #SUB 736 2736 TYR E 1238 1238 PRO A Protein S 1 FT #SUB 737 2737 CYS E 1234 1234 VAL A Protein B 2 FT #SUB 738 2738 ALA E 1232 1232 THR A Protein S 1 FT #SUB 738 2738 ALA E 1233 1233 SER A Protein A 5 FT #SUB 738 2738 ALA E 1234 1234 VAL A Protein B 4 FT #SUB 739 2739 LEU E 1207 1207 TYR A Protein S 4 FT #SUB 739 2739 LEU E 1211 1211 TYR A Protein A 9 FT #SUB 739 2739 LEU E 1234 1234 VAL A Protein A 4 FT #SUB 740 2740 ASP E 1211 1211 TYR A Protein B 1 FT #SUB 740 2740 ASP E 1212 1212 HIS A Protein S 3 FT #SUB 740 2740 ASP E 1213 1213 VAL A Protein S 2 FT #SUB 740 2740 ASP E 1232 1232 THR A Protein S 1 FT #SUB 741 2741 GLN E 1232 1232 THR A Protein S 5 FT #SUB 743 2743 ALA E 1208 1208 THR A Protein A 2 FT #SUB 743 2743 ALA E 1209 1209 ASP A Protein S 2 FT #SUB 743 2743 ALA E 1210 1210 LYS A Protein A 6 FT #SUB 744 2744 PHE E 1164 1164 ASP A Protein S 3 FT #SUB 744 2744 PHE E 1210 1210 LYS A Protein A 15 FT #SUB 744 2744 PHE E 1212 1212 HIS A Protein S 2 FT #SUB 809 2809 LEU E 1187 1187 LYS A Protein S 1 FT #SUB 809 2809 LEU E 1188 1188 PHE A Protein S 1 FT #SUB 809 2809 LEU E 1189 1189 ASP A Protein S 1 FT #SUB 847 2847 HIS E 1230 1230 LEU A Protein S 3 FT #SUB 849 2849 ASP E 1147 1147 MET A Protein B 2 FT #SUB 849 2849 ASP E 1189 1189 ASP A Protein B 1 FT #SUB 849 2849 ASP E 1191 1191 PRO A Protein A 5 FT #SUB 849 2849 ASP E 1233 1233 SER A Protein S 1 FT #SUB 850 2850 ARG E 1189 1189 ASP A Protein B 1 FT #SUB 851 2851 ASN E 1189 1189 ASP A Protein S 11 FT #SUB 870 2870 LEU E 1074 1074 ILE A Protein S 1 FT #SUB 870 2870 LEU E 1078 1078 ARG A Protein B 2 FT #SUB 871 2871 PHE E 1070 1070 ALA A Protein S 1 FT #SUB 871 2871 PHE E 1071 1071 ASN A Protein S 2 FT #SUB 871 2871 PHE E 1074 1074 ILE A Protein S 2 FT #SUB 871 2871 PHE E 1103 1103 PHE A Protein A 7 FT #SUB 872 2872 GLU E 1078 1078 ARG A Protein B 2 FT #SUB 875 2875 SER E 1078 1078 ARG A Protein B 1 FT #SUB 876 2876 LYS E 1078 1078 ARG A Protein B 1 FT #SUB 877 2877 ILE E 1078 1078 ARG A Protein B 7 FT #SUB 897 2897 PRO E 1187 1187 LYS A Protein S 3 FT #SUB 898 2898 LYS E 1075 1075 GLU A Protein S 1 FT #SUB 899 2899 PRO E 1075 1075 GLU A Protein B 3 FT #SUB 900 2900 SER E 1073 1073 ALA A Protein S 1 FT #SUB 900 2900 SER E 1075 1075 GLU A Protein A 4 FT #SUB 901 2901 LEU E 1071 1071 ASN A Protein B 1 FT #SUB 901 2901 LEU E 1073 1073 ALA A Protein B 1 FT #SUB 901 2901 LEU E 1074 1074 ILE A Protein A 6 FT #SUB 901 2901 LEU E 1075 1075 GLU A Protein S 3 FT #SUB 902 2902 ILE E 1071 1071 ASN A Protein A 4 FT #SUB 903 2903 TYR E 1071 1071 ASN A Protein A 6 FT #SUB 84 2084 ARG E 502 2502 LEU B Protein S 3 FT #SUB 90 2090 LYS E 375 2375 GLY B Protein S 1 FT #SUB 94 2094 ARG E 94 2094 ARG B Protein S 6 FT #SUB 94 2094 ARG E 182 2182 TYR B Protein A 4 FT #SUB 94 2094 ARG E 370 2370 MET B Protein S 4 FT #SUB 96 2096 SER E 374 2374 ASN B Protein S 3 FT #SUB 97 2097 LEU E 235 2235 PRO B Protein S 2 FT #SUB 98 2098 GLN E 374 2374 ASN B Protein S 1 FT #SUB 99 2099 GLU E 375 2375 GLY B Protein S 1 FT #SUB 109 2109 ARG E 503 2503 ASN B Protein S 1 FT #SUB 109 2109 ARG E 582 2582 ARG B Protein S 5 FT #SUB 109 2109 ARG E 585 2585 GLY B Protein B 1 FT #SUB 112 2112 LYS E 583 2583 ARG B Protein S 1 FT #SUB 113 2113 ASP E 519 2519 ASN B Protein S 3 FT #SUB 113 2113 ASP E 585 2585 GLY B Protein S 4 FT #SUB 114 2114 ARG E 526 2526 ARG B Protein S 6 FT #SUB 115 2115 SER E 519 2519 ASN B Protein S 4 FT #SUB 115 2115 SER E 522 2522 SER B Protein B 2 FT #SUB 121 2121 THR E 615 2615 TRP B Protein S 3 FT #SUB 124 2124 SER E 615 2615 TRP B Protein S 2 FT #SUB 125 2125 PHE E 615 2615 TRP B Protein S 1 FT #SUB 131 2131 LEU E 615 2615 TRP B Protein S 1 FT #SUB 167 2167 ASN E 502 2502 LEU B Protein A 4 FT #SUB 168 2168 ARG E 502 2502 LEU B Protein B 1 FT #SUB 168 2168 ARG E 503 2503 ASN B Protein B 3 FT #SUB 168 2168 ARG E 515 2515 ARG B Protein S 2 FT #SUB 168 2168 ARG E 516 2516 ASP B Protein S 5 FT #SUB 168 2168 ARG E 587 2587 SER B Protein B 2 FT #SUB 169 2169 HIS E 503 2503 ASN B Protein B 2 FT #SUB 169 2169 HIS E 585 2585 GLY B Protein S 1 FT #SUB 169 2169 HIS E 587 2587 SER B Protein S 2 FT #SUB 170 2170 GLY E 503 2503 ASN B Protein B 1 FT #SUB 182 2182 TYR E 94 2094 ARG B Protein S 7 FT #SUB 199 2199 PRO E 235 2235 PRO B Protein A 2 FT #SUB 199 2199 PRO E 236 2236 ALA B Protein B 2 FT #SUB 199 2199 PRO E 237 2237 LEU B Protein B 2 FT #SUB 200 2200 PHE E 237 2237 LEU B Protein B 9 FT #SUB 234 2234 GLN E 97 2097 LEU B Protein S 1 FT #SUB 235 2235 PRO E 97 2097 LEU B Protein S 2 FT #SUB 235 2235 PRO E 199 2199 PRO B Protein B 2 FT #SUB 236 2236 ALA E 199 2199 PRO B Protein B 2 FT #SUB 237 2237 LEU E 199 2199 PRO B Protein B 2 FT #SUB 237 2237 LEU E 200 2200 PHE B Protein A 8 FT #SUB 240 2240 GLN E 98 2098 GLN B Protein S 1 FT #SUB 341 2341 PRO E 614 2614 ALA B Protein S 2 FT #SUB 341 2341 PRO E 615 2615 TRP B Protein B 1 FT #SUB 342 2342 TYR E 615 2615 TRP B Protein B 4 FT #SUB 344 2344 LEU E 515 2515 ARG B Protein B 4 FT #SUB 345 2345 ASN E 515 2515 ARG B Protein A 8 FT #SUB 346 2346 PRO E 513 2513 GLU B Protein S 1 FT #SUB 346 2346 PRO E 515 2515 ARG B Protein S 3 FT #SUB 370 2370 MET E 94 2094 ARG B Protein S 9 FT #SUB 374 2374 ASN E 96 2096 SER B Protein A 3 FT #SUB 374 2374 ASN E 98 2098 GLN B Protein S 1 FT #SUB 375 2375 GLY E 90 2090 LYS B Protein B 1 FT #SUB 375 2375 GLY E 99 2099 GLU B Protein B 1 FT #SUB 502 2502 LEU E 167 2167 ASN B Protein S 3 FT #SUB 502 2502 LEU E 168 2168 ARG B Protein S 2 FT #SUB 503 2503 ASN E 109 2109 ARG B Protein S 2 FT #SUB 503 2503 ASN E 168 2168 ARG B Protein A 4 FT #SUB 503 2503 ASN E 169 2169 HIS B Protein S 2 FT #SUB 503 2503 ASN E 170 2170 GLY B Protein S 2 FT #SUB 513 2513 GLU E 346 2346 PRO B Protein S 1 FT #SUB 515 2515 ARG E 168 2168 ARG B Protein S 11 FT #SUB 515 2515 ARG E 344 2344 LEU B Protein A 5 FT #SUB 515 2515 ARG E 345 2345 ASN B Protein S 6 FT #SUB 515 2515 ARG E 346 2346 PRO B Protein S 3 FT #SUB 519 2519 ASN E 113 2113 ASP B Protein S 3 FT #SUB 519 2519 ASN E 115 2115 SER B Protein B 4 FT #SUB 522 2522 SER E 115 2115 SER B Protein S 3 FT #SUB 526 2526 ARG E 114 2114 ARG B Protein S 7 FT #SUB 583 2583 ARG E 112 2112 LYS B Protein B 3 FT #SUB 584 2584 HIS E 112 2112 LYS B Protein B 1 FT #SUB 585 2585 GLY E 109 2109 ARG B Protein B 1 FT #SUB 585 2585 GLY E 113 2113 ASP B Protein B 5 FT #SUB 585 2585 GLY E 169 2169 HIS B Protein B 1 FT #SUB 587 2587 SER E 168 2168 ARG B Protein A 5 FT #SUB 587 2587 SER E 169 2169 HIS B Protein S 3 FT #SUB 614 2614 ALA E 341 2341 PRO B Protein S 2 FT #SUB 615 2615 TRP E 116 2116 SER B Protein S 3 FT #SUB 615 2615 TRP E 121 2121 THR B Protein S 5 FT #SUB 615 2615 TRP E 341 2341 PRO B Protein S 3 FT #SUB 615 2615 TRP E 342 2342 TYR B Protein S 1 FT #SUB 797 2797 LYS E 108 3028 PRO F Protein S 1 FT #SUB 800 2800 LYS E 100 3020 SER F Protein S 1 FT #SUB 800 2800 LYS E 101 3021 LEU F Protein S 1 FT #SUB 800 2800 LYS E 103 3023 ILE F Protein A 5 FT #SUB 802 2802 ASP E 10 2930 LEU F Protein S 1 FT #SUB 802 2802 ASP E 12 2932 PRO F Protein S 3 FT #SUB 867 2867 PRO E 12 2932 PRO F Protein S 6 FT #SUB 868 2868 GLU E 11 2931 THR F Protein S 3 FT #SUB 868 2868 GLU E 12 2932 PRO F Protein A 15 FT #SUB 868 2868 GLU E 13 2933 SER F Protein S 7 FT #SUB 903 2903 TYR E 11 2931 THR F Protein S 1 FT #SUB 903 2903 TYR E 12 2932 PRO F Protein S 1 FT #SUB 609 2609 ALA E 106 106 GLN G Protein S 1 FT #SUB 610 2610 ASP E 124 124 TYR G Protein S 5 FT #SUB 610 2610 ASP E 137 137 ARG G Protein S 1 FT #SUB 612 2612 TYR E 138 138 ALA G Protein S 2 FT #SUB 612 2612 TYR E 189 189 ARG G Protein S 6 FT #SUB 617 2617 ASP E 189 189 ARG G Protein S 3 FT #SUB 617 2617 ASP E 190 190 PHE G Protein B 4 FT #SUB 617 2617 ASP E 191 191 THR G Protein S 1 FT #SUB 618 2618 SER E 190 190 PHE G Protein B 1 FT #SUB 619 2619 VAL E 136 136 ALA G Protein S 2 FT #SUB 619 2619 VAL E 138 138 ALA G Protein S 1 FT #SUB 619 2619 VAL E 190 190 PHE G Protein S 3 FT #SUB 625 2625 LEU E 107 107 HIS G Protein S 2 FT #SUB 626 2626 ARG E 108 108 PRO G Protein S 2 FT #SUB 626 2626 ARG E 109 109 LEU G Protein S 4 FT #SUB 632 2632 GLU E 117 117 LYS G Protein B 1 FT #SUB 634 2634 THR E 117 117 LYS G Protein S 4 FT #SUB 634 2634 THR E 118 118 ALA G Protein S 1 FT #SUB 635 2635 TYR E 109 109 LEU G Protein S 2 FT #SUB 637 2637 VAL E 111 111 ILE G Protein S 2 FT #SUB 637 2637 VAL E 118 118 ALA G Protein S 1 FT #SUB 691 2691 GLN E 112 112 ASP G Protein S 1 FT #SUB 691 2691 GLN E 116 116 LYS G Protein S 1 FT #SUB 692 2692 LYS E 116 116 LYS G Protein A 5 FT #SUB 693 2693 TYR E 116 116 LYS G Protein S 2 FT #SUB 693 2693 TYR E 117 117 LYS G Protein S 2 FT #SUB 136 2136 THR E 388 388 GLY b Protein S 4 FT #SUB 136 2136 THR E 389 389 THR b Protein A 7 FT #HET 126 2126 HIS E 145 3001 CUO E S 5 FT #HET 136 2136 THR E 118 1 NAG NA B 1 FT #HET 137 2137 ALA E 118 1 NAG NA B 1 FT #HET 138 2138 LYS E 118 1 NAG NA S 3 FT #HET 138 2138 LYS E 119 2 NAG NA S 3 FT #HET 144 2144 CYS E 145 3001 CUO E S 1 FT #HET 146 2146 HIS E 145 3001 CUO E S 7 FT #HET 151 2151 PHE E 145 3001 CUO E S 1 FT #HET 155 2155 HIS E 145 3001 CUO E S 10 FT #HET 267 2267 HIS E 145 3001 CUO E S 7 FT #HET 271 2271 HIS E 145 3001 CUO E S 4 FT #HET 294 2294 PHE E 145 3001 CUO E S 3 FT #HET 298 2298 HIS E 145 3001 CUO E S 7 FT #HET 403 2403 PHE E 26 2 NAG n A 3 FT #HET 405 2405 SER E 25 1 NAG n S 4 FT #HET 429 2429 LEU E 145 3001 CUO E S 1 FT #HET 474 2474 THR E 25 1 NAG n S 1 FT #HET 480 2480 LEU E 26 2 NAG n S 3 FT #HET 543 2543 HIS E 146 3002 CUO E S 6 FT #HET 559 2559 CYS E 146 3002 CUO E S 1 FT #HET 561 2561 HIS E 146 3002 CUO E S 6 FT #HET 566 2566 PHE E 146 3002 CUO E S 1 FT #HET 570 2570 HIS E 146 3002 CUO E S 7 FT #HET 680 2680 HIS E 146 3002 CUO E S 7 FT #HET 684 2684 HIS E 146 3002 CUO E S 6 FT #HET 707 2707 PHE E 146 3002 CUO E S 4 FT #HET 711 2711 HIS E 146 3002 CUO E S 7 FT #MOD 472 2472 ASN E 25 1 NAG n S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9F STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 276 3196 THR F 416 416 ASN A Protein S 10 FT #SUB 279 3199 GLN F 419 419 THR A Protein S 1 FT #SUB 352 3272 HIS F 1535 1535 LEU A Protein S 5 FT #SUB 352 3272 HIS F 1539 1539 GLN A Protein S 1 FT #SUB 352 3272 HIS F 1543 1543 ILE A Protein A 3 FT #SUB 354 3274 ARG F 1350 1350 GLU A Protein S 2 FT #SUB 354 3274 ARG F 1533 1533 ALA A Protein S 2 FT #SUB 10 2930 LEU F 802 2802 ASP E Protein B 1 FT #SUB 11 2931 THR F 868 2868 GLU E Protein A 3 FT #SUB 11 2931 THR F 903 2903 TYR E Protein S 1 FT #SUB 12 2932 PRO F 802 2802 ASP E Protein S 3 FT #SUB 12 2932 PRO F 867 2867 PRO E Protein A 6 FT #SUB 12 2932 PRO F 868 2868 GLU E Protein A 15 FT #SUB 12 2932 PRO F 903 2903 TYR E Protein S 1 FT #SUB 13 2933 SER F 868 2868 GLU E Protein A 7 FT #SUB 100 3020 SER F 800 2800 LYS E Protein S 1 FT #SUB 101 3021 LEU F 800 2800 LYS E Protein B 1 FT #SUB 103 3023 ILE F 800 2800 LYS E Protein S 5 FT #SUB 108 3028 PRO F 797 2797 LYS E Protein S 1 FT #HET 41 2961 HIS F 148 3402 CUO F S 8 FT #HET 60 2980 HIS F 148 3402 CUO F S 3 FT #HET 69 2989 HIS F 148 3402 CUO F S 13 FT #HET 121 3041 ALA F 203 2101 CUO b B 7 FT #HET 122 3042 ASP F 203 2101 CUO b A 28 FT #HET 123 3043 THR F 203 2101 CUO b B 4 FT #HET 169 3089 HIS F 148 3402 CUO F S 7 FT #HET 173 3093 HIS F 148 3402 CUO F S 5 FT #HET 196 3116 PHE F 148 3402 CUO F S 5 FT #HET 199 3119 HIS F 148 3402 CUO F S 1 FT #HET 200 3120 HIS F 148 3402 CUO F S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9G STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 388 388 GLY G 136 2136 THR B Protein B 4 FT #SUB 389 389 THR G 136 2136 THR B Protein A 10 FT #SUB 389 389 THR G 138 2138 LYS B Protein S 1 FT #SUB 328 328 LYS G 1571 1571 TYR D Protein S 9 FT #SUB 328 328 LYS G 1583 1583 CYS D Protein B 1 FT #SUB 328 328 LYS G 1631 1631 GLU D Protein S 4 FT #SUB 329 329 ARG G 1572 1572 ILE D Protein S 3 FT #SUB 329 329 ARG G 1573 1573 CYS D Protein S 2 FT #SUB 329 329 ARG G 1574 1574 VAL D Protein S 4 FT #SUB 329 329 ARG G 1582 1582 ASN D Protein A 5 FT #SUB 329 329 ARG G 1583 1583 CYS D Protein A 11 FT #SUB 329 329 ARG G 1584 1584 GLY D Protein A 15 FT #SUB 329 329 ARG G 1585 1585 ASN D Protein S 6 FT #SUB 330 330 GLN G 1582 1582 ASN D Protein A 2 FT #SUB 335 335 ASP G 1578 1578 ILE D Protein S 1 FT #SUB 335 335 ASP G 1579 1579 GLY D Protein S 5 FT #SUB 472 472 ASP G 1638 1638 SER D Protein A 5 FT #SUB 472 472 ASP G 1640 1640 ILE D Protein S 1 FT #SUB 1365 1365 TYR G 1392 1392 ASP D Protein S 1 FT #SUB 1365 1365 TYR G 1437 1437 ARG D Protein S 19 FT #SUB 1372 1372 ILE G 1390 1390 ASP D Protein A 6 FT #SUB 1372 1372 ILE G 1391 1391 ARG D Protein S 3 FT #SUB 1372 1372 ILE G 1438 1438 ASP D Protein A 7 FT #SUB 1378 1378 PHE G 1362 1362 ALA D Protein S 1 FT #SUB 1390 1390 ASP G 1372 1372 ILE D Protein S 2 FT #SUB 1391 1391 ARG G 1372 1372 ILE D Protein B 3 FT #SUB 1392 1392 ASP G 1372 1372 ILE D Protein A 3 FT #SUB 1437 1437 ARG G 1365 1365 TYR D Protein S 17 FT #SUB 1438 1438 ASP G 1372 1372 ILE D Protein S 6 FT #SUB 1571 1571 TYR G 328 328 LYS D Protein S 3 FT #SUB 1574 1574 VAL G 329 329 ARG D Protein B 1 FT #SUB 1578 1578 ILE G 335 335 ASP D Protein B 1 FT #SUB 1579 1579 GLY G 335 335 ASP D Protein B 4 FT #SUB 1581 1581 GLU G 331 331 SER D Protein S 11 FT #SUB 1582 1582 ASN G 329 329 ARG D Protein B 5 FT #SUB 1582 1582 ASN G 330 330 GLN D Protein A 4 FT #SUB 1582 1582 ASN G 331 331 SER D Protein B 2 FT #SUB 1583 1583 CYS G 329 329 ARG D Protein A 16 FT #SUB 1584 1584 GLY G 329 329 ARG D Protein B 1 FT #SUB 1585 1585 ASN G 329 329 ARG D Protein S 3 FT #SUB 1631 1631 GLU G 328 328 LYS D Protein S 1 FT #SUB 1638 1638 SER G 472 472 ASP D Protein S 2 FT #SUB 1639 1639 LYS G 472 472 ASP D Protein B 1 FT #SUB 1640 1640 ILE G 470 470 ARG D Protein S 2 FT #SUB 1640 1640 ILE G 472 472 ASP D Protein B 1 FT #SUB 1640 1640 ILE G 473 473 ALA D Protein B 1 FT #SUB 106 106 GLN G 609 2609 ALA E Protein B 1 FT #SUB 107 107 HIS G 625 2625 LEU E Protein S 2 FT #SUB 108 108 PRO G 626 2626 ARG E Protein S 2 FT #SUB 109 109 LEU G 626 2626 ARG E Protein S 4 FT #SUB 109 109 LEU G 635 2635 TYR E Protein S 2 FT #SUB 111 111 ILE G 637 2637 VAL E Protein S 2 FT #SUB 112 112 ASP G 691 2691 GLN E Protein B 1 FT #SUB 116 116 LYS G 691 2691 GLN E Protein B 1 FT #SUB 116 116 LYS G 692 2692 LYS E Protein S 5 FT #SUB 116 116 LYS G 693 2693 TYR E Protein B 2 FT #SUB 117 117 LYS G 632 2632 GLU E Protein S 1 FT #SUB 117 117 LYS G 634 2634 THR E Protein S 4 FT #SUB 117 117 LYS G 693 2693 TYR E Protein S 2 FT #SUB 118 118 ALA G 634 2634 THR E Protein B 1 FT #SUB 118 118 ALA G 637 2637 VAL E Protein S 1 FT #SUB 124 124 TYR G 610 2610 ASP E Protein S 5 FT #SUB 136 136 ALA G 619 2619 VAL E Protein S 2 FT #SUB 137 137 ARG G 610 2610 ASP E Protein B 1 FT #SUB 138 138 ALA G 612 2612 TYR E Protein S 2 FT #SUB 138 138 ALA G 619 2619 VAL E Protein S 1 FT #SUB 189 189 ARG G 612 2612 TYR E Protein S 6 FT #SUB 189 189 ARG G 617 2617 ASP E Protein B 3 FT #SUB 190 190 PHE G 617 2617 ASP E Protein A 4 FT #SUB 190 190 PHE G 618 2618 SER E Protein S 1 FT #SUB 190 190 PHE G 619 2619 VAL E Protein S 3 FT #SUB 191 191 THR G 617 2617 ASP E Protein S 1 FT #SUB 1071 1071 ASN G 871 2871 PHE K Protein S 1 FT #SUB 1071 1071 ASN G 901 2901 LEU K Protein B 1 FT #SUB 1071 1071 ASN G 902 2902 ILE K Protein B 4 FT #SUB 1071 1071 ASN G 903 2903 TYR K Protein A 5 FT #SUB 1071 1071 ASN G 905 2905 PRO K Protein S 1 FT #SUB 1072 1072 CYS G 901 2901 LEU K Protein B 1 FT #SUB 1072 1072 CYS G 902 2902 ILE K Protein A 2 FT #SUB 1073 1073 ALA G 901 2901 LEU K Protein B 3 FT #SUB 1074 1074 ILE G 870 2870 LEU K Protein S 2 FT #SUB 1074 1074 ILE G 871 2871 PHE K Protein S 1 FT #SUB 1074 1074 ILE G 901 2901 LEU K Protein A 6 FT #SUB 1075 1075 GLU G 899 2899 PRO K Protein S 3 FT #SUB 1075 1075 GLU G 900 2900 SER K Protein S 6 FT #SUB 1075 1075 GLU G 901 2901 LEU K Protein S 1 FT #SUB 1078 1078 ARG G 870 2870 LEU K Protein S 4 FT #SUB 1078 1078 ARG G 872 2872 GLU K Protein S 4 FT #SUB 1078 1078 ARG G 873 2873 HIS K Protein A 3 FT #SUB 1078 1078 ARG G 875 2875 SER K Protein S 4 FT #SUB 1078 1078 ARG G 876 2876 LYS K Protein S 6 FT #SUB 1078 1078 ARG G 877 2877 ILE K Protein S 4 FT #SUB 1101 1101 VAL G 873 2873 HIS K Protein S 2 FT #SUB 1103 1103 PHE G 871 2871 PHE K Protein S 4 FT #SUB 1104 1104 ASP G 873 2873 HIS K Protein S 1 FT #SUB 1147 1147 MET G 849 2849 ASP K Protein S 4 FT #SUB 1164 1164 ASP G 744 2744 PHE K Protein S 1 FT #SUB 1187 1187 LYS G 809 2809 LEU K Protein S 1 FT #SUB 1187 1187 LYS G 847 2847 HIS K Protein S 2 FT #SUB 1188 1188 PHE G 809 2809 LEU K Protein B 2 FT #SUB 1189 1189 ASP G 809 2809 LEU K Protein B 1 FT #SUB 1189 1189 ASP G 849 2849 ASP K Protein B 1 FT #SUB 1189 1189 ASP G 850 2850 ARG K Protein B 1 FT #SUB 1189 1189 ASP G 851 2851 ASN K Protein A 9 FT #SUB 1191 1191 PRO G 849 2849 ASP K Protein S 5 FT #SUB 1206 1206 HIS G 457 2457 LYS K Protein S 3 FT #SUB 1207 1207 TYR G 736 2736 TYR K Protein S 1 FT #SUB 1207 1207 TYR G 739 2739 LEU K Protein A 4 FT #SUB 1208 1208 THR G 743 2743 ALA K Protein B 1 FT #SUB 1210 1210 LYS G 743 2743 ALA K Protein B 3 FT #SUB 1211 1211 TYR G 739 2739 LEU K Protein S 3 FT #SUB 1232 1232 THR G 740 2740 ASP K Protein A 5 FT #SUB 1232 1232 THR G 741 2741 GLN K Protein A 2 FT #SUB 1233 1233 SER G 738 2738 ALA K Protein B 2 FT #SUB 1233 1233 SER G 740 2740 ASP K Protein B 7 FT #SUB 1234 1234 VAL G 737 2737 CYS K Protein B 3 FT #SUB 1234 1234 VAL G 738 2738 ALA K Protein B 3 FT #SUB 1234 1234 VAL G 739 2739 LEU K Protein A 5 FT #SUB 1234 1234 VAL G 740 2740 ASP K Protein B 1 FT #SUB 1236 1236 TYR G 736 2736 TYR K Protein A 7 FT #SUB 1245 1245 GLU G 729 2729 LYS K Protein S 4 FT #SUB 1245 1245 GLU G 730 2730 LEU K Protein S 1 FT #SUB 1245 1245 GLU G 731 2731 PRO K Protein S 1 FT #SUB 1481 1481 GLU G 458 2458 PHE K Protein S 1 FT #SUB 1481 1481 GLU G 459 2459 ASP K Protein S 1 FT #SUB 1483 1483 ASN G 458 2458 PHE K Protein S 1 FT #SUB 1483 1483 ASN G 487 2487 VAL K Protein B 2 FT #SUB 1483 1483 ASN G 488 2488 ARG K Protein A 10 FT #SUB 1484 1484 CYS G 486 2486 ILE K Protein B 3 FT #SUB 1484 1484 CYS G 487 2487 VAL K Protein B 4 FT #SUB 1485 1485 ALA G 486 2486 ILE K Protein B 1 FT #SUB 1486 1486 LEU G 458 2458 PHE K Protein S 1 FT #SUB 1486 1486 LEU G 486 2486 ILE K Protein A 7 FT #SUB 1487 1487 PRO G 486 2486 ILE K Protein S 5 FT #SUB 1490 1490 ASN G 459 2459 ASP K Protein S 1 FT #SUB 1490 1490 ASN G 461 2461 HIS K Protein S 9 FT #SUB 1558 1558 LEU G 440 2440 ASP K Protein S 1 FT #SUB 1602 1602 GLN G 482 2482 PRO K Protein S 1 FT #SUB 1603 1603 PHE G 399 2399 LEU K Protein B 1 FT #SUB 1604 1604 ASP G 440 2440 ASP K Protein B 1 FT #SUB 1604 1604 ASP G 441 2441 ARG K Protein B 2 FT #SUB 1604 1604 ASP G 442 2442 LEU K Protein A 6 FT #SUB 1605 1605 ARG G 440 2440 ASP K Protein B 1 FT #SUB 1606 1606 LEU G 440 2440 ASP K Protein A 6 FT #SUB 1622 1622 GLN G 326 2326 ILE K Protein B 1 FT #SUB 1648 1648 PRO G 327 2327 GLU K Protein B 3 FT #SUB 1649 1649 THR G 325 2325 ALA K Protein S 1 FT #SUB 1649 1649 THR G 327 2327 GLU K Protein B 4 FT #SUB 1650 1650 ILE G 323 2323 ASN K Protein B 1 FT #SUB 1650 1650 ILE G 325 2325 ALA K Protein B 1 FT #SUB 1650 1650 ILE G 326 2326 ILE K Protein A 5 FT #SUB 1650 1650 ILE G 327 2327 GLU K Protein A 4 FT #SUB 1651 1651 ILE G 323 2323 ASN K Protein A 3 FT #SUB 1652 1652 PHE G 323 2323 ASN K Protein A 10 FT #SUB 1654 1654 PRO G 323 2323 ASN K Protein S 2 FT #SUB 417 417 MET G 279 3199 GLN L Protein S 5 FT #SUB 1350 1350 GLU G 354 3274 ARG L Protein S 4 FT #SUB 1404 1404 TYR G 352 3272 HIS L Protein S 1 FT #SUB 1533 1533 ALA G 351 3271 VAL L Protein B 3 FT #SUB 1533 1533 ALA G 354 3274 ARG L Protein B 1 FT #SUB 1534 1534 GLY G 348 3268 LYS L Protein B 1 FT #SUB 1535 1535 LEU G 352 3272 HIS L Protein S 5 FT #SUB 1539 1539 GLN G 348 3268 LYS L Protein S 2 FT #SUB 1539 1539 GLN G 352 3272 HIS L Protein S 1 FT #SUB 1543 1543 ILE G 352 3272 HIS L Protein S 4 FT #HET 41 41 HIS G 150 2102 CUO G S 6 FT #HET 58 58 CYS G 150 2102 CUO G S 1 FT #HET 60 60 HIS G 150 2102 CUO G S 6 FT #HET 69 69 HIS G 150 2102 CUO G S 9 FT #HET 179 179 HIS G 150 2102 CUO G S 5 FT #HET 183 183 HIS G 150 2102 CUO G S 4 FT #HET 206 206 PHE G 150 2102 CUO G S 3 FT #HET 210 210 HIS G 150 2102 CUO G S 8 FT #HET 385 385 ALA G 27 1 NAG o S 1 FT #HET 389 389 THR G 27 1 NAG o S 3 FT #HET 391 391 MET G 28 2 NAG o S 1 FT #HET 462 462 HIS G 151 2103 CUO G S 8 FT #HET 480 480 CYS G 151 2103 CUO G S 1 FT #HET 482 482 HIS G 151 2103 CUO G S 6 FT #HET 491 491 HIS G 151 2103 CUO G S 9 FT #HET 603 603 HIS G 151 2103 CUO G S 4 FT #HET 607 607 HIS G 151 2103 CUO G S 7 FT #HET 630 630 PHE G 151 2103 CUO G S 2 FT #HET 634 634 HIS G 151 2103 CUO G S 7 FT #HET 740 740 THR G 29 1 NAG p S 1 FT #HET 763 763 LEU G 151 2103 CUO G S 1 FT #HET 804 804 ASP G 29 1 NAG p S 7 FT #HET 808 808 THR G 29 1 NAG p S 3 FT #HET 810 810 LEU G 29 1 NAG p S 2 FT #HET 810 810 LEU G 30 2 NAG p S 1 FT #HET 877 877 HIS G 152 2104 CUO G S 5 FT #HET 895 895 CYS G 152 2104 CUO G S 1 FT #HET 897 897 HIS G 152 2104 CUO G S 3 FT #HET 906 906 HIS G 152 2104 CUO G S 10 FT #HET 1015 1015 HIS G 152 2104 CUO G S 7 FT #HET 1019 1019 HIS G 152 2104 CUO G S 6 FT #HET 1042 1042 PHE G 152 2104 CUO G S 2 FT #HET 1046 1046 HIS G 152 2104 CUO G S 8 FT #HET 1294 1294 HIS G 153 2105 CUO G S 7 FT #HET 1298 1298 ALA G 32 2 NAG q B 1 FT #HET 1299 1299 GLN G 31 1 NAG q A 9 FT #HET 1301 1301 PRO G 31 1 NAG q S 1 FT #HET 1301 1301 PRO G 32 2 NAG q S 3 FT #HET 1308 1308 VAL G 32 2 NAG q S 1 FT #HET 1312 1312 CYS G 153 2105 CUO G S 1 FT #HET 1314 1314 HIS G 153 2105 CUO G S 6 FT #HET 1319 1319 PHE G 153 2105 CUO G S 1 FT #HET 1323 1323 HIS G 153 2105 CUO G S 9 FT #HET 1427 1427 HIS G 153 2105 CUO G S 6 FT #HET 1431 1431 HIS G 153 2105 CUO G S 8 FT #HET 1454 1454 PHE G 153 2105 CUO G S 2 FT #HET 1458 1458 HIS G 153 2105 CUO G S 9 FT #HET 1494 1494 ARG G 31 1 NAG q S 1 FT #HET 1500 1500 THR G 31 1 NAG q B 3 FT #HET 1564 1564 ALA G 33 1 NAG r S 3 FT #HET 1635 1635 VAL G 33 1 NAG r S 1 FT #HET 1639 1639 LYS G 33 1 NAG r S 3 FT #HET 1641 1641 THR G 34 2 NAG r S 1 FT #HET 1644 1644 ILE G 34 2 NAG r S 1 FT #MOD 387 387 ASN G 27 1 NAG o S FT #MOD 806 806 ASN G 29 1 NAG p S FT #MOD 1498 1498 ASN G 31 1 NAG q S FT #MOD 1636 1636 ASN G 33 1 NAG r S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9H STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 609 2609 ALA H 106 106 GLN D Protein S 1 FT #SUB 610 2610 ASP H 124 124 TYR D Protein S 5 FT #SUB 612 2612 TYR H 138 138 ALA D Protein S 2 FT #SUB 612 2612 TYR H 189 189 ARG D Protein S 4 FT #SUB 617 2617 ASP H 189 189 ARG D Protein S 1 FT #SUB 617 2617 ASP H 190 190 PHE D Protein B 5 FT #SUB 618 2618 SER H 190 190 PHE D Protein B 1 FT #SUB 619 2619 VAL H 138 138 ALA D Protein S 1 FT #SUB 619 2619 VAL H 190 190 PHE D Protein S 2 FT #SUB 625 2625 LEU H 107 107 HIS D Protein S 2 FT #SUB 626 2626 ARG H 109 109 LEU D Protein S 3 FT #SUB 632 2632 GLU H 117 117 LYS D Protein B 1 FT #SUB 633 2633 ASP H 119 119 LYS D Protein B 1 FT #SUB 634 2634 THR H 117 117 LYS D Protein S 4 FT #SUB 634 2634 THR H 119 119 LYS D Protein B 1 FT #SUB 635 2635 TYR H 109 109 LEU D Protein S 1 FT #SUB 636 2636 THR H 109 109 LEU D Protein B 1 FT #SUB 637 2637 VAL H 109 109 LEU D Protein S 1 FT #SUB 637 2637 VAL H 111 111 ILE D Protein S 1 FT #SUB 637 2637 VAL H 117 117 LYS D Protein S 1 FT #SUB 639 2639 GLU H 111 111 ILE D Protein S 3 FT #SUB 691 2691 GLN H 112 112 ASP D Protein S 1 FT #SUB 691 2691 GLN H 116 116 LYS D Protein S 1 FT #SUB 691 2691 GLN H 117 117 LYS D Protein S 1 FT #SUB 693 2693 TYR H 117 117 LYS D Protein S 7 FT #SUB 797 2797 LYS H 107 3027 GLU I Protein S 4 FT #SUB 797 2797 LYS H 108 3028 PRO I Protein S 4 FT #SUB 800 2800 LYS H 100 3020 SER I Protein S 2 FT #SUB 800 2800 LYS H 101 3021 LEU I Protein S 3 FT #SUB 800 2800 LYS H 103 3023 ILE I Protein A 6 FT #SUB 801 2801 ALA H 103 3023 ILE I Protein B 2 FT #SUB 802 2802 ASP H 12 2932 PRO I Protein S 4 FT #SUB 867 2867 PRO H 12 2932 PRO I Protein S 2 FT #SUB 868 2868 GLU H 11 2931 THR I Protein S 4 FT #SUB 868 2868 GLU H 12 2932 PRO I Protein A 13 FT #SUB 868 2868 GLU H 13 2933 SER I Protein S 8 FT #SUB 868 2868 GLU H 14 2934 GLU I Protein S 1 FT #SUB 903 2903 TYR H 12 2932 PRO I Protein S 3 FT #SUB 321 2321 GLU H 1623 1623 ASN J Protein S 1 FT #SUB 323 2323 ASN H 1650 1650 ILE J Protein B 1 FT #SUB 323 2323 ASN H 1651 1651 ILE J Protein B 2 FT #SUB 323 2323 ASN H 1652 1652 PHE J Protein A 10 FT #SUB 323 2323 ASN H 1654 1654 PRO J Protein S 2 FT #SUB 324 2324 CYS H 1650 1650 ILE J Protein B 2 FT #SUB 325 2325 ALA H 1649 1649 THR J Protein B 2 FT #SUB 325 2325 ALA H 1650 1650 ILE J Protein B 3 FT #SUB 326 2326 ILE H 1622 1622 GLN J Protein S 1 FT #SUB 326 2326 ILE H 1650 1650 ILE J Protein A 4 FT #SUB 326 2326 ILE H 1652 1652 PHE J Protein S 1 FT #SUB 327 2327 GLU H 1648 1648 PRO J Protein S 3 FT #SUB 327 2327 GLU H 1649 1649 THR J Protein S 6 FT #SUB 327 2327 GLU H 1650 1650 ILE J Protein S 3 FT #SUB 353 2353 LYS H 1625 1625 HIS J Protein S 2 FT #SUB 399 2399 LEU H 1603 1603 PHE J Protein S 1 FT #SUB 401 2401 GLU H 1602 1602 GLN J Protein S 1 FT #SUB 439 2439 PHE H 1558 1558 LEU J Protein B 1 FT #SUB 440 2440 ASP H 1558 1558 LEU J Protein B 2 FT #SUB 440 2440 ASP H 1605 1605 ARG J Protein B 2 FT #SUB 440 2440 ASP H 1606 1606 LEU J Protein A 5 FT #SUB 442 2442 LEU H 1604 1604 ASP J Protein A 8 FT #SUB 442 2442 LEU H 1605 1605 ARG J Protein S 1 FT #SUB 457 2457 LYS H 1206 1206 HIS J Protein S 4 FT #SUB 458 2458 PHE H 1481 1481 GLU J Protein S 1 FT #SUB 458 2458 PHE H 1486 1486 LEU J Protein A 2 FT #SUB 459 2459 ASP H 1481 1481 GLU J Protein S 1 FT #SUB 461 2461 HIS H 1490 1490 ASN J Protein S 7 FT #SUB 485 2485 THR H 1485 1485 ALA J Protein S 2 FT #SUB 486 2486 ILE H 1483 1483 ASN J Protein B 1 FT #SUB 486 2486 ILE H 1485 1485 ALA J Protein B 3 FT #SUB 486 2486 ILE H 1486 1486 LEU J Protein B 4 FT #SUB 486 2486 ILE H 1487 1487 PRO J Protein A 3 FT #SUB 487 2487 VAL H 1483 1483 ASN J Protein B 1 FT #SUB 488 2488 ARG H 1483 1483 ASN J Protein A 7 FT #SUB 490 2490 PRO H 1483 1483 ASN J Protein S 1 FT #SUB 729 2729 LYS H 1245 1245 GLU J Protein A 4 FT #SUB 736 2736 TYR H 1235 1235 ILE J Protein B 2 FT #SUB 736 2736 TYR H 1236 1236 TYR J Protein A 5 FT #SUB 736 2736 TYR H 1238 1238 PRO J Protein S 1 FT #SUB 736 2736 TYR H 1245 1245 GLU J Protein S 3 FT #SUB 738 2738 ALA H 1232 1232 THR J Protein S 1 FT #SUB 738 2738 ALA H 1233 1233 SER J Protein S 2 FT #SUB 738 2738 ALA H 1234 1234 VAL J Protein A 5 FT #SUB 739 2739 LEU H 1207 1207 TYR J Protein S 3 FT #SUB 739 2739 LEU H 1211 1211 TYR J Protein S 3 FT #SUB 739 2739 LEU H 1234 1234 VAL J Protein A 3 FT #SUB 740 2740 ASP H 1211 1211 TYR J Protein S 1 FT #SUB 740 2740 ASP H 1212 1212 HIS J Protein S 2 FT #SUB 740 2740 ASP H 1213 1213 VAL J Protein S 1 FT #SUB 740 2740 ASP H 1232 1232 THR J Protein S 3 FT #SUB 741 2741 GLN H 1232 1232 THR J Protein S 1 FT #SUB 743 2743 ALA H 1208 1208 THR J Protein A 2 FT #SUB 743 2743 ALA H 1209 1209 ASP J Protein S 1 FT #SUB 743 2743 ALA H 1210 1210 LYS J Protein A 3 FT #SUB 744 2744 PHE H 1210 1210 LYS J Protein S 4 FT #SUB 744 2744 PHE H 1212 1212 HIS J Protein S 2 FT #SUB 809 2809 LEU H 1187 1187 LYS J Protein S 1 FT #SUB 809 2809 LEU H 1188 1188 PHE J Protein S 2 FT #SUB 809 2809 LEU H 1189 1189 ASP J Protein S 1 FT #SUB 847 2847 HIS H 1187 1187 LYS J Protein S 3 FT #SUB 847 2847 HIS H 1230 1230 LEU J Protein S 2 FT #SUB 849 2849 ASP H 1147 1147 MET J Protein B 2 FT #SUB 849 2849 ASP H 1191 1191 PRO J Protein B 2 FT #SUB 849 2849 ASP H 1233 1233 SER J Protein S 3 FT #SUB 851 2851 ASN H 1189 1189 ASP J Protein S 2 FT #SUB 870 2870 LEU H 1074 1074 ILE J Protein S 1 FT #SUB 870 2870 LEU H 1078 1078 ARG J Protein B 6 FT #SUB 871 2871 PHE H 1070 1070 ALA J Protein S 2 FT #SUB 871 2871 PHE H 1071 1071 ASN J Protein S 1 FT #SUB 871 2871 PHE H 1074 1074 ILE J Protein S 1 FT #SUB 871 2871 PHE H 1103 1103 PHE J Protein B 2 FT #SUB 872 2872 GLU H 1078 1078 ARG J Protein B 4 FT #SUB 873 2873 HIS H 1078 1078 ARG J Protein B 2 FT #SUB 875 2875 SER H 1078 1078 ARG J Protein A 4 FT #SUB 877 2877 ILE H 1078 1078 ARG J Protein B 3 FT #SUB 899 2899 PRO H 1075 1075 GLU J Protein B 2 FT #SUB 900 2900 SER H 1075 1075 GLU J Protein A 6 FT #SUB 901 2901 LEU H 1073 1073 ALA J Protein B 3 FT #SUB 901 2901 LEU H 1074 1074 ILE J Protein A 5 FT #SUB 901 2901 LEU H 1075 1075 GLU J Protein B 1 FT #SUB 902 2902 ILE H 1071 1071 ASN J Protein A 3 FT #SUB 903 2903 TYR H 1071 1071 ASN J Protein A 5 FT #SUB 905 2905 PRO H 1071 1071 ASN J Protein S 2 FT #SUB 84 2084 ARG H 502 2502 LEU K Protein S 4 FT #SUB 90 2090 LYS H 375 2375 GLY K Protein S 1 FT #SUB 94 2094 ARG H 91 2091 ASN K Protein S 1 FT #SUB 94 2094 ARG H 94 2094 ARG K Protein S 8 FT #SUB 94 2094 ARG H 182 2182 TYR K Protein A 4 FT #SUB 94 2094 ARG H 370 2370 MET K Protein S 4 FT #SUB 96 2096 SER H 374 2374 ASN K Protein S 3 FT #SUB 97 2097 LEU H 235 2235 PRO K Protein S 2 FT #SUB 98 2098 GLN H 244 2244 PHE K Protein S 1 FT #SUB 98 2098 GLN H 374 2374 ASN K Protein S 2 FT #SUB 99 2099 GLU H 375 2375 GLY K Protein S 1 FT #SUB 109 2109 ARG H 503 2503 ASN K Protein S 1 FT #SUB 109 2109 ARG H 582 2582 ARG K Protein S 1 FT #SUB 109 2109 ARG H 585 2585 GLY K Protein B 1 FT #SUB 112 2112 LYS H 583 2583 ARG K Protein S 1 FT #SUB 112 2112 LYS H 584 2584 HIS K Protein B 1 FT #SUB 113 2113 ASP H 519 2519 ASN K Protein S 3 FT #SUB 113 2113 ASP H 585 2585 GLY K Protein A 7 FT #SUB 114 2114 ARG H 526 2526 ARG K Protein S 6 FT #SUB 115 2115 SER H 519 2519 ASN K Protein S 4 FT #SUB 115 2115 SER H 522 2522 SER K Protein A 2 FT #SUB 116 2116 SER H 615 2615 TRP K Protein S 3 FT #SUB 121 2121 THR H 615 2615 TRP K Protein S 3 FT #SUB 125 2125 PHE H 615 2615 TRP K Protein S 1 FT #SUB 167 2167 ASN H 502 2502 LEU K Protein A 3 FT #SUB 168 2168 ARG H 503 2503 ASN K Protein B 4 FT #SUB 168 2168 ARG H 515 2515 ARG K Protein S 6 FT #SUB 168 2168 ARG H 587 2587 SER K Protein A 7 FT #SUB 169 2169 HIS H 503 2503 ASN K Protein B 2 FT #SUB 169 2169 HIS H 585 2585 GLY K Protein S 2 FT #SUB 169 2169 HIS H 587 2587 SER K Protein S 4 FT #SUB 170 2170 GLY H 503 2503 ASN K Protein B 2 FT #SUB 182 2182 TYR H 94 2094 ARG K Protein S 5 FT #SUB 199 2199 PRO H 236 2236 ALA K Protein B 1 FT #SUB 199 2199 PRO H 237 2237 LEU K Protein B 2 FT #SUB 200 2200 PHE H 237 2237 LEU K Protein B 7 FT #SUB 235 2235 PRO H 97 2097 LEU K Protein S 2 FT #SUB 235 2235 PRO H 199 2199 PRO K Protein B 1 FT #SUB 236 2236 ALA H 199 2199 PRO K Protein B 2 FT #SUB 237 2237 LEU H 199 2199 PRO K Protein A 4 FT #SUB 237 2237 LEU H 200 2200 PHE K Protein A 8 FT #SUB 237 2237 LEU H 202 2202 GLY K Protein S 1 FT #SUB 341 2341 PRO H 614 2614 ALA K Protein S 1 FT #SUB 342 2342 TYR H 614 2614 ALA K Protein S 1 FT #SUB 342 2342 TYR H 615 2615 TRP K Protein A 6 FT #SUB 344 2344 LEU H 515 2515 ARG K Protein B 3 FT #SUB 345 2345 ASN H 515 2515 ARG K Protein B 1 FT #SUB 346 2346 PRO H 515 2515 ARG K Protein S 6 FT #SUB 370 2370 MET H 94 2094 ARG K Protein S 5 FT #SUB 370 2370 MET H 370 2370 MET K Protein S 1 FT #SUB 374 2374 ASN H 96 2096 SER K Protein A 3 FT #SUB 374 2374 ASN H 98 2098 GLN K Protein S 1 FT #SUB 375 2375 GLY H 90 2090 LYS K Protein B 1 FT #SUB 375 2375 GLY H 99 2099 GLU K Protein B 1 FT #SUB 502 2502 LEU H 84 2084 ARG K Protein S 3 FT #SUB 502 2502 LEU H 167 2167 ASN K Protein S 2 FT #SUB 503 2503 ASN H 168 2168 ARG K Protein A 3 FT #SUB 513 2513 GLU H 346 2346 PRO K Protein S 1 FT #SUB 515 2515 ARG H 168 2168 ARG K Protein S 10 FT #SUB 515 2515 ARG H 344 2344 LEU K Protein S 4 FT #SUB 515 2515 ARG H 345 2345 ASN K Protein S 5 FT #SUB 515 2515 ARG H 346 2346 PRO K Protein S 4 FT #SUB 519 2519 ASN H 113 2113 ASP K Protein S 4 FT #SUB 519 2519 ASN H 115 2115 SER K Protein A 4 FT #SUB 522 2522 SER H 115 2115 SER K Protein S 4 FT #SUB 526 2526 ARG H 114 2114 ARG K Protein S 6 FT #SUB 583 2583 ARG H 112 2112 LYS K Protein B 3 FT #SUB 584 2584 HIS H 112 2112 LYS K Protein B 1 FT #SUB 585 2585 GLY H 109 2109 ARG K Protein B 1 FT #SUB 585 2585 GLY H 113 2113 ASP K Protein B 3 FT #SUB 585 2585 GLY H 169 2169 HIS K Protein B 3 FT #SUB 587 2587 SER H 168 2168 ARG K Protein A 6 FT #SUB 587 2587 SER H 169 2169 HIS K Protein S 2 FT #SUB 614 2614 ALA H 341 2341 PRO K Protein S 1 FT #SUB 615 2615 TRP H 121 2121 THR K Protein S 8 FT #SUB 615 2615 TRP H 124 2124 SER K Protein S 1 FT #SUB 136 2136 THR H 388 388 GLY M Protein S 5 FT #SUB 136 2136 THR H 389 389 THR M Protein A 7 FT #HET 126 2126 HIS H 154 3001 CUO H S 4 FT #HET 138 2138 LYS H 49 1 NAG y S 3 FT #HET 138 2138 LYS H 50 2 NAG y S 4 FT #HET 144 2144 CYS H 154 3001 CUO H S 1 FT #HET 146 2146 HIS H 154 3001 CUO H S 7 FT #HET 151 2151 PHE H 154 3001 CUO H S 1 FT #HET 155 2155 HIS H 154 3001 CUO H S 9 FT #HET 267 2267 HIS H 154 3001 CUO H S 8 FT #HET 271 2271 HIS H 154 3001 CUO H S 6 FT #HET 294 2294 PHE H 154 3001 CUO H S 2 FT #HET 298 2298 HIS H 154 3001 CUO H S 6 FT #HET 405 2405 SER H 36 1 NAG s A 4 FT #HET 429 2429 LEU H 154 3001 CUO H S 1 FT #HET 474 2474 THR H 36 1 NAG s S 1 FT #HET 543 2543 HIS H 155 3002 CUO H S 6 FT #HET 559 2559 CYS H 155 3002 CUO H S 1 FT #HET 561 2561 HIS H 155 3002 CUO H S 6 FT #HET 566 2566 PHE H 155 3002 CUO H S 1 FT #HET 570 2570 HIS H 155 3002 CUO H S 10 FT #HET 680 2680 HIS H 155 3002 CUO H S 6 FT #HET 684 2684 HIS H 155 3002 CUO H S 6 FT #HET 707 2707 PHE H 155 3002 CUO H S 2 FT #HET 711 2711 HIS H 155 3002 CUO H S 6 FT #MOD 472 2472 ASN H 36 1 NAG s S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9I STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 11 2931 THR I 868 2868 GLU H Protein A 4 FT #SUB 12 2932 PRO I 802 2802 ASP H Protein S 4 FT #SUB 12 2932 PRO I 867 2867 PRO H Protein S 2 FT #SUB 12 2932 PRO I 868 2868 GLU H Protein B 13 FT #SUB 12 2932 PRO I 903 2903 TYR H Protein A 3 FT #SUB 13 2933 SER I 868 2868 GLU H Protein A 8 FT #SUB 14 2934 GLU I 868 2868 GLU H Protein B 1 FT #SUB 100 3020 SER I 800 2800 LYS H Protein S 2 FT #SUB 101 3021 LEU I 800 2800 LYS H Protein B 3 FT #SUB 103 3023 ILE I 800 2800 LYS H Protein S 6 FT #SUB 103 3023 ILE I 801 2801 ALA H Protein S 2 FT #SUB 107 3027 GLU I 797 2797 LYS H Protein A 4 FT #SUB 108 3028 PRO I 797 2797 LYS H Protein A 4 FT #SUB 276 3196 THR I 416 416 ASN J Protein A 11 FT #SUB 278 3198 PRO I 416 416 ASN J Protein S 1 FT #SUB 279 3199 GLN I 416 416 ASN J Protein S 1 FT #SUB 279 3199 GLN I 417 417 MET J Protein S 1 FT #SUB 348 3268 LYS I 1539 1539 GLN J Protein S 2 FT #SUB 351 3271 VAL I 1533 1533 ALA J Protein S 1 FT #SUB 352 3272 HIS I 1404 1404 TYR J Protein B 1 FT #SUB 352 3272 HIS I 1535 1535 LEU J Protein S 3 FT #SUB 352 3272 HIS I 1539 1539 GLN J Protein S 1 FT #SUB 352 3272 HIS I 1543 1543 ILE J Protein S 3 FT #SUB 354 3274 ARG I 1533 1533 ALA J Protein S 2 FT #HET 41 2961 HIS I 157 3402 CUO I S 8 FT #HET 60 2980 HIS I 157 3402 CUO I S 3 FT #HET 69 2989 HIS I 157 3402 CUO I S 13 FT #HET 121 3041 ALA I 165 2101 CUO M B 10 FT #HET 122 3042 ASP I 165 2101 CUO M A 30 FT #HET 123 3043 THR I 165 2101 CUO M B 4 FT #HET 169 3089 HIS I 157 3402 CUO I S 7 FT #HET 173 3093 HIS I 157 3402 CUO I S 5 FT #HET 196 3116 PHE I 157 3402 CUO I S 5 FT #HET 199 3119 HIS I 157 3402 CUO I S 1 FT #HET 200 3120 HIS I 157 3402 CUO I S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9J STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 1070 1070 ALA J 871 2871 PHE H Protein B 2 FT #SUB 1071 1071 ASN J 871 2871 PHE H Protein S 1 FT #SUB 1071 1071 ASN J 902 2902 ILE H Protein B 3 FT #SUB 1071 1071 ASN J 903 2903 TYR H Protein A 5 FT #SUB 1071 1071 ASN J 905 2905 PRO H Protein S 2 FT #SUB 1073 1073 ALA J 901 2901 LEU H Protein B 3 FT #SUB 1074 1074 ILE J 870 2870 LEU H Protein S 1 FT #SUB 1074 1074 ILE J 871 2871 PHE H Protein S 1 FT #SUB 1074 1074 ILE J 901 2901 LEU H Protein A 5 FT #SUB 1075 1075 GLU J 899 2899 PRO H Protein S 2 FT #SUB 1075 1075 GLU J 900 2900 SER H Protein S 6 FT #SUB 1075 1075 GLU J 901 2901 LEU H Protein S 1 FT #SUB 1078 1078 ARG J 870 2870 LEU H Protein S 6 FT #SUB 1078 1078 ARG J 872 2872 GLU H Protein S 4 FT #SUB 1078 1078 ARG J 873 2873 HIS H Protein S 2 FT #SUB 1078 1078 ARG J 875 2875 SER H Protein S 4 FT #SUB 1078 1078 ARG J 877 2877 ILE H Protein S 3 FT #SUB 1103 1103 PHE J 871 2871 PHE H Protein S 2 FT #SUB 1147 1147 MET J 849 2849 ASP H Protein S 2 FT #SUB 1187 1187 LYS J 809 2809 LEU H Protein S 1 FT #SUB 1187 1187 LYS J 847 2847 HIS H Protein S 3 FT #SUB 1188 1188 PHE J 809 2809 LEU H Protein B 2 FT #SUB 1189 1189 ASP J 809 2809 LEU H Protein B 1 FT #SUB 1189 1189 ASP J 851 2851 ASN H Protein A 2 FT #SUB 1191 1191 PRO J 849 2849 ASP H Protein S 2 FT #SUB 1206 1206 HIS J 457 2457 LYS H Protein S 4 FT #SUB 1207 1207 TYR J 739 2739 LEU H Protein A 3 FT #SUB 1208 1208 THR J 743 2743 ALA H Protein B 2 FT #SUB 1209 1209 ASP J 743 2743 ALA H Protein B 1 FT #SUB 1210 1210 LYS J 743 2743 ALA H Protein A 3 FT #SUB 1210 1210 LYS J 744 2744 PHE H Protein S 4 FT #SUB 1211 1211 TYR J 739 2739 LEU H Protein S 3 FT #SUB 1211 1211 TYR J 740 2740 ASP H Protein B 1 FT #SUB 1212 1212 HIS J 740 2740 ASP H Protein A 2 FT #SUB 1212 1212 HIS J 744 2744 PHE H Protein S 2 FT #SUB 1213 1213 VAL J 740 2740 ASP H Protein S 1 FT #SUB 1230 1230 LEU J 847 2847 HIS H Protein S 2 FT #SUB 1232 1232 THR J 738 2738 ALA H Protein B 1 FT #SUB 1232 1232 THR J 740 2740 ASP H Protein B 3 FT #SUB 1232 1232 THR J 741 2741 GLN H Protein S 1 FT #SUB 1233 1233 SER J 738 2738 ALA H Protein A 2 FT #SUB 1233 1233 SER J 849 2849 ASP H Protein S 3 FT #SUB 1234 1234 VAL J 738 2738 ALA H Protein B 5 FT #SUB 1234 1234 VAL J 739 2739 LEU H Protein B 3 FT #SUB 1235 1235 ILE J 736 2736 TYR H Protein A 2 FT #SUB 1236 1236 TYR J 736 2736 TYR H Protein A 5 FT #SUB 1238 1238 PRO J 736 2736 TYR H Protein S 1 FT #SUB 1245 1245 GLU J 729 2729 LYS H Protein A 4 FT #SUB 1245 1245 GLU J 736 2736 TYR H Protein S 3 FT #SUB 1481 1481 GLU J 458 2458 PHE H Protein S 1 FT #SUB 1481 1481 GLU J 459 2459 ASP H Protein S 1 FT #SUB 1483 1483 ASN J 486 2486 ILE H Protein B 1 FT #SUB 1483 1483 ASN J 487 2487 VAL H Protein B 1 FT #SUB 1483 1483 ASN J 488 2488 ARG H Protein A 7 FT #SUB 1483 1483 ASN J 490 2490 PRO H Protein S 1 FT #SUB 1485 1485 ALA J 485 2485 THR H Protein B 2 FT #SUB 1485 1485 ALA J 486 2486 ILE H Protein B 3 FT #SUB 1486 1486 LEU J 458 2458 PHE H Protein S 2 FT #SUB 1486 1486 LEU J 486 2486 ILE H Protein A 4 FT #SUB 1487 1487 PRO J 486 2486 ILE H Protein S 3 FT #SUB 1490 1490 ASN J 461 2461 HIS H Protein S 7 FT #SUB 1558 1558 LEU J 439 2439 PHE H Protein S 1 FT #SUB 1558 1558 LEU J 440 2440 ASP H Protein S 2 FT #SUB 1602 1602 GLN J 401 2401 GLU H Protein S 1 FT #SUB 1603 1603 PHE J 399 2399 LEU H Protein B 1 FT #SUB 1604 1604 ASP J 442 2442 LEU H Protein A 8 FT #SUB 1605 1605 ARG J 440 2440 ASP H Protein B 2 FT #SUB 1605 1605 ARG J 442 2442 LEU H Protein S 1 FT #SUB 1606 1606 LEU J 440 2440 ASP H Protein A 5 FT #SUB 1622 1622 GLN J 326 2326 ILE H Protein B 1 FT #SUB 1623 1623 ASN J 321 2321 GLU H Protein S 1 FT #SUB 1625 1625 HIS J 353 2353 LYS H Protein S 2 FT #SUB 1648 1648 PRO J 327 2327 GLU H Protein B 3 FT #SUB 1649 1649 THR J 325 2325 ALA H Protein S 2 FT #SUB 1649 1649 THR J 327 2327 GLU H Protein A 6 FT #SUB 1650 1650 ILE J 323 2323 ASN H Protein B 1 FT #SUB 1650 1650 ILE J 324 2324 CYS H Protein B 2 FT #SUB 1650 1650 ILE J 325 2325 ALA H Protein B 3 FT #SUB 1650 1650 ILE J 326 2326 ILE H Protein B 4 FT #SUB 1650 1650 ILE J 327 2327 GLU H Protein A 3 FT #SUB 1651 1651 ILE J 323 2323 ASN H Protein B 2 FT #SUB 1652 1652 PHE J 323 2323 ASN H Protein A 10 FT #SUB 1652 1652 PHE J 326 2326 ILE H Protein S 1 FT #SUB 1654 1654 PRO J 323 2323 ASN H Protein S 2 FT #SUB 416 416 ASN J 276 3196 THR I Protein S 11 FT #SUB 416 416 ASN J 278 3198 PRO I Protein S 1 FT #SUB 416 416 ASN J 279 3199 GLN I Protein S 1 FT #SUB 417 417 MET J 279 3199 GLN I Protein S 1 FT #SUB 1404 1404 TYR J 352 3272 HIS I Protein S 1 FT #SUB 1533 1533 ALA J 351 3271 VAL I Protein B 1 FT #SUB 1533 1533 ALA J 354 3274 ARG I Protein B 2 FT #SUB 1535 1535 LEU J 352 3272 HIS I Protein S 3 FT #SUB 1539 1539 GLN J 348 3268 LYS I Protein S 2 FT #SUB 1539 1539 GLN J 352 3272 HIS I Protein S 1 FT #SUB 1543 1543 ILE J 352 3272 HIS I Protein S 3 FT #SUB 328 328 LYS J 1571 1571 TYR M Protein S 6 FT #SUB 328 328 LYS J 1631 1631 GLU M Protein S 4 FT #SUB 329 329 ARG J 1574 1574 VAL M Protein S 1 FT #SUB 329 329 ARG J 1582 1582 ASN M Protein S 2 FT #SUB 329 329 ARG J 1583 1583 CYS M Protein A 14 FT #SUB 329 329 ARG J 1584 1584 GLY M Protein A 3 FT #SUB 329 329 ARG J 1585 1585 ASN M Protein S 4 FT #SUB 330 330 GLN J 1581 1581 GLU M Protein B 3 FT #SUB 330 330 GLN J 1582 1582 ASN M Protein B 1 FT #SUB 331 331 SER J 1581 1581 GLU M Protein B 9 FT #SUB 331 331 SER J 1582 1582 ASN M Protein A 4 FT #SUB 472 472 ASP J 1638 1638 SER M Protein S 3 FT #SUB 472 472 ASP J 1639 1639 LYS M Protein S 3 FT #SUB 472 472 ASP J 1640 1640 ILE M Protein S 2 FT #SUB 473 473 ALA J 1639 1639 LYS M Protein S 3 FT #SUB 474 474 LEU J 1639 1639 LYS M Protein S 1 FT #SUB 1365 1365 TYR J 1437 1437 ARG M Protein S 17 FT #SUB 1371 1371 GLN J 1439 1439 VAL M Protein S 1 FT #SUB 1372 1372 ILE J 1392 1392 ASP M Protein S 1 FT #SUB 1372 1372 ILE J 1438 1438 ASP M Protein S 2 FT #SUB 1378 1378 PHE J 1362 1362 ALA M Protein S 1 FT #SUB 1390 1390 ASP J 1372 1372 ILE M Protein S 1 FT #SUB 1392 1392 ASP J 1372 1372 ILE M Protein S 1 FT #SUB 1437 1437 ARG J 1365 1365 TYR M Protein S 14 FT #SUB 1438 1438 ASP J 1372 1372 ILE M Protein S 2 FT #SUB 1571 1571 TYR J 328 328 LYS M Protein S 7 FT #SUB 1574 1574 VAL J 329 329 ARG M Protein A 7 FT #SUB 1578 1578 ILE J 335 335 ASP M Protein A 4 FT #SUB 1579 1579 GLY J 335 335 ASP M Protein B 6 FT #SUB 1581 1581 GLU J 330 330 GLN M Protein S 1 FT #SUB 1581 1581 GLU J 331 331 SER M Protein S 2 FT #SUB 1581 1581 GLU J 332 332 ASP M Protein S 2 FT #SUB 1582 1582 ASN J 329 329 ARG M Protein B 2 FT #SUB 1582 1582 ASN J 330 330 GLN M Protein B 5 FT #SUB 1582 1582 ASN J 331 331 SER M Protein B 3 FT #SUB 1583 1583 CYS J 329 329 ARG M Protein B 14 FT #SUB 1583 1583 CYS J 330 330 GLN M Protein A 4 FT #SUB 1584 1584 GLY J 329 329 ARG M Protein B 6 FT #SUB 1631 1631 GLU J 328 328 LYS M Protein S 4 FT #SUB 1638 1638 SER J 472 472 ASP M Protein S 3 FT #SUB 1640 1640 ILE J 472 472 ASP M Protein S 6 FT #SUB 106 106 GLN J 609 2609 ALA N Protein B 1 FT #SUB 107 107 HIS J 625 2625 LEU N Protein S 3 FT #SUB 109 109 LEU J 626 2626 ARG N Protein S 2 FT #SUB 109 109 LEU J 635 2635 TYR N Protein S 2 FT #SUB 109 109 LEU J 636 2636 THR N Protein S 1 FT #SUB 111 111 ILE J 637 2637 VAL N Protein S 3 FT #SUB 111 111 ILE J 691 2691 GLN N Protein S 3 FT #SUB 112 112 ASP J 691 2691 GLN N Protein B 1 FT #SUB 115 115 GLY J 691 2691 GLN N Protein B 2 FT #SUB 115 115 GLY J 692 2692 LYS N Protein B 1 FT #SUB 116 116 LYS J 693 2693 TYR N Protein B 2 FT #SUB 117 117 LYS J 632 2632 GLU N Protein S 2 FT #SUB 117 117 LYS J 634 2634 THR N Protein S 3 FT #SUB 118 118 ALA J 634 2634 THR N Protein A 2 FT #SUB 119 119 LYS J 635 2635 TYR N Protein S 2 FT #SUB 124 124 TYR J 610 2610 ASP N Protein S 2 FT #SUB 136 136 ALA J 619 2619 VAL N Protein S 2 FT #SUB 137 137 ARG J 610 2610 ASP N Protein B 1 FT #SUB 137 137 ARG J 619 2619 VAL N Protein B 1 FT #SUB 138 138 ALA J 612 2612 TYR N Protein S 1 FT #SUB 138 138 ALA J 619 2619 VAL N Protein A 2 FT #SUB 189 189 ARG J 612 2612 TYR N Protein S 3 FT #SUB 189 189 ARG J 617 2617 ASP N Protein A 5 FT #SUB 190 190 PHE J 617 2617 ASP N Protein A 4 FT #SUB 190 190 PHE J 619 2619 VAL N Protein S 3 FT #SUB 191 191 THR J 617 2617 ASP N Protein S 1 FT #SUB 189 189 ARG J 136 2136 THR Q Protein S 1 FT #SUB 388 388 GLY J 136 2136 THR Q Protein B 6 FT #SUB 389 389 THR J 136 2136 THR Q Protein A 9 FT #SUB 389 389 THR J 138 2138 LYS Q Protein S 1 FT #HET 41 41 HIS J 158 5001 CUO J S 5 FT #HET 58 58 CYS J 158 5001 CUO J S 1 FT #HET 60 60 HIS J 158 5001 CUO J S 6 FT #HET 65 65 PHE J 158 5001 CUO J S 1 FT #HET 69 69 HIS J 158 5001 CUO J S 9 FT #HET 179 179 HIS J 158 5001 CUO J S 7 FT #HET 183 183 HIS J 158 5001 CUO J S 6 FT #HET 206 206 PHE J 158 5001 CUO J S 2 FT #HET 210 210 HIS J 158 5001 CUO J S 8 FT #HET 313 313 THR J 38 1 NAG t S 3 FT #HET 389 389 THR J 38 1 NAG t S 3 FT #HET 462 462 HIS J 159 5002 CUO J S 6 FT #HET 480 480 CYS J 159 5002 CUO J S 1 FT #HET 482 482 HIS J 159 5002 CUO J S 3 FT #HET 491 491 HIS J 159 5002 CUO J S 9 FT #HET 603 603 HIS J 159 5002 CUO J S 4 FT #HET 607 607 HIS J 159 5002 CUO J S 6 FT #HET 630 630 PHE J 159 5002 CUO J S 2 FT #HET 633 633 HIS J 159 5002 CUO J S 1 FT #HET 634 634 HIS J 159 5002 CUO J S 7 FT #HET 740 740 THR J 40 1 NAG u S 1 FT #HET 763 763 LEU J 159 5002 CUO J S 1 FT #HET 804 804 ASP J 40 1 NAG u S 8 FT #HET 808 808 THR J 40 1 NAG u S 3 FT #HET 810 810 LEU J 40 1 NAG u S 3 FT #HET 877 877 HIS J 160 5003 CUO J S 5 FT #HET 895 895 CYS J 160 5003 CUO J S 1 FT #HET 897 897 HIS J 160 5003 CUO J S 3 FT #HET 906 906 HIS J 160 5003 CUO J S 6 FT #HET 1015 1015 HIS J 160 5003 CUO J S 6 FT #HET 1019 1019 HIS J 160 5003 CUO J S 6 FT #HET 1042 1042 PHE J 160 5003 CUO J S 3 FT #HET 1046 1046 HIS J 160 5003 CUO J S 7 FT #HET 1178 1178 LEU J 160 5003 CUO J S 1 FT #HET 1294 1294 HIS J 161 5004 CUO J S 7 FT #HET 1298 1298 ALA J 43 2 NAG v B 2 FT #HET 1299 1299 GLN J 42 1 NAG v A 5 FT #HET 1301 1301 PRO J 42 1 NAG v S 2 FT #HET 1308 1308 VAL J 43 2 NAG v S 2 FT #HET 1312 1312 CYS J 161 5004 CUO J S 2 FT #HET 1314 1314 HIS J 161 5004 CUO J S 6 FT #HET 1319 1319 PHE J 161 5004 CUO J S 1 FT #HET 1323 1323 HIS J 161 5004 CUO J S 9 FT #HET 1427 1427 HIS J 161 5004 CUO J S 5 FT #HET 1431 1431 HIS J 161 5004 CUO J S 7 FT #HET 1454 1454 PHE J 161 5004 CUO J S 3 FT #HET 1458 1458 HIS J 161 5004 CUO J S 7 FT #HET 1494 1494 ARG J 42 1 NAG v S 2 FT #HET 1500 1500 THR J 42 1 NAG v B 3 FT #HET 1563 1563 LYS J 45 2 NAG w S 2 FT #HET 1564 1564 ALA J 44 1 NAG w S 1 FT #HET 1565 1565 SER J 44 1 NAG w B 1 FT #HET 1593 1593 LEU J 161 5004 CUO J S 1 FT #HET 1635 1635 VAL J 44 1 NAG w S 1 FT #HET 1639 1639 LYS J 44 1 NAG w S 12 FT #HET 1639 1639 LYS J 45 2 NAG w S 7 FT #HET 1641 1641 THR J 45 2 NAG w S 1 FT #MOD 387 387 ASN J 38 1 NAG t S FT #MOD 806 806 ASN J 40 1 NAG u S FT #MOD 1498 1498 ASN J 42 1 NAG v S FT #MOD 1636 1636 ASN J 44 1 NAG w S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9K STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 136 2136 THR K 388 388 GLY D Protein S 6 FT #SUB 136 2136 THR K 389 389 THR D Protein A 9 FT #SUB 136 2136 THR K 390 390 LEU D Protein S 1 FT #SUB 323 2323 ASN K 1650 1650 ILE G Protein B 1 FT #SUB 323 2323 ASN K 1651 1651 ILE G Protein B 3 FT #SUB 323 2323 ASN K 1652 1652 PHE G Protein A 10 FT #SUB 323 2323 ASN K 1654 1654 PRO G Protein S 2 FT #SUB 325 2325 ALA K 1649 1649 THR G Protein B 1 FT #SUB 325 2325 ALA K 1650 1650 ILE G Protein B 1 FT #SUB 326 2326 ILE K 1622 1622 GLN G Protein S 1 FT #SUB 326 2326 ILE K 1650 1650 ILE G Protein A 5 FT #SUB 327 2327 GLU K 1648 1648 PRO G Protein S 3 FT #SUB 327 2327 GLU K 1649 1649 THR G Protein S 4 FT #SUB 327 2327 GLU K 1650 1650 ILE G Protein S 4 FT #SUB 399 2399 LEU K 1603 1603 PHE G Protein S 1 FT #SUB 440 2440 ASP K 1558 1558 LEU G Protein B 1 FT #SUB 440 2440 ASP K 1604 1604 ASP G Protein B 1 FT #SUB 440 2440 ASP K 1605 1605 ARG G Protein B 1 FT #SUB 440 2440 ASP K 1606 1606 LEU G Protein A 6 FT #SUB 441 2441 ARG K 1604 1604 ASP G Protein B 2 FT #SUB 442 2442 LEU K 1604 1604 ASP G Protein A 6 FT #SUB 457 2457 LYS K 1206 1206 HIS G Protein S 3 FT #SUB 458 2458 PHE K 1481 1481 GLU G Protein S 1 FT #SUB 458 2458 PHE K 1483 1483 ASN G Protein S 1 FT #SUB 458 2458 PHE K 1486 1486 LEU G Protein B 1 FT #SUB 459 2459 ASP K 1481 1481 GLU G Protein S 1 FT #SUB 459 2459 ASP K 1490 1490 ASN G Protein B 1 FT #SUB 461 2461 HIS K 1490 1490 ASN G Protein S 9 FT #SUB 482 2482 PRO K 1602 1602 GLN G Protein S 1 FT #SUB 486 2486 ILE K 1484 1484 CYS G Protein B 3 FT #SUB 486 2486 ILE K 1485 1485 ALA G Protein B 1 FT #SUB 486 2486 ILE K 1486 1486 LEU G Protein A 7 FT #SUB 486 2486 ILE K 1487 1487 PRO G Protein A 5 FT #SUB 487 2487 VAL K 1483 1483 ASN G Protein B 2 FT #SUB 487 2487 VAL K 1484 1484 CYS G Protein A 4 FT #SUB 488 2488 ARG K 1483 1483 ASN G Protein A 10 FT #SUB 729 2729 LYS K 1245 1245 GLU G Protein B 4 FT #SUB 730 2730 LEU K 1245 1245 GLU G Protein S 1 FT #SUB 731 2731 PRO K 1245 1245 GLU G Protein S 1 FT #SUB 736 2736 TYR K 1207 1207 TYR G Protein S 1 FT #SUB 736 2736 TYR K 1236 1236 TYR G Protein S 7 FT #SUB 737 2737 CYS K 1234 1234 VAL G Protein B 3 FT #SUB 738 2738 ALA K 1233 1233 SER G Protein A 2 FT #SUB 738 2738 ALA K 1234 1234 VAL G Protein B 3 FT #SUB 739 2739 LEU K 1207 1207 TYR G Protein S 4 FT #SUB 739 2739 LEU K 1211 1211 TYR G Protein S 3 FT #SUB 739 2739 LEU K 1234 1234 VAL G Protein A 5 FT #SUB 740 2740 ASP K 1232 1232 THR G Protein S 5 FT #SUB 740 2740 ASP K 1233 1233 SER G Protein S 7 FT #SUB 740 2740 ASP K 1234 1234 VAL G Protein S 1 FT #SUB 741 2741 GLN K 1232 1232 THR G Protein S 2 FT #SUB 743 2743 ALA K 1208 1208 THR G Protein S 1 FT #SUB 743 2743 ALA K 1210 1210 LYS G Protein S 3 FT #SUB 744 2744 PHE K 1164 1164 ASP G Protein S 1 FT #SUB 809 2809 LEU K 1187 1187 LYS G Protein S 1 FT #SUB 809 2809 LEU K 1188 1188 PHE G Protein S 2 FT #SUB 809 2809 LEU K 1189 1189 ASP G Protein S 1 FT #SUB 847 2847 HIS K 1187 1187 LYS G Protein S 2 FT #SUB 849 2849 ASP K 1147 1147 MET G Protein A 4 FT #SUB 849 2849 ASP K 1189 1189 ASP G Protein B 1 FT #SUB 849 2849 ASP K 1191 1191 PRO G Protein A 5 FT #SUB 850 2850 ARG K 1189 1189 ASP G Protein B 1 FT #SUB 851 2851 ASN K 1189 1189 ASP G Protein S 9 FT #SUB 870 2870 LEU K 1074 1074 ILE G Protein A 2 FT #SUB 870 2870 LEU K 1078 1078 ARG G Protein B 4 FT #SUB 871 2871 PHE K 1071 1071 ASN G Protein S 1 FT #SUB 871 2871 PHE K 1074 1074 ILE G Protein S 1 FT #SUB 871 2871 PHE K 1103 1103 PHE G Protein A 4 FT #SUB 872 2872 GLU K 1078 1078 ARG G Protein B 4 FT #SUB 873 2873 HIS K 1078 1078 ARG G Protein A 3 FT #SUB 873 2873 HIS K 1101 1101 VAL G Protein S 2 FT #SUB 873 2873 HIS K 1104 1104 ASP G Protein S 1 FT #SUB 875 2875 SER K 1078 1078 ARG G Protein B 4 FT #SUB 876 2876 LYS K 1078 1078 ARG G Protein B 6 FT #SUB 877 2877 ILE K 1078 1078 ARG G Protein A 4 FT #SUB 899 2899 PRO K 1075 1075 GLU G Protein B 3 FT #SUB 900 2900 SER K 1075 1075 GLU G Protein A 6 FT #SUB 901 2901 LEU K 1071 1071 ASN G Protein B 1 FT #SUB 901 2901 LEU K 1072 1072 CYS G Protein B 1 FT #SUB 901 2901 LEU K 1073 1073 ALA G Protein B 3 FT #SUB 901 2901 LEU K 1074 1074 ILE G Protein A 6 FT #SUB 901 2901 LEU K 1075 1075 GLU G Protein B 1 FT #SUB 902 2902 ILE K 1071 1071 ASN G Protein A 4 FT #SUB 902 2902 ILE K 1072 1072 CYS G Protein S 2 FT #SUB 903 2903 TYR K 1071 1071 ASN G Protein B 5 FT #SUB 905 2905 PRO K 1071 1071 ASN G Protein S 1 FT #SUB 84 2084 ARG K 502 2502 LEU H Protein S 3 FT #SUB 90 2090 LYS K 375 2375 GLY H Protein S 1 FT #SUB 91 2091 ASN K 94 2094 ARG H Protein S 1 FT #SUB 94 2094 ARG K 94 2094 ARG H Protein S 8 FT #SUB 94 2094 ARG K 182 2182 TYR H Protein A 5 FT #SUB 94 2094 ARG K 370 2370 MET H Protein S 5 FT #SUB 96 2096 SER K 374 2374 ASN H Protein S 3 FT #SUB 97 2097 LEU K 235 2235 PRO H Protein S 2 FT #SUB 98 2098 GLN K 374 2374 ASN H Protein S 1 FT #SUB 99 2099 GLU K 375 2375 GLY H Protein S 1 FT #SUB 109 2109 ARG K 585 2585 GLY H Protein B 1 FT #SUB 112 2112 LYS K 583 2583 ARG H Protein S 3 FT #SUB 112 2112 LYS K 584 2584 HIS H Protein B 1 FT #SUB 113 2113 ASP K 519 2519 ASN H Protein S 4 FT #SUB 113 2113 ASP K 585 2585 GLY H Protein S 3 FT #SUB 114 2114 ARG K 526 2526 ARG H Protein S 6 FT #SUB 115 2115 SER K 519 2519 ASN H Protein S 4 FT #SUB 115 2115 SER K 522 2522 SER H Protein A 4 FT #SUB 121 2121 THR K 615 2615 TRP H Protein A 8 FT #SUB 124 2124 SER K 615 2615 TRP H Protein S 1 FT #SUB 167 2167 ASN K 502 2502 LEU H Protein A 2 FT #SUB 168 2168 ARG K 503 2503 ASN H Protein B 3 FT #SUB 168 2168 ARG K 515 2515 ARG H Protein S 10 FT #SUB 168 2168 ARG K 587 2587 SER H Protein A 6 FT #SUB 169 2169 HIS K 585 2585 GLY H Protein S 3 FT #SUB 169 2169 HIS K 587 2587 SER H Protein S 2 FT #SUB 182 2182 TYR K 94 2094 ARG H Protein S 4 FT #SUB 199 2199 PRO K 235 2235 PRO H Protein S 1 FT #SUB 199 2199 PRO K 236 2236 ALA H Protein B 2 FT #SUB 199 2199 PRO K 237 2237 LEU H Protein B 4 FT #SUB 200 2200 PHE K 237 2237 LEU H Protein B 8 FT #SUB 202 2202 GLY K 237 2237 LEU H Protein B 1 FT #SUB 235 2235 PRO K 97 2097 LEU H Protein S 2 FT #SUB 236 2236 ALA K 199 2199 PRO H Protein B 1 FT #SUB 237 2237 LEU K 199 2199 PRO H Protein B 2 FT #SUB 237 2237 LEU K 200 2200 PHE H Protein A 7 FT #SUB 244 2244 PHE K 98 2098 GLN H Protein S 1 FT #SUB 341 2341 PRO K 614 2614 ALA H Protein S 1 FT #SUB 344 2344 LEU K 515 2515 ARG H Protein B 4 FT #SUB 345 2345 ASN K 515 2515 ARG H Protein B 5 FT #SUB 346 2346 PRO K 513 2513 GLU H Protein S 1 FT #SUB 346 2346 PRO K 515 2515 ARG H Protein S 4 FT #SUB 370 2370 MET K 94 2094 ARG H Protein S 4 FT #SUB 370 2370 MET K 370 2370 MET H Protein S 1 FT #SUB 374 2374 ASN K 96 2096 SER H Protein A 3 FT #SUB 374 2374 ASN K 98 2098 GLN H Protein S 2 FT #SUB 375 2375 GLY K 90 2090 LYS H Protein B 1 FT #SUB 375 2375 GLY K 99 2099 GLU H Protein B 1 FT #SUB 502 2502 LEU K 84 2084 ARG H Protein S 4 FT #SUB 502 2502 LEU K 167 2167 ASN H Protein S 3 FT #SUB 503 2503 ASN K 109 2109 ARG H Protein S 1 FT #SUB 503 2503 ASN K 168 2168 ARG H Protein A 4 FT #SUB 503 2503 ASN K 169 2169 HIS H Protein S 2 FT #SUB 503 2503 ASN K 170 2170 GLY H Protein S 2 FT #SUB 515 2515 ARG K 168 2168 ARG H Protein S 6 FT #SUB 515 2515 ARG K 344 2344 LEU H Protein S 3 FT #SUB 515 2515 ARG K 345 2345 ASN H Protein S 1 FT #SUB 515 2515 ARG K 346 2346 PRO H Protein S 6 FT #SUB 519 2519 ASN K 113 2113 ASP H Protein S 3 FT #SUB 519 2519 ASN K 115 2115 SER H Protein A 4 FT #SUB 522 2522 SER K 115 2115 SER H Protein S 2 FT #SUB 526 2526 ARG K 114 2114 ARG H Protein S 6 FT #SUB 582 2582 ARG K 109 2109 ARG H Protein B 1 FT #SUB 583 2583 ARG K 112 2112 LYS H Protein B 1 FT #SUB 584 2584 HIS K 112 2112 LYS H Protein B 1 FT #SUB 585 2585 GLY K 109 2109 ARG H Protein B 1 FT #SUB 585 2585 GLY K 113 2113 ASP H Protein B 7 FT #SUB 585 2585 GLY K 169 2169 HIS H Protein B 2 FT #SUB 587 2587 SER K 168 2168 ARG H Protein A 7 FT #SUB 587 2587 SER K 169 2169 HIS H Protein S 4 FT #SUB 614 2614 ALA K 341 2341 PRO H Protein S 1 FT #SUB 614 2614 ALA K 342 2342 TYR H Protein B 1 FT #SUB 615 2615 TRP K 116 2116 SER H Protein S 3 FT #SUB 615 2615 TRP K 121 2121 THR H Protein S 3 FT #SUB 615 2615 TRP K 125 2125 PHE H Protein S 1 FT #SUB 615 2615 TRP K 342 2342 TYR H Protein S 6 FT #SUB 800 2800 LYS K 100 3020 SER L Protein S 1 FT #SUB 800 2800 LYS K 101 3021 LEU L Protein S 1 FT #SUB 800 2800 LYS K 103 3023 ILE L Protein A 5 FT #SUB 802 2802 ASP K 10 2930 LEU L Protein S 1 FT #SUB 802 2802 ASP K 12 2932 PRO L Protein S 4 FT #SUB 867 2867 PRO K 12 2932 PRO L Protein S 2 FT #SUB 868 2868 GLU K 11 2931 THR L Protein S 2 FT #SUB 868 2868 GLU K 12 2932 PRO L Protein A 18 FT #SUB 868 2868 GLU K 13 2933 SER L Protein S 6 FT #SUB 903 2903 TYR K 12 2932 PRO L Protein S 2 FT #SUB 609 2609 ALA K 106 106 GLN M Protein S 1 FT #SUB 610 2610 ASP K 124 124 TYR M Protein S 4 FT #SUB 612 2612 TYR K 138 138 ALA M Protein S 3 FT #SUB 612 2612 TYR K 189 189 ARG M Protein S 7 FT #SUB 617 2617 ASP K 189 189 ARG M Protein S 5 FT #SUB 617 2617 ASP K 190 190 PHE M Protein B 4 FT #SUB 617 2617 ASP K 191 191 THR M Protein S 1 FT #SUB 619 2619 VAL K 136 136 ALA M Protein S 3 FT #SUB 619 2619 VAL K 138 138 ALA M Protein S 2 FT #SUB 619 2619 VAL K 190 190 PHE M Protein S 3 FT #SUB 625 2625 LEU K 107 107 HIS M Protein S 2 FT #SUB 626 2626 ARG K 108 108 PRO M Protein S 3 FT #SUB 626 2626 ARG K 109 109 LEU M Protein S 2 FT #SUB 632 2632 GLU K 117 117 LYS M Protein B 2 FT #SUB 634 2634 THR K 117 117 LYS M Protein S 6 FT #SUB 635 2635 TYR K 109 109 LEU M Protein S 1 FT #SUB 636 2636 THR K 109 109 LEU M Protein B 1 FT #SUB 637 2637 VAL K 111 111 ILE M Protein S 1 FT #SUB 637 2637 VAL K 118 118 ALA M Protein S 1 FT #SUB 691 2691 GLN K 115 115 GLY M Protein S 1 FT #SUB 692 2692 LYS K 116 116 LYS M Protein A 2 FT #SUB 693 2693 TYR K 116 116 LYS M Protein S 2 FT #SUB 693 2693 TYR K 117 117 LYS M Protein S 3 FT #HET 126 2126 HIS K 163 3001 CUO K S 4 FT #HET 136 2136 THR K 14 1 NAG j B 3 FT #HET 137 2137 ALA K 14 1 NAG j B 1 FT #HET 138 2138 LYS K 14 1 NAG j S 2 FT #HET 138 2138 LYS K 15 2 NAG j S 3 FT #HET 144 2144 CYS K 163 3001 CUO K S 1 FT #HET 146 2146 HIS K 163 3001 CUO K S 7 FT #HET 151 2151 PHE K 163 3001 CUO K S 1 FT #HET 155 2155 HIS K 163 3001 CUO K S 8 FT #HET 267 2267 HIS K 163 3001 CUO K S 7 FT #HET 271 2271 HIS K 163 3001 CUO K S 6 FT #HET 294 2294 PHE K 163 3001 CUO K S 1 FT #HET 298 2298 HIS K 163 3001 CUO K S 8 FT #HET 405 2405 SER K 47 1 NAG x S 3 FT #HET 429 2429 LEU K 163 3001 CUO K S 1 FT #HET 543 2543 HIS K 164 3002 CUO K S 7 FT #HET 559 2559 CYS K 164 3002 CUO K S 1 FT #HET 561 2561 HIS K 164 3002 CUO K S 6 FT #HET 566 2566 PHE K 164 3002 CUO K S 1 FT #HET 570 2570 HIS K 164 3002 CUO K S 7 FT #HET 680 2680 HIS K 164 3002 CUO K S 6 FT #HET 684 2684 HIS K 164 3002 CUO K S 6 FT #HET 707 2707 PHE K 164 3002 CUO K S 3 FT #HET 711 2711 HIS K 164 3002 CUO K S 6 FT #MOD 472 2472 ASN K 47 1 NAG x S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9L STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 279 3199 GLN L 417 417 MET G Protein S 5 FT #SUB 348 3268 LYS L 1534 1534 GLY G Protein S 1 FT #SUB 348 3268 LYS L 1539 1539 GLN G Protein S 2 FT #SUB 351 3271 VAL L 1533 1533 ALA G Protein A 3 FT #SUB 352 3272 HIS L 1404 1404 TYR G Protein B 1 FT #SUB 352 3272 HIS L 1535 1535 LEU G Protein S 5 FT #SUB 352 3272 HIS L 1539 1539 GLN G Protein S 1 FT #SUB 352 3272 HIS L 1543 1543 ILE G Protein S 4 FT #SUB 354 3274 ARG L 1350 1350 GLU G Protein S 4 FT #SUB 354 3274 ARG L 1533 1533 ALA G Protein S 1 FT #SUB 10 2930 LEU L 802 2802 ASP K Protein B 1 FT #SUB 11 2931 THR L 868 2868 GLU K Protein B 2 FT #SUB 12 2932 PRO L 802 2802 ASP K Protein S 4 FT #SUB 12 2932 PRO L 867 2867 PRO K Protein S 2 FT #SUB 12 2932 PRO L 868 2868 GLU K Protein A 18 FT #SUB 12 2932 PRO L 903 2903 TYR K Protein S 2 FT #SUB 13 2933 SER L 868 2868 GLU K Protein A 6 FT #SUB 100 3020 SER L 800 2800 LYS K Protein S 1 FT #SUB 101 3021 LEU L 800 2800 LYS K Protein B 1 FT #SUB 103 3023 ILE L 800 2800 LYS K Protein S 5 FT #HET 41 2961 HIS L 138 3401 CUO C S 8 FT #HET 60 2980 HIS L 138 3401 CUO C S 3 FT #HET 69 2989 HIS L 138 3401 CUO C S 13 FT #HET 118 3038 ILE L 144 5016 CUO D B 1 FT #HET 121 3041 ALA L 144 5016 CUO D B 6 FT #HET 122 3042 ASP L 144 5016 CUO D A 29 FT #HET 123 3043 THR L 144 5016 CUO D B 6 FT #HET 169 3089 HIS L 138 3401 CUO C S 7 FT #HET 173 3093 HIS L 138 3401 CUO C S 5 FT #HET 196 3116 PHE L 138 3401 CUO C S 5 FT #HET 199 3119 HIS L 138 3401 CUO C S 1 FT #HET 200 3120 HIS L 138 3401 CUO C S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9M STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 388 388 GLY M 136 2136 THR H Protein B 5 FT #SUB 389 389 THR M 136 2136 THR H Protein A 7 FT #SUB 328 328 LYS M 1571 1571 TYR J Protein S 7 FT #SUB 328 328 LYS M 1631 1631 GLU J Protein S 4 FT #SUB 329 329 ARG M 1574 1574 VAL J Protein S 7 FT #SUB 329 329 ARG M 1582 1582 ASN J Protein S 2 FT #SUB 329 329 ARG M 1583 1583 CYS J Protein A 14 FT #SUB 329 329 ARG M 1584 1584 GLY J Protein S 6 FT #SUB 330 330 GLN M 1581 1581 GLU J Protein B 1 FT #SUB 330 330 GLN M 1582 1582 ASN J Protein B 5 FT #SUB 330 330 GLN M 1583 1583 CYS J Protein A 4 FT #SUB 331 331 SER M 1581 1581 GLU J Protein A 2 FT #SUB 331 331 SER M 1582 1582 ASN J Protein A 3 FT #SUB 332 332 ASP M 1581 1581 GLU J Protein B 2 FT #SUB 335 335 ASP M 1578 1578 ILE J Protein S 4 FT #SUB 335 335 ASP M 1579 1579 GLY J Protein S 6 FT #SUB 472 472 ASP M 1638 1638 SER J Protein S 3 FT #SUB 472 472 ASP M 1640 1640 ILE J Protein S 6 FT #SUB 1362 1362 ALA M 1378 1378 PHE J Protein S 1 FT #SUB 1365 1365 TYR M 1437 1437 ARG J Protein S 14 FT #SUB 1372 1372 ILE M 1390 1390 ASP J Protein S 1 FT #SUB 1372 1372 ILE M 1392 1392 ASP J Protein S 1 FT #SUB 1372 1372 ILE M 1438 1438 ASP J Protein S 2 FT #SUB 1392 1392 ASP M 1372 1372 ILE J Protein S 1 FT #SUB 1437 1437 ARG M 1365 1365 TYR J Protein S 17 FT #SUB 1438 1438 ASP M 1372 1372 ILE J Protein S 2 FT #SUB 1439 1439 VAL M 1371 1371 GLN J Protein S 1 FT #SUB 1571 1571 TYR M 328 328 LYS J Protein S 6 FT #SUB 1574 1574 VAL M 329 329 ARG J Protein B 1 FT #SUB 1581 1581 GLU M 330 330 GLN J Protein S 3 FT #SUB 1581 1581 GLU M 331 331 SER J Protein S 9 FT #SUB 1582 1582 ASN M 329 329 ARG J Protein B 2 FT #SUB 1582 1582 ASN M 330 330 GLN J Protein B 1 FT #SUB 1582 1582 ASN M 331 331 SER J Protein B 4 FT #SUB 1583 1583 CYS M 329 329 ARG J Protein B 14 FT #SUB 1584 1584 GLY M 329 329 ARG J Protein B 3 FT #SUB 1585 1585 ASN M 329 329 ARG J Protein S 4 FT #SUB 1631 1631 GLU M 328 328 LYS J Protein S 4 FT #SUB 1638 1638 SER M 472 472 ASP J Protein S 3 FT #SUB 1639 1639 LYS M 472 472 ASP J Protein A 3 FT #SUB 1639 1639 LYS M 473 473 ALA J Protein S 3 FT #SUB 1639 1639 LYS M 474 474 LEU J Protein S 1 FT #SUB 1640 1640 ILE M 472 472 ASP J Protein S 2 FT #SUB 106 106 GLN M 609 2609 ALA K Protein B 1 FT #SUB 107 107 HIS M 625 2625 LEU K Protein S 2 FT #SUB 108 108 PRO M 626 2626 ARG K Protein S 3 FT #SUB 109 109 LEU M 626 2626 ARG K Protein S 2 FT #SUB 109 109 LEU M 635 2635 TYR K Protein S 1 FT #SUB 109 109 LEU M 636 2636 THR K Protein S 1 FT #SUB 111 111 ILE M 637 2637 VAL K Protein S 1 FT #SUB 115 115 GLY M 691 2691 GLN K Protein B 1 FT #SUB 116 116 LYS M 692 2692 LYS K Protein A 2 FT #SUB 116 116 LYS M 693 2693 TYR K Protein B 2 FT #SUB 117 117 LYS M 632 2632 GLU K Protein S 2 FT #SUB 117 117 LYS M 634 2634 THR K Protein S 6 FT #SUB 117 117 LYS M 693 2693 TYR K Protein S 3 FT #SUB 118 118 ALA M 637 2637 VAL K Protein S 1 FT #SUB 124 124 TYR M 610 2610 ASP K Protein S 4 FT #SUB 136 136 ALA M 619 2619 VAL K Protein S 3 FT #SUB 138 138 ALA M 612 2612 TYR K Protein S 3 FT #SUB 138 138 ALA M 619 2619 VAL K Protein A 2 FT #SUB 189 189 ARG M 612 2612 TYR K Protein S 7 FT #SUB 189 189 ARG M 617 2617 ASP K Protein A 5 FT #SUB 190 190 PHE M 617 2617 ASP K Protein A 4 FT #SUB 190 190 PHE M 619 2619 VAL K Protein S 3 FT #SUB 191 191 THR M 617 2617 ASP K Protein S 1 FT #SUB 1071 1071 ASN M 871 2871 PHE Q Protein S 1 FT #SUB 1071 1071 ASN M 901 2901 LEU Q Protein B 1 FT #SUB 1071 1071 ASN M 902 2902 ILE Q Protein B 4 FT #SUB 1071 1071 ASN M 903 2903 TYR Q Protein A 6 FT #SUB 1073 1073 ALA M 900 2900 SER Q Protein B 1 FT #SUB 1073 1073 ALA M 901 2901 LEU Q Protein B 1 FT #SUB 1074 1074 ILE M 870 2870 LEU Q Protein S 1 FT #SUB 1074 1074 ILE M 871 2871 PHE Q Protein S 1 FT #SUB 1074 1074 ILE M 901 2901 LEU Q Protein A 7 FT #SUB 1074 1074 ILE M 903 2903 TYR Q Protein S 1 FT #SUB 1075 1075 GLU M 898 2898 LYS Q Protein S 1 FT #SUB 1075 1075 GLU M 899 2899 PRO Q Protein S 3 FT #SUB 1075 1075 GLU M 900 2900 SER Q Protein S 4 FT #SUB 1078 1078 ARG M 869 2869 ALA Q Protein S 1 FT #SUB 1078 1078 ARG M 870 2870 LEU Q Protein S 6 FT #SUB 1078 1078 ARG M 871 2871 PHE Q Protein S 1 FT #SUB 1078 1078 ARG M 872 2872 GLU Q Protein S 5 FT #SUB 1078 1078 ARG M 875 2875 SER Q Protein S 3 FT #SUB 1078 1078 ARG M 877 2877 ILE Q Protein S 8 FT #SUB 1079 1079 LYS M 898 2898 LYS Q Protein S 1 FT #SUB 1101 1101 VAL M 873 2873 HIS Q Protein S 1 FT #SUB 1103 1103 PHE M 871 2871 PHE Q Protein S 7 FT #SUB 1147 1147 MET M 848 2848 PHE Q Protein S 1 FT #SUB 1147 1147 MET M 849 2849 ASP Q Protein S 3 FT #SUB 1187 1187 LYS M 809 2809 LEU Q Protein S 1 FT #SUB 1187 1187 LYS M 847 2847 HIS Q Protein S 5 FT #SUB 1188 1188 PHE M 809 2809 LEU Q Protein B 1 FT #SUB 1189 1189 ASP M 809 2809 LEU Q Protein B 1 FT #SUB 1189 1189 ASP M 851 2851 ASN Q Protein A 2 FT #SUB 1191 1191 PRO M 849 2849 ASP Q Protein S 3 FT #SUB 1207 1207 TYR M 736 2736 TYR Q Protein S 1 FT #SUB 1207 1207 TYR M 739 2739 LEU Q Protein A 6 FT #SUB 1208 1208 THR M 743 2743 ALA Q Protein B 1 FT #SUB 1210 1210 LYS M 744 2744 PHE Q Protein S 4 FT #SUB 1211 1211 TYR M 739 2739 LEU Q Protein S 4 FT #SUB 1212 1212 HIS M 744 2744 PHE Q Protein S 7 FT #SUB 1230 1230 LEU M 847 2847 HIS Q Protein S 1 FT #SUB 1232 1232 THR M 740 2740 ASP Q Protein A 12 FT #SUB 1233 1233 SER M 738 2738 ALA Q Protein A 4 FT #SUB 1233 1233 SER M 740 2740 ASP Q Protein B 2 FT #SUB 1233 1233 SER M 849 2849 ASP Q Protein S 1 FT #SUB 1234 1234 VAL M 737 2737 CYS Q Protein B 2 FT #SUB 1234 1234 VAL M 738 2738 ALA Q Protein B 3 FT #SUB 1234 1234 VAL M 739 2739 LEU Q Protein B 4 FT #SUB 1235 1235 ILE M 736 2736 TYR Q Protein A 4 FT #SUB 1236 1236 TYR M 736 2736 TYR Q Protein A 6 FT #SUB 1238 1238 PRO M 736 2736 TYR Q Protein S 1 FT #SUB 1245 1245 GLU M 729 2729 LYS Q Protein A 3 FT #SUB 1245 1245 GLU M 731 2731 PRO Q Protein S 1 FT #SUB 1245 1245 GLU M 736 2736 TYR Q Protein S 4 FT #SUB 1481 1481 GLU M 459 2459 ASP Q Protein S 1 FT #SUB 1483 1483 ASN M 458 2458 PHE Q Protein S 1 FT #SUB 1483 1483 ASN M 487 2487 VAL Q Protein B 2 FT #SUB 1483 1483 ASN M 488 2488 ARG Q Protein A 8 FT #SUB 1484 1484 CYS M 486 2486 ILE Q Protein B 2 FT #SUB 1484 1484 CYS M 487 2487 VAL Q Protein B 3 FT #SUB 1486 1486 LEU M 458 2458 PHE Q Protein S 1 FT #SUB 1486 1486 LEU M 486 2486 ILE Q Protein A 5 FT #SUB 1486 1486 LEU M 488 2488 ARG Q Protein S 1 FT #SUB 1487 1487 PRO M 486 2486 ILE Q Protein S 3 FT #SUB 1490 1490 ASN M 459 2459 ASP Q Protein S 1 FT #SUB 1490 1490 ASN M 461 2461 HIS Q Protein S 3 FT #SUB 1558 1558 LEU M 440 2440 ASP Q Protein S 1 FT #SUB 1603 1603 PHE M 399 2399 LEU Q Protein B 1 FT #SUB 1604 1604 ASP M 440 2440 ASP Q Protein B 1 FT #SUB 1604 1604 ASP M 441 2441 ARG Q Protein B 2 FT #SUB 1604 1604 ASP M 442 2442 LEU Q Protein A 7 FT #SUB 1605 1605 ARG M 440 2440 ASP Q Protein B 2 FT #SUB 1606 1606 LEU M 440 2440 ASP Q Protein A 6 FT #SUB 1622 1622 GLN M 326 2326 ILE Q Protein B 1 FT #SUB 1623 1623 ASN M 321 2321 GLU Q Protein S 1 FT #SUB 1648 1648 PRO M 327 2327 GLU Q Protein B 2 FT #SUB 1649 1649 THR M 327 2327 GLU Q Protein A 8 FT #SUB 1650 1650 ILE M 324 2324 CYS Q Protein B 1 FT #SUB 1650 1650 ILE M 325 2325 ALA Q Protein B 3 FT #SUB 1650 1650 ILE M 326 2326 ILE Q Protein A 4 FT #SUB 1650 1650 ILE M 327 2327 GLU Q Protein B 1 FT #SUB 1651 1651 ILE M 323 2323 ASN Q Protein B 2 FT #SUB 1652 1652 PHE M 323 2323 ASN Q Protein A 9 FT #SUB 1654 1654 PRO M 323 2323 ASN Q Protein S 2 FT #SUB 416 416 ASN M 276 3196 THR R Protein S 12 FT #SUB 416 416 ASN M 279 3199 GLN R Protein S 3 FT #SUB 1533 1533 ALA M 354 3274 ARG R Protein B 2 FT #SUB 1539 1539 GLN M 348 3268 LYS R Protein S 1 FT #SUB 1539 1539 GLN M 352 3272 HIS R Protein S 1 FT #SUB 1543 1543 ILE M 352 3272 HIS R Protein S 3 FT #HET 41 41 HIS M 166 2102 CUO M S 8 FT #HET 58 58 CYS M 166 2102 CUO M S 1 FT #HET 60 60 HIS M 166 2102 CUO M S 6 FT #HET 65 65 PHE M 166 2102 CUO M S 1 FT #HET 69 69 HIS M 166 2102 CUO M S 9 FT #HET 179 179 HIS M 166 2102 CUO M S 4 FT #HET 183 183 HIS M 166 2102 CUO M S 6 FT #HET 206 206 PHE M 166 2102 CUO M S 2 FT #HET 210 210 HIS M 166 2102 CUO M S 9 FT #HET 311 311 ILE M 50 2 NAG y B 1 FT #HET 313 313 THR M 49 1 NAG y S 3 FT #HET 389 389 THR M 49 1 NAG y S 2 FT #HET 391 391 MET M 50 2 NAG y S 4 FT #HET 462 462 HIS M 167 2103 CUO M S 6 FT #HET 480 480 CYS M 167 2103 CUO M S 1 FT #HET 482 482 HIS M 167 2103 CUO M S 6 FT #HET 487 487 PHE M 167 2103 CUO M S 2 FT #HET 491 491 HIS M 167 2103 CUO M S 7 FT #HET 603 603 HIS M 167 2103 CUO M S 8 FT #HET 607 607 HIS M 167 2103 CUO M S 6 FT #HET 634 634 HIS M 167 2103 CUO M S 6 FT #HET 763 763 LEU M 167 2103 CUO M S 1 FT #HET 804 804 ASP M 52 1 NAG z S 7 FT #HET 808 808 THR M 52 1 NAG z S 3 FT #HET 810 810 LEU M 52 1 NAG z S 2 FT #HET 877 877 HIS M 168 2104 CUO M S 5 FT #HET 895 895 CYS M 168 2104 CUO M S 1 FT #HET 897 897 HIS M 168 2104 CUO M S 3 FT #HET 902 902 PHE M 168 2104 CUO M S 3 FT #HET 906 906 HIS M 168 2104 CUO M S 11 FT #HET 1015 1015 HIS M 168 2104 CUO M S 6 FT #HET 1019 1019 HIS M 168 2104 CUO M S 5 FT #HET 1042 1042 PHE M 168 2104 CUO M S 3 FT #HET 1046 1046 HIS M 168 2104 CUO M S 8 FT #HET 1294 1294 HIS M 169 2105 CUO M S 7 FT #HET 1298 1298 ALA M 58 2 NAG 0 B 2 FT #HET 1299 1299 GLN M 57 1 NAG 0 B 5 FT #HET 1301 1301 PRO M 57 1 NAG 0 S 2 FT #HET 1308 1308 VAL M 58 2 NAG 0 S 2 FT #HET 1312 1312 CYS M 169 2105 CUO M S 1 FT #HET 1314 1314 HIS M 169 2105 CUO M S 5 FT #HET 1323 1323 HIS M 169 2105 CUO M S 9 FT #HET 1427 1427 HIS M 169 2105 CUO M S 6 FT #HET 1431 1431 HIS M 169 2105 CUO M S 8 FT #HET 1454 1454 PHE M 169 2105 CUO M S 5 FT #HET 1458 1458 HIS M 169 2105 CUO M S 9 FT #HET 1494 1494 ARG M 57 1 NAG 0 S 3 FT #HET 1500 1500 THR M 57 1 NAG 0 B 6 FT #HET 1501 1501 ALA M 57 1 NAG 0 B 1 FT #HET 1562 1562 ILE M 62 2 NAG 1 S 2 FT #HET 1564 1564 ALA M 61 1 NAG 1 S 2 FT #HET 1565 1565 SER M 61 1 NAG 1 B 1 FT #HET 1634 1634 ALA M 61 1 NAG 1 S 1 FT #HET 1635 1635 VAL M 61 1 NAG 1 S 1 FT #MOD 387 387 ASN M 49 1 NAG y S FT #MOD 806 806 ASN M 52 1 NAG z S FT #MOD 1498 1498 ASN M 57 1 NAG 0 S FT #MOD 1636 1636 ASN M 61 1 NAG 1 S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9N STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 609 2609 ALA N 106 106 GLN J Protein S 1 FT #SUB 610 2610 ASP N 124 124 TYR J Protein S 2 FT #SUB 610 2610 ASP N 137 137 ARG J Protein S 1 FT #SUB 612 2612 TYR N 138 138 ALA J Protein S 1 FT #SUB 612 2612 TYR N 189 189 ARG J Protein S 3 FT #SUB 617 2617 ASP N 189 189 ARG J Protein S 5 FT #SUB 617 2617 ASP N 190 190 PHE J Protein B 4 FT #SUB 617 2617 ASP N 191 191 THR J Protein S 1 FT #SUB 619 2619 VAL N 136 136 ALA J Protein S 2 FT #SUB 619 2619 VAL N 137 137 ARG J Protein S 1 FT #SUB 619 2619 VAL N 138 138 ALA J Protein S 2 FT #SUB 619 2619 VAL N 190 190 PHE J Protein S 3 FT #SUB 625 2625 LEU N 107 107 HIS J Protein S 3 FT #SUB 626 2626 ARG N 109 109 LEU J Protein S 2 FT #SUB 632 2632 GLU N 117 117 LYS J Protein B 2 FT #SUB 634 2634 THR N 117 117 LYS J Protein S 3 FT #SUB 634 2634 THR N 118 118 ALA J Protein S 2 FT #SUB 635 2635 TYR N 109 109 LEU J Protein A 2 FT #SUB 635 2635 TYR N 119 119 LYS J Protein S 2 FT #SUB 636 2636 THR N 109 109 LEU J Protein B 1 FT #SUB 637 2637 VAL N 111 111 ILE J Protein S 3 FT #SUB 691 2691 GLN N 111 111 ILE J Protein S 3 FT #SUB 691 2691 GLN N 112 112 ASP J Protein S 1 FT #SUB 691 2691 GLN N 115 115 GLY J Protein S 2 FT #SUB 692 2692 LYS N 115 115 GLY J Protein S 1 FT #SUB 693 2693 TYR N 116 116 LYS J Protein S 2 FT #SUB 800 2800 LYS N 100 3020 SER O Protein S 2 FT #SUB 800 2800 LYS N 101 3021 LEU O Protein S 1 FT #SUB 800 2800 LYS N 103 3023 ILE O Protein A 5 FT #SUB 802 2802 ASP N 12 2932 PRO O Protein S 1 FT #SUB 861 2861 LYS N 16 2936 GLU O Protein S 4 FT #SUB 867 2867 PRO N 12 2932 PRO O Protein S 2 FT #SUB 868 2868 GLU N 12 2932 PRO O Protein A 15 FT #SUB 868 2868 GLU N 13 2933 SER O Protein S 7 FT #SUB 903 2903 TYR N 12 2932 PRO O Protein S 2 FT #SUB 321 2321 GLU N 1623 1623 ASN P Protein S 2 FT #SUB 323 2323 ASN N 1650 1650 ILE P Protein B 1 FT #SUB 323 2323 ASN N 1651 1651 ILE P Protein B 2 FT #SUB 323 2323 ASN N 1652 1652 PHE P Protein A 11 FT #SUB 323 2323 ASN N 1654 1654 PRO P Protein S 2 FT #SUB 325 2325 ALA N 1649 1649 THR P Protein B 2 FT #SUB 325 2325 ALA N 1650 1650 ILE P Protein B 1 FT #SUB 326 2326 ILE N 1622 1622 GLN P Protein S 1 FT #SUB 326 2326 ILE N 1650 1650 ILE P Protein A 4 FT #SUB 326 2326 ILE N 1652 1652 PHE P Protein S 2 FT #SUB 327 2327 GLU N 1648 1648 PRO P Protein S 3 FT #SUB 327 2327 GLU N 1649 1649 THR P Protein S 8 FT #SUB 327 2327 GLU N 1650 1650 ILE P Protein S 4 FT #SUB 399 2399 LEU N 1603 1603 PHE P Protein S 1 FT #SUB 399 2399 LEU N 1604 1604 ASP P Protein S 1 FT #SUB 439 2439 PHE N 1558 1558 LEU P Protein B 1 FT #SUB 440 2440 ASP N 1558 1558 LEU P Protein B 1 FT #SUB 440 2440 ASP N 1605 1605 ARG P Protein B 1 FT #SUB 440 2440 ASP N 1606 1606 LEU P Protein A 4 FT #SUB 441 2441 ARG N 1604 1604 ASP P Protein B 1 FT #SUB 442 2442 LEU N 1604 1604 ASP P Protein A 6 FT #SUB 442 2442 LEU N 1605 1605 ARG P Protein S 1 FT #SUB 458 2458 PHE N 1486 1486 LEU P Protein A 2 FT #SUB 459 2459 ASP N 1481 1481 GLU P Protein S 1 FT #SUB 461 2461 HIS N 1490 1490 ASN P Protein A 8 FT #SUB 485 2485 THR N 1485 1485 ALA P Protein S 2 FT #SUB 486 2486 ILE N 1484 1484 CYS P Protein B 2 FT #SUB 486 2486 ILE N 1485 1485 ALA P Protein B 2 FT #SUB 486 2486 ILE N 1486 1486 LEU P Protein B 3 FT #SUB 486 2486 ILE N 1487 1487 PRO P Protein A 3 FT #SUB 487 2487 VAL N 1483 1483 ASN P Protein B 1 FT #SUB 488 2488 ARG N 1483 1483 ASN P Protein A 7 FT #SUB 490 2490 PRO N 1483 1483 ASN P Protein S 3 FT #SUB 729 2729 LYS N 1245 1245 GLU P Protein B 4 FT #SUB 730 2730 LEU N 1245 1245 GLU P Protein B 1 FT #SUB 731 2731 PRO N 1245 1245 GLU P Protein S 1 FT #SUB 736 2736 TYR N 1235 1235 ILE P Protein B 1 FT #SUB 736 2736 TYR N 1236 1236 TYR P Protein S 4 FT #SUB 736 2736 TYR N 1238 1238 PRO P Protein S 4 FT #SUB 736 2736 TYR N 1245 1245 GLU P Protein S 1 FT #SUB 738 2738 ALA N 1232 1232 THR P Protein B 1 FT #SUB 738 2738 ALA N 1233 1233 SER P Protein B 1 FT #SUB 739 2739 LEU N 1207 1207 TYR P Protein S 2 FT #SUB 739 2739 LEU N 1211 1211 TYR P Protein S 4 FT #SUB 739 2739 LEU N 1234 1234 VAL P Protein A 4 FT #SUB 739 2739 LEU N 1236 1236 TYR P Protein S 2 FT #SUB 740 2740 ASP N 1232 1232 THR P Protein S 9 FT #SUB 740 2740 ASP N 1234 1234 VAL P Protein S 2 FT #SUB 741 2741 GLN N 1232 1232 THR P Protein S 5 FT #SUB 743 2743 ALA N 1208 1208 THR P Protein A 2 FT #SUB 743 2743 ALA N 1210 1210 LYS P Protein S 2 FT #SUB 744 2744 PHE N 1164 1164 ASP P Protein S 3 FT #SUB 744 2744 PHE N 1210 1210 LYS P Protein S 7 FT #SUB 744 2744 PHE N 1212 1212 HIS P Protein S 5 FT #SUB 809 2809 LEU N 1187 1187 LYS P Protein S 1 FT #SUB 809 2809 LEU N 1188 1188 PHE P Protein S 2 FT #SUB 809 2809 LEU N 1189 1189 ASP P Protein S 1 FT #SUB 847 2847 HIS N 1187 1187 LYS P Protein S 5 FT #SUB 848 2848 PHE N 1147 1147 MET P Protein B 1 FT #SUB 849 2849 ASP N 1147 1147 MET P Protein A 4 FT #SUB 849 2849 ASP N 1191 1191 PRO P Protein A 5 FT #SUB 849 2849 ASP N 1233 1233 SER P Protein S 3 FT #SUB 850 2850 ARG N 1189 1189 ASP P Protein B 1 FT #SUB 851 2851 ASN N 1189 1189 ASP P Protein S 2 FT #SUB 869 2869 ALA N 1078 1078 ARG P Protein B 1 FT #SUB 870 2870 LEU N 1074 1074 ILE P Protein B 1 FT #SUB 870 2870 LEU N 1078 1078 ARG P Protein B 7 FT #SUB 871 2871 PHE N 1070 1070 ALA P Protein S 1 FT #SUB 871 2871 PHE N 1071 1071 ASN P Protein S 2 FT #SUB 871 2871 PHE N 1074 1074 ILE P Protein S 1 FT #SUB 871 2871 PHE N 1103 1103 PHE P Protein A 4 FT #SUB 872 2872 GLU N 1078 1078 ARG P Protein B 4 FT #SUB 873 2873 HIS N 1078 1078 ARG P Protein B 2 FT #SUB 875 2875 SER N 1078 1078 ARG P Protein B 3 FT #SUB 877 2877 ILE N 1078 1078 ARG P Protein A 3 FT #SUB 899 2899 PRO N 1075 1075 GLU P Protein B 3 FT #SUB 900 2900 SER N 1075 1075 GLU P Protein A 7 FT #SUB 901 2901 LEU N 1071 1071 ASN P Protein B 1 FT #SUB 901 2901 LEU N 1073 1073 ALA P Protein B 1 FT #SUB 901 2901 LEU N 1074 1074 ILE P Protein A 5 FT #SUB 901 2901 LEU N 1075 1075 GLU P Protein A 4 FT #SUB 902 2902 ILE N 1071 1071 ASN P Protein B 1 FT #SUB 903 2903 TYR N 1071 1071 ASN P Protein B 3 FT #SUB 903 2903 TYR N 1074 1074 ILE P Protein A 2 FT #SUB 905 2905 PRO N 1071 1071 ASN P Protein S 2 FT #SUB 84 2084 ARG N 502 2502 LEU Q Protein S 3 FT #SUB 90 2090 LYS N 375 2375 GLY Q Protein S 1 FT #SUB 94 2094 ARG N 94 2094 ARG Q Protein S 5 FT #SUB 94 2094 ARG N 182 2182 TYR Q Protein A 6 FT #SUB 94 2094 ARG N 370 2370 MET Q Protein S 8 FT #SUB 96 2096 SER N 374 2374 ASN Q Protein S 2 FT #SUB 97 2097 LEU N 235 2235 PRO Q Protein S 2 FT #SUB 98 2098 GLN N 374 2374 ASN Q Protein S 3 FT #SUB 99 2099 GLU N 375 2375 GLY Q Protein S 1 FT #SUB 109 2109 ARG N 582 2582 ARG Q Protein S 1 FT #SUB 109 2109 ARG N 585 2585 GLY Q Protein B 1 FT #SUB 112 2112 LYS N 583 2583 ARG Q Protein S 2 FT #SUB 112 2112 LYS N 584 2584 HIS Q Protein B 1 FT #SUB 113 2113 ASP N 519 2519 ASN Q Protein S 3 FT #SUB 113 2113 ASP N 585 2585 GLY Q Protein A 6 FT #SUB 114 2114 ARG N 526 2526 ARG Q Protein S 4 FT #SUB 115 2115 SER N 519 2519 ASN Q Protein S 1 FT #SUB 116 2116 SER N 615 2615 TRP Q Protein S 2 FT #SUB 117 2117 ASP N 587 2587 SER Q Protein S 1 FT #SUB 121 2121 THR N 615 2615 TRP Q Protein S 3 FT #SUB 125 2125 PHE N 615 2615 TRP Q Protein S 1 FT #SUB 167 2167 ASN N 502 2502 LEU Q Protein A 2 FT #SUB 168 2168 ARG N 502 2502 LEU Q Protein B 1 FT #SUB 168 2168 ARG N 503 2503 ASN Q Protein B 5 FT #SUB 168 2168 ARG N 515 2515 ARG Q Protein S 13 FT #SUB 168 2168 ARG N 587 2587 SER Q Protein B 2 FT #SUB 169 2169 HIS N 503 2503 ASN Q Protein B 2 FT #SUB 169 2169 HIS N 519 2519 ASN Q Protein S 1 FT #SUB 169 2169 HIS N 585 2585 GLY Q Protein S 2 FT #SUB 169 2169 HIS N 587 2587 SER Q Protein S 5 FT #SUB 170 2170 GLY N 503 2503 ASN Q Protein B 2 FT #SUB 182 2182 TYR N 94 2094 ARG Q Protein S 4 FT #SUB 199 2199 PRO N 235 2235 PRO Q Protein A 2 FT #SUB 199 2199 PRO N 236 2236 ALA Q Protein B 2 FT #SUB 199 2199 PRO N 237 2237 LEU Q Protein B 2 FT #SUB 200 2200 PHE N 237 2237 LEU Q Protein A 8 FT #SUB 235 2235 PRO N 97 2097 LEU Q Protein S 2 FT #SUB 235 2235 PRO N 199 2199 PRO Q Protein B 1 FT #SUB 236 2236 ALA N 199 2199 PRO Q Protein B 2 FT #SUB 237 2237 LEU N 199 2199 PRO Q Protein B 1 FT #SUB 237 2237 LEU N 200 2200 PHE Q Protein A 7 FT #SUB 341 2341 PRO N 614 2614 ALA Q Protein S 2 FT #SUB 341 2341 PRO N 615 2615 TRP Q Protein B 1 FT #SUB 342 2342 TYR N 615 2615 TRP Q Protein B 3 FT #SUB 344 2344 LEU N 515 2515 ARG Q Protein A 7 FT #SUB 344 2344 LEU N 518 2518 GLN Q Protein S 1 FT #SUB 345 2345 ASN N 515 2515 ARG Q Protein S 2 FT #SUB 346 2346 PRO N 515 2515 ARG Q Protein S 6 FT #SUB 370 2370 MET N 94 2094 ARG Q Protein S 8 FT #SUB 374 2374 ASN N 96 2096 SER Q Protein A 3 FT #SUB 374 2374 ASN N 98 2098 GLN Q Protein S 4 FT #SUB 375 2375 GLY N 90 2090 LYS Q Protein B 1 FT #SUB 375 2375 GLY N 99 2099 GLU Q Protein B 1 FT #SUB 502 2502 LEU N 84 2084 ARG Q Protein S 3 FT #SUB 502 2502 LEU N 167 2167 ASN Q Protein S 5 FT #SUB 502 2502 LEU N 168 2168 ARG Q Protein S 5 FT #SUB 503 2503 ASN N 109 2109 ARG Q Protein S 2 FT #SUB 503 2503 ASN N 168 2168 ARG Q Protein A 3 FT #SUB 503 2503 ASN N 169 2169 HIS Q Protein S 2 FT #SUB 503 2503 ASN N 170 2170 GLY Q Protein S 2 FT #SUB 504 2504 ARG N 109 2109 ARG Q Protein S 1 FT #SUB 513 2513 GLU N 346 2346 PRO Q Protein S 1 FT #SUB 514 2514 SER N 344 2344 LEU Q Protein S 2 FT #SUB 515 2515 ARG N 168 2168 ARG Q Protein S 11 FT #SUB 515 2515 ARG N 344 2344 LEU Q Protein A 6 FT #SUB 515 2515 ARG N 345 2345 ASN Q Protein S 1 FT #SUB 515 2515 ARG N 346 2346 PRO Q Protein S 6 FT #SUB 518 2518 GLN N 115 2115 SER Q Protein A 3 FT #SUB 518 2518 GLN N 344 2344 LEU Q Protein S 1 FT #SUB 519 2519 ASN N 113 2113 ASP Q Protein S 3 FT #SUB 519 2519 ASN N 115 2115 SER Q Protein B 3 FT #SUB 522 2522 SER N 115 2115 SER Q Protein S 5 FT #SUB 526 2526 ARG N 114 2114 ARG Q Protein S 6 FT #SUB 582 2582 ARG N 109 2109 ARG Q Protein B 1 FT #SUB 583 2583 ARG N 112 2112 LYS Q Protein B 1 FT #SUB 584 2584 HIS N 112 2112 LYS Q Protein B 1 FT #SUB 585 2585 GLY N 109 2109 ARG Q Protein B 1 FT #SUB 585 2585 GLY N 113 2113 ASP Q Protein B 4 FT #SUB 585 2585 GLY N 169 2169 HIS Q Protein B 2 FT #SUB 587 2587 SER N 168 2168 ARG Q Protein A 5 FT #SUB 587 2587 SER N 169 2169 HIS Q Protein S 5 FT #SUB 614 2614 ALA N 341 2341 PRO Q Protein S 2 FT #SUB 615 2615 TRP N 116 2116 SER Q Protein S 1 FT #SUB 615 2615 TRP N 121 2121 THR Q Protein S 3 FT #SUB 615 2615 TRP N 124 2124 SER Q Protein S 1 FT #SUB 615 2615 TRP N 125 2125 PHE Q Protein S 1 FT #SUB 615 2615 TRP N 131 2131 LEU Q Protein S 2 FT #SUB 615 2615 TRP N 341 2341 PRO Q Protein S 1 FT #SUB 615 2615 TRP N 342 2342 TYR Q Protein S 2 FT #SUB 136 2136 THR N 388 388 GLY S Protein S 5 FT #SUB 136 2136 THR N 389 389 THR S Protein A 8 FT #SUB 138 2138 LYS N 389 389 THR S Protein S 1 FT #HET 126 2126 HIS N 170 3001 CUO N S 7 FT #HET 138 2138 LYS N 77 1 NAG 8 S 2 FT #HET 138 2138 LYS N 78 2 NAG 8 S 4 FT #HET 144 2144 CYS N 170 3001 CUO N S 1 FT #HET 146 2146 HIS N 170 3001 CUO N S 5 FT #HET 155 2155 HIS N 170 3001 CUO N S 6 FT #HET 267 2267 HIS N 170 3001 CUO N S 6 FT #HET 271 2271 HIS N 170 3001 CUO N S 7 FT #HET 294 2294 PHE N 170 3001 CUO N S 3 FT #HET 298 2298 HIS N 170 3001 CUO N S 8 FT #HET 405 2405 SER N 63 1 NAG 2 S 2 FT #HET 429 2429 LEU N 170 3001 CUO N S 1 FT #HET 476 2476 LEU N 64 2 NAG 2 B 1 FT #HET 480 2480 LEU N 64 2 NAG 2 S 1 FT #HET 543 2543 HIS N 171 3002 CUO N S 7 FT #HET 559 2559 CYS N 171 3002 CUO N S 1 FT #HET 561 2561 HIS N 171 3002 CUO N S 6 FT #HET 566 2566 PHE N 171 3002 CUO N S 1 FT #HET 570 2570 HIS N 171 3002 CUO N S 9 FT #HET 680 2680 HIS N 171 3002 CUO N S 6 FT #HET 684 2684 HIS N 171 3002 CUO N S 6 FT #HET 707 2707 PHE N 171 3002 CUO N S 3 FT #HET 711 2711 HIS N 171 3002 CUO N S 8 FT #MOD 472 2472 ASN N 63 1 NAG 2 S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9O STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 12 2932 PRO O 802 2802 ASP N Protein S 1 FT #SUB 12 2932 PRO O 867 2867 PRO N Protein S 2 FT #SUB 12 2932 PRO O 868 2868 GLU N Protein A 15 FT #SUB 12 2932 PRO O 903 2903 TYR N Protein S 2 FT #SUB 13 2933 SER O 868 2868 GLU N Protein A 7 FT #SUB 16 2936 GLU O 861 2861 LYS N Protein S 4 FT #SUB 100 3020 SER O 800 2800 LYS N Protein S 2 FT #SUB 101 3021 LEU O 800 2800 LYS N Protein B 1 FT #SUB 103 3023 ILE O 800 2800 LYS N Protein S 5 FT #SUB 276 3196 THR O 416 416 ASN P Protein S 8 FT #SUB 348 3268 LYS O 1539 1539 GLN P Protein S 1 FT #SUB 352 3272 HIS O 1542 1542 ARG P Protein S 9 FT #SUB 352 3272 HIS O 1543 1543 ILE P Protein A 3 FT #SUB 354 3274 ARG O 1533 1533 ALA P Protein S 2 FT #HET 41 2961 HIS O 172 3401 CUO O S 8 FT #HET 60 2980 HIS O 172 3401 CUO O S 3 FT #HET 69 2989 HIS O 172 3401 CUO O S 13 FT #HET 121 3041 ALA O 180 2101 CUO S B 6 FT #HET 122 3042 ASP O 180 2101 CUO S A 29 FT #HET 123 3043 THR O 180 2101 CUO S B 5 FT #HET 169 3089 HIS O 172 3401 CUO O S 7 FT #HET 173 3093 HIS O 172 3401 CUO O S 5 FT #HET 196 3116 PHE O 172 3401 CUO O S 5 FT #HET 199 3119 HIS O 172 3401 CUO O S 1 FT #HET 200 3120 HIS O 172 3401 CUO O S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9P STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 1070 1070 ALA P 871 2871 PHE N Protein B 1 FT #SUB 1071 1071 ASN P 871 2871 PHE N Protein S 2 FT #SUB 1071 1071 ASN P 901 2901 LEU N Protein B 1 FT #SUB 1071 1071 ASN P 902 2902 ILE N Protein B 1 FT #SUB 1071 1071 ASN P 903 2903 TYR N Protein A 3 FT #SUB 1071 1071 ASN P 905 2905 PRO N Protein S 2 FT #SUB 1073 1073 ALA P 901 2901 LEU N Protein B 1 FT #SUB 1074 1074 ILE P 870 2870 LEU N Protein S 1 FT #SUB 1074 1074 ILE P 871 2871 PHE N Protein S 1 FT #SUB 1074 1074 ILE P 901 2901 LEU N Protein A 5 FT #SUB 1074 1074 ILE P 903 2903 TYR N Protein S 2 FT #SUB 1075 1075 GLU P 899 2899 PRO N Protein S 3 FT #SUB 1075 1075 GLU P 900 2900 SER N Protein S 7 FT #SUB 1075 1075 GLU P 901 2901 LEU N Protein A 4 FT #SUB 1078 1078 ARG P 869 2869 ALA N Protein S 1 FT #SUB 1078 1078 ARG P 870 2870 LEU N Protein S 7 FT #SUB 1078 1078 ARG P 872 2872 GLU N Protein S 4 FT #SUB 1078 1078 ARG P 873 2873 HIS N Protein S 2 FT #SUB 1078 1078 ARG P 875 2875 SER N Protein S 3 FT #SUB 1078 1078 ARG P 877 2877 ILE N Protein S 3 FT #SUB 1103 1103 PHE P 871 2871 PHE N Protein S 4 FT #SUB 1147 1147 MET P 848 2848 PHE N Protein S 1 FT #SUB 1147 1147 MET P 849 2849 ASP N Protein S 4 FT #SUB 1164 1164 ASP P 744 2744 PHE N Protein S 3 FT #SUB 1187 1187 LYS P 809 2809 LEU N Protein S 1 FT #SUB 1187 1187 LYS P 847 2847 HIS N Protein S 5 FT #SUB 1188 1188 PHE P 809 2809 LEU N Protein B 2 FT #SUB 1189 1189 ASP P 809 2809 LEU N Protein B 1 FT #SUB 1189 1189 ASP P 850 2850 ARG N Protein B 1 FT #SUB 1189 1189 ASP P 851 2851 ASN N Protein A 2 FT #SUB 1191 1191 PRO P 849 2849 ASP N Protein S 5 FT #SUB 1207 1207 TYR P 739 2739 LEU N Protein A 2 FT #SUB 1208 1208 THR P 743 2743 ALA N Protein B 2 FT #SUB 1210 1210 LYS P 743 2743 ALA N Protein S 2 FT #SUB 1210 1210 LYS P 744 2744 PHE N Protein S 7 FT #SUB 1211 1211 TYR P 739 2739 LEU N Protein S 4 FT #SUB 1212 1212 HIS P 744 2744 PHE N Protein S 5 FT #SUB 1232 1232 THR P 738 2738 ALA N Protein B 1 FT #SUB 1232 1232 THR P 740 2740 ASP N Protein A 9 FT #SUB 1232 1232 THR P 741 2741 GLN N Protein A 5 FT #SUB 1233 1233 SER P 738 2738 ALA N Protein S 1 FT #SUB 1233 1233 SER P 849 2849 ASP N Protein S 3 FT #SUB 1234 1234 VAL P 739 2739 LEU N Protein B 4 FT #SUB 1234 1234 VAL P 740 2740 ASP N Protein A 2 FT #SUB 1235 1235 ILE P 736 2736 TYR N Protein S 1 FT #SUB 1236 1236 TYR P 736 2736 TYR N Protein A 4 FT #SUB 1236 1236 TYR P 739 2739 LEU N Protein A 2 FT #SUB 1238 1238 PRO P 736 2736 TYR N Protein S 4 FT #SUB 1245 1245 GLU P 729 2729 LYS N Protein S 4 FT #SUB 1245 1245 GLU P 730 2730 LEU N Protein S 1 FT #SUB 1245 1245 GLU P 731 2731 PRO N Protein S 1 FT #SUB 1245 1245 GLU P 736 2736 TYR N Protein S 1 FT #SUB 1481 1481 GLU P 459 2459 ASP N Protein S 1 FT #SUB 1483 1483 ASN P 487 2487 VAL N Protein B 1 FT #SUB 1483 1483 ASN P 488 2488 ARG N Protein A 7 FT #SUB 1483 1483 ASN P 490 2490 PRO N Protein S 3 FT #SUB 1484 1484 CYS P 486 2486 ILE N Protein B 2 FT #SUB 1485 1485 ALA P 485 2485 THR N Protein A 2 FT #SUB 1485 1485 ALA P 486 2486 ILE N Protein B 2 FT #SUB 1486 1486 LEU P 458 2458 PHE N Protein S 2 FT #SUB 1486 1486 LEU P 486 2486 ILE N Protein A 3 FT #SUB 1487 1487 PRO P 486 2486 ILE N Protein S 3 FT #SUB 1490 1490 ASN P 461 2461 HIS N Protein S 8 FT #SUB 1558 1558 LEU P 439 2439 PHE N Protein S 1 FT #SUB 1558 1558 LEU P 440 2440 ASP N Protein S 1 FT #SUB 1603 1603 PHE P 399 2399 LEU N Protein B 1 FT #SUB 1604 1604 ASP P 399 2399 LEU N Protein B 1 FT #SUB 1604 1604 ASP P 441 2441 ARG N Protein B 1 FT #SUB 1604 1604 ASP P 442 2442 LEU N Protein A 6 FT #SUB 1605 1605 ARG P 440 2440 ASP N Protein B 1 FT #SUB 1605 1605 ARG P 442 2442 LEU N Protein S 1 FT #SUB 1606 1606 LEU P 440 2440 ASP N Protein A 4 FT #SUB 1622 1622 GLN P 326 2326 ILE N Protein B 1 FT #SUB 1623 1623 ASN P 321 2321 GLU N Protein S 2 FT #SUB 1648 1648 PRO P 327 2327 GLU N Protein B 3 FT #SUB 1649 1649 THR P 325 2325 ALA N Protein S 2 FT #SUB 1649 1649 THR P 327 2327 GLU N Protein A 8 FT #SUB 1650 1650 ILE P 323 2323 ASN N Protein B 1 FT #SUB 1650 1650 ILE P 325 2325 ALA N Protein B 1 FT #SUB 1650 1650 ILE P 326 2326 ILE N Protein B 4 FT #SUB 1650 1650 ILE P 327 2327 GLU N Protein A 4 FT #SUB 1651 1651 ILE P 323 2323 ASN N Protein B 2 FT #SUB 1652 1652 PHE P 323 2323 ASN N Protein A 11 FT #SUB 1652 1652 PHE P 326 2326 ILE N Protein A 2 FT #SUB 1654 1654 PRO P 323 2323 ASN N Protein S 2 FT #SUB 416 416 ASN P 276 3196 THR O Protein S 8 FT #SUB 1533 1533 ALA P 354 3274 ARG O Protein B 2 FT #SUB 1539 1539 GLN P 348 3268 LYS O Protein S 1 FT #SUB 1542 1542 ARG P 352 3272 HIS O Protein S 9 FT #SUB 1543 1543 ILE P 352 3272 HIS O Protein S 3 FT #SUB 328 328 LYS P 1571 1571 TYR S Protein S 5 FT #SUB 328 328 LYS P 1583 1583 CYS S Protein B 1 FT #SUB 328 328 LYS P 1631 1631 GLU S Protein S 3 FT #SUB 329 329 ARG P 1573 1573 CYS S Protein S 1 FT #SUB 329 329 ARG P 1574 1574 VAL S Protein S 7 FT #SUB 329 329 ARG P 1576 1576 THR S Protein S 2 FT #SUB 329 329 ARG P 1582 1582 ASN S Protein A 7 FT #SUB 329 329 ARG P 1583 1583 CYS S Protein A 20 FT #SUB 329 329 ARG P 1584 1584 GLY S Protein S 4 FT #SUB 329 329 ARG P 1585 1585 ASN S Protein S 2 FT #SUB 330 330 GLN P 1582 1582 ASN S Protein B 3 FT #SUB 331 331 SER P 1576 1576 THR S Protein S 1 FT #SUB 331 331 SER P 1581 1581 GLU S Protein A 3 FT #SUB 331 331 SER P 1582 1582 ASN S Protein B 2 FT #SUB 333 333 ASN P 1581 1581 GLU S Protein A 6 FT #SUB 334 334 CYS P 1581 1581 GLU S Protein B 1 FT #SUB 335 335 ASP P 1578 1578 ILE S Protein S 7 FT #SUB 335 335 ASP P 1579 1579 GLY S Protein S 3 FT #SUB 335 335 ASP P 1581 1581 GLU S Protein B 1 FT #SUB 472 472 ASP P 1638 1638 SER S Protein S 8 FT #SUB 472 472 ASP P 1639 1639 LYS S Protein A 2 FT #SUB 1362 1362 ALA P 1378 1378 PHE S Protein S 1 FT #SUB 1365 1365 TYR P 1437 1437 ARG S Protein S 14 FT #SUB 1371 1371 GLN P 1439 1439 VAL S Protein S 1 FT #SUB 1372 1372 ILE P 1390 1390 ASP S Protein S 1 FT #SUB 1372 1372 ILE P 1392 1392 ASP S Protein S 1 FT #SUB 1372 1372 ILE P 1438 1438 ASP S Protein S 1 FT #SUB 1390 1390 ASP P 1363 1363 THR S Protein S 2 FT #SUB 1390 1390 ASP P 1372 1372 ILE S Protein S 3 FT #SUB 1437 1437 ARG P 1365 1365 TYR S Protein S 10 FT #SUB 1438 1438 ASP P 1372 1372 ILE S Protein S 3 FT #SUB 1439 1439 VAL P 1371 1371 GLN S Protein A 4 FT #SUB 1571 1571 TYR P 329 329 ARG S Protein S 7 FT #SUB 1579 1579 GLY P 335 335 ASP S Protein B 5 FT #SUB 1581 1581 GLU P 332 332 ASP S Protein S 3 FT #SUB 1581 1581 GLU P 333 333 ASN S Protein S 2 FT #SUB 1582 1582 ASN P 330 330 GLN S Protein B 6 FT #SUB 1582 1582 ASN P 331 331 SER S Protein B 4 FT #SUB 1583 1583 CYS P 329 329 ARG S Protein A 3 FT #SUB 1583 1583 CYS P 330 330 GLN S Protein B 1 FT #SUB 1584 1584 GLY P 329 329 ARG S Protein B 5 FT #SUB 1631 1631 GLU P 329 329 ARG S Protein S 5 FT #SUB 1638 1638 SER P 472 472 ASP S Protein A 5 FT #SUB 1639 1639 LYS P 471 471 PRO S Protein S 4 FT #SUB 1639 1639 LYS P 472 472 ASP S Protein A 7 FT #SUB 1639 1639 LYS P 473 473 ALA S Protein S 6 FT #SUB 106 106 GLN P 609 2609 ALA T Protein B 1 FT #SUB 107 107 HIS P 625 2625 LEU T Protein S 2 FT #SUB 108 108 PRO P 621 2621 GLU T Protein S 1 FT #SUB 108 108 PRO P 626 2626 ARG T Protein S 1 FT #SUB 109 109 LEU P 626 2626 ARG T Protein S 1 FT #SUB 109 109 LEU P 635 2635 TYR T Protein S 1 FT #SUB 109 109 LEU P 636 2636 THR T Protein S 1 FT #SUB 111 111 ILE P 637 2637 VAL T Protein S 3 FT #SUB 111 111 ILE P 638 2638 ARG T Protein S 1 FT #SUB 111 111 ILE P 691 2691 GLN T Protein S 2 FT #SUB 112 112 ASP P 691 2691 GLN T Protein B 1 FT #SUB 116 116 LYS P 693 2693 TYR T Protein B 2 FT #SUB 117 117 LYS P 632 2632 GLU T Protein S 2 FT #SUB 117 117 LYS P 634 2634 THR T Protein S 5 FT #SUB 117 117 LYS P 693 2693 TYR T Protein S 3 FT #SUB 118 118 ALA P 634 2634 THR T Protein B 1 FT #SUB 118 118 ALA P 637 2637 VAL T Protein S 1 FT #SUB 136 136 ALA P 619 2619 VAL T Protein S 2 FT #SUB 137 137 ARG P 610 2610 ASP T Protein B 1 FT #SUB 138 138 ALA P 612 2612 TYR T Protein S 2 FT #SUB 138 138 ALA P 619 2619 VAL T Protein A 3 FT #SUB 189 189 ARG P 612 2612 TYR T Protein S 5 FT #SUB 189 189 ARG P 617 2617 ASP T Protein B 2 FT #SUB 190 190 PHE P 617 2617 ASP T Protein B 1 FT #SUB 190 190 PHE P 619 2619 VAL T Protein S 4 FT #SUB 388 388 GLY P 136 2136 THR W Protein B 5 FT #SUB 389 389 THR P 136 2136 THR W Protein A 10 FT #HET 41 41 HIS P 173 5001 CUO P S 5 FT #HET 58 58 CYS P 173 5001 CUO P S 1 FT #HET 60 60 HIS P 173 5001 CUO P S 6 FT #HET 65 65 PHE P 173 5001 CUO P S 1 FT #HET 69 69 HIS P 173 5001 CUO P S 9 FT #HET 179 179 HIS P 173 5001 CUO P S 6 FT #HET 183 183 HIS P 173 5001 CUO P S 6 FT #HET 206 206 PHE P 173 5001 CUO P S 2 FT #HET 210 210 HIS P 173 5001 CUO P S 9 FT #HET 313 313 THR P 65 1 NAG 3 S 3 FT #HET 462 462 HIS P 174 5002 CUO P S 8 FT #HET 480 480 CYS P 174 5002 CUO P S 1 FT #HET 482 482 HIS P 174 5002 CUO P S 6 FT #HET 491 491 HIS P 174 5002 CUO P S 8 FT #HET 603 603 HIS P 174 5002 CUO P S 4 FT #HET 607 607 HIS P 174 5002 CUO P S 7 FT #HET 630 630 PHE P 174 5002 CUO P S 2 FT #HET 634 634 HIS P 174 5002 CUO P S 10 FT #HET 740 740 THR P 68 1 NAG 4 S 1 FT #HET 763 763 LEU P 174 5002 CUO P S 1 FT #HET 804 804 ASP P 68 1 NAG 4 S 8 FT #HET 808 808 THR P 68 1 NAG 4 S 4 FT #HET 808 808 THR P 69 2 NAG 4 S 1 FT #HET 809 809 ALA P 69 2 NAG 4 B 1 FT #HET 810 810 LEU P 68 1 NAG 4 S 2 FT #HET 877 877 HIS P 175 5003 CUO P S 7 FT #HET 895 895 CYS P 175 5003 CUO P S 1 FT #HET 897 897 HIS P 175 5003 CUO P S 4 FT #HET 906 906 HIS P 175 5003 CUO P S 7 FT #HET 1015 1015 HIS P 175 5003 CUO P S 6 FT #HET 1019 1019 HIS P 175 5003 CUO P S 7 FT #HET 1042 1042 PHE P 175 5003 CUO P S 4 FT #HET 1046 1046 HIS P 175 5003 CUO P S 10 FT #HET 1294 1294 HIS P 176 5004 CUO P S 7 FT #HET 1298 1298 ALA P 71 2 NAG 5 B 3 FT #HET 1299 1299 GLN P 70 1 NAG 5 A 8 FT #HET 1301 1301 PRO P 70 1 NAG 5 S 3 FT #HET 1308 1308 VAL P 71 2 NAG 5 S 2 FT #HET 1312 1312 CYS P 176 5004 CUO P S 1 FT #HET 1314 1314 HIS P 176 5004 CUO P S 4 FT #HET 1323 1323 HIS P 176 5004 CUO P S 9 FT #HET 1427 1427 HIS P 176 5004 CUO P S 6 FT #HET 1431 1431 HIS P 176 5004 CUO P S 8 FT #HET 1454 1454 PHE P 176 5004 CUO P S 4 FT #HET 1458 1458 HIS P 176 5004 CUO P S 9 FT #HET 1494 1494 ARG P 70 1 NAG 5 S 1 FT #HET 1500 1500 THR P 70 1 NAG 5 B 6 FT #HET 1501 1501 ALA P 70 1 NAG 5 B 1 FT #HET 1563 1563 LYS P 74 2 NAG 6 S 4 FT #HET 1638 1638 SER P 73 1 NAG 6 A 7 FT #HET 1639 1639 LYS P 73 1 NAG 6 A 9 FT #HET 1641 1641 THR P 74 2 NAG 6 S 1 FT #HET 1644 1644 ILE P 74 2 NAG 6 S 1 FT #MOD 387 387 ASN P 65 1 NAG 3 S FT #MOD 806 806 ASN P 68 1 NAG 4 S FT #MOD 1498 1498 ASN P 70 1 NAG 5 S FT #MOD 1636 1636 ASN P 73 1 NAG 6 S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9Q STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 136 2136 THR Q 189 189 ARG J Protein S 1 FT #SUB 136 2136 THR Q 388 388 GLY J Protein S 6 FT #SUB 136 2136 THR Q 389 389 THR J Protein A 9 FT #SUB 138 2138 LYS Q 389 389 THR J Protein S 1 FT #SUB 321 2321 GLU Q 1623 1623 ASN M Protein S 1 FT #SUB 323 2323 ASN Q 1651 1651 ILE M Protein B 2 FT #SUB 323 2323 ASN Q 1652 1652 PHE M Protein A 9 FT #SUB 323 2323 ASN Q 1654 1654 PRO M Protein S 2 FT #SUB 324 2324 CYS Q 1650 1650 ILE M Protein B 1 FT #SUB 325 2325 ALA Q 1650 1650 ILE M Protein B 3 FT #SUB 326 2326 ILE Q 1622 1622 GLN M Protein S 1 FT #SUB 326 2326 ILE Q 1650 1650 ILE M Protein A 4 FT #SUB 327 2327 GLU Q 1648 1648 PRO M Protein S 2 FT #SUB 327 2327 GLU Q 1649 1649 THR M Protein S 8 FT #SUB 327 2327 GLU Q 1650 1650 ILE M Protein S 1 FT #SUB 399 2399 LEU Q 1603 1603 PHE M Protein S 1 FT #SUB 440 2440 ASP Q 1558 1558 LEU M Protein B 1 FT #SUB 440 2440 ASP Q 1604 1604 ASP M Protein B 1 FT #SUB 440 2440 ASP Q 1605 1605 ARG M Protein B 2 FT #SUB 440 2440 ASP Q 1606 1606 LEU M Protein A 6 FT #SUB 441 2441 ARG Q 1604 1604 ASP M Protein B 2 FT #SUB 442 2442 LEU Q 1604 1604 ASP M Protein A 7 FT #SUB 458 2458 PHE Q 1483 1483 ASN M Protein S 1 FT #SUB 458 2458 PHE Q 1486 1486 LEU M Protein B 1 FT #SUB 459 2459 ASP Q 1481 1481 GLU M Protein S 1 FT #SUB 459 2459 ASP Q 1490 1490 ASN M Protein B 1 FT #SUB 461 2461 HIS Q 1490 1490 ASN M Protein A 3 FT #SUB 486 2486 ILE Q 1484 1484 CYS M Protein B 2 FT #SUB 486 2486 ILE Q 1486 1486 LEU M Protein A 5 FT #SUB 486 2486 ILE Q 1487 1487 PRO M Protein A 3 FT #SUB 487 2487 VAL Q 1483 1483 ASN M Protein B 2 FT #SUB 487 2487 VAL Q 1484 1484 CYS M Protein A 3 FT #SUB 488 2488 ARG Q 1483 1483 ASN M Protein A 8 FT #SUB 488 2488 ARG Q 1486 1486 LEU M Protein B 1 FT #SUB 729 2729 LYS Q 1245 1245 GLU M Protein B 3 FT #SUB 731 2731 PRO Q 1245 1245 GLU M Protein S 1 FT #SUB 736 2736 TYR Q 1207 1207 TYR M Protein S 1 FT #SUB 736 2736 TYR Q 1235 1235 ILE M Protein B 4 FT #SUB 736 2736 TYR Q 1236 1236 TYR M Protein A 6 FT #SUB 736 2736 TYR Q 1238 1238 PRO M Protein S 1 FT #SUB 736 2736 TYR Q 1245 1245 GLU M Protein S 4 FT #SUB 737 2737 CYS Q 1234 1234 VAL M Protein B 2 FT #SUB 738 2738 ALA Q 1233 1233 SER M Protein A 4 FT #SUB 738 2738 ALA Q 1234 1234 VAL M Protein B 3 FT #SUB 739 2739 LEU Q 1207 1207 TYR M Protein S 6 FT #SUB 739 2739 LEU Q 1211 1211 TYR M Protein S 4 FT #SUB 739 2739 LEU Q 1234 1234 VAL M Protein A 4 FT #SUB 740 2740 ASP Q 1232 1232 THR M Protein S 12 FT #SUB 740 2740 ASP Q 1233 1233 SER M Protein S 2 FT #SUB 743 2743 ALA Q 1208 1208 THR M Protein S 1 FT #SUB 744 2744 PHE Q 1210 1210 LYS M Protein S 4 FT #SUB 744 2744 PHE Q 1212 1212 HIS M Protein S 7 FT #SUB 809 2809 LEU Q 1187 1187 LYS M Protein S 1 FT #SUB 809 2809 LEU Q 1188 1188 PHE M Protein S 1 FT #SUB 809 2809 LEU Q 1189 1189 ASP M Protein S 1 FT #SUB 847 2847 HIS Q 1187 1187 LYS M Protein S 5 FT #SUB 847 2847 HIS Q 1230 1230 LEU M Protein S 1 FT #SUB 848 2848 PHE Q 1147 1147 MET M Protein B 1 FT #SUB 849 2849 ASP Q 1147 1147 MET M Protein A 3 FT #SUB 849 2849 ASP Q 1191 1191 PRO M Protein A 3 FT #SUB 849 2849 ASP Q 1233 1233 SER M Protein S 1 FT #SUB 851 2851 ASN Q 1189 1189 ASP M Protein S 2 FT #SUB 869 2869 ALA Q 1078 1078 ARG M Protein B 1 FT #SUB 870 2870 LEU Q 1074 1074 ILE M Protein B 1 FT #SUB 870 2870 LEU Q 1078 1078 ARG M Protein B 6 FT #SUB 871 2871 PHE Q 1071 1071 ASN M Protein S 1 FT #SUB 871 2871 PHE Q 1074 1074 ILE M Protein S 1 FT #SUB 871 2871 PHE Q 1078 1078 ARG M Protein B 1 FT #SUB 871 2871 PHE Q 1103 1103 PHE M Protein A 7 FT #SUB 872 2872 GLU Q 1078 1078 ARG M Protein B 5 FT #SUB 873 2873 HIS Q 1101 1101 VAL M Protein S 1 FT #SUB 875 2875 SER Q 1078 1078 ARG M Protein B 3 FT #SUB 877 2877 ILE Q 1078 1078 ARG M Protein A 8 FT #SUB 898 2898 LYS Q 1075 1075 GLU M Protein S 1 FT #SUB 898 2898 LYS Q 1079 1079 LYS M Protein S 1 FT #SUB 899 2899 PRO Q 1075 1075 GLU M Protein B 3 FT #SUB 900 2900 SER Q 1073 1073 ALA M Protein S 1 FT #SUB 900 2900 SER Q 1075 1075 GLU M Protein A 4 FT #SUB 901 2901 LEU Q 1071 1071 ASN M Protein B 1 FT #SUB 901 2901 LEU Q 1073 1073 ALA M Protein B 1 FT #SUB 901 2901 LEU Q 1074 1074 ILE M Protein A 7 FT #SUB 902 2902 ILE Q 1071 1071 ASN M Protein A 4 FT #SUB 903 2903 TYR Q 1071 1071 ASN M Protein A 6 FT #SUB 903 2903 TYR Q 1074 1074 ILE M Protein B 1 FT #SUB 84 2084 ARG Q 502 2502 LEU N Protein S 3 FT #SUB 90 2090 LYS Q 375 2375 GLY N Protein S 1 FT #SUB 94 2094 ARG Q 94 2094 ARG N Protein S 5 FT #SUB 94 2094 ARG Q 182 2182 TYR N Protein A 4 FT #SUB 94 2094 ARG Q 370 2370 MET N Protein S 8 FT #SUB 96 2096 SER Q 374 2374 ASN N Protein S 3 FT #SUB 97 2097 LEU Q 235 2235 PRO N Protein S 2 FT #SUB 98 2098 GLN Q 374 2374 ASN N Protein S 4 FT #SUB 99 2099 GLU Q 375 2375 GLY N Protein S 1 FT #SUB 109 2109 ARG Q 503 2503 ASN N Protein S 2 FT #SUB 109 2109 ARG Q 504 2504 ARG N Protein S 1 FT #SUB 109 2109 ARG Q 582 2582 ARG N Protein S 1 FT #SUB 109 2109 ARG Q 585 2585 GLY N Protein B 1 FT #SUB 112 2112 LYS Q 583 2583 ARG N Protein S 1 FT #SUB 112 2112 LYS Q 584 2584 HIS N Protein B 1 FT #SUB 113 2113 ASP Q 519 2519 ASN N Protein S 3 FT #SUB 113 2113 ASP Q 585 2585 GLY N Protein A 4 FT #SUB 114 2114 ARG Q 526 2526 ARG N Protein S 6 FT #SUB 115 2115 SER Q 518 2518 GLN N Protein S 3 FT #SUB 115 2115 SER Q 519 2519 ASN N Protein S 3 FT #SUB 115 2115 SER Q 522 2522 SER N Protein A 5 FT #SUB 116 2116 SER Q 615 2615 TRP N Protein S 1 FT #SUB 121 2121 THR Q 615 2615 TRP N Protein S 3 FT #SUB 124 2124 SER Q 615 2615 TRP N Protein S 1 FT #SUB 125 2125 PHE Q 615 2615 TRP N Protein S 1 FT #SUB 131 2131 LEU Q 615 2615 TRP N Protein S 2 FT #SUB 167 2167 ASN Q 502 2502 LEU N Protein A 5 FT #SUB 168 2168 ARG Q 502 2502 LEU N Protein A 5 FT #SUB 168 2168 ARG Q 503 2503 ASN N Protein B 3 FT #SUB 168 2168 ARG Q 515 2515 ARG N Protein S 11 FT #SUB 168 2168 ARG Q 587 2587 SER N Protein A 5 FT #SUB 169 2169 HIS Q 503 2503 ASN N Protein B 2 FT #SUB 169 2169 HIS Q 585 2585 GLY N Protein S 2 FT #SUB 169 2169 HIS Q 587 2587 SER N Protein S 5 FT #SUB 170 2170 GLY Q 503 2503 ASN N Protein B 2 FT #SUB 182 2182 TYR Q 94 2094 ARG N Protein S 6 FT #SUB 199 2199 PRO Q 235 2235 PRO N Protein B 1 FT #SUB 199 2199 PRO Q 236 2236 ALA N Protein B 2 FT #SUB 199 2199 PRO Q 237 2237 LEU N Protein B 1 FT #SUB 200 2200 PHE Q 237 2237 LEU N Protein B 7 FT #SUB 235 2235 PRO Q 97 2097 LEU N Protein S 2 FT #SUB 235 2235 PRO Q 199 2199 PRO N Protein B 2 FT #SUB 236 2236 ALA Q 199 2199 PRO N Protein B 2 FT #SUB 237 2237 LEU Q 199 2199 PRO N Protein B 2 FT #SUB 237 2237 LEU Q 200 2200 PHE N Protein A 8 FT #SUB 341 2341 PRO Q 614 2614 ALA N Protein S 2 FT #SUB 341 2341 PRO Q 615 2615 TRP N Protein B 1 FT #SUB 342 2342 TYR Q 615 2615 TRP N Protein B 2 FT #SUB 344 2344 LEU Q 514 2514 SER N Protein S 2 FT #SUB 344 2344 LEU Q 515 2515 ARG N Protein A 6 FT #SUB 344 2344 LEU Q 518 2518 GLN N Protein S 1 FT #SUB 345 2345 ASN Q 515 2515 ARG N Protein S 1 FT #SUB 346 2346 PRO Q 513 2513 GLU N Protein S 1 FT #SUB 346 2346 PRO Q 515 2515 ARG N Protein S 6 FT #SUB 370 2370 MET Q 94 2094 ARG N Protein S 8 FT #SUB 374 2374 ASN Q 96 2096 SER N Protein A 2 FT #SUB 374 2374 ASN Q 98 2098 GLN N Protein S 3 FT #SUB 375 2375 GLY Q 90 2090 LYS N Protein B 1 FT #SUB 375 2375 GLY Q 99 2099 GLU N Protein B 1 FT #SUB 502 2502 LEU Q 84 2084 ARG N Protein S 3 FT #SUB 502 2502 LEU Q 167 2167 ASN N Protein S 2 FT #SUB 502 2502 LEU Q 168 2168 ARG N Protein S 1 FT #SUB 503 2503 ASN Q 168 2168 ARG N Protein A 5 FT #SUB 503 2503 ASN Q 169 2169 HIS N Protein S 2 FT #SUB 503 2503 ASN Q 170 2170 GLY N Protein S 2 FT #SUB 515 2515 ARG Q 168 2168 ARG N Protein S 13 FT #SUB 515 2515 ARG Q 344 2344 LEU N Protein A 7 FT #SUB 515 2515 ARG Q 345 2345 ASN N Protein S 2 FT #SUB 515 2515 ARG Q 346 2346 PRO N Protein S 6 FT #SUB 518 2518 GLN Q 344 2344 LEU N Protein S 1 FT #SUB 519 2519 ASN Q 113 2113 ASP N Protein S 3 FT #SUB 519 2519 ASN Q 115 2115 SER N Protein B 1 FT #SUB 519 2519 ASN Q 169 2169 HIS N Protein S 1 FT #SUB 526 2526 ARG Q 114 2114 ARG N Protein S 4 FT #SUB 582 2582 ARG Q 109 2109 ARG N Protein B 1 FT #SUB 583 2583 ARG Q 112 2112 LYS N Protein B 2 FT #SUB 584 2584 HIS Q 112 2112 LYS N Protein B 1 FT #SUB 585 2585 GLY Q 109 2109 ARG N Protein B 1 FT #SUB 585 2585 GLY Q 113 2113 ASP N Protein B 6 FT #SUB 585 2585 GLY Q 169 2169 HIS N Protein B 2 FT #SUB 587 2587 SER Q 117 2117 ASP N Protein S 1 FT #SUB 587 2587 SER Q 168 2168 ARG N Protein A 2 FT #SUB 587 2587 SER Q 169 2169 HIS N Protein S 5 FT #SUB 614 2614 ALA Q 341 2341 PRO N Protein S 2 FT #SUB 615 2615 TRP Q 116 2116 SER N Protein S 2 FT #SUB 615 2615 TRP Q 121 2121 THR N Protein S 3 FT #SUB 615 2615 TRP Q 125 2125 PHE N Protein S 1 FT #SUB 615 2615 TRP Q 341 2341 PRO N Protein S 1 FT #SUB 615 2615 TRP Q 342 2342 TYR N Protein S 3 FT #SUB 797 2797 LYS Q 108 3028 PRO R Protein S 1 FT #SUB 800 2800 LYS Q 100 3020 SER R Protein S 1 FT #SUB 800 2800 LYS Q 101 3021 LEU R Protein S 1 FT #SUB 800 2800 LYS Q 103 3023 ILE R Protein A 5 FT #SUB 802 2802 ASP Q 12 2932 PRO R Protein S 3 FT #SUB 861 2861 LYS Q 16 2936 GLU R Protein S 6 FT #SUB 867 2867 PRO Q 12 2932 PRO R Protein S 10 FT #SUB 868 2868 GLU Q 11 2931 THR R Protein S 2 FT #SUB 868 2868 GLU Q 12 2932 PRO R Protein S 11 FT #SUB 868 2868 GLU Q 13 2933 SER R Protein S 7 FT #SUB 868 2868 GLU Q 14 2934 GLU R Protein S 1 FT #SUB 903 2903 TYR Q 12 2932 PRO R Protein S 2 FT #SUB 609 2609 ALA Q 106 106 GLN S Protein S 1 FT #SUB 610 2610 ASP Q 124 124 TYR S Protein S 2 FT #SUB 610 2610 ASP Q 137 137 ARG S Protein S 1 FT #SUB 612 2612 TYR Q 138 138 ALA S Protein S 1 FT #SUB 612 2612 TYR Q 189 189 ARG S Protein S 7 FT #SUB 617 2617 ASP Q 189 189 ARG S Protein S 4 FT #SUB 617 2617 ASP Q 190 190 PHE S Protein B 2 FT #SUB 619 2619 VAL Q 136 136 ALA S Protein S 2 FT #SUB 619 2619 VAL Q 138 138 ALA S Protein S 2 FT #SUB 619 2619 VAL Q 190 190 PHE S Protein S 3 FT #SUB 625 2625 LEU Q 107 107 HIS S Protein S 1 FT #SUB 626 2626 ARG Q 108 108 PRO S Protein S 3 FT #SUB 626 2626 ARG Q 109 109 LEU S Protein S 4 FT #SUB 632 2632 GLU Q 117 117 LYS S Protein B 1 FT #SUB 634 2634 THR Q 117 117 LYS S Protein S 3 FT #SUB 634 2634 THR Q 118 118 ALA S Protein S 2 FT #SUB 635 2635 TYR Q 109 109 LEU S Protein S 1 FT #SUB 635 2635 TYR Q 118 118 ALA S Protein B 2 FT #SUB 636 2636 THR Q 109 109 LEU S Protein B 1 FT #SUB 637 2637 VAL Q 111 111 ILE S Protein S 3 FT #SUB 691 2691 GLN Q 111 111 ILE S Protein S 3 FT #SUB 691 2691 GLN Q 112 112 ASP S Protein S 1 FT #SUB 693 2693 TYR Q 116 116 LYS S Protein S 1 FT #SUB 693 2693 TYR Q 117 117 LYS S Protein S 2 FT #HET 126 2126 HIS Q 178 3001 CUO Q S 5 FT #HET 138 2138 LYS Q 38 1 NAG t S 2 FT #HET 138 2138 LYS Q 39 2 NAG t S 2 FT #HET 144 2144 CYS Q 178 3001 CUO Q S 1 FT #HET 146 2146 HIS Q 178 3001 CUO Q S 7 FT #HET 151 2151 PHE Q 178 3001 CUO Q S 1 FT #HET 155 2155 HIS Q 178 3001 CUO Q S 9 FT #HET 267 2267 HIS Q 178 3001 CUO Q S 7 FT #HET 271 2271 HIS Q 178 3001 CUO Q S 6 FT #HET 294 2294 PHE Q 178 3001 CUO Q S 1 FT #HET 298 2298 HIS Q 178 3001 CUO Q S 6 FT #HET 405 2405 SER Q 75 1 NAG 7 A 3 FT #HET 429 2429 LEU Q 178 3001 CUO Q S 1 FT #HET 543 2543 HIS Q 179 3002 CUO Q S 6 FT #HET 559 2559 CYS Q 179 3002 CUO Q S 1 FT #HET 561 2561 HIS Q 179 3002 CUO Q S 6 FT #HET 566 2566 PHE Q 179 3002 CUO Q S 1 FT #HET 570 2570 HIS Q 179 3002 CUO Q S 7 FT #HET 680 2680 HIS Q 179 3002 CUO Q S 8 FT #HET 684 2684 HIS Q 179 3002 CUO Q S 8 FT #HET 707 2707 PHE Q 179 3002 CUO Q S 3 FT #HET 711 2711 HIS Q 179 3002 CUO Q S 6 FT #MOD 472 2472 ASN Q 75 1 NAG 7 S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9R STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 276 3196 THR R 416 416 ASN M Protein A 12 FT #SUB 279 3199 GLN R 416 416 ASN M Protein S 3 FT #SUB 348 3268 LYS R 1539 1539 GLN M Protein S 1 FT #SUB 352 3272 HIS R 1539 1539 GLN M Protein S 1 FT #SUB 352 3272 HIS R 1543 1543 ILE M Protein A 3 FT #SUB 354 3274 ARG R 1533 1533 ALA M Protein S 2 FT #SUB 11 2931 THR R 868 2868 GLU Q Protein B 2 FT #SUB 12 2932 PRO R 802 2802 ASP Q Protein S 3 FT #SUB 12 2932 PRO R 867 2867 PRO Q Protein A 10 FT #SUB 12 2932 PRO R 868 2868 GLU Q Protein B 11 FT #SUB 12 2932 PRO R 903 2903 TYR Q Protein S 2 FT #SUB 13 2933 SER R 868 2868 GLU Q Protein A 7 FT #SUB 14 2934 GLU R 868 2868 GLU Q Protein B 1 FT #SUB 16 2936 GLU R 861 2861 LYS Q Protein S 6 FT #SUB 100 3020 SER R 800 2800 LYS Q Protein S 1 FT #SUB 101 3021 LEU R 800 2800 LYS Q Protein B 1 FT #SUB 103 3023 ILE R 800 2800 LYS Q Protein S 5 FT #SUB 108 3028 PRO R 797 2797 LYS Q Protein S 1 FT #HET 41 2961 HIS R 156 3401 CUO I S 8 FT #HET 60 2980 HIS R 156 3401 CUO I S 3 FT #HET 69 2989 HIS R 156 3401 CUO I S 13 FT #HET 118 3038 ILE R 162 5014 CUO J B 1 FT #HET 121 3041 ALA R 162 5014 CUO J B 10 FT #HET 122 3042 ASP R 162 5014 CUO J A 30 FT #HET 123 3043 THR R 162 5014 CUO J A 11 FT #HET 169 3089 HIS R 156 3401 CUO I S 7 FT #HET 173 3093 HIS R 156 3401 CUO I S 5 FT #HET 196 3116 PHE R 156 3401 CUO I S 5 FT #HET 199 3119 HIS R 156 3401 CUO I S 1 FT #HET 200 3120 HIS R 156 3401 CUO I S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9S STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 388 388 GLY S 136 2136 THR N Protein B 5 FT #SUB 389 389 THR S 136 2136 THR N Protein A 8 FT #SUB 389 389 THR S 138 2138 LYS N Protein S 1 FT #SUB 329 329 ARG S 1571 1571 TYR P Protein S 7 FT #SUB 329 329 ARG S 1583 1583 CYS P Protein B 3 FT #SUB 329 329 ARG S 1584 1584 GLY P Protein A 5 FT #SUB 329 329 ARG S 1631 1631 GLU P Protein S 5 FT #SUB 330 330 GLN S 1582 1582 ASN P Protein A 6 FT #SUB 330 330 GLN S 1583 1583 CYS P Protein B 1 FT #SUB 331 331 SER S 1582 1582 ASN P Protein A 4 FT #SUB 332 332 ASP S 1581 1581 GLU P Protein B 3 FT #SUB 333 333 ASN S 1581 1581 GLU P Protein A 2 FT #SUB 335 335 ASP S 1579 1579 GLY P Protein S 5 FT #SUB 471 471 PRO S 1639 1639 LYS P Protein B 4 FT #SUB 472 472 ASP S 1638 1638 SER P Protein S 5 FT #SUB 472 472 ASP S 1639 1639 LYS P Protein A 7 FT #SUB 473 473 ALA S 1639 1639 LYS P Protein A 6 FT #SUB 1363 1363 THR S 1390 1390 ASP P Protein S 2 FT #SUB 1365 1365 TYR S 1437 1437 ARG P Protein S 10 FT #SUB 1371 1371 GLN S 1439 1439 VAL P Protein S 4 FT #SUB 1372 1372 ILE S 1390 1390 ASP P Protein S 3 FT #SUB 1372 1372 ILE S 1438 1438 ASP P Protein S 3 FT #SUB 1378 1378 PHE S 1362 1362 ALA P Protein S 1 FT #SUB 1390 1390 ASP S 1372 1372 ILE P Protein S 1 FT #SUB 1392 1392 ASP S 1372 1372 ILE P Protein S 1 FT #SUB 1437 1437 ARG S 1365 1365 TYR P Protein S 14 FT #SUB 1438 1438 ASP S 1372 1372 ILE P Protein S 1 FT #SUB 1439 1439 VAL S 1371 1371 GLN P Protein S 1 FT #SUB 1571 1571 TYR S 328 328 LYS P Protein S 5 FT #SUB 1573 1573 CYS S 329 329 ARG P Protein B 1 FT #SUB 1574 1574 VAL S 329 329 ARG P Protein A 7 FT #SUB 1576 1576 THR S 329 329 ARG P Protein S 2 FT #SUB 1576 1576 THR S 331 331 SER P Protein S 1 FT #SUB 1578 1578 ILE S 335 335 ASP P Protein A 7 FT #SUB 1579 1579 GLY S 335 335 ASP P Protein B 3 FT #SUB 1581 1581 GLU S 331 331 SER P Protein S 3 FT #SUB 1581 1581 GLU S 333 333 ASN P Protein S 6 FT #SUB 1581 1581 GLU S 334 334 CYS P Protein S 1 FT #SUB 1581 1581 GLU S 335 335 ASP P Protein S 1 FT #SUB 1582 1582 ASN S 329 329 ARG P Protein B 7 FT #SUB 1582 1582 ASN S 330 330 GLN P Protein B 3 FT #SUB 1582 1582 ASN S 331 331 SER P Protein B 2 FT #SUB 1583 1583 CYS S 328 328 LYS P Protein B 1 FT #SUB 1583 1583 CYS S 329 329 ARG P Protein B 20 FT #SUB 1584 1584 GLY S 329 329 ARG P Protein B 4 FT #SUB 1585 1585 ASN S 329 329 ARG P Protein S 2 FT #SUB 1631 1631 GLU S 328 328 LYS P Protein S 3 FT #SUB 1638 1638 SER S 472 472 ASP P Protein A 8 FT #SUB 1639 1639 LYS S 472 472 ASP P Protein B 2 FT #SUB 106 106 GLN S 609 2609 ALA Q Protein B 1 FT #SUB 107 107 HIS S 625 2625 LEU Q Protein S 1 FT #SUB 108 108 PRO S 626 2626 ARG Q Protein S 3 FT #SUB 109 109 LEU S 626 2626 ARG Q Protein S 4 FT #SUB 109 109 LEU S 635 2635 TYR Q Protein S 1 FT #SUB 109 109 LEU S 636 2636 THR Q Protein S 1 FT #SUB 111 111 ILE S 637 2637 VAL Q Protein S 3 FT #SUB 111 111 ILE S 691 2691 GLN Q Protein S 3 FT #SUB 112 112 ASP S 691 2691 GLN Q Protein B 1 FT #SUB 116 116 LYS S 693 2693 TYR Q Protein B 1 FT #SUB 117 117 LYS S 632 2632 GLU Q Protein S 1 FT #SUB 117 117 LYS S 634 2634 THR Q Protein S 3 FT #SUB 117 117 LYS S 693 2693 TYR Q Protein S 2 FT #SUB 118 118 ALA S 634 2634 THR Q Protein A 2 FT #SUB 118 118 ALA S 635 2635 TYR Q Protein S 2 FT #SUB 124 124 TYR S 610 2610 ASP Q Protein S 2 FT #SUB 136 136 ALA S 619 2619 VAL Q Protein S 2 FT #SUB 137 137 ARG S 610 2610 ASP Q Protein B 1 FT #SUB 138 138 ALA S 612 2612 TYR Q Protein S 1 FT #SUB 138 138 ALA S 619 2619 VAL Q Protein A 2 FT #SUB 189 189 ARG S 612 2612 TYR Q Protein S 7 FT #SUB 189 189 ARG S 617 2617 ASP Q Protein A 4 FT #SUB 190 190 PHE S 617 2617 ASP Q Protein A 2 FT #SUB 190 190 PHE S 619 2619 VAL Q Protein S 3 FT #SUB 1070 1070 ALA S 871 2871 PHE W Protein B 1 FT #SUB 1071 1071 ASN S 871 2871 PHE W Protein S 3 FT #SUB 1071 1071 ASN S 901 2901 LEU W Protein B 1 FT #SUB 1071 1071 ASN S 902 2902 ILE W Protein B 1 FT #SUB 1071 1071 ASN S 903 2903 TYR W Protein A 7 FT #SUB 1071 1071 ASN S 905 2905 PRO W Protein S 1 FT #SUB 1073 1073 ALA S 901 2901 LEU W Protein B 1 FT #SUB 1074 1074 ILE S 870 2870 LEU W Protein S 1 FT #SUB 1074 1074 ILE S 871 2871 PHE W Protein S 1 FT #SUB 1074 1074 ILE S 901 2901 LEU W Protein A 4 FT #SUB 1075 1075 GLU S 899 2899 PRO W Protein S 4 FT #SUB 1075 1075 GLU S 900 2900 SER W Protein S 3 FT #SUB 1078 1078 ARG S 869 2869 ALA W Protein S 1 FT #SUB 1078 1078 ARG S 870 2870 LEU W Protein S 2 FT #SUB 1078 1078 ARG S 872 2872 GLU W Protein S 5 FT #SUB 1078 1078 ARG S 873 2873 HIS W Protein S 2 FT #SUB 1078 1078 ARG S 875 2875 SER W Protein S 4 FT #SUB 1078 1078 ARG S 876 2876 LYS W Protein S 2 FT #SUB 1078 1078 ARG S 877 2877 ILE W Protein S 9 FT #SUB 1101 1101 VAL S 873 2873 HIS W Protein S 4 FT #SUB 1103 1103 PHE S 871 2871 PHE W Protein S 8 FT #SUB 1104 1104 ASP S 873 2873 HIS W Protein S 4 FT #SUB 1147 1147 MET S 848 2848 PHE W Protein S 1 FT #SUB 1147 1147 MET S 849 2849 ASP W Protein S 3 FT #SUB 1164 1164 ASP S 744 2744 PHE W Protein S 3 FT #SUB 1187 1187 LYS S 809 2809 LEU W Protein S 1 FT #SUB 1187 1187 LYS S 811 2811 HIS W Protein S 2 FT #SUB 1187 1187 LYS S 897 2897 PRO W Protein S 2 FT #SUB 1188 1188 PHE S 809 2809 LEU W Protein B 2 FT #SUB 1189 1189 ASP S 809 2809 LEU W Protein B 1 FT #SUB 1189 1189 ASP S 850 2850 ARG W Protein B 1 FT #SUB 1189 1189 ASP S 851 2851 ASN W Protein A 4 FT #SUB 1191 1191 PRO S 849 2849 ASP W Protein S 3 FT #SUB 1207 1207 TYR S 734 2734 LYS W Protein S 2 FT #SUB 1207 1207 TYR S 739 2739 LEU W Protein A 7 FT #SUB 1208 1208 THR S 734 2734 LYS W Protein S 1 FT #SUB 1209 1209 ASP S 743 2743 ALA W Protein B 1 FT #SUB 1210 1210 LYS S 743 2743 ALA W Protein A 2 FT #SUB 1210 1210 LYS S 745 2745 PRO W Protein S 3 FT #SUB 1211 1211 TYR S 739 2739 LEU W Protein S 3 FT #SUB 1211 1211 TYR S 744 2744 PHE W Protein B 3 FT #SUB 1212 1212 HIS S 744 2744 PHE W Protein S 1 FT #SUB 1230 1230 LEU S 847 2847 HIS W Protein S 1 FT #SUB 1232 1232 THR S 740 2740 ASP W Protein A 5 FT #SUB 1233 1233 SER S 737 2737 CYS W Protein S 1 FT #SUB 1233 1233 SER S 740 2740 ASP W Protein B 6 FT #SUB 1233 1233 SER S 849 2849 ASP W Protein S 3 FT #SUB 1234 1234 VAL S 736 2736 TYR W Protein B 1 FT #SUB 1234 1234 VAL S 737 2737 CYS W Protein B 2 FT #SUB 1234 1234 VAL S 739 2739 LEU W Protein B 4 FT #SUB 1234 1234 VAL S 740 2740 ASP W Protein B 1 FT #SUB 1235 1235 ILE S 736 2736 TYR W Protein A 3 FT #SUB 1235 1235 ILE S 737 2737 CYS W Protein B 1 FT #SUB 1236 1236 TYR S 736 2736 TYR W Protein A 6 FT #SUB 1245 1245 GLU S 729 2729 LYS W Protein B 2 FT #SUB 1245 1245 GLU S 730 2730 LEU W Protein B 1 FT #SUB 1245 1245 GLU S 736 2736 TYR W Protein S 3 FT #SUB 1483 1483 ASN S 458 2458 PHE W Protein S 1 FT #SUB 1483 1483 ASN S 487 2487 VAL W Protein B 1 FT #SUB 1483 1483 ASN S 488 2488 ARG W Protein A 9 FT #SUB 1483 1483 ASN S 490 2490 PRO W Protein S 2 FT #SUB 1484 1484 CYS S 486 2486 ILE W Protein B 3 FT #SUB 1484 1484 CYS S 487 2487 VAL W Protein B 3 FT #SUB 1485 1485 ALA S 486 2486 ILE W Protein B 1 FT #SUB 1486 1486 LEU S 458 2458 PHE W Protein S 3 FT #SUB 1486 1486 LEU S 486 2486 ILE W Protein A 4 FT #SUB 1486 1486 LEU S 488 2488 ARG W Protein S 2 FT #SUB 1487 1487 PRO S 486 2486 ILE W Protein S 5 FT #SUB 1490 1490 ASN S 461 2461 HIS W Protein S 9 FT #SUB 1558 1558 LEU S 439 2439 PHE W Protein S 1 FT #SUB 1558 1558 LEU S 440 2440 ASP W Protein S 1 FT #SUB 1602 1602 GLN S 482 2482 PRO W Protein S 2 FT #SUB 1603 1603 PHE S 399 2399 LEU W Protein B 1 FT #SUB 1604 1604 ASP S 399 2399 LEU W Protein B 1 FT #SUB 1604 1604 ASP S 442 2442 LEU W Protein A 8 FT #SUB 1605 1605 ARG S 440 2440 ASP W Protein B 2 FT #SUB 1606 1606 LEU S 440 2440 ASP W Protein A 6 FT #SUB 1606 1606 LEU S 441 2441 ARG W Protein S 2 FT #SUB 1622 1622 GLN S 326 2326 ILE W Protein B 1 FT #SUB 1623 1623 ASN S 321 2321 GLU W Protein S 1 FT #SUB 1625 1625 HIS S 330 2330 SER W Protein S 1 FT #SUB 1625 1625 HIS S 353 2353 LYS W Protein S 2 FT #SUB 1648 1648 PRO S 327 2327 GLU W Protein B 3 FT #SUB 1649 1649 THR S 325 2325 ALA W Protein S 2 FT #SUB 1649 1649 THR S 327 2327 GLU W Protein A 5 FT #SUB 1650 1650 ILE S 323 2323 ASN W Protein B 1 FT #SUB 1650 1650 ILE S 324 2324 CYS W Protein B 1 FT #SUB 1650 1650 ILE S 325 2325 ALA W Protein B 3 FT #SUB 1650 1650 ILE S 326 2326 ILE W Protein B 5 FT #SUB 1650 1650 ILE S 327 2327 GLU W Protein A 3 FT #SUB 1651 1651 ILE S 323 2323 ASN W Protein A 3 FT #SUB 1652 1652 PHE S 323 2323 ASN W Protein A 11 FT #SUB 1652 1652 PHE S 326 2326 ILE W Protein A 2 FT #SUB 1654 1654 PRO S 323 2323 ASN W Protein S 2 FT #SUB 416 416 ASN S 276 3196 THR X Protein S 12 FT #SUB 416 416 ASN S 279 3199 GLN X Protein S 4 FT #SUB 419 419 THR S 279 3199 GLN X Protein S 1 FT #SUB 1401 1401 ARG S 354 3274 ARG X Protein S 1 FT #SUB 1401 1401 ARG S 359 3279 ASP X Protein S 2 FT #SUB 1404 1404 TYR S 352 3272 HIS X Protein S 1 FT #SUB 1533 1533 ALA S 351 3271 VAL X Protein B 2 FT #SUB 1533 1533 ALA S 354 3274 ARG X Protein B 2 FT #SUB 1535 1535 LEU S 352 3272 HIS X Protein S 6 FT #SUB 1542 1542 ARG S 352 3272 HIS X Protein S 1 FT #SUB 1543 1543 ILE S 352 3272 HIS X Protein S 3 FT #HET 41 41 HIS S 181 2102 CUO S S 6 FT #HET 58 58 CYS S 181 2102 CUO S S 1 FT #HET 60 60 HIS S 181 2102 CUO S S 7 FT #HET 65 65 PHE S 181 2102 CUO S S 1 FT #HET 69 69 HIS S 181 2102 CUO S S 8 FT #HET 179 179 HIS S 181 2102 CUO S S 7 FT #HET 183 183 HIS S 181 2102 CUO S S 7 FT #HET 206 206 PHE S 181 2102 CUO S S 1 FT #HET 210 210 HIS S 181 2102 CUO S S 7 FT #HET 311 311 ILE S 78 2 NAG 8 B 1 FT #HET 313 313 THR S 77 1 NAG 8 S 4 FT #HET 344 344 LEU S 181 2102 CUO S S 1 FT #HET 389 389 THR S 77 1 NAG 8 S 2 FT #HET 391 391 MET S 77 1 NAG 8 S 1 FT #HET 391 391 MET S 78 2 NAG 8 S 2 FT #HET 462 462 HIS S 182 2103 CUO S S 6 FT #HET 480 480 CYS S 182 2103 CUO S S 1 FT #HET 482 482 HIS S 182 2103 CUO S S 6 FT #HET 487 487 PHE S 182 2103 CUO S S 1 FT #HET 491 491 HIS S 182 2103 CUO S S 7 FT #HET 603 603 HIS S 182 2103 CUO S S 6 FT #HET 607 607 HIS S 182 2103 CUO S S 7 FT #HET 630 630 PHE S 182 2103 CUO S S 1 FT #HET 634 634 HIS S 182 2103 CUO S S 6 FT #HET 763 763 LEU S 182 2103 CUO S S 1 FT #HET 804 804 ASP S 79 1 NAG 9 S 4 FT #HET 808 808 THR S 79 1 NAG 9 S 1 FT #HET 810 810 LEU S 79 1 NAG 9 S 1 FT #HET 877 877 HIS S 183 2104 CUO S S 7 FT #HET 897 897 HIS S 183 2104 CUO S S 3 FT #HET 906 906 HIS S 183 2104 CUO S S 10 FT #HET 1015 1015 HIS S 183 2104 CUO S S 5 FT #HET 1019 1019 HIS S 183 2104 CUO S S 5 FT #HET 1042 1042 PHE S 183 2104 CUO S S 3 FT #HET 1046 1046 HIS S 183 2104 CUO S S 7 FT #HET 1294 1294 HIS S 184 2105 CUO S S 7 FT #HET 1298 1298 ALA S 82 2 NAG AA B 1 FT #HET 1299 1299 GLN S 81 1 NAG AA A 4 FT #HET 1301 1301 PRO S 81 1 NAG AA S 3 FT #HET 1308 1308 VAL S 82 2 NAG AA S 1 FT #HET 1312 1312 CYS S 184 2105 CUO S S 1 FT #HET 1314 1314 HIS S 184 2105 CUO S S 5 FT #HET 1319 1319 PHE S 184 2105 CUO S S 1 FT #HET 1323 1323 HIS S 184 2105 CUO S S 10 FT #HET 1427 1427 HIS S 184 2105 CUO S S 6 FT #HET 1431 1431 HIS S 184 2105 CUO S S 7 FT #HET 1454 1454 PHE S 184 2105 CUO S S 3 FT #HET 1458 1458 HIS S 184 2105 CUO S S 9 FT #HET 1494 1494 ARG S 81 1 NAG AA S 3 FT #HET 1500 1500 THR S 81 1 NAG AA B 6 FT #HET 1501 1501 ALA S 81 1 NAG AA A 2 FT #HET 1564 1564 ALA S 85 1 NAG BA S 1 FT #HET 1639 1639 LYS S 85 1 NAG BA S 8 FT #HET 1639 1639 LYS S 86 2 NAG BA S 2 FT #MOD 387 387 ASN S 77 1 NAG 8 S FT #MOD 806 806 ASN S 79 1 NAG 9 S FT #MOD 1498 1498 ASN S 81 1 NAG AA S FT #MOD 1636 1636 ASN S 85 1 NAG BA S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9T STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 609 2609 ALA T 106 106 GLN P Protein S 1 FT #SUB 610 2610 ASP T 137 137 ARG P Protein S 1 FT #SUB 612 2612 TYR T 138 138 ALA P Protein S 2 FT #SUB 612 2612 TYR T 189 189 ARG P Protein S 5 FT #SUB 617 2617 ASP T 189 189 ARG P Protein S 2 FT #SUB 617 2617 ASP T 190 190 PHE P Protein B 1 FT #SUB 619 2619 VAL T 136 136 ALA P Protein S 2 FT #SUB 619 2619 VAL T 138 138 ALA P Protein S 3 FT #SUB 619 2619 VAL T 190 190 PHE P Protein A 4 FT #SUB 621 2621 GLU T 108 108 PRO P Protein S 1 FT #SUB 625 2625 LEU T 107 107 HIS P Protein S 2 FT #SUB 626 2626 ARG T 108 108 PRO P Protein S 1 FT #SUB 626 2626 ARG T 109 109 LEU P Protein S 1 FT #SUB 632 2632 GLU T 117 117 LYS P Protein B 2 FT #SUB 634 2634 THR T 117 117 LYS P Protein S 5 FT #SUB 634 2634 THR T 118 118 ALA P Protein S 1 FT #SUB 635 2635 TYR T 109 109 LEU P Protein S 1 FT #SUB 636 2636 THR T 109 109 LEU P Protein B 1 FT #SUB 637 2637 VAL T 111 111 ILE P Protein S 3 FT #SUB 637 2637 VAL T 118 118 ALA P Protein S 1 FT #SUB 638 2638 ARG T 111 111 ILE P Protein B 1 FT #SUB 691 2691 GLN T 111 111 ILE P Protein S 2 FT #SUB 691 2691 GLN T 112 112 ASP P Protein S 1 FT #SUB 693 2693 TYR T 116 116 LYS P Protein S 2 FT #SUB 693 2693 TYR T 117 117 LYS P Protein S 3 FT #SUB 800 2800 LYS T 100 3020 SER U Protein S 2 FT #SUB 800 2800 LYS T 101 3021 LEU U Protein S 1 FT #SUB 800 2800 LYS T 103 3023 ILE U Protein A 7 FT #SUB 801 2801 ALA T 103 3023 ILE U Protein B 1 FT #SUB 802 2802 ASP T 10 2930 LEU U Protein S 1 FT #SUB 802 2802 ASP T 12 2932 PRO U Protein S 4 FT #SUB 861 2861 LYS T 16 2936 GLU U Protein S 2 FT #SUB 867 2867 PRO T 12 2932 PRO U Protein S 5 FT #SUB 868 2868 GLU T 12 2932 PRO U Protein B 1 FT #SUB 903 2903 TYR T 11 2931 THR U Protein S 1 FT #SUB 903 2903 TYR T 12 2932 PRO U Protein S 1 FT #SUB 321 2321 GLU T 1623 1623 ASN V Protein S 2 FT #SUB 323 2323 ASN T 1651 1651 ILE V Protein B 3 FT #SUB 323 2323 ASN T 1652 1652 PHE V Protein A 10 FT #SUB 323 2323 ASN T 1654 1654 PRO V Protein S 2 FT #SUB 325 2325 ALA T 1650 1650 ILE V Protein B 3 FT #SUB 326 2326 ILE T 1622 1622 GLN V Protein S 1 FT #SUB 326 2326 ILE T 1650 1650 ILE V Protein A 5 FT #SUB 326 2326 ILE T 1652 1652 PHE V Protein S 2 FT #SUB 327 2327 GLU T 1648 1648 PRO V Protein S 2 FT #SUB 327 2327 GLU T 1649 1649 THR V Protein S 8 FT #SUB 327 2327 GLU T 1650 1650 ILE V Protein S 3 FT #SUB 330 2330 SER T 1625 1625 HIS V Protein S 2 FT #SUB 353 2353 LYS T 1625 1625 HIS V Protein S 1 FT #SUB 399 2399 LEU T 1603 1603 PHE V Protein S 1 FT #SUB 401 2401 GLU T 1602 1602 GLN V Protein S 1 FT #SUB 439 2439 PHE T 1558 1558 LEU V Protein B 1 FT #SUB 440 2440 ASP T 1558 1558 LEU V Protein A 3 FT #SUB 440 2440 ASP T 1605 1605 ARG V Protein B 2 FT #SUB 440 2440 ASP T 1606 1606 LEU V Protein A 7 FT #SUB 441 2441 ARG T 1604 1604 ASP V Protein B 1 FT #SUB 442 2442 LEU T 1604 1604 ASP V Protein A 7 FT #SUB 458 2458 PHE T 1481 1481 GLU V Protein S 1 FT #SUB 458 2458 PHE T 1486 1486 LEU V Protein A 4 FT #SUB 461 2461 HIS T 1490 1490 ASN V Protein S 5 FT #SUB 485 2485 THR T 1485 1485 ALA V Protein S 2 FT #SUB 486 2486 ILE T 1484 1484 CYS V Protein B 1 FT #SUB 486 2486 ILE T 1485 1485 ALA V Protein B 3 FT #SUB 486 2486 ILE T 1486 1486 LEU V Protein B 6 FT #SUB 486 2486 ILE T 1487 1487 PRO V Protein A 4 FT #SUB 487 2487 VAL T 1483 1483 ASN V Protein A 3 FT #SUB 487 2487 VAL T 1485 1485 ALA V Protein S 1 FT #SUB 488 2488 ARG T 1483 1483 ASN V Protein A 7 FT #SUB 490 2490 PRO T 1483 1483 ASN V Protein S 3 FT #SUB 729 2729 LYS T 1245 1245 GLU V Protein B 5 FT #SUB 730 2730 LEU T 1245 1245 GLU V Protein A 6 FT #SUB 730 2730 LEU T 1246 1246 GLY V Protein S 2 FT #SUB 731 2731 PRO T 1245 1245 GLU V Protein A 4 FT #SUB 734 2734 LYS T 1245 1245 GLU V Protein S 3 FT #SUB 736 2736 TYR T 1235 1235 ILE V Protein B 1 FT #SUB 736 2736 TYR T 1236 1236 TYR V Protein A 4 FT #SUB 736 2736 TYR T 1238 1238 PRO V Protein S 3 FT #SUB 736 2736 TYR T 1245 1245 GLU V Protein S 3 FT #SUB 737 2737 CYS T 1234 1234 VAL V Protein B 1 FT #SUB 738 2738 ALA T 1232 1232 THR V Protein S 1 FT #SUB 738 2738 ALA T 1233 1233 SER V Protein S 1 FT #SUB 738 2738 ALA T 1234 1234 VAL V Protein A 6 FT #SUB 739 2739 LEU T 1207 1207 TYR V Protein S 3 FT #SUB 739 2739 LEU T 1211 1211 TYR V Protein S 2 FT #SUB 739 2739 LEU T 1234 1234 VAL V Protein A 3 FT #SUB 740 2740 ASP T 1211 1211 TYR V Protein S 1 FT #SUB 740 2740 ASP T 1212 1212 HIS V Protein S 2 FT #SUB 740 2740 ASP T 1231 1231 VAL V Protein S 1 FT #SUB 740 2740 ASP T 1232 1232 THR V Protein S 6 FT #SUB 740 2740 ASP T 1234 1234 VAL V Protein S 1 FT #SUB 741 2741 GLN T 1232 1232 THR V Protein S 5 FT #SUB 743 2743 ALA T 1208 1208 THR V Protein S 1 FT #SUB 743 2743 ALA T 1210 1210 LYS V Protein S 1 FT #SUB 744 2744 PHE T 1210 1210 LYS V Protein S 4 FT #SUB 744 2744 PHE T 1211 1211 TYR V Protein S 6 FT #SUB 744 2744 PHE T 1212 1212 HIS V Protein S 3 FT #SUB 745 2745 PRO T 1210 1210 LYS V Protein S 1 FT #SUB 809 2809 LEU T 1187 1187 LYS V Protein S 2 FT #SUB 809 2809 LEU T 1188 1188 PHE V Protein S 2 FT #SUB 809 2809 LEU T 1189 1189 ASP V Protein S 1 FT #SUB 847 2847 HIS T 1230 1230 LEU V Protein S 6 FT #SUB 848 2848 PHE T 1147 1147 MET V Protein B 1 FT #SUB 849 2849 ASP T 1147 1147 MET V Protein A 4 FT #SUB 849 2849 ASP T 1191 1191 PRO V Protein A 3 FT #SUB 850 2850 ARG T 1189 1189 ASP V Protein B 1 FT #SUB 851 2851 ASN T 1189 1189 ASP V Protein S 4 FT #SUB 869 2869 ALA T 1078 1078 ARG V Protein B 1 FT #SUB 870 2870 LEU T 1074 1074 ILE V Protein B 1 FT #SUB 870 2870 LEU T 1078 1078 ARG V Protein B 3 FT #SUB 871 2871 PHE T 1070 1070 ALA V Protein S 2 FT #SUB 871 2871 PHE T 1071 1071 ASN V Protein S 2 FT #SUB 871 2871 PHE T 1103 1103 PHE V Protein A 6 FT #SUB 872 2872 GLU T 1078 1078 ARG V Protein B 3 FT #SUB 873 2873 HIS T 1078 1078 ARG V Protein A 5 FT #SUB 873 2873 HIS T 1101 1101 VAL V Protein S 2 FT #SUB 873 2873 HIS T 1104 1104 ASP V Protein S 1 FT #SUB 875 2875 SER T 1078 1078 ARG V Protein B 4 FT #SUB 876 2876 LYS T 1078 1078 ARG V Protein B 1 FT #SUB 877 2877 ILE T 1078 1078 ARG V Protein A 9 FT #SUB 899 2899 PRO T 1075 1075 GLU V Protein B 3 FT #SUB 900 2900 SER T 1075 1075 GLU V Protein A 6 FT #SUB 901 2901 LEU T 1071 1071 ASN V Protein B 1 FT #SUB 901 2901 LEU T 1073 1073 ALA V Protein B 2 FT #SUB 901 2901 LEU T 1074 1074 ILE V Protein A 4 FT #SUB 901 2901 LEU T 1075 1075 GLU V Protein A 4 FT #SUB 902 2902 ILE T 1071 1071 ASN V Protein B 2 FT #SUB 903 2903 TYR T 1071 1071 ASN V Protein A 5 FT #SUB 903 2903 TYR T 1074 1074 ILE V Protein S 1 FT #SUB 84 2084 ARG T 500 2500 ILE W Protein S 1 FT #SUB 84 2084 ARG T 502 2502 LEU W Protein S 1 FT #SUB 90 2090 LYS T 375 2375 GLY W Protein S 1 FT #SUB 94 2094 ARG T 94 2094 ARG W Protein S 5 FT #SUB 94 2094 ARG T 182 2182 TYR W Protein A 7 FT #SUB 94 2094 ARG T 370 2370 MET W Protein S 7 FT #SUB 96 2096 SER T 182 2182 TYR W Protein S 1 FT #SUB 96 2096 SER T 374 2374 ASN W Protein S 3 FT #SUB 97 2097 LEU T 235 2235 PRO W Protein S 2 FT #SUB 98 2098 GLN T 244 2244 PHE W Protein S 1 FT #SUB 98 2098 GLN T 374 2374 ASN W Protein S 3 FT #SUB 99 2099 GLU T 375 2375 GLY W Protein S 3 FT #SUB 109 2109 ARG T 503 2503 ASN W Protein S 1 FT #SUB 109 2109 ARG T 582 2582 ARG W Protein S 2 FT #SUB 109 2109 ARG T 585 2585 GLY W Protein B 1 FT #SUB 112 2112 LYS T 584 2584 HIS W Protein B 1 FT #SUB 113 2113 ASP T 519 2519 ASN W Protein S 3 FT #SUB 113 2113 ASP T 585 2585 GLY W Protein A 7 FT #SUB 114 2114 ARG T 526 2526 ARG W Protein S 8 FT #SUB 115 2115 SER T 522 2522 SER W Protein A 3 FT #SUB 116 2116 SER T 615 2615 TRP W Protein S 3 FT #SUB 121 2121 THR T 615 2615 TRP W Protein S 6 FT #SUB 125 2125 PHE T 615 2615 TRP W Protein S 1 FT #SUB 167 2167 ASN T 502 2502 LEU W Protein A 2 FT #SUB 168 2168 ARG T 502 2502 LEU W Protein A 3 FT #SUB 168 2168 ARG T 503 2503 ASN W Protein B 4 FT #SUB 168 2168 ARG T 515 2515 ARG W Protein S 10 FT #SUB 168 2168 ARG T 516 2516 ASP W Protein S 1 FT #SUB 168 2168 ARG T 587 2587 SER W Protein A 5 FT #SUB 169 2169 HIS T 503 2503 ASN W Protein B 2 FT #SUB 169 2169 HIS T 585 2585 GLY W Protein S 2 FT #SUB 169 2169 HIS T 587 2587 SER W Protein S 3 FT #SUB 170 2170 GLY T 503 2503 ASN W Protein B 2 FT #SUB 182 2182 TYR T 94 2094 ARG W Protein S 4 FT #SUB 199 2199 PRO T 235 2235 PRO W Protein A 2 FT #SUB 199 2199 PRO T 236 2236 ALA W Protein B 3 FT #SUB 199 2199 PRO T 237 2237 LEU W Protein B 2 FT #SUB 200 2200 PHE T 237 2237 LEU W Protein B 7 FT #SUB 235 2235 PRO T 97 2097 LEU W Protein S 2 FT #SUB 235 2235 PRO T 199 2199 PRO W Protein B 1 FT #SUB 236 2236 ALA T 199 2199 PRO W Protein B 2 FT #SUB 237 2237 LEU T 199 2199 PRO W Protein B 2 FT #SUB 237 2237 LEU T 200 2200 PHE W Protein A 8 FT #SUB 341 2341 PRO T 615 2615 TRP W Protein B 1 FT #SUB 342 2342 TYR T 615 2615 TRP W Protein A 9 FT #SUB 344 2344 LEU T 514 2514 SER W Protein S 3 FT #SUB 344 2344 LEU T 515 2515 ARG W Protein A 6 FT #SUB 345 2345 ASN T 515 2515 ARG W Protein S 1 FT #SUB 346 2346 PRO T 513 2513 GLU W Protein S 1 FT #SUB 346 2346 PRO T 515 2515 ARG W Protein S 6 FT #SUB 370 2370 MET T 94 2094 ARG W Protein S 5 FT #SUB 370 2370 MET T 370 2370 MET W Protein S 1 FT #SUB 374 2374 ASN T 96 2096 SER W Protein A 2 FT #SUB 374 2374 ASN T 98 2098 GLN W Protein S 1 FT #SUB 375 2375 GLY T 90 2090 LYS W Protein B 1 FT #SUB 375 2375 GLY T 99 2099 GLU W Protein B 1 FT #SUB 500 2500 ILE T 84 2084 ARG W Protein B 1 FT #SUB 502 2502 LEU T 84 2084 ARG W Protein S 3 FT #SUB 502 2502 LEU T 167 2167 ASN W Protein S 1 FT #SUB 502 2502 LEU T 168 2168 ARG W Protein S 2 FT #SUB 503 2503 ASN T 109 2109 ARG W Protein S 1 FT #SUB 503 2503 ASN T 168 2168 ARG W Protein A 3 FT #SUB 503 2503 ASN T 169 2169 HIS W Protein S 2 FT #SUB 503 2503 ASN T 170 2170 GLY W Protein S 2 FT #SUB 504 2504 ARG T 109 2109 ARG W Protein S 1 FT #SUB 515 2515 ARG T 168 2168 ARG W Protein S 13 FT #SUB 515 2515 ARG T 345 2345 ASN W Protein S 2 FT #SUB 515 2515 ARG T 346 2346 PRO W Protein S 1 FT #SUB 519 2519 ASN T 113 2113 ASP W Protein S 3 FT #SUB 519 2519 ASN T 115 2115 SER W Protein B 2 FT #SUB 522 2522 SER T 115 2115 SER W Protein S 3 FT #SUB 526 2526 ARG T 114 2114 ARG W Protein S 5 FT #SUB 583 2583 ARG T 112 2112 LYS W Protein B 3 FT #SUB 584 2584 HIS T 112 2112 LYS W Protein B 1 FT #SUB 585 2585 GLY T 109 2109 ARG W Protein B 1 FT #SUB 585 2585 GLY T 113 2113 ASP W Protein B 4 FT #SUB 585 2585 GLY T 169 2169 HIS W Protein B 1 FT #SUB 587 2587 SER T 168 2168 ARG W Protein A 4 FT #SUB 587 2587 SER T 169 2169 HIS W Protein S 5 FT #SUB 614 2614 ALA T 341 2341 PRO W Protein S 2 FT #SUB 614 2614 ALA T 342 2342 TYR W Protein S 1 FT #SUB 615 2615 TRP T 116 2116 SER W Protein S 1 FT #SUB 615 2615 TRP T 121 2121 THR W Protein S 3 FT #SUB 615 2615 TRP T 124 2124 SER W Protein S 1 FT #SUB 615 2615 TRP T 125 2125 PHE W Protein S 1 FT #SUB 615 2615 TRP T 131 2131 LEU W Protein S 1 FT #SUB 615 2615 TRP T 342 2342 TYR W Protein S 2 FT #SUB 136 2136 THR T 388 388 GLY Y Protein S 5 FT #SUB 136 2136 THR T 389 389 THR Y Protein A 9 FT #SUB 138 2138 LYS T 391 391 MET Y Protein S 2 FT #HET 126 2126 HIS T 185 3001 CUO T S 7 FT #HET 138 2138 LYS T 105 1 NAG IA S 3 FT #HET 138 2138 LYS T 106 2 NAG IA S 2 FT #HET 144 2144 CYS T 185 3001 CUO T S 1 FT #HET 146 2146 HIS T 185 3001 CUO T S 6 FT #HET 155 2155 HIS T 185 3001 CUO T S 7 FT #HET 267 2267 HIS T 185 3001 CUO T S 6 FT #HET 271 2271 HIS T 185 3001 CUO T S 6 FT #HET 294 2294 PHE T 185 3001 CUO T S 2 FT #HET 298 2298 HIS T 185 3001 CUO T S 7 FT #HET 404 2404 GLY T 87 1 NAG CA B 1 FT #HET 405 2405 SER T 87 1 NAG CA S 2 FT #HET 429 2429 LEU T 185 3001 CUO T S 1 FT #HET 476 2476 LEU T 88 2 NAG CA S 2 FT #HET 480 2480 LEU T 88 2 NAG CA S 1 FT #HET 543 2543 HIS T 186 3002 CUO T S 6 FT #HET 559 2559 CYS T 186 3002 CUO T S 1 FT #HET 561 2561 HIS T 186 3002 CUO T S 5 FT #HET 570 2570 HIS T 186 3002 CUO T S 10 FT #HET 680 2680 HIS T 186 3002 CUO T S 8 FT #HET 684 2684 HIS T 186 3002 CUO T S 6 FT #HET 707 2707 PHE T 186 3002 CUO T S 4 FT #HET 711 2711 HIS T 186 3002 CUO T S 10 FT #MOD 472 2472 ASN T 87 1 NAG CA S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9U STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 10 2930 LEU U 802 2802 ASP T Protein B 1 FT #SUB 11 2931 THR U 903 2903 TYR T Protein S 1 FT #SUB 12 2932 PRO U 802 2802 ASP T Protein S 4 FT #SUB 12 2932 PRO U 867 2867 PRO T Protein A 5 FT #SUB 12 2932 PRO U 868 2868 GLU T Protein B 1 FT #SUB 12 2932 PRO U 903 2903 TYR T Protein S 1 FT #SUB 16 2936 GLU U 861 2861 LYS T Protein S 2 FT #SUB 100 3020 SER U 800 2800 LYS T Protein S 2 FT #SUB 101 3021 LEU U 800 2800 LYS T Protein B 1 FT #SUB 103 3023 ILE U 800 2800 LYS T Protein S 7 FT #SUB 103 3023 ILE U 801 2801 ALA T Protein S 1 FT #SUB 279 3199 GLN U 416 416 ASN V Protein S 1 FT #SUB 348 3268 LYS U 1539 1539 GLN V Protein S 3 FT #SUB 352 3272 HIS U 1535 1535 LEU V Protein S 3 FT #SUB 352 3272 HIS U 1542 1542 ARG V Protein S 2 FT #SUB 352 3272 HIS U 1543 1543 ILE V Protein S 2 FT #SUB 354 3274 ARG U 1533 1533 ALA V Protein S 2 FT #HET 41 2961 HIS U 187 3401 CUO U S 8 FT #HET 60 2980 HIS U 187 3401 CUO U S 3 FT #HET 69 2989 HIS U 187 3401 CUO U S 13 FT #HET 118 3038 ILE U 196 2101 CUO Y B 1 FT #HET 119 3039 ASP U 196 2101 CUO Y B 1 FT #HET 121 3041 ALA U 196 2101 CUO Y B 11 FT #HET 122 3042 ASP U 196 2101 CUO Y A 30 FT #HET 123 3043 THR U 196 2101 CUO Y B 6 FT #HET 169 3089 HIS U 187 3401 CUO U S 7 FT #HET 173 3093 HIS U 187 3401 CUO U S 5 FT #HET 196 3116 PHE U 187 3401 CUO U S 5 FT #HET 199 3119 HIS U 187 3401 CUO U S 1 FT #HET 200 3120 HIS U 187 3401 CUO U S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9V STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 1070 1070 ALA V 871 2871 PHE T Protein B 2 FT #SUB 1071 1071 ASN V 871 2871 PHE T Protein S 2 FT #SUB 1071 1071 ASN V 901 2901 LEU T Protein B 1 FT #SUB 1071 1071 ASN V 902 2902 ILE T Protein B 2 FT #SUB 1071 1071 ASN V 903 2903 TYR T Protein A 5 FT #SUB 1073 1073 ALA V 901 2901 LEU T Protein B 2 FT #SUB 1074 1074 ILE V 870 2870 LEU T Protein S 1 FT #SUB 1074 1074 ILE V 901 2901 LEU T Protein A 4 FT #SUB 1074 1074 ILE V 903 2903 TYR T Protein S 1 FT #SUB 1075 1075 GLU V 899 2899 PRO T Protein S 3 FT #SUB 1075 1075 GLU V 900 2900 SER T Protein S 6 FT #SUB 1075 1075 GLU V 901 2901 LEU T Protein A 4 FT #SUB 1078 1078 ARG V 869 2869 ALA T Protein S 1 FT #SUB 1078 1078 ARG V 870 2870 LEU T Protein S 3 FT #SUB 1078 1078 ARG V 872 2872 GLU T Protein S 3 FT #SUB 1078 1078 ARG V 873 2873 HIS T Protein A 5 FT #SUB 1078 1078 ARG V 875 2875 SER T Protein S 4 FT #SUB 1078 1078 ARG V 876 2876 LYS T Protein S 1 FT #SUB 1078 1078 ARG V 877 2877 ILE T Protein S 9 FT #SUB 1101 1101 VAL V 873 2873 HIS T Protein S 2 FT #SUB 1103 1103 PHE V 871 2871 PHE T Protein S 6 FT #SUB 1104 1104 ASP V 873 2873 HIS T Protein S 1 FT #SUB 1147 1147 MET V 848 2848 PHE T Protein S 1 FT #SUB 1147 1147 MET V 849 2849 ASP T Protein S 4 FT #SUB 1187 1187 LYS V 809 2809 LEU T Protein S 2 FT #SUB 1188 1188 PHE V 809 2809 LEU T Protein B 2 FT #SUB 1189 1189 ASP V 809 2809 LEU T Protein B 1 FT #SUB 1189 1189 ASP V 850 2850 ARG T Protein B 1 FT #SUB 1189 1189 ASP V 851 2851 ASN T Protein A 4 FT #SUB 1191 1191 PRO V 849 2849 ASP T Protein S 3 FT #SUB 1207 1207 TYR V 739 2739 LEU T Protein S 3 FT #SUB 1208 1208 THR V 743 2743 ALA T Protein B 1 FT #SUB 1210 1210 LYS V 743 2743 ALA T Protein B 1 FT #SUB 1210 1210 LYS V 744 2744 PHE T Protein A 4 FT #SUB 1210 1210 LYS V 745 2745 PRO T Protein S 1 FT #SUB 1211 1211 TYR V 739 2739 LEU T Protein S 2 FT #SUB 1211 1211 TYR V 740 2740 ASP T Protein B 1 FT #SUB 1211 1211 TYR V 744 2744 PHE T Protein B 6 FT #SUB 1212 1212 HIS V 740 2740 ASP T Protein A 2 FT #SUB 1212 1212 HIS V 744 2744 PHE T Protein A 3 FT #SUB 1230 1230 LEU V 847 2847 HIS T Protein S 6 FT #SUB 1231 1231 VAL V 740 2740 ASP T Protein B 1 FT #SUB 1232 1232 THR V 738 2738 ALA T Protein B 1 FT #SUB 1232 1232 THR V 740 2740 ASP T Protein A 6 FT #SUB 1232 1232 THR V 741 2741 GLN T Protein A 5 FT #SUB 1233 1233 SER V 738 2738 ALA T Protein B 1 FT #SUB 1234 1234 VAL V 737 2737 CYS T Protein B 1 FT #SUB 1234 1234 VAL V 738 2738 ALA T Protein B 6 FT #SUB 1234 1234 VAL V 739 2739 LEU T Protein B 3 FT #SUB 1234 1234 VAL V 740 2740 ASP T Protein S 1 FT #SUB 1235 1235 ILE V 736 2736 TYR T Protein B 1 FT #SUB 1236 1236 TYR V 736 2736 TYR T Protein A 4 FT #SUB 1238 1238 PRO V 736 2736 TYR T Protein S 3 FT #SUB 1245 1245 GLU V 729 2729 LYS T Protein A 5 FT #SUB 1245 1245 GLU V 730 2730 LEU T Protein A 6 FT #SUB 1245 1245 GLU V 731 2731 PRO T Protein S 4 FT #SUB 1245 1245 GLU V 734 2734 LYS T Protein S 3 FT #SUB 1245 1245 GLU V 736 2736 TYR T Protein S 3 FT #SUB 1246 1246 GLY V 730 2730 LEU T Protein B 2 FT #SUB 1481 1481 GLU V 458 2458 PHE T Protein S 1 FT #SUB 1483 1483 ASN V 487 2487 VAL T Protein B 3 FT #SUB 1483 1483 ASN V 488 2488 ARG T Protein A 7 FT #SUB 1483 1483 ASN V 490 2490 PRO T Protein S 3 FT #SUB 1484 1484 CYS V 486 2486 ILE T Protein B 1 FT #SUB 1485 1485 ALA V 485 2485 THR T Protein A 2 FT #SUB 1485 1485 ALA V 486 2486 ILE T Protein B 3 FT #SUB 1485 1485 ALA V 487 2487 VAL T Protein B 1 FT #SUB 1486 1486 LEU V 458 2458 PHE T Protein S 4 FT #SUB 1486 1486 LEU V 486 2486 ILE T Protein A 6 FT #SUB 1487 1487 PRO V 486 2486 ILE T Protein S 4 FT #SUB 1490 1490 ASN V 461 2461 HIS T Protein S 5 FT #SUB 1558 1558 LEU V 439 2439 PHE T Protein S 1 FT #SUB 1558 1558 LEU V 440 2440 ASP T Protein S 3 FT #SUB 1602 1602 GLN V 401 2401 GLU T Protein S 1 FT #SUB 1603 1603 PHE V 399 2399 LEU T Protein B 1 FT #SUB 1604 1604 ASP V 441 2441 ARG T Protein B 1 FT #SUB 1604 1604 ASP V 442 2442 LEU T Protein A 7 FT #SUB 1605 1605 ARG V 440 2440 ASP T Protein B 2 FT #SUB 1606 1606 LEU V 440 2440 ASP T Protein A 7 FT #SUB 1622 1622 GLN V 326 2326 ILE T Protein B 1 FT #SUB 1623 1623 ASN V 321 2321 GLU T Protein S 2 FT #SUB 1625 1625 HIS V 330 2330 SER T Protein S 2 FT #SUB 1625 1625 HIS V 353 2353 LYS T Protein S 1 FT #SUB 1648 1648 PRO V 327 2327 GLU T Protein B 2 FT #SUB 1649 1649 THR V 327 2327 GLU T Protein A 8 FT #SUB 1650 1650 ILE V 325 2325 ALA T Protein B 3 FT #SUB 1650 1650 ILE V 326 2326 ILE T Protein B 5 FT #SUB 1650 1650 ILE V 327 2327 GLU T Protein A 3 FT #SUB 1651 1651 ILE V 323 2323 ASN T Protein A 3 FT #SUB 1652 1652 PHE V 323 2323 ASN T Protein A 10 FT #SUB 1652 1652 PHE V 326 2326 ILE T Protein A 2 FT #SUB 1654 1654 PRO V 323 2323 ASN T Protein S 2 FT #SUB 416 416 ASN V 279 3199 GLN U Protein B 1 FT #SUB 1533 1533 ALA V 354 3274 ARG U Protein B 2 FT #SUB 1535 1535 LEU V 352 3272 HIS U Protein S 3 FT #SUB 1539 1539 GLN V 348 3268 LYS U Protein S 3 FT #SUB 1542 1542 ARG V 352 3272 HIS U Protein S 2 FT #SUB 1543 1543 ILE V 352 3272 HIS U Protein S 2 FT #SUB 328 328 LYS V 1571 1571 TYR Y Protein S 2 FT #SUB 328 328 LYS V 1583 1583 CYS Y Protein B 1 FT #SUB 328 328 LYS V 1631 1631 GLU Y Protein S 1 FT #SUB 329 329 ARG V 1572 1572 ILE Y Protein S 3 FT #SUB 329 329 ARG V 1573 1573 CYS Y Protein S 3 FT #SUB 329 329 ARG V 1574 1574 VAL Y Protein S 5 FT #SUB 329 329 ARG V 1582 1582 ASN Y Protein S 7 FT #SUB 329 329 ARG V 1583 1583 CYS Y Protein A 14 FT #SUB 329 329 ARG V 1584 1584 GLY Y Protein A 14 FT #SUB 329 329 ARG V 1585 1585 ASN Y Protein S 5 FT #SUB 330 330 GLN V 1582 1582 ASN Y Protein B 1 FT #SUB 330 330 GLN V 1583 1583 CYS Y Protein B 1 FT #SUB 331 331 SER V 1581 1581 GLU Y Protein A 2 FT #SUB 335 335 ASP V 1579 1579 GLY Y Protein S 7 FT #SUB 471 471 PRO V 1638 1638 SER Y Protein B 2 FT #SUB 472 472 ASP V 1638 1638 SER Y Protein A 9 FT #SUB 472 472 ASP V 1640 1640 ILE Y Protein S 2 FT #SUB 1365 1365 TYR V 1437 1437 ARG Y Protein S 14 FT #SUB 1372 1372 ILE V 1390 1390 ASP Y Protein S 2 FT #SUB 1372 1372 ILE V 1392 1392 ASP Y Protein S 1 FT #SUB 1372 1372 ILE V 1438 1438 ASP Y Protein S 3 FT #SUB 1390 1390 ASP V 1372 1372 ILE Y Protein S 4 FT #SUB 1437 1437 ARG V 1365 1365 TYR Y Protein S 14 FT #SUB 1438 1438 ASP V 1372 1372 ILE Y Protein S 4 FT #SUB 1571 1571 TYR V 328 328 LYS Y Protein S 8 FT #SUB 1578 1578 ILE V 335 335 ASP Y Protein A 4 FT #SUB 1579 1579 GLY V 335 335 ASP Y Protein B 3 FT #SUB 1581 1581 GLU V 330 330 GLN Y Protein S 3 FT #SUB 1582 1582 ASN V 329 329 ARG Y Protein B 2 FT #SUB 1582 1582 ASN V 330 330 GLN Y Protein S 1 FT #SUB 1582 1582 ASN V 331 331 SER Y Protein B 1 FT #SUB 1583 1583 CYS V 329 329 ARG Y Protein A 14 FT #SUB 1583 1583 CYS V 330 330 GLN Y Protein S 1 FT #SUB 1584 1584 GLY V 329 329 ARG Y Protein B 2 FT #SUB 1585 1585 ASN V 329 329 ARG Y Protein S 2 FT #SUB 1631 1631 GLU V 328 328 LYS Y Protein S 5 FT #SUB 1638 1638 SER V 470 470 ARG Y Protein S 1 FT #SUB 1638 1638 SER V 472 472 ASP Y Protein A 3 FT #SUB 1639 1639 LYS V 472 472 ASP Y Protein B 5 FT #SUB 1640 1640 ILE V 472 472 ASP Y Protein B 3 FT #SUB 1640 1640 ILE V 473 473 ALA Y Protein B 1 FT #SUB 106 106 GLN V 609 2609 ALA Z Protein B 1 FT #SUB 107 107 HIS V 625 2625 LEU Z Protein S 4 FT #SUB 108 108 PRO V 625 2625 LEU Z Protein S 2 FT #SUB 108 108 PRO V 626 2626 ARG Z Protein S 1 FT #SUB 109 109 LEU V 626 2626 ARG Z Protein S 2 FT #SUB 109 109 LEU V 635 2635 TYR Z Protein S 1 FT #SUB 109 109 LEU V 636 2636 THR Z Protein S 1 FT #SUB 111 111 ILE V 637 2637 VAL Z Protein S 1 FT #SUB 111 111 ILE V 691 2691 GLN Z Protein S 3 FT #SUB 112 112 ASP V 691 2691 GLN Z Protein B 1 FT #SUB 115 115 GLY V 691 2691 GLN Z Protein B 3 FT #SUB 115 115 GLY V 887 2887 ASP Z Protein B 1 FT #SUB 116 116 LYS V 693 2693 TYR Z Protein B 2 FT #SUB 117 117 LYS V 632 2632 GLU Z Protein S 3 FT #SUB 117 117 LYS V 634 2634 THR Z Protein S 5 FT #SUB 117 117 LYS V 693 2693 TYR Z Protein S 4 FT #SUB 118 118 ALA V 634 2634 THR Z Protein B 1 FT #SUB 118 118 ALA V 637 2637 VAL Z Protein S 1 FT #SUB 124 124 TYR V 610 2610 ASP Z Protein S 5 FT #SUB 136 136 ALA V 619 2619 VAL Z Protein S 2 FT #SUB 138 138 ALA V 612 2612 TYR Z Protein S 3 FT #SUB 138 138 ALA V 619 2619 VAL Z Protein A 2 FT #SUB 189 189 ARG V 612 2612 TYR Z Protein S 8 FT #SUB 189 189 ARG V 617 2617 ASP Z Protein B 1 FT #SUB 190 190 PHE V 617 2617 ASP Z Protein S 1 FT #SUB 190 190 PHE V 619 2619 VAL Z Protein S 2 FT #SUB 191 191 THR V 617 2617 ASP Z Protein S 1 FT #SUB 387 387 ASN V 136 2136 THR c Protein B 1 FT #SUB 388 388 GLY V 136 2136 THR c Protein B 6 FT #SUB 389 389 THR V 136 2136 THR c Protein A 10 FT #SUB 389 389 THR V 138 2138 LYS c Protein S 1 FT #HET 41 41 HIS V 188 5001 CUO V S 7 FT #HET 58 58 CYS V 188 5001 CUO V S 1 FT #HET 60 60 HIS V 188 5001 CUO V S 6 FT #HET 65 65 PHE V 188 5001 CUO V S 1 FT #HET 69 69 HIS V 188 5001 CUO V S 8 FT #HET 179 179 HIS V 188 5001 CUO V S 5 FT #HET 183 183 HIS V 188 5001 CUO V S 7 FT #HET 206 206 PHE V 188 5001 CUO V S 3 FT #HET 210 210 HIS V 188 5001 CUO V S 8 FT #HET 311 311 ILE V 91 2 NAG DA B 1 FT #HET 313 313 THR V 90 1 NAG DA S 5 FT #HET 313 313 THR V 91 2 NAG DA S 1 FT #HET 389 389 THR V 90 1 NAG DA S 3 FT #HET 391 391 MET V 91 2 NAG DA S 2 FT #HET 462 462 HIS V 189 5002 CUO V S 7 FT #HET 480 480 CYS V 189 5002 CUO V S 1 FT #HET 482 482 HIS V 189 5002 CUO V S 6 FT #HET 487 487 PHE V 189 5002 CUO V S 1 FT #HET 491 491 HIS V 189 5002 CUO V S 10 FT #HET 603 603 HIS V 189 5002 CUO V S 5 FT #HET 607 607 HIS V 189 5002 CUO V S 7 FT #HET 630 630 PHE V 189 5002 CUO V S 2 FT #HET 634 634 HIS V 189 5002 CUO V S 8 FT #HET 740 740 THR V 93 1 NAG EA S 3 FT #HET 763 763 LEU V 189 5002 CUO V S 1 FT #HET 804 804 ASP V 93 1 NAG EA S 8 FT #HET 808 808 THR V 93 1 NAG EA S 3 FT #HET 810 810 LEU V 93 1 NAG EA S 2 FT #HET 810 810 LEU V 94 2 NAG EA S 1 FT #HET 877 877 HIS V 190 5003 CUO V S 7 FT #HET 895 895 CYS V 190 5003 CUO V S 1 FT #HET 897 897 HIS V 190 5003 CUO V S 3 FT #HET 902 902 PHE V 190 5003 CUO V S 1 FT #HET 906 906 HIS V 190 5003 CUO V S 11 FT #HET 1015 1015 HIS V 190 5003 CUO V S 6 FT #HET 1019 1019 HIS V 190 5003 CUO V S 6 FT #HET 1042 1042 PHE V 190 5003 CUO V S 1 FT #HET 1046 1046 HIS V 190 5003 CUO V S 7 FT #HET 1294 1294 HIS V 191 5004 CUO V S 6 FT #HET 1298 1298 ALA V 97 2 NAG FA B 3 FT #HET 1299 1299 GLN V 96 1 NAG FA A 10 FT #HET 1299 1299 GLN V 97 2 NAG FA B 1 FT #HET 1301 1301 PRO V 96 1 NAG FA S 4 FT #HET 1312 1312 CYS V 191 5004 CUO V S 1 FT #HET 1314 1314 HIS V 191 5004 CUO V S 4 FT #HET 1323 1323 HIS V 191 5004 CUO V S 9 FT #HET 1427 1427 HIS V 191 5004 CUO V S 5 FT #HET 1431 1431 HIS V 191 5004 CUO V S 6 FT #HET 1454 1454 PHE V 191 5004 CUO V S 7 FT #HET 1458 1458 HIS V 191 5004 CUO V S 8 FT #HET 1494 1494 ARG V 96 1 NAG FA S 2 FT #HET 1500 1500 THR V 96 1 NAG FA B 4 FT #HET 1563 1563 LYS V 101 2 NAG GA S 3 FT #HET 1564 1564 ALA V 100 1 NAG GA S 1 FT #HET 1639 1639 LYS V 100 1 NAG GA S 3 FT #HET 1641 1641 THR V 101 2 NAG GA S 2 FT #MOD 387 387 ASN V 90 1 NAG DA S FT #MOD 806 806 ASN V 93 1 NAG EA S FT #MOD 1498 1498 ASN V 96 1 NAG FA S FT #MOD 1636 1636 ASN V 100 1 NAG GA S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9W STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 136 2136 THR W 388 388 GLY P Protein S 5 FT #SUB 136 2136 THR W 389 389 THR P Protein A 10 FT #SUB 321 2321 GLU W 1623 1623 ASN S Protein S 1 FT #SUB 323 2323 ASN W 1650 1650 ILE S Protein B 1 FT #SUB 323 2323 ASN W 1651 1651 ILE S Protein B 3 FT #SUB 323 2323 ASN W 1652 1652 PHE S Protein A 11 FT #SUB 323 2323 ASN W 1654 1654 PRO S Protein S 2 FT #SUB 324 2324 CYS W 1650 1650 ILE S Protein B 1 FT #SUB 325 2325 ALA W 1649 1649 THR S Protein B 2 FT #SUB 325 2325 ALA W 1650 1650 ILE S Protein B 3 FT #SUB 326 2326 ILE W 1622 1622 GLN S Protein S 1 FT #SUB 326 2326 ILE W 1650 1650 ILE S Protein A 5 FT #SUB 326 2326 ILE W 1652 1652 PHE S Protein S 2 FT #SUB 327 2327 GLU W 1648 1648 PRO S Protein S 3 FT #SUB 327 2327 GLU W 1649 1649 THR S Protein S 5 FT #SUB 327 2327 GLU W 1650 1650 ILE S Protein S 3 FT #SUB 330 2330 SER W 1625 1625 HIS S Protein S 1 FT #SUB 353 2353 LYS W 1625 1625 HIS S Protein S 2 FT #SUB 399 2399 LEU W 1603 1603 PHE S Protein S 1 FT #SUB 399 2399 LEU W 1604 1604 ASP S Protein S 1 FT #SUB 439 2439 PHE W 1558 1558 LEU S Protein B 1 FT #SUB 440 2440 ASP W 1558 1558 LEU S Protein B 1 FT #SUB 440 2440 ASP W 1605 1605 ARG S Protein B 2 FT #SUB 440 2440 ASP W 1606 1606 LEU S Protein A 6 FT #SUB 441 2441 ARG W 1606 1606 LEU S Protein S 2 FT #SUB 442 2442 LEU W 1604 1604 ASP S Protein A 8 FT #SUB 458 2458 PHE W 1483 1483 ASN S Protein S 1 FT #SUB 458 2458 PHE W 1486 1486 LEU S Protein A 3 FT #SUB 461 2461 HIS W 1490 1490 ASN S Protein A 9 FT #SUB 482 2482 PRO W 1602 1602 GLN S Protein S 2 FT #SUB 486 2486 ILE W 1484 1484 CYS S Protein B 3 FT #SUB 486 2486 ILE W 1485 1485 ALA S Protein B 1 FT #SUB 486 2486 ILE W 1486 1486 LEU S Protein A 4 FT #SUB 486 2486 ILE W 1487 1487 PRO S Protein A 5 FT #SUB 487 2487 VAL W 1483 1483 ASN S Protein B 1 FT #SUB 487 2487 VAL W 1484 1484 CYS S Protein A 3 FT #SUB 488 2488 ARG W 1483 1483 ASN S Protein A 9 FT #SUB 488 2488 ARG W 1486 1486 LEU S Protein A 2 FT #SUB 490 2490 PRO W 1483 1483 ASN S Protein S 2 FT #SUB 729 2729 LYS W 1245 1245 GLU S Protein B 2 FT #SUB 730 2730 LEU W 1245 1245 GLU S Protein S 1 FT #SUB 734 2734 LYS W 1207 1207 TYR S Protein S 2 FT #SUB 734 2734 LYS W 1208 1208 THR S Protein S 1 FT #SUB 736 2736 TYR W 1234 1234 VAL S Protein B 1 FT #SUB 736 2736 TYR W 1235 1235 ILE S Protein B 3 FT #SUB 736 2736 TYR W 1236 1236 TYR S Protein A 6 FT #SUB 736 2736 TYR W 1245 1245 GLU S Protein S 3 FT #SUB 737 2737 CYS W 1233 1233 SER S Protein B 1 FT #SUB 737 2737 CYS W 1234 1234 VAL S Protein B 2 FT #SUB 737 2737 CYS W 1235 1235 ILE S Protein B 1 FT #SUB 739 2739 LEU W 1207 1207 TYR S Protein S 7 FT #SUB 739 2739 LEU W 1211 1211 TYR S Protein S 3 FT #SUB 739 2739 LEU W 1234 1234 VAL S Protein A 4 FT #SUB 740 2740 ASP W 1232 1232 THR S Protein S 5 FT #SUB 740 2740 ASP W 1233 1233 SER S Protein S 6 FT #SUB 740 2740 ASP W 1234 1234 VAL S Protein S 1 FT #SUB 743 2743 ALA W 1209 1209 ASP S Protein S 1 FT #SUB 743 2743 ALA W 1210 1210 LYS S Protein A 2 FT #SUB 744 2744 PHE W 1164 1164 ASP S Protein S 3 FT #SUB 744 2744 PHE W 1211 1211 TYR S Protein S 3 FT #SUB 744 2744 PHE W 1212 1212 HIS S Protein S 1 FT #SUB 745 2745 PRO W 1210 1210 LYS S Protein S 3 FT #SUB 809 2809 LEU W 1187 1187 LYS S Protein S 1 FT #SUB 809 2809 LEU W 1188 1188 PHE S Protein S 2 FT #SUB 809 2809 LEU W 1189 1189 ASP S Protein S 1 FT #SUB 811 2811 HIS W 1187 1187 LYS S Protein S 2 FT #SUB 847 2847 HIS W 1230 1230 LEU S Protein S 1 FT #SUB 848 2848 PHE W 1147 1147 MET S Protein B 1 FT #SUB 849 2849 ASP W 1147 1147 MET S Protein B 3 FT #SUB 849 2849 ASP W 1191 1191 PRO S Protein A 3 FT #SUB 849 2849 ASP W 1233 1233 SER S Protein S 3 FT #SUB 850 2850 ARG W 1189 1189 ASP S Protein B 1 FT #SUB 851 2851 ASN W 1189 1189 ASP S Protein S 4 FT #SUB 869 2869 ALA W 1078 1078 ARG S Protein B 1 FT #SUB 870 2870 LEU W 1074 1074 ILE S Protein B 1 FT #SUB 870 2870 LEU W 1078 1078 ARG S Protein B 2 FT #SUB 871 2871 PHE W 1070 1070 ALA S Protein S 1 FT #SUB 871 2871 PHE W 1071 1071 ASN S Protein S 3 FT #SUB 871 2871 PHE W 1074 1074 ILE S Protein S 1 FT #SUB 871 2871 PHE W 1103 1103 PHE S Protein A 8 FT #SUB 872 2872 GLU W 1078 1078 ARG S Protein B 5 FT #SUB 873 2873 HIS W 1078 1078 ARG S Protein B 2 FT #SUB 873 2873 HIS W 1101 1101 VAL S Protein S 4 FT #SUB 873 2873 HIS W 1104 1104 ASP S Protein S 4 FT #SUB 875 2875 SER W 1078 1078 ARG S Protein B 4 FT #SUB 876 2876 LYS W 1078 1078 ARG S Protein B 2 FT #SUB 877 2877 ILE W 1078 1078 ARG S Protein A 9 FT #SUB 897 2897 PRO W 1187 1187 LYS S Protein S 2 FT #SUB 899 2899 PRO W 1075 1075 GLU S Protein B 4 FT #SUB 900 2900 SER W 1075 1075 GLU S Protein A 3 FT #SUB 901 2901 LEU W 1071 1071 ASN S Protein B 1 FT #SUB 901 2901 LEU W 1073 1073 ALA S Protein B 1 FT #SUB 901 2901 LEU W 1074 1074 ILE S Protein A 4 FT #SUB 902 2902 ILE W 1071 1071 ASN S Protein B 1 FT #SUB 903 2903 TYR W 1071 1071 ASN S Protein A 7 FT #SUB 905 2905 PRO W 1071 1071 ASN S Protein S 1 FT #SUB 84 2084 ARG W 500 2500 ILE T Protein S 1 FT #SUB 84 2084 ARG W 502 2502 LEU T Protein S 3 FT #SUB 90 2090 LYS W 375 2375 GLY T Protein S 1 FT #SUB 94 2094 ARG W 94 2094 ARG T Protein S 5 FT #SUB 94 2094 ARG W 182 2182 TYR T Protein A 4 FT #SUB 94 2094 ARG W 370 2370 MET T Protein S 5 FT #SUB 96 2096 SER W 374 2374 ASN T Protein S 2 FT #SUB 97 2097 LEU W 235 2235 PRO T Protein S 2 FT #SUB 98 2098 GLN W 374 2374 ASN T Protein S 1 FT #SUB 99 2099 GLU W 375 2375 GLY T Protein S 1 FT #SUB 109 2109 ARG W 503 2503 ASN T Protein S 1 FT #SUB 109 2109 ARG W 504 2504 ARG T Protein S 1 FT #SUB 109 2109 ARG W 585 2585 GLY T Protein B 1 FT #SUB 112 2112 LYS W 583 2583 ARG T Protein S 3 FT #SUB 112 2112 LYS W 584 2584 HIS T Protein B 1 FT #SUB 113 2113 ASP W 519 2519 ASN T Protein S 3 FT #SUB 113 2113 ASP W 585 2585 GLY T Protein S 4 FT #SUB 114 2114 ARG W 526 2526 ARG T Protein S 5 FT #SUB 115 2115 SER W 519 2519 ASN T Protein S 2 FT #SUB 115 2115 SER W 522 2522 SER T Protein A 3 FT #SUB 116 2116 SER W 615 2615 TRP T Protein S 1 FT #SUB 121 2121 THR W 615 2615 TRP T Protein S 3 FT #SUB 124 2124 SER W 615 2615 TRP T Protein S 1 FT #SUB 125 2125 PHE W 615 2615 TRP T Protein S 1 FT #SUB 131 2131 LEU W 615 2615 TRP T Protein S 1 FT #SUB 167 2167 ASN W 502 2502 LEU T Protein B 1 FT #SUB 168 2168 ARG W 502 2502 LEU T Protein A 2 FT #SUB 168 2168 ARG W 503 2503 ASN T Protein B 3 FT #SUB 168 2168 ARG W 515 2515 ARG T Protein S 13 FT #SUB 168 2168 ARG W 587 2587 SER T Protein B 4 FT #SUB 169 2169 HIS W 503 2503 ASN T Protein B 2 FT #SUB 169 2169 HIS W 585 2585 GLY T Protein S 1 FT #SUB 169 2169 HIS W 587 2587 SER T Protein S 5 FT #SUB 170 2170 GLY W 503 2503 ASN T Protein B 2 FT #SUB 182 2182 TYR W 94 2094 ARG T Protein S 7 FT #SUB 182 2182 TYR W 96 2096 SER T Protein S 1 FT #SUB 199 2199 PRO W 235 2235 PRO T Protein B 1 FT #SUB 199 2199 PRO W 236 2236 ALA T Protein B 2 FT #SUB 199 2199 PRO W 237 2237 LEU T Protein B 2 FT #SUB 200 2200 PHE W 237 2237 LEU T Protein B 8 FT #SUB 235 2235 PRO W 97 2097 LEU T Protein S 2 FT #SUB 235 2235 PRO W 199 2199 PRO T Protein B 2 FT #SUB 236 2236 ALA W 199 2199 PRO T Protein B 3 FT #SUB 237 2237 LEU W 199 2199 PRO T Protein B 2 FT #SUB 237 2237 LEU W 200 2200 PHE T Protein A 7 FT #SUB 244 2244 PHE W 98 2098 GLN T Protein S 1 FT #SUB 341 2341 PRO W 614 2614 ALA T Protein S 2 FT #SUB 342 2342 TYR W 614 2614 ALA T Protein S 1 FT #SUB 342 2342 TYR W 615 2615 TRP T Protein B 2 FT #SUB 345 2345 ASN W 515 2515 ARG T Protein A 2 FT #SUB 346 2346 PRO W 515 2515 ARG T Protein S 1 FT #SUB 370 2370 MET W 94 2094 ARG T Protein S 7 FT #SUB 370 2370 MET W 370 2370 MET T Protein S 1 FT #SUB 374 2374 ASN W 96 2096 SER T Protein A 3 FT #SUB 374 2374 ASN W 98 2098 GLN T Protein S 3 FT #SUB 375 2375 GLY W 90 2090 LYS T Protein B 1 FT #SUB 375 2375 GLY W 99 2099 GLU T Protein B 3 FT #SUB 500 2500 ILE W 84 2084 ARG T Protein B 1 FT #SUB 502 2502 LEU W 84 2084 ARG T Protein S 1 FT #SUB 502 2502 LEU W 167 2167 ASN T Protein S 2 FT #SUB 502 2502 LEU W 168 2168 ARG T Protein S 3 FT #SUB 503 2503 ASN W 109 2109 ARG T Protein S 1 FT #SUB 503 2503 ASN W 168 2168 ARG T Protein A 4 FT #SUB 503 2503 ASN W 169 2169 HIS T Protein S 2 FT #SUB 503 2503 ASN W 170 2170 GLY T Protein S 2 FT #SUB 513 2513 GLU W 346 2346 PRO T Protein S 1 FT #SUB 514 2514 SER W 344 2344 LEU T Protein A 3 FT #SUB 515 2515 ARG W 168 2168 ARG T Protein S 10 FT #SUB 515 2515 ARG W 344 2344 LEU T Protein A 6 FT #SUB 515 2515 ARG W 345 2345 ASN T Protein S 1 FT #SUB 515 2515 ARG W 346 2346 PRO T Protein S 6 FT #SUB 516 2516 ASP W 168 2168 ARG T Protein S 1 FT #SUB 519 2519 ASN W 113 2113 ASP T Protein S 3 FT #SUB 522 2522 SER W 115 2115 SER T Protein S 3 FT #SUB 526 2526 ARG W 114 2114 ARG T Protein S 8 FT #SUB 582 2582 ARG W 109 2109 ARG T Protein B 2 FT #SUB 584 2584 HIS W 112 2112 LYS T Protein B 1 FT #SUB 585 2585 GLY W 109 2109 ARG T Protein B 1 FT #SUB 585 2585 GLY W 113 2113 ASP T Protein B 7 FT #SUB 585 2585 GLY W 169 2169 HIS T Protein B 2 FT #SUB 587 2587 SER W 168 2168 ARG T Protein A 5 FT #SUB 587 2587 SER W 169 2169 HIS T Protein S 3 FT #SUB 615 2615 TRP W 116 2116 SER T Protein S 3 FT #SUB 615 2615 TRP W 121 2121 THR T Protein S 6 FT #SUB 615 2615 TRP W 125 2125 PHE T Protein S 1 FT #SUB 615 2615 TRP W 341 2341 PRO T Protein S 1 FT #SUB 615 2615 TRP W 342 2342 TYR T Protein S 9 FT #SUB 800 2800 LYS W 100 3020 SER X Protein S 1 FT #SUB 800 2800 LYS W 101 3021 LEU X Protein S 1 FT #SUB 800 2800 LYS W 103 3023 ILE X Protein A 6 FT #SUB 802 2802 ASP W 12 2932 PRO X Protein S 6 FT #SUB 867 2867 PRO W 12 2932 PRO X Protein S 5 FT #SUB 868 2868 GLU W 11 2931 THR X Protein S 1 FT #SUB 868 2868 GLU W 12 2932 PRO X Protein A 11 FT #SUB 868 2868 GLU W 13 2933 SER X Protein S 8 FT #SUB 868 2868 GLU W 14 2934 GLU X Protein S 1 FT #SUB 903 2903 TYR W 12 2932 PRO X Protein S 2 FT #SUB 609 2609 ALA W 106 106 GLN Y Protein S 1 FT #SUB 610 2610 ASP W 124 124 TYR Y Protein S 3 FT #SUB 610 2610 ASP W 137 137 ARG Y Protein S 1 FT #SUB 612 2612 TYR W 138 138 ALA Y Protein S 3 FT #SUB 612 2612 TYR W 189 189 ARG Y Protein S 7 FT #SUB 617 2617 ASP W 189 189 ARG Y Protein S 2 FT #SUB 617 2617 ASP W 190 190 PHE Y Protein B 2 FT #SUB 618 2618 SER W 190 190 PHE Y Protein B 1 FT #SUB 619 2619 VAL W 136 136 ALA Y Protein S 2 FT #SUB 619 2619 VAL W 138 138 ALA Y Protein S 2 FT #SUB 619 2619 VAL W 190 190 PHE Y Protein S 3 FT #SUB 625 2625 LEU W 107 107 HIS Y Protein S 2 FT #SUB 626 2626 ARG W 108 108 PRO Y Protein S 3 FT #SUB 626 2626 ARG W 109 109 LEU Y Protein S 1 FT #SUB 632 2632 GLU W 117 117 LYS Y Protein B 1 FT #SUB 634 2634 THR W 117 117 LYS Y Protein S 3 FT #SUB 635 2635 TYR W 109 109 LEU Y Protein S 1 FT #SUB 636 2636 THR W 109 109 LEU Y Protein B 1 FT #SUB 637 2637 VAL W 111 111 ILE Y Protein S 1 FT #SUB 637 2637 VAL W 118 118 ALA Y Protein S 1 FT #SUB 691 2691 GLN W 113 113 PRO Y Protein S 1 FT #SUB 693 2693 TYR W 116 116 LYS Y Protein S 2 FT #SUB 693 2693 TYR W 117 117 LYS Y Protein S 1 FT #HET 126 2126 HIS W 193 3001 CUO W S 5 FT #HET 138 2138 LYS W 65 1 NAG 3 S 2 FT #HET 138 2138 LYS W 66 2 NAG 3 S 1 FT #HET 144 2144 CYS W 193 3001 CUO W S 1 FT #HET 146 2146 HIS W 193 3001 CUO W S 7 FT #HET 151 2151 PHE W 193 3001 CUO W S 1 FT #HET 155 2155 HIS W 193 3001 CUO W S 8 FT #HET 267 2267 HIS W 193 3001 CUO W S 7 FT #HET 271 2271 HIS W 193 3001 CUO W S 6 FT #HET 294 2294 PHE W 193 3001 CUO W S 1 FT #HET 298 2298 HIS W 193 3001 CUO W S 7 FT #HET 405 2405 SER W 103 1 NAG HA S 4 FT #HET 429 2429 LEU W 193 3001 CUO W S 1 FT #HET 470 2470 ALA W 103 1 NAG HA S 1 FT #HET 474 2474 THR W 103 1 NAG HA S 2 FT #HET 476 2476 LEU W 104 2 NAG HA S 3 FT #HET 480 2480 LEU W 104 2 NAG HA S 1 FT #HET 543 2543 HIS W 194 3002 CUO W S 7 FT #HET 559 2559 CYS W 194 3002 CUO W S 1 FT #HET 561 2561 HIS W 194 3002 CUO W S 5 FT #HET 570 2570 HIS W 194 3002 CUO W S 11 FT #HET 680 2680 HIS W 194 3002 CUO W S 7 FT #HET 684 2684 HIS W 194 3002 CUO W S 7 FT #HET 707 2707 PHE W 194 3002 CUO W S 4 FT #HET 711 2711 HIS W 194 3002 CUO W S 9 FT #MOD 472 2472 ASN W 103 1 NAG HA S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9X STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 276 3196 THR X 416 416 ASN S Protein A 12 FT #SUB 279 3199 GLN X 416 416 ASN S Protein S 4 FT #SUB 279 3199 GLN X 419 419 THR S Protein S 1 FT #SUB 351 3271 VAL X 1533 1533 ALA S Protein S 2 FT #SUB 352 3272 HIS X 1404 1404 TYR S Protein B 1 FT #SUB 352 3272 HIS X 1535 1535 LEU S Protein A 6 FT #SUB 352 3272 HIS X 1542 1542 ARG S Protein S 1 FT #SUB 352 3272 HIS X 1543 1543 ILE S Protein A 3 FT #SUB 354 3274 ARG X 1401 1401 ARG S Protein B 1 FT #SUB 354 3274 ARG X 1533 1533 ALA S Protein S 2 FT #SUB 359 3279 ASP X 1401 1401 ARG S Protein S 2 FT #SUB 11 2931 THR X 868 2868 GLU W Protein B 1 FT #SUB 12 2932 PRO X 802 2802 ASP W Protein A 6 FT #SUB 12 2932 PRO X 867 2867 PRO W Protein S 5 FT #SUB 12 2932 PRO X 868 2868 GLU W Protein B 11 FT #SUB 12 2932 PRO X 903 2903 TYR W Protein S 2 FT #SUB 13 2933 SER X 868 2868 GLU W Protein A 8 FT #SUB 14 2934 GLU X 868 2868 GLU W Protein B 1 FT #SUB 100 3020 SER X 800 2800 LYS W Protein S 1 FT #SUB 101 3021 LEU X 800 2800 LYS W Protein B 1 FT #SUB 103 3023 ILE X 800 2800 LYS W Protein S 6 FT #HET 41 2961 HIS X 195 3401 CUO X S 8 FT #HET 60 2980 HIS X 195 3401 CUO X S 3 FT #HET 69 2989 HIS X 195 3401 CUO X S 13 FT #HET 121 3041 ALA X 177 5015 CUO P B 6 FT #HET 122 3042 ASP X 177 5015 CUO P A 29 FT #HET 123 3043 THR X 177 5015 CUO P B 4 FT #HET 169 3089 HIS X 195 3401 CUO X S 7 FT #HET 173 3093 HIS X 195 3401 CUO X S 5 FT #HET 196 3116 PHE X 195 3401 CUO X S 5 FT #HET 199 3119 HIS X 195 3401 CUO X S 1 FT #HET 200 3120 HIS X 195 3401 CUO X S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9Y STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 388 388 GLY Y 136 2136 THR T Protein B 5 FT #SUB 389 389 THR Y 136 2136 THR T Protein A 9 FT #SUB 391 391 MET Y 138 2138 LYS T Protein S 2 FT #SUB 328 328 LYS Y 1571 1571 TYR V Protein S 8 FT #SUB 328 328 LYS Y 1631 1631 GLU V Protein S 5 FT #SUB 329 329 ARG Y 1582 1582 ASN V Protein S 2 FT #SUB 329 329 ARG Y 1583 1583 CYS V Protein S 14 FT #SUB 329 329 ARG Y 1584 1584 GLY V Protein S 2 FT #SUB 329 329 ARG Y 1585 1585 ASN V Protein S 2 FT #SUB 330 330 GLN Y 1581 1581 GLU V Protein B 3 FT #SUB 330 330 GLN Y 1582 1582 ASN V Protein S 1 FT #SUB 330 330 GLN Y 1583 1583 CYS V Protein S 1 FT #SUB 331 331 SER Y 1582 1582 ASN V Protein S 1 FT #SUB 335 335 ASP Y 1578 1578 ILE V Protein S 4 FT #SUB 335 335 ASP Y 1579 1579 GLY V Protein S 3 FT #SUB 470 470 ARG Y 1638 1638 SER V Protein S 1 FT #SUB 472 472 ASP Y 1638 1638 SER V Protein S 3 FT #SUB 472 472 ASP Y 1639 1639 LYS V Protein S 5 FT #SUB 472 472 ASP Y 1640 1640 ILE V Protein S 3 FT #SUB 473 473 ALA Y 1640 1640 ILE V Protein S 1 FT #SUB 1365 1365 TYR Y 1437 1437 ARG V Protein S 14 FT #SUB 1372 1372 ILE Y 1390 1390 ASP V Protein S 4 FT #SUB 1372 1372 ILE Y 1438 1438 ASP V Protein S 4 FT #SUB 1390 1390 ASP Y 1372 1372 ILE V Protein S 2 FT #SUB 1392 1392 ASP Y 1372 1372 ILE V Protein S 1 FT #SUB 1437 1437 ARG Y 1365 1365 TYR V Protein S 14 FT #SUB 1438 1438 ASP Y 1372 1372 ILE V Protein S 3 FT #SUB 1571 1571 TYR Y 328 328 LYS V Protein S 2 FT #SUB 1572 1572 ILE Y 329 329 ARG V Protein B 3 FT #SUB 1573 1573 CYS Y 329 329 ARG V Protein B 3 FT #SUB 1574 1574 VAL Y 329 329 ARG V Protein B 5 FT #SUB 1579 1579 GLY Y 335 335 ASP V Protein B 7 FT #SUB 1581 1581 GLU Y 331 331 SER V Protein S 2 FT #SUB 1582 1582 ASN Y 329 329 ARG V Protein B 7 FT #SUB 1582 1582 ASN Y 330 330 GLN V Protein B 1 FT #SUB 1583 1583 CYS Y 328 328 LYS V Protein B 1 FT #SUB 1583 1583 CYS Y 329 329 ARG V Protein B 14 FT #SUB 1583 1583 CYS Y 330 330 GLN V Protein B 1 FT #SUB 1584 1584 GLY Y 329 329 ARG V Protein B 14 FT #SUB 1585 1585 ASN Y 329 329 ARG V Protein A 5 FT #SUB 1631 1631 GLU Y 328 328 LYS V Protein S 1 FT #SUB 1638 1638 SER Y 471 471 PRO V Protein S 2 FT #SUB 1638 1638 SER Y 472 472 ASP V Protein A 9 FT #SUB 1640 1640 ILE Y 472 472 ASP V Protein A 2 FT #SUB 106 106 GLN Y 609 2609 ALA W Protein B 1 FT #SUB 107 107 HIS Y 625 2625 LEU W Protein S 2 FT #SUB 108 108 PRO Y 626 2626 ARG W Protein S 3 FT #SUB 109 109 LEU Y 626 2626 ARG W Protein S 1 FT #SUB 109 109 LEU Y 635 2635 TYR W Protein S 1 FT #SUB 109 109 LEU Y 636 2636 THR W Protein S 1 FT #SUB 111 111 ILE Y 637 2637 VAL W Protein S 1 FT #SUB 113 113 PRO Y 691 2691 GLN W Protein B 1 FT #SUB 116 116 LYS Y 693 2693 TYR W Protein B 2 FT #SUB 117 117 LYS Y 632 2632 GLU W Protein S 1 FT #SUB 117 117 LYS Y 634 2634 THR W Protein A 3 FT #SUB 117 117 LYS Y 693 2693 TYR W Protein S 1 FT #SUB 118 118 ALA Y 637 2637 VAL W Protein S 1 FT #SUB 124 124 TYR Y 610 2610 ASP W Protein S 3 FT #SUB 136 136 ALA Y 619 2619 VAL W Protein S 2 FT #SUB 137 137 ARG Y 610 2610 ASP W Protein B 1 FT #SUB 138 138 ALA Y 612 2612 TYR W Protein S 3 FT #SUB 138 138 ALA Y 619 2619 VAL W Protein A 2 FT #SUB 189 189 ARG Y 612 2612 TYR W Protein S 7 FT #SUB 189 189 ARG Y 617 2617 ASP W Protein B 2 FT #SUB 190 190 PHE Y 617 2617 ASP W Protein S 2 FT #SUB 190 190 PHE Y 618 2618 SER W Protein S 1 FT #SUB 190 190 PHE Y 619 2619 VAL W Protein S 3 FT #SUB 1070 1070 ALA Y 871 2871 PHE c Protein B 1 FT #SUB 1071 1071 ASN Y 871 2871 PHE c Protein S 1 FT #SUB 1071 1071 ASN Y 902 2902 ILE c Protein B 3 FT #SUB 1071 1071 ASN Y 903 2903 TYR c Protein A 6 FT #SUB 1073 1073 ALA Y 901 2901 LEU c Protein B 2 FT #SUB 1074 1074 ILE Y 870 2870 LEU c Protein S 2 FT #SUB 1074 1074 ILE Y 871 2871 PHE c Protein S 3 FT #SUB 1074 1074 ILE Y 901 2901 LEU c Protein A 5 FT #SUB 1075 1075 GLU Y 899 2899 PRO c Protein S 2 FT #SUB 1075 1075 GLU Y 900 2900 SER c Protein S 8 FT #SUB 1075 1075 GLU Y 901 2901 LEU c Protein A 5 FT #SUB 1078 1078 ARG Y 870 2870 LEU c Protein S 2 FT #SUB 1078 1078 ARG Y 872 2872 GLU c Protein S 1 FT #SUB 1078 1078 ARG Y 873 2873 HIS c Protein A 3 FT #SUB 1078 1078 ARG Y 875 2875 SER c Protein S 2 FT #SUB 1078 1078 ARG Y 877 2877 ILE c Protein S 6 FT #SUB 1101 1101 VAL Y 873 2873 HIS c Protein S 4 FT #SUB 1103 1103 PHE Y 871 2871 PHE c Protein S 6 FT #SUB 1147 1147 MET Y 849 2849 ASP c Protein S 2 FT #SUB 1187 1187 LYS Y 847 2847 HIS c Protein S 4 FT #SUB 1188 1188 PHE Y 809 2809 LEU c Protein B 1 FT #SUB 1189 1189 ASP Y 809 2809 LEU c Protein B 1 FT #SUB 1189 1189 ASP Y 851 2851 ASN c Protein S 2 FT #SUB 1191 1191 PRO Y 849 2849 ASP c Protein S 3 FT #SUB 1207 1207 TYR Y 739 2739 LEU c Protein S 3 FT #SUB 1209 1209 ASP Y 743 2743 ALA c Protein B 2 FT #SUB 1210 1210 LYS Y 744 2744 PHE c Protein A 4 FT #SUB 1211 1211 TYR Y 739 2739 LEU c Protein S 2 FT #SUB 1211 1211 TYR Y 744 2744 PHE c Protein B 5 FT #SUB 1232 1232 THR Y 740 2740 ASP c Protein A 13 FT #SUB 1232 1232 THR Y 741 2741 GLN c Protein S 1 FT #SUB 1233 1233 SER Y 849 2849 ASP c Protein S 2 FT #SUB 1234 1234 VAL Y 736 2736 TYR c Protein B 1 FT #SUB 1234 1234 VAL Y 738 2738 ALA c Protein B 1 FT #SUB 1234 1234 VAL Y 739 2739 LEU c Protein A 5 FT #SUB 1235 1235 ILE Y 736 2736 TYR c Protein B 1 FT #SUB 1236 1236 TYR Y 736 2736 TYR c Protein A 7 FT #SUB 1238 1238 PRO Y 736 2736 TYR c Protein S 1 FT #SUB 1245 1245 GLU Y 729 2729 LYS c Protein A 2 FT #SUB 1245 1245 GLU Y 730 2730 LEU c Protein A 3 FT #SUB 1245 1245 GLU Y 731 2731 PRO c Protein S 3 FT #SUB 1245 1245 GLU Y 736 2736 TYR c Protein S 3 FT #SUB 1481 1481 GLU Y 458 2458 PHE c Protein S 1 FT #SUB 1481 1481 GLU Y 459 2459 ASP c Protein S 1 FT #SUB 1483 1483 ASN Y 486 2486 ILE c Protein B 1 FT #SUB 1483 1483 ASN Y 487 2487 VAL c Protein B 2 FT #SUB 1483 1483 ASN Y 488 2488 ARG c Protein A 9 FT #SUB 1483 1483 ASN Y 490 2490 PRO c Protein S 2 FT #SUB 1484 1484 CYS Y 486 2486 ILE c Protein B 3 FT #SUB 1484 1484 CYS Y 487 2487 VAL c Protein B 2 FT #SUB 1486 1486 LEU Y 458 2458 PHE c Protein S 3 FT #SUB 1486 1486 LEU Y 486 2486 ILE c Protein A 5 FT #SUB 1486 1486 LEU Y 488 2488 ARG c Protein S 1 FT #SUB 1487 1487 PRO Y 486 2486 ILE c Protein S 9 FT #SUB 1490 1490 ASN Y 461 2461 HIS c Protein S 9 FT #SUB 1512 1512 ARG Y 461 2461 HIS c Protein S 1 FT #SUB 1558 1558 LEU Y 440 2440 ASP c Protein S 2 FT #SUB 1603 1603 PHE Y 399 2399 LEU c Protein B 1 FT #SUB 1604 1604 ASP Y 399 2399 LEU c Protein A 3 FT #SUB 1604 1604 ASP Y 440 2440 ASP c Protein B 1 FT #SUB 1604 1604 ASP Y 441 2441 ARG c Protein B 2 FT #SUB 1604 1604 ASP Y 442 2442 LEU c Protein A 5 FT #SUB 1605 1605 ARG Y 440 2440 ASP c Protein B 1 FT #SUB 1606 1606 LEU Y 440 2440 ASP c Protein A 6 FT #SUB 1622 1622 GLN Y 326 2326 ILE c Protein B 2 FT #SUB 1623 1623 ASN Y 321 2321 GLU c Protein S 1 FT #SUB 1648 1648 PRO Y 327 2327 GLU c Protein B 3 FT #SUB 1649 1649 THR Y 325 2325 ALA c Protein S 2 FT #SUB 1649 1649 THR Y 327 2327 GLU c Protein A 5 FT #SUB 1650 1650 ILE Y 325 2325 ALA c Protein B 2 FT #SUB 1650 1650 ILE Y 326 2326 ILE c Protein A 5 FT #SUB 1650 1650 ILE Y 327 2327 GLU c Protein A 6 FT #SUB 1651 1651 ILE Y 323 2323 ASN c Protein B 2 FT #SUB 1652 1652 PHE Y 323 2323 ASN c Protein A 9 FT #SUB 1654 1654 PRO Y 323 2323 ASN c Protein S 2 FT #SUB 416 416 ASN Y 276 3196 THR d Protein S 8 FT #SUB 419 419 THR Y 279 3199 GLN d Protein S 1 FT #SUB 1350 1350 GLU Y 354 3274 ARG d Protein S 5 FT #SUB 1404 1404 TYR Y 352 3272 HIS d Protein S 1 FT #SUB 1533 1533 ALA Y 351 3271 VAL d Protein B 2 FT #SUB 1533 1533 ALA Y 354 3274 ARG d Protein B 3 FT #SUB 1534 1534 GLY Y 348 3268 LYS d Protein B 1 FT #SUB 1535 1535 LEU Y 352 3272 HIS d Protein S 5 FT #SUB 1543 1543 ILE Y 352 3272 HIS d Protein S 2 FT #HET 41 41 HIS Y 197 2102 CUO Y S 7 FT #HET 60 60 HIS Y 197 2102 CUO Y S 6 FT #HET 69 69 HIS Y 197 2102 CUO Y S 8 FT #HET 179 179 HIS Y 197 2102 CUO Y S 6 FT #HET 183 183 HIS Y 197 2102 CUO Y S 4 FT #HET 206 206 PHE Y 197 2102 CUO Y S 4 FT #HET 210 210 HIS Y 197 2102 CUO Y S 8 FT #HET 313 313 THR Y 105 1 NAG IA S 6 FT #HET 344 344 LEU Y 197 2102 CUO Y S 1 FT #HET 389 389 THR Y 105 1 NAG IA S 2 FT #HET 391 391 MET Y 106 2 NAG IA S 1 FT #HET 462 462 HIS Y 198 2103 CUO Y S 5 FT #HET 480 480 CYS Y 198 2103 CUO Y S 1 FT #HET 482 482 HIS Y 198 2103 CUO Y S 6 FT #HET 487 487 PHE Y 198 2103 CUO Y S 2 FT #HET 491 491 HIS Y 198 2103 CUO Y S 8 FT #HET 603 603 HIS Y 198 2103 CUO Y S 8 FT #HET 607 607 HIS Y 198 2103 CUO Y S 5 FT #HET 634 634 HIS Y 198 2103 CUO Y S 8 FT #HET 763 763 LEU Y 198 2103 CUO Y S 1 FT #HET 804 804 ASP Y 107 1 NAG JA S 6 FT #HET 808 808 THR Y 107 1 NAG JA S 3 FT #HET 810 810 LEU Y 107 1 NAG JA S 2 FT #HET 877 877 HIS Y 199 2104 CUO Y S 5 FT #HET 895 895 CYS Y 199 2104 CUO Y S 1 FT #HET 897 897 HIS Y 199 2104 CUO Y S 4 FT #HET 906 906 HIS Y 199 2104 CUO Y S 10 FT #HET 1015 1015 HIS Y 199 2104 CUO Y S 5 FT #HET 1019 1019 HIS Y 199 2104 CUO Y S 5 FT #HET 1042 1042 PHE Y 199 2104 CUO Y S 2 FT #HET 1046 1046 HIS Y 199 2104 CUO Y S 8 FT #HET 1294 1294 HIS Y 200 2105 CUO Y S 5 FT #HET 1299 1299 GLN Y 109 1 NAG KA A 10 FT #HET 1301 1301 PRO Y 109 1 NAG KA S 2 FT #HET 1308 1308 VAL Y 110 2 NAG KA S 1 FT #HET 1312 1312 CYS Y 200 2105 CUO Y S 1 FT #HET 1314 1314 HIS Y 200 2105 CUO Y S 6 FT #HET 1323 1323 HIS Y 200 2105 CUO Y S 11 FT #HET 1427 1427 HIS Y 200 2105 CUO Y S 7 FT #HET 1431 1431 HIS Y 200 2105 CUO Y S 4 FT #HET 1454 1454 PHE Y 200 2105 CUO Y S 3 FT #HET 1458 1458 HIS Y 200 2105 CUO Y S 10 FT #HET 1494 1494 ARG Y 109 1 NAG KA S 2 FT #HET 1500 1500 THR Y 109 1 NAG KA S 2 FT #HET 1500 1500 THR Y 110 2 NAG KA S 1 FT #HET 1501 1501 ALA Y 109 1 NAG KA S 1 FT #HET 1563 1563 LYS Y 114 2 NAG LA S 2 FT #HET 1564 1564 ALA Y 113 1 NAG LA S 1 FT #HET 1638 1638 SER Y 113 1 NAG LA B 4 FT #HET 1639 1639 LYS Y 113 1 NAG LA B 1 FT #HET 1641 1641 THR Y 114 2 NAG LA S 2 FT #HET 1644 1644 ILE Y 114 2 NAG LA S 1 FT #MOD 387 387 ASN Y 105 1 NAG IA S FT #MOD 806 806 ASN Y 107 1 NAG JA S FT #MOD 1498 1498 ASN Y 109 1 NAG KA S FT #MOD 1636 1636 ASN Y 113 1 NAG LA S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9Z STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 136 2136 THR Z 388 388 GLY A Protein S 5 FT #SUB 136 2136 THR Z 389 389 THR A Protein A 8 FT #SUB 136 2136 THR Z 390 390 LEU A Protein S 1 FT #SUB 138 2138 LYS Z 389 389 THR A Protein S 1 FT #SUB 138 2138 LYS Z 391 391 MET A Protein S 2 FT #SUB 609 2609 ALA Z 106 106 GLN V Protein S 1 FT #SUB 610 2610 ASP Z 124 124 TYR V Protein S 5 FT #SUB 612 2612 TYR Z 138 138 ALA V Protein S 3 FT #SUB 612 2612 TYR Z 189 189 ARG V Protein S 8 FT #SUB 617 2617 ASP Z 189 189 ARG V Protein S 1 FT #SUB 617 2617 ASP Z 190 190 PHE V Protein B 1 FT #SUB 617 2617 ASP Z 191 191 THR V Protein S 1 FT #SUB 619 2619 VAL Z 136 136 ALA V Protein S 2 FT #SUB 619 2619 VAL Z 138 138 ALA V Protein S 2 FT #SUB 619 2619 VAL Z 190 190 PHE V Protein S 2 FT #SUB 625 2625 LEU Z 107 107 HIS V Protein S 4 FT #SUB 625 2625 LEU Z 108 108 PRO V Protein S 2 FT #SUB 626 2626 ARG Z 108 108 PRO V Protein S 1 FT #SUB 626 2626 ARG Z 109 109 LEU V Protein S 2 FT #SUB 632 2632 GLU Z 117 117 LYS V Protein B 3 FT #SUB 634 2634 THR Z 117 117 LYS V Protein S 5 FT #SUB 634 2634 THR Z 118 118 ALA V Protein S 1 FT #SUB 635 2635 TYR Z 109 109 LEU V Protein S 1 FT #SUB 636 2636 THR Z 109 109 LEU V Protein B 1 FT #SUB 637 2637 VAL Z 111 111 ILE V Protein S 1 FT #SUB 637 2637 VAL Z 118 118 ALA V Protein S 1 FT #SUB 691 2691 GLN Z 111 111 ILE V Protein S 3 FT #SUB 691 2691 GLN Z 112 112 ASP V Protein S 1 FT #SUB 691 2691 GLN Z 115 115 GLY V Protein S 3 FT #SUB 693 2693 TYR Z 116 116 LYS V Protein S 2 FT #SUB 693 2693 TYR Z 117 117 LYS V Protein S 4 FT #SUB 887 2887 ASP Z 115 115 GLY V Protein S 1 FT #SUB 800 2800 LYS Z 100 3020 SER a Protein S 3 FT #SUB 800 2800 LYS Z 101 3021 LEU a Protein S 2 FT #SUB 800 2800 LYS Z 103 3023 ILE a Protein A 5 FT #SUB 801 2801 ALA Z 103 3023 ILE a Protein B 2 FT #SUB 802 2802 ASP Z 12 2932 PRO a Protein S 1 FT #SUB 866 2866 PRO Z 12 2932 PRO a Protein S 2 FT #SUB 866 2866 PRO Z 13 2933 SER a Protein A 8 FT #SUB 867 2867 PRO Z 12 2932 PRO a Protein A 15 FT #SUB 868 2868 GLU Z 12 2932 PRO a Protein A 3 FT #SUB 903 2903 TYR Z 12 2932 PRO a Protein S 2 FT #SUB 321 2321 GLU Z 1623 1623 ASN b Protein S 1 FT #SUB 323 2323 ASN Z 1650 1650 ILE b Protein B 1 FT #SUB 323 2323 ASN Z 1651 1651 ILE b Protein B 2 FT #SUB 323 2323 ASN Z 1652 1652 PHE b Protein A 12 FT #SUB 323 2323 ASN Z 1654 1654 PRO b Protein S 2 FT #SUB 324 2324 CYS Z 1650 1650 ILE b Protein B 1 FT #SUB 325 2325 ALA Z 1649 1649 THR b Protein B 2 FT #SUB 325 2325 ALA Z 1650 1650 ILE b Protein B 1 FT #SUB 326 2326 ILE Z 1650 1650 ILE b Protein A 5 FT #SUB 326 2326 ILE Z 1652 1652 PHE b Protein S 1 FT #SUB 327 2327 GLU Z 1647 1647 ASN b Protein S 1 FT #SUB 327 2327 GLU Z 1648 1648 PRO b Protein S 3 FT #SUB 327 2327 GLU Z 1649 1649 THR b Protein S 10 FT #SUB 327 2327 GLU Z 1650 1650 ILE b Protein S 1 FT #SUB 330 2330 SER Z 1625 1625 HIS b Protein S 2 FT #SUB 353 2353 LYS Z 1625 1625 HIS b Protein S 1 FT #SUB 399 2399 LEU Z 1604 1604 ASP b Protein S 2 FT #SUB 401 2401 GLU Z 1602 1602 GLN b Protein S 3 FT #SUB 440 2440 ASP Z 1558 1558 LEU b Protein A 2 FT #SUB 440 2440 ASP Z 1604 1604 ASP b Protein B 1 FT #SUB 440 2440 ASP Z 1605 1605 ARG b Protein B 2 FT #SUB 440 2440 ASP Z 1606 1606 LEU b Protein A 5 FT #SUB 441 2441 ARG Z 1604 1604 ASP b Protein B 2 FT #SUB 442 2442 LEU Z 1604 1604 ASP b Protein A 6 FT #SUB 458 2458 PHE Z 1481 1481 GLU b Protein S 1 FT #SUB 458 2458 PHE Z 1486 1486 LEU b Protein A 2 FT #SUB 459 2459 ASP Z 1481 1481 GLU b Protein S 1 FT #SUB 461 2461 HIS Z 1490 1490 ASN b Protein S 5 FT #SUB 485 2485 THR Z 1485 1485 ALA b Protein S 2 FT #SUB 486 2486 ILE Z 1484 1484 CYS b Protein B 2 FT #SUB 486 2486 ILE Z 1485 1485 ALA b Protein B 3 FT #SUB 486 2486 ILE Z 1486 1486 LEU b Protein B 4 FT #SUB 486 2486 ILE Z 1487 1487 PRO b Protein A 6 FT #SUB 487 2487 VAL Z 1483 1483 ASN b Protein B 2 FT #SUB 487 2487 VAL Z 1484 1484 CYS b Protein S 2 FT #SUB 487 2487 VAL Z 1485 1485 ALA b Protein S 1 FT #SUB 488 2488 ARG Z 1483 1483 ASN b Protein A 10 FT #SUB 490 2490 PRO Z 1483 1483 ASN b Protein S 4 FT #SUB 729 2729 LYS Z 1245 1245 GLU b Protein A 7 FT #SUB 730 2730 LEU Z 1245 1245 GLU b Protein A 3 FT #SUB 731 2731 PRO Z 1245 1245 GLU b Protein S 1 FT #SUB 734 2734 LYS Z 1207 1207 TYR b Protein S 3 FT #SUB 734 2734 LYS Z 1208 1208 THR b Protein S 3 FT #SUB 736 2736 TYR Z 1236 1236 TYR b Protein S 5 FT #SUB 736 2736 TYR Z 1245 1245 GLU b Protein S 2 FT #SUB 738 2738 ALA Z 1232 1232 THR b Protein A 2 FT #SUB 738 2738 ALA Z 1233 1233 SER b Protein A 4 FT #SUB 738 2738 ALA Z 1234 1234 VAL b Protein B 4 FT #SUB 739 2739 LEU Z 1207 1207 TYR b Protein S 5 FT #SUB 739 2739 LEU Z 1211 1211 TYR b Protein S 3 FT #SUB 739 2739 LEU Z 1234 1234 VAL b Protein A 4 FT #SUB 739 2739 LEU Z 1236 1236 TYR b Protein S 1 FT #SUB 740 2740 ASP Z 1211 1211 TYR b Protein S 3 FT #SUB 740 2740 ASP Z 1212 1212 HIS b Protein S 1 FT #SUB 740 2740 ASP Z 1213 1213 VAL b Protein S 3 FT #SUB 740 2740 ASP Z 1232 1232 THR b Protein S 2 FT #SUB 740 2740 ASP Z 1234 1234 VAL b Protein S 1 FT #SUB 743 2743 ALA Z 1208 1208 THR b Protein S 1 FT #SUB 743 2743 ALA Z 1210 1210 LYS b Protein A 4 FT #SUB 744 2744 PHE Z 1164 1164 ASP b Protein S 2 FT #SUB 744 2744 PHE Z 1210 1210 LYS b Protein A 3 FT #SUB 744 2744 PHE Z 1211 1211 TYR b Protein S 1 FT #SUB 744 2744 PHE Z 1212 1212 HIS b Protein S 1 FT #SUB 809 2809 LEU Z 1188 1188 PHE b Protein S 1 FT #SUB 809 2809 LEU Z 1189 1189 ASP b Protein S 2 FT #SUB 847 2847 HIS Z 1187 1187 LYS b Protein S 3 FT #SUB 847 2847 HIS Z 1230 1230 LEU b Protein S 3 FT #SUB 849 2849 ASP Z 1147 1147 MET b Protein B 2 FT #SUB 849 2849 ASP Z 1189 1189 ASP b Protein B 1 FT #SUB 849 2849 ASP Z 1191 1191 PRO b Protein B 1 FT #SUB 849 2849 ASP Z 1233 1233 SER b Protein S 2 FT #SUB 850 2850 ARG Z 1189 1189 ASP b Protein B 1 FT #SUB 851 2851 ASN Z 1189 1189 ASP b Protein S 2 FT #SUB 870 2870 LEU Z 1074 1074 ILE b Protein A 2 FT #SUB 870 2870 LEU Z 1078 1078 ARG b Protein B 2 FT #SUB 871 2871 PHE Z 1070 1070 ALA b Protein S 1 FT #SUB 871 2871 PHE Z 1074 1074 ILE b Protein S 2 FT #SUB 871 2871 PHE Z 1103 1103 PHE b Protein A 5 FT #SUB 872 2872 GLU Z 1078 1078 ARG b Protein B 3 FT #SUB 876 2876 LYS Z 1078 1078 ARG b Protein A 3 FT #SUB 877 2877 ILE Z 1078 1078 ARG b Protein B 7 FT #SUB 899 2899 PRO Z 1075 1075 GLU b Protein B 3 FT #SUB 900 2900 SER Z 1075 1075 GLU b Protein A 6 FT #SUB 901 2901 LEU Z 1071 1071 ASN b Protein B 1 FT #SUB 901 2901 LEU Z 1073 1073 ALA b Protein B 1 FT #SUB 901 2901 LEU Z 1074 1074 ILE b Protein A 4 FT #SUB 901 2901 LEU Z 1075 1075 GLU b Protein B 1 FT #SUB 902 2902 ILE Z 1071 1071 ASN b Protein A 3 FT #SUB 903 2903 TYR Z 1071 1071 ASN b Protein B 5 FT #SUB 905 2905 PRO Z 1071 1071 ASN b Protein S 2 FT #SUB 84 2084 ARG Z 500 2500 ILE c Protein S 2 FT #SUB 84 2084 ARG Z 502 2502 LEU c Protein S 4 FT #SUB 90 2090 LYS Z 375 2375 GLY c Protein S 1 FT #SUB 91 2091 ASN Z 94 2094 ARG c Protein S 1 FT #SUB 94 2094 ARG Z 91 2091 ASN c Protein S 2 FT #SUB 94 2094 ARG Z 94 2094 ARG c Protein S 3 FT #SUB 94 2094 ARG Z 182 2182 TYR c Protein A 7 FT #SUB 94 2094 ARG Z 370 2370 MET c Protein S 7 FT #SUB 96 2096 SER Z 374 2374 ASN c Protein S 3 FT #SUB 97 2097 LEU Z 235 2235 PRO c Protein S 1 FT #SUB 98 2098 GLN Z 244 2244 PHE c Protein S 1 FT #SUB 98 2098 GLN Z 374 2374 ASN c Protein S 3 FT #SUB 98 2098 GLN Z 376 2376 MET c Protein S 1 FT #SUB 99 2099 GLU Z 375 2375 GLY c Protein S 1 FT #SUB 109 2109 ARG Z 503 2503 ASN c Protein S 2 FT #SUB 109 2109 ARG Z 582 2582 ARG c Protein S 1 FT #SUB 109 2109 ARG Z 585 2585 GLY c Protein B 1 FT #SUB 112 2112 LYS Z 583 2583 ARG c Protein S 3 FT #SUB 112 2112 LYS Z 584 2584 HIS c Protein B 1 FT #SUB 113 2113 ASP Z 519 2519 ASN c Protein S 3 FT #SUB 113 2113 ASP Z 585 2585 GLY c Protein A 7 FT #SUB 114 2114 ARG Z 526 2526 ARG c Protein S 5 FT #SUB 115 2115 SER Z 518 2518 GLN c Protein S 1 FT #SUB 115 2115 SER Z 519 2519 ASN c Protein S 4 FT #SUB 115 2115 SER Z 522 2522 SER c Protein A 4 FT #SUB 116 2116 SER Z 615 2615 TRP c Protein S 3 FT #SUB 121 2121 THR Z 615 2615 TRP c Protein S 3 FT #SUB 125 2125 PHE Z 615 2615 TRP c Protein S 1 FT #SUB 167 2167 ASN Z 502 2502 LEU c Protein A 2 FT #SUB 168 2168 ARG Z 502 2502 LEU c Protein A 3 FT #SUB 168 2168 ARG Z 503 2503 ASN c Protein B 4 FT #SUB 168 2168 ARG Z 515 2515 ARG c Protein S 11 FT #SUB 168 2168 ARG Z 516 2516 ASP c Protein S 1 FT #SUB 168 2168 ARG Z 587 2587 SER c Protein A 4 FT #SUB 169 2169 HIS Z 503 2503 ASN c Protein B 2 FT #SUB 169 2169 HIS Z 585 2585 GLY c Protein S 2 FT #SUB 169 2169 HIS Z 587 2587 SER c Protein S 3 FT #SUB 170 2170 GLY Z 503 2503 ASN c Protein B 2 FT #SUB 182 2182 TYR Z 94 2094 ARG c Protein S 5 FT #SUB 199 2199 PRO Z 235 2235 PRO c Protein B 1 FT #SUB 199 2199 PRO Z 236 2236 ALA c Protein B 2 FT #SUB 199 2199 PRO Z 237 2237 LEU c Protein B 2 FT #SUB 200 2200 PHE Z 237 2237 LEU c Protein B 8 FT #SUB 201 2201 THR Z 237 2237 LEU c Protein B 2 FT #SUB 202 2202 GLY Z 237 2237 LEU c Protein B 1 FT #SUB 235 2235 PRO Z 97 2097 LEU c Protein S 2 FT #SUB 236 2236 ALA Z 199 2199 PRO c Protein B 3 FT #SUB 236 2236 ALA Z 200 2200 PHE c Protein B 1 FT #SUB 237 2237 LEU Z 199 2199 PRO c Protein B 2 FT #SUB 237 2237 LEU Z 200 2200 PHE c Protein A 9 FT #SUB 341 2341 PRO Z 614 2614 ALA c Protein S 2 FT #SUB 342 2342 TYR Z 615 2615 TRP c Protein B 1 FT #SUB 344 2344 LEU Z 514 2514 SER c Protein S 3 FT #SUB 344 2344 LEU Z 515 2515 ARG c Protein A 5 FT #SUB 345 2345 ASN Z 515 2515 ARG c Protein S 2 FT #SUB 346 2346 PRO Z 513 2513 GLU c Protein S 1 FT #SUB 346 2346 PRO Z 515 2515 ARG c Protein S 7 FT #SUB 370 2370 MET Z 94 2094 ARG c Protein S 7 FT #SUB 374 2374 ASN Z 96 2096 SER c Protein A 3 FT #SUB 374 2374 ASN Z 98 2098 GLN c Protein S 1 FT #SUB 375 2375 GLY Z 90 2090 LYS c Protein B 1 FT #SUB 375 2375 GLY Z 99 2099 GLU c Protein B 1 FT #SUB 376 2376 MET Z 98 2098 GLN c Protein S 1 FT #SUB 502 2502 LEU Z 84 2084 ARG c Protein S 3 FT #SUB 502 2502 LEU Z 167 2167 ASN c Protein S 4 FT #SUB 502 2502 LEU Z 168 2168 ARG c Protein S 1 FT #SUB 503 2503 ASN Z 109 2109 ARG c Protein S 2 FT #SUB 503 2503 ASN Z 168 2168 ARG c Protein A 4 FT #SUB 503 2503 ASN Z 169 2169 HIS c Protein S 4 FT #SUB 503 2503 ASN Z 170 2170 GLY c Protein S 1 FT #SUB 504 2504 ARG Z 109 2109 ARG c Protein S 1 FT #SUB 514 2514 SER Z 344 2344 LEU c Protein S 2 FT #SUB 515 2515 ARG Z 168 2168 ARG c Protein S 13 FT #SUB 515 2515 ARG Z 344 2344 LEU c Protein A 8 FT #SUB 515 2515 ARG Z 345 2345 ASN c Protein S 2 FT #SUB 515 2515 ARG Z 346 2346 PRO c Protein S 4 FT #SUB 518 2518 GLN Z 115 2115 SER c Protein B 2 FT #SUB 518 2518 GLN Z 344 2344 LEU c Protein S 1 FT #SUB 519 2519 ASN Z 113 2113 ASP c Protein S 3 FT #SUB 519 2519 ASN Z 115 2115 SER c Protein B 3 FT #SUB 522 2522 SER Z 115 2115 SER c Protein S 3 FT #SUB 526 2526 ARG Z 114 2114 ARG c Protein S 5 FT #SUB 582 2582 ARG Z 109 2109 ARG c Protein A 3 FT #SUB 583 2583 ARG Z 112 2112 LYS c Protein B 2 FT #SUB 584 2584 HIS Z 112 2112 LYS c Protein B 1 FT #SUB 585 2585 GLY Z 109 2109 ARG c Protein B 1 FT #SUB 585 2585 GLY Z 113 2113 ASP c Protein B 4 FT #SUB 585 2585 GLY Z 169 2169 HIS c Protein B 1 FT #SUB 587 2587 SER Z 168 2168 ARG c Protein A 4 FT #SUB 587 2587 SER Z 169 2169 HIS c Protein S 5 FT #SUB 614 2614 ALA Z 341 2341 PRO c Protein S 2 FT #SUB 614 2614 ALA Z 342 2342 TYR c Protein S 1 FT #SUB 615 2615 TRP Z 116 2116 SER c Protein S 1 FT #SUB 615 2615 TRP Z 121 2121 THR c Protein S 4 FT #SUB 615 2615 TRP Z 125 2125 PHE c Protein S 1 FT #SUB 615 2615 TRP Z 131 2131 LEU c Protein S 1 FT #SUB 615 2615 TRP Z 341 2341 PRO c Protein S 1 FT #SUB 615 2615 TRP Z 342 2342 TYR c Protein S 4 FT #HET 126 2126 HIS Z 201 3001 CUO Z S 4 FT #HET 138 2138 LYS Z 1 1 NAG e S 3 FT #HET 138 2138 LYS Z 2 2 NAG e S 4 FT #HET 144 2144 CYS Z 201 3001 CUO Z S 1 FT #HET 146 2146 HIS Z 201 3001 CUO Z S 7 FT #HET 151 2151 PHE Z 201 3001 CUO Z S 1 FT #HET 155 2155 HIS Z 201 3001 CUO Z S 8 FT #HET 267 2267 HIS Z 201 3001 CUO Z S 7 FT #HET 271 2271 HIS Z 201 3001 CUO Z S 6 FT #HET 294 2294 PHE Z 201 3001 CUO Z S 2 FT #HET 298 2298 HIS Z 201 3001 CUO Z S 6 FT #HET 404 2404 GLY Z 116 1 NAG MA B 3 FT #HET 405 2405 SER Z 116 1 NAG MA A 2 FT #HET 429 2429 LEU Z 201 3001 CUO Z S 1 FT #HET 476 2476 LEU Z 117 2 NAG MA S 4 FT #HET 480 2480 LEU Z 117 2 NAG MA S 1 FT #HET 543 2543 HIS Z 202 3002 CUO Z S 10 FT #HET 561 2561 HIS Z 202 3002 CUO Z S 5 FT #HET 570 2570 HIS Z 202 3002 CUO Z S 11 FT #HET 680 2680 HIS Z 202 3002 CUO Z S 6 FT #HET 684 2684 HIS Z 202 3002 CUO Z S 5 FT #HET 707 2707 PHE Z 202 3002 CUO Z S 3 FT #HET 711 2711 HIS Z 202 3002 CUO Z S 7 FT #MOD 472 2472 ASN Z 116 1 NAG MA S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9a STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 12 2932 PRO a 802 2802 ASP Z Protein S 1 FT #SUB 12 2932 PRO a 866 2866 PRO Z Protein B 2 FT #SUB 12 2932 PRO a 867 2867 PRO Z Protein A 15 FT #SUB 12 2932 PRO a 868 2868 GLU Z Protein B 3 FT #SUB 12 2932 PRO a 903 2903 TYR Z Protein S 2 FT #SUB 13 2933 SER a 866 2866 PRO Z Protein A 8 FT #SUB 100 3020 SER a 800 2800 LYS Z Protein S 3 FT #SUB 101 3021 LEU a 800 2800 LYS Z Protein B 2 FT #SUB 103 3023 ILE a 800 2800 LYS Z Protein S 5 FT #SUB 103 3023 ILE a 801 2801 ALA Z Protein S 2 FT #SUB 276 3196 THR a 416 416 ASN b Protein S 12 FT #SUB 279 3199 GLN a 419 419 THR b Protein S 1 FT #SUB 348 3268 LYS a 1539 1539 GLN b Protein S 1 FT #SUB 352 3272 HIS a 1535 1535 LEU b Protein S 3 FT #SUB 354 3274 ARG a 1533 1533 ALA b Protein S 1 FT #HET 41 2961 HIS a 147 3401 CUO F S 8 FT #HET 60 2980 HIS a 147 3401 CUO F S 3 FT #HET 69 2989 HIS a 147 3401 CUO F S 13 FT #HET 121 3041 ALA a 135 5016 CUO A B 10 FT #HET 122 3042 ASP a 135 5016 CUO A A 29 FT #HET 123 3043 THR a 135 5016 CUO A A 5 FT #HET 169 3089 HIS a 147 3401 CUO F S 7 FT #HET 173 3093 HIS a 147 3401 CUO F S 5 FT #HET 196 3116 PHE a 147 3401 CUO F S 5 FT #HET 199 3119 HIS a 147 3401 CUO F S 1 FT #HET 200 3120 HIS a 147 3401 CUO F S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI // ID 4YD9b STANDARD; PRT; 2000 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 328 328 LYS b 1571 1571 TYR A Protein S 7 FT #SUB 328 328 LYS b 1631 1631 GLU A Protein S 4 FT #SUB 329 329 ARG b 1574 1574 VAL A Protein S 4 FT #SUB 329 329 ARG b 1582 1582 ASN A Protein S 4 FT #SUB 329 329 ARG b 1583 1583 CYS A Protein A 15 FT #SUB 329 329 ARG b 1584 1584 GLY A Protein A 8 FT #SUB 329 329 ARG b 1585 1585 ASN A Protein S 2 FT #SUB 330 330 GLN b 1581 1581 GLU A Protein B 4 FT #SUB 330 330 GLN b 1582 1582 ASN A Protein A 3 FT #SUB 330 330 GLN b 1583 1583 CYS A Protein S 2 FT #SUB 331 331 SER b 1581 1581 GLU A Protein B 3 FT #SUB 331 331 SER b 1582 1582 ASN A Protein A 4 FT #SUB 332 332 ASP b 1581 1581 GLU A Protein B 1 FT #SUB 335 335 ASP b 1578 1578 ILE A Protein S 3 FT #SUB 335 335 ASP b 1579 1579 GLY A Protein S 5 FT #SUB 472 472 ASP b 1638 1638 SER A Protein S 1 FT #SUB 472 472 ASP b 1639 1639 LYS A Protein A 2 FT #SUB 474 474 LEU b 1639 1639 LYS A Protein S 1 FT #SUB 1365 1365 TYR b 1437 1437 ARG A Protein S 16 FT #SUB 1372 1372 ILE b 1390 1390 ASP A Protein A 6 FT #SUB 1372 1372 ILE b 1438 1438 ASP A Protein S 4 FT #SUB 1378 1378 PHE b 1362 1362 ALA A Protein S 1 FT #SUB 1390 1390 ASP b 1372 1372 ILE A Protein S 4 FT #SUB 1391 1391 ARG b 1372 1372 ILE A Protein B 1 FT #SUB 1392 1392 ASP b 1365 1365 TYR A Protein S 1 FT #SUB 1437 1437 ARG b 1365 1365 TYR A Protein S 17 FT #SUB 1438 1438 ASP b 1372 1372 ILE A Protein S 3 FT #SUB 1571 1571 TYR b 328 328 LYS A Protein S 8 FT #SUB 1574 1574 VAL b 329 329 ARG A Protein A 3 FT #SUB 1578 1578 ILE b 335 335 ASP A Protein B 3 FT #SUB 1579 1579 GLY b 335 335 ASP A Protein B 4 FT #SUB 1581 1581 GLU b 331 331 SER A Protein S 2 FT #SUB 1581 1581 GLU b 332 332 ASP A Protein S 1 FT #SUB 1582 1582 ASN b 329 329 ARG A Protein B 8 FT #SUB 1582 1582 ASN b 331 331 SER A Protein B 4 FT #SUB 1583 1583 CYS b 329 329 ARG A Protein A 14 FT #SUB 1584 1584 GLY b 329 329 ARG A Protein B 3 FT #SUB 1585 1585 ASN b 329 329 ARG A Protein S 3 FT #SUB 1631 1631 GLU b 328 328 LYS A Protein S 4 FT #SUB 1638 1638 SER b 472 472 ASP A Protein A 2 FT #SUB 1639 1639 LYS b 472 472 ASP A Protein B 3 FT #SUB 1639 1639 LYS b 473 473 ALA A Protein A 3 FT #SUB 1640 1640 ILE b 472 472 ASP A Protein S 3 FT #SUB 106 106 GLN b 609 2609 ALA B Protein B 1 FT #SUB 108 108 PRO b 625 2625 LEU B Protein S 2 FT #SUB 108 108 PRO b 626 2626 ARG B Protein S 2 FT #SUB 109 109 LEU b 626 2626 ARG B Protein S 4 FT #SUB 109 109 LEU b 635 2635 TYR B Protein S 1 FT #SUB 109 109 LEU b 636 2636 THR B Protein S 1 FT #SUB 111 111 ILE b 637 2637 VAL B Protein S 3 FT #SUB 111 111 ILE b 638 2638 ARG B Protein S 1 FT #SUB 111 111 ILE b 691 2691 GLN B Protein S 3 FT #SUB 112 112 ASP b 691 2691 GLN B Protein B 1 FT #SUB 116 116 LYS b 693 2693 TYR B Protein B 2 FT #SUB 117 117 LYS b 632 2632 GLU B Protein S 3 FT #SUB 117 117 LYS b 634 2634 THR B Protein S 6 FT #SUB 117 117 LYS b 693 2693 TYR B Protein S 3 FT #SUB 118 118 ALA b 634 2634 THR B Protein A 2 FT #SUB 119 119 LYS b 635 2635 TYR B Protein S 7 FT #SUB 124 124 TYR b 610 2610 ASP B Protein S 3 FT #SUB 136 136 ALA b 619 2619 VAL B Protein S 1 FT #SUB 137 137 ARG b 610 2610 ASP B Protein B 1 FT #SUB 137 137 ARG b 619 2619 VAL B Protein B 1 FT #SUB 138 138 ALA b 612 2612 TYR B Protein S 3 FT #SUB 138 138 ALA b 619 2619 VAL B Protein A 3 FT #SUB 189 189 ARG b 612 2612 TYR B Protein S 5 FT #SUB 189 189 ARG b 617 2617 ASP B Protein A 5 FT #SUB 190 190 PHE b 617 2617 ASP B Protein A 5 FT #SUB 190 190 PHE b 619 2619 VAL B Protein S 3 FT #SUB 191 191 THR b 617 2617 ASP B Protein S 1 FT #SUB 388 388 GLY b 136 2136 THR E Protein B 4 FT #SUB 389 389 THR b 136 2136 THR E Protein A 7 FT #SUB 1070 1070 ALA b 871 2871 PHE Z Protein B 1 FT #SUB 1071 1071 ASN b 901 2901 LEU Z Protein B 1 FT #SUB 1071 1071 ASN b 902 2902 ILE Z Protein B 3 FT #SUB 1071 1071 ASN b 903 2903 TYR Z Protein A 5 FT #SUB 1071 1071 ASN b 905 2905 PRO Z Protein S 2 FT #SUB 1073 1073 ALA b 901 2901 LEU Z Protein B 1 FT #SUB 1074 1074 ILE b 870 2870 LEU Z Protein S 2 FT #SUB 1074 1074 ILE b 871 2871 PHE Z Protein S 2 FT #SUB 1074 1074 ILE b 901 2901 LEU Z Protein A 4 FT #SUB 1075 1075 GLU b 899 2899 PRO Z Protein S 3 FT #SUB 1075 1075 GLU b 900 2900 SER Z Protein S 6 FT #SUB 1075 1075 GLU b 901 2901 LEU Z Protein S 1 FT #SUB 1078 1078 ARG b 870 2870 LEU Z Protein S 2 FT #SUB 1078 1078 ARG b 872 2872 GLU Z Protein S 3 FT #SUB 1078 1078 ARG b 876 2876 LYS Z Protein S 3 FT #SUB 1078 1078 ARG b 877 2877 ILE Z Protein S 7 FT #SUB 1103 1103 PHE b 871 2871 PHE Z Protein S 5 FT #SUB 1147 1147 MET b 849 2849 ASP Z Protein S 2 FT #SUB 1164 1164 ASP b 744 2744 PHE Z Protein S 2 FT #SUB 1187 1187 LYS b 847 2847 HIS Z Protein S 3 FT #SUB 1188 1188 PHE b 809 2809 LEU Z Protein B 1 FT #SUB 1189 1189 ASP b 809 2809 LEU Z Protein B 2 FT #SUB 1189 1189 ASP b 849 2849 ASP Z Protein B 1 FT #SUB 1189 1189 ASP b 850 2850 ARG Z Protein B 1 FT #SUB 1189 1189 ASP b 851 2851 ASN Z Protein S 2 FT #SUB 1191 1191 PRO b 849 2849 ASP Z Protein S 1 FT #SUB 1207 1207 TYR b 734 2734 LYS Z Protein S 3 FT #SUB 1207 1207 TYR b 739 2739 LEU Z Protein A 5 FT #SUB 1208 1208 THR b 734 2734 LYS Z Protein S 3 FT #SUB 1208 1208 THR b 743 2743 ALA Z Protein B 1 FT #SUB 1210 1210 LYS b 743 2743 ALA Z Protein A 4 FT #SUB 1210 1210 LYS b 744 2744 PHE Z Protein S 3 FT #SUB 1211 1211 TYR b 739 2739 LEU Z Protein S 3 FT #SUB 1211 1211 TYR b 740 2740 ASP Z Protein B 3 FT #SUB 1211 1211 TYR b 744 2744 PHE Z Protein B 1 FT #SUB 1212 1212 HIS b 740 2740 ASP Z Protein B 1 FT #SUB 1212 1212 HIS b 744 2744 PHE Z Protein S 1 FT #SUB 1213 1213 VAL b 740 2740 ASP Z Protein A 3 FT #SUB 1230 1230 LEU b 847 2847 HIS Z Protein S 3 FT #SUB 1232 1232 THR b 738 2738 ALA Z Protein B 2 FT #SUB 1232 1232 THR b 740 2740 ASP Z Protein B 2 FT #SUB 1233 1233 SER b 738 2738 ALA Z Protein A 4 FT #SUB 1233 1233 SER b 849 2849 ASP Z Protein S 2 FT #SUB 1234 1234 VAL b 738 2738 ALA Z Protein B 4 FT #SUB 1234 1234 VAL b 739 2739 LEU Z Protein B 4 FT #SUB 1234 1234 VAL b 740 2740 ASP Z Protein S 1 FT #SUB 1236 1236 TYR b 736 2736 TYR Z Protein A 5 FT #SUB 1236 1236 TYR b 739 2739 LEU Z Protein S 1 FT #SUB 1245 1245 GLU b 729 2729 LYS Z Protein A 7 FT #SUB 1245 1245 GLU b 730 2730 LEU Z Protein A 3 FT #SUB 1245 1245 GLU b 731 2731 PRO Z Protein S 1 FT #SUB 1245 1245 GLU b 736 2736 TYR Z Protein S 2 FT #SUB 1481 1481 GLU b 458 2458 PHE Z Protein S 1 FT #SUB 1481 1481 GLU b 459 2459 ASP Z Protein S 1 FT #SUB 1483 1483 ASN b 487 2487 VAL Z Protein B 2 FT #SUB 1483 1483 ASN b 488 2488 ARG Z Protein A 10 FT #SUB 1483 1483 ASN b 490 2490 PRO Z Protein S 4 FT #SUB 1484 1484 CYS b 486 2486 ILE Z Protein B 2 FT #SUB 1484 1484 CYS b 487 2487 VAL Z Protein B 2 FT #SUB 1485 1485 ALA b 485 2485 THR Z Protein A 2 FT #SUB 1485 1485 ALA b 486 2486 ILE Z Protein B 3 FT #SUB 1485 1485 ALA b 487 2487 VAL Z Protein B 1 FT #SUB 1486 1486 LEU b 458 2458 PHE Z Protein S 2 FT #SUB 1486 1486 LEU b 486 2486 ILE Z Protein A 4 FT #SUB 1487 1487 PRO b 486 2486 ILE Z Protein S 6 FT #SUB 1490 1490 ASN b 461 2461 HIS Z Protein S 5 FT #SUB 1558 1558 LEU b 440 2440 ASP Z Protein S 2 FT #SUB 1602 1602 GLN b 401 2401 GLU Z Protein S 3 FT #SUB 1604 1604 ASP b 399 2399 LEU Z Protein A 2 FT #SUB 1604 1604 ASP b 440 2440 ASP Z Protein B 1 FT #SUB 1604 1604 ASP b 441 2441 ARG Z Protein B 2 FT #SUB 1604 1604 ASP b 442 2442 LEU Z Protein A 6 FT #SUB 1605 1605 ARG b 440 2440 ASP Z Protein B 2 FT #SUB 1606 1606 LEU b 440 2440 ASP Z Protein A 5 FT #SUB 1623 1623 ASN b 321 2321 GLU Z Protein S 1 FT #SUB 1625 1625 HIS b 330 2330 SER Z Protein S 2 FT #SUB 1625 1625 HIS b 353 2353 LYS Z Protein S 1 FT #SUB 1647 1647 ASN b 327 2327 GLU Z Protein B 1 FT #SUB 1648 1648 PRO b 327 2327 GLU Z Protein B 3 FT #SUB 1649 1649 THR b 325 2325 ALA Z Protein S 2 FT #SUB 1649 1649 THR b 327 2327 GLU Z Protein A 10 FT #SUB 1650 1650 ILE b 323 2323 ASN Z Protein B 1 FT #SUB 1650 1650 ILE b 324 2324 CYS Z Protein B 1 FT #SUB 1650 1650 ILE b 325 2325 ALA Z Protein B 1 FT #SUB 1650 1650 ILE b 326 2326 ILE Z Protein A 5 FT #SUB 1650 1650 ILE b 327 2327 GLU Z Protein B 1 FT #SUB 1651 1651 ILE b 323 2323 ASN Z Protein B 2 FT #SUB 1652 1652 PHE b 323 2323 ASN Z Protein A 12 FT #SUB 1652 1652 PHE b 326 2326 ILE Z Protein S 1 FT #SUB 1654 1654 PRO b 323 2323 ASN Z Protein S 2 FT #SUB 416 416 ASN b 276 3196 THR a Protein S 12 FT #SUB 419 419 THR b 279 3199 GLN a Protein S 1 FT #SUB 1533 1533 ALA b 354 3274 ARG a Protein B 1 FT #SUB 1535 1535 LEU b 352 3272 HIS a Protein S 3 FT #SUB 1539 1539 GLN b 348 3268 LYS a Protein S 1 FT #HET 41 41 HIS b 204 2102 CUO b S 6 FT #HET 58 58 CYS b 204 2102 CUO b S 1 FT #HET 60 60 HIS b 204 2102 CUO b S 5 FT #HET 69 69 HIS b 204 2102 CUO b S 9 FT #HET 179 179 HIS b 204 2102 CUO b S 4 FT #HET 183 183 HIS b 204 2102 CUO b S 6 FT #HET 206 206 PHE b 204 2102 CUO b S 3 FT #HET 210 210 HIS b 204 2102 CUO b S 10 FT #HET 311 311 ILE b 119 2 NAG NA B 1 FT #HET 313 313 THR b 118 1 NAG NA S 2 FT #HET 389 389 THR b 118 1 NAG NA S 2 FT #HET 391 391 MET b 118 1 NAG NA S 1 FT #HET 391 391 MET b 119 2 NAG NA S 3 FT #HET 462 462 HIS b 205 2103 CUO b S 6 FT #HET 480 480 CYS b 205 2103 CUO b S 1 FT #HET 482 482 HIS b 205 2103 CUO b S 3 FT #HET 491 491 HIS b 205 2103 CUO b S 7 FT #HET 603 603 HIS b 205 2103 CUO b S 5 FT #HET 607 607 HIS b 205 2103 CUO b S 6 FT #HET 630 630 PHE b 205 2103 CUO b S 3 FT #HET 634 634 HIS b 205 2103 CUO b S 7 FT #HET 763 763 LEU b 205 2103 CUO b S 1 FT #HET 804 804 ASP b 121 1 NAG OA S 6 FT #HET 808 808 THR b 121 1 NAG OA S 4 FT #HET 810 810 LEU b 121 1 NAG OA S 2 FT #HET 810 810 LEU b 122 2 NAG OA S 1 FT #HET 877 877 HIS b 206 2104 CUO b S 6 FT #HET 895 895 CYS b 206 2104 CUO b S 1 FT #HET 897 897 HIS b 206 2104 CUO b S 3 FT #HET 906 906 HIS b 206 2104 CUO b S 7 FT #HET 1015 1015 HIS b 206 2104 CUO b S 6 FT #HET 1019 1019 HIS b 206 2104 CUO b S 6 FT #HET 1042 1042 PHE b 206 2104 CUO b S 3 FT #HET 1046 1046 HIS b 206 2104 CUO b S 7 FT #HET 1294 1294 HIS b 207 2105 CUO b S 7 FT #HET 1298 1298 ALA b 124 2 NAG PA B 1 FT #HET 1299 1299 GLN b 123 1 NAG PA B 4 FT #HET 1301 1301 PRO b 123 1 NAG PA S 3 FT #HET 1312 1312 CYS b 207 2105 CUO b S 1 FT #HET 1314 1314 HIS b 207 2105 CUO b S 5 FT #HET 1323 1323 HIS b 207 2105 CUO b S 9 FT #HET 1427 1427 HIS b 207 2105 CUO b S 5 FT #HET 1431 1431 HIS b 207 2105 CUO b S 6 FT #HET 1454 1454 PHE b 207 2105 CUO b S 5 FT #HET 1458 1458 HIS b 207 2105 CUO b S 7 FT #HET 1494 1494 ARG b 123 1 NAG PA S 5 FT #HET 1500 1500 THR b 123 1 NAG PA A 8 FT #HET 1501 1501 ALA b 123 1 NAG PA B 1 FT #HET 1564 1564 ALA b 127 1 NAG QA S 1 FT #HET 1565 1565 SER b 127 1 NAG QA B 1 FT #HET 1634 1634 ALA b 127 1 NAG QA S 3 FT #HET 1635 1635 VAL b 127 1 NAG QA A 2 FT #HET 1639 1639 LYS b 127 1 NAG QA S 9 FT #MOD 387 387 ASN b 118 1 NAG NA S FT #MOD 806 806 ASN b 121 1 NAG OA S FT #MOD 1498 1498 ASN b 123 1 NAG PA S FT #MOD 1636 1636 ASN b 127 1 NAG QA S FT DISORDER 1657 2000 CC SEQUENCE 1656 AA (ATOM); CC NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH CC GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK CC NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN CC PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF CC HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA CC RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH CC EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS CC SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC CC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET CC KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE CC VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA CC NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV CC EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE CC ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR CC KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA CC TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF CC SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA CC LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP CC MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN CC RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM CC DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR CC KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP CC HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG CC KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH CC SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST CC ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH CC GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL CC NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ATOM NLIRKNVDTLTPDEILNLQVSLRAMQDDEGASGYQAISAYHGEPADCKAA CC ************************************************** CC SEQRES DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ATOM DGSTIVCCLHGMPTFPMWHRLYLVQFEQALVAHGSTLGIPYWDWTKPMTQ CC ************************************************** CC SEQRES LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ATOM LPELVQHPLFIDPTGKKAKKNVFYSGEIKFENKVTARAVDARLYQASQEG CC ************************************************** CC SEQRES QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ATOM QKNFLLEGVLNALEQEDFCHFEVQLEVAHNPIHYLVGGRFTHSMSSLEYT CC ************************************************** CC SEQRES SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ATOM SYDPLFFLHHSNVERHFALWQALQKHRGLPTRPNCGLNLFHSPMEPFGRD CC ************************************************** CC SEQRES TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ATOM TNPFAITKDNSKASSLFDYEHLGYAFDDLSLNGMTIEELEALLKQRRSGA CC ************************************************** CC SEQRES RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ATOM RAFANFRLGGIKTSANVRIKLCIPTEDKRQSDNCDNDAGQFFILGGTNEM CC ************************************************** CC SEQRES PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ATOM PWNFAFPYLHEITDTVLSLGLALDSNYYVTAEVTAINGTLMPTQTIPRPI CC ************************************************** CC SEQRES VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ATOM VTYIPPQGFKDVNMVNMDTSSLRFRKDISSLTTEEEYELRVAMERFMSDK CC ************************************************** CC SEQRES SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ATOM SINGYQALAEFHGLPAKCPRPDALNRVACCIHGMATFPHWHRLVVMQFEN CC ************************************************** CC SEQRES ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ATOM ALFTRGSPIGVPYWDWTKPFTALPSLLADETYVDPYTKETKPNPFFKAPI CC ************************************************** CC SEQRES EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ATOM EFLKAGVHTSRQIDERLFKQPSKGDHGFLYDGLSLAFEQDDFCDFEVQFE CC ************************************************** CC SEQRES VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ATOM VTHNSIHAWTGGSEPYSMSSLHYTSFDPMFWLHHSQVDRLWAIWQALQIQ CC ************************************************** CC SEQRES RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ATOM RGKPYKTYCANSEVYRPMKPFAFKSPLNNDEKTREHSVPTDVYDYQAELA CC ************************************************** CC SEQRES YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ATOM YTYDTLFFGGLSIRELQRYVEEAKSKDRVFAGFLLMGIQTSANVDLFVVA CC ************************************************** CC SEQRES GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ATOM GGNEFFVGSIAVLGGSKEMTWRFDRVYKHEITDALGALGVDMFAEYTLRV CC ************************************************** CC SEQRES DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ATOM DIKDVNGTALPPTAIPPPIVIFVPGIADANVKFDEQHRSRKNVDSMTVSE CC ************************************************** CC SEQRES MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ATOM MNALRTAMAAFAADKEVTGYQQVAAFHGSTQWCPSPDAAQKYACCHHGMA CC ************************************************** CC SEQRES TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ATOM TFPHWHRLIALNFENGLRRNGWSGGLPYWDWTRPIDALPALVLEAEYTDA CC ************************************************** CC SEQRES NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ATOM NGEAKPNPFFSGAIDSIGASTSRAPTEALYEKPDFGKYTHLANEIISALE CC ************************************************** CC SEQRES QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ATOM QEDFCDFEVQYEIAHNHIHALVGGTEAVSMASLEYSAFDPIFMLHHSNVD CC ************************************************** CC SEQRES RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ATOM RIWATWQALQKFRGKAYNSANCAIEILRKPMSPFSLASDINPDAMTREYS CC ************************************************** CC SEQRES VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ATOM VPFDVFNYKKNFHYEYDTLELNGLSIAQLSREINRRKAKNRVAVTFMLEG CC ************************************************** CC SEQRES LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ATOM LKKSLLVEYFIAADGTDQKMKAGEFYVLGSENEMPWKFDRPYKSDITYVM CC ************************************************** CC SEQRES DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ATOM DAMKLHYTDKYHVELRITDMTGAEVTDLKLVTSVIYEPGIGNFGEGRRWI CC ************************************************** CC SEQRES SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ATOM SPITSASRIRKNLLDFEDGEMESLRNAFKQMADEGRYEEIASFHGVPAQC CC ************************************************** CC SEQRES PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ATOM PSEDGTMVHTCCLHGMPVFPHWHRLYVSLVEDELLARGSGVAVPYWDWVE CC ************************************************** CC SEQRES PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ATOM PFDELPRLINEATFYNSRTLQIEPNPFFKGKISFENAETDRDTQPELFGN CC ************************************************** CC SEQRES RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ATOM RYLYDHTLFVFEQTDFCEFEVHYEVLHNTIHSWLGGRDVHSMSSLDYAAY CC ************************************************** CC SEQRES DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ATOM DPVFFLHHSNVDRLWAIWQELQRYRKLSYNEANCALPLMNQPMRPFSNST CC ************************************************** CC SEQRES ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ATOM ANNDRLTFTNSRPNDVFDYQNVLHYKYDTLNFAGLSIPQLERILQKNQGR CC ************************************************** CC SEQRES DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ATOM DRIFAGFLLHGIKASADVRIYICVPTGIGEENCGNYAGIFSVLGGETEMP CC ************************************************** CC SEQRES WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ATOM WQFDRLFRYEITDELKKLGLNQNSHFRVEMELTAVNGSKITQKIFPNPTI CC ************************************************** CC SEQRES IFVPSDVEFEEDTWRDVVTSANRIRRNLKDLSKEDMFSLRAAFKRMTDDG CC ATOM IFVPSD-------------------------------------------- CC ****** CC SEQRES RYEEIAAFHGLPAQCPNADGSNIHTCCLHGMPTFPHWHRLYLSLVENELL CC ATOM -------------------------------------------------- CC CC SEQRES ARGSDVAVPYWDWIEPFDSLPGLISDETYKHPKTNEDIENPFHHGKISFA CC ATOM -------------------------------------------------- CC CC SEQRES DAVTVRKPRDQLFNNRYLYEHALFAFEHTDFCDFEVHFEVLHNSIHSWIG CC ATOM -------------------------------------------------- CC CC SEQRES GPNPHSMSSLDFAAYDPIFFLHHSTVDRLWAIWQDLQRYRKLDYNVANCA CC ATOM -------------------------------------------------- CC CC SEQRES LNLLNDPMRPFNNKTANQDHLTFTNSRPNDVFDYQNSLNYKFDSLSFSGL CC ATOM -------------------------------------------------- CC CC SEQRES SIPRLDDLLESRQSHDRVFAGFWLSGIKASADVNIHICVPIGVEHEDCDN CC ATOM -------------------------------------------------- CC CC SEQRES CC ATOM CC SQ SEQUENCE 2000 AA; MW; CN; NLIRKNVDTL TPDEILNLQV SLRAMQDDEG ASGYQAISAY HGEPADCKAA DGSTIVCCLH GMPTFPMWHR LYLVQFEQAL VAHGSTLGIP YWDWTKPMTQ LPELVQHPLF IDPTGKKAKK NVFYSGEIKF ENKVTARAVD ARLYQASQEG QKNFLLEGVL NALEQEDFCH FEVQLEVAHN PIHYLVGGRF THSMSSLEYT SYDPLFFLHH SNVERHFALW QALQKHRGLP TRPNCGLNLF HSPMEPFGRD TNPFAITKDN SKASSLFDYE HLGYAFDDLS LNGMTIEELE ALLKQRRSGA RAFANFRLGG IKTSANVRIK LCIPTEDKRQ SDNCDNDAGQ FFILGGTNEM PWNFAFPYLH EITDTVLSLG LALDSNYYVT AEVTAINGTL MPTQTIPRPI VTYIPPQGFK DVNMVNMDTS SLRFRKDISS LTTEEEYELR VAMERFMSDK SINGYQALAE FHGLPAKCPR PDALNRVACC IHGMATFPHW HRLVVMQFEN ALFTRGSPIG VPYWDWTKPF TALPSLLADE TYVDPYTKET KPNPFFKAPI EFLKAGVHTS RQIDERLFKQ PSKGDHGFLY DGLSLAFEQD DFCDFEVQFE VTHNSIHAWT GGSEPYSMSS LHYTSFDPMF WLHHSQVDRL WAIWQALQIQ RGKPYKTYCA NSEVYRPMKP FAFKSPLNND EKTREHSVPT DVYDYQAELA YTYDTLFFGG LSIRELQRYV EEAKSKDRVF AGFLLMGIQT SANVDLFVVA GGNEFFVGSI AVLGGSKEMT WRFDRVYKHE ITDALGALGV DMFAEYTLRV DIKDVNGTAL PPTAIPPPIV IFVPGIADAN VKFDEQHRSR KNVDSMTVSE MNALRTAMAA FAADKEVTGY QQVAAFHGST QWCPSPDAAQ KYACCHHGMA TFPHWHRLIA LNFENGLRRN GWSGGLPYWD WTRPIDALPA LVLEAEYTDA NGEAKPNPFF SGAIDSIGAS TSRAPTEALY EKPDFGKYTH LANEIISALE QEDFCDFEVQ YEIAHNHIHA LVGGTEAVSM ASLEYSAFDP IFMLHHSNVD RIWATWQALQ KFRGKAYNSA NCAIEILRKP MSPFSLASDI NPDAMTREYS VPFDVFNYKK NFHYEYDTLE LNGLSIAQLS REINRRKAKN RVAVTFMLEG LKKSLLVEYF IAADGTDQKM KAGEFYVLGS ENEMPWKFDR PYKSDITYVM DAMKLHYTDK YHVELRITDM TGAEVTDLKL VTSVIYEPGI GNFGEGRRWI SPITSASRIR KNLLDFEDGE MESLRNAFKQ MADEGRYEEI ASFHGVPAQC PSEDGTMVHT CCLHGMPVFP HWHRLYVSLV EDELLARGSG VAVPYWDWVE PFDELPRLIN EATFYNSRTL QIEPNPFFKG KISFENAETD RDTQPELFGN RYLYDHTLFV FEQTDFCEFE VHYEVLHNTI HSWLGGRDVH SMSSLDYAAY DPVFFLHHSN VDRLWAIWQE LQRYRKLSYN EANCALPLMN QPMRPFSNST ANNDRLTFTN SRPNDVFDYQ NVLHYKYDTL NFAGLSIPQL ERILQKNQGR DRIFAGFLLH GIKASADVRI YICVPTGIGE ENCGNYAGIF SVLGGETEMP WQFDRLFRYE ITDELKKLGL NQNSHFRVEM ELTAVNGSKI TQKIFPNPTI IFVPSDVEFE EDTWRDVVTS ANRIRRNLKD LSKEDMFSLR AAFKRMTDDG RYEEIAAFHG LPAQCPNADG SNIHTCCLHG MPTFPHWHRL YLSLVENELL ARGSDVAVPY WDWIEPFDSL PGLISDETYK HPKTNEDIEN PFHHGKISFA DAVTVRKPRD QLFNNRYLYE HALFAFEHTD FCDFEVHFEV LHNSIHSWIG GPNPHSMSSL DFAAYDPIFF LHHSTVDRLW AIWQDLQRYR KLDYNVANCA LNLLNDPMRP FNNKTANQDH LTFTNSRPND VFDYQNSLNY KFDSLSFSGL SIPRLDDLLE SRQSHDRVFA GFWLSGIKAS ADVNIHICVP IGVEHEDCDN // ID 4YD9c STANDARD; PRT; 920 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 609 2609 ALA c 106 106 GLN A Protein S 1 FT #SUB 610 2610 ASP c 124 124 TYR A Protein S 3 FT #SUB 610 2610 ASP c 137 137 ARG A Protein S 1 FT #SUB 612 2612 TYR c 138 138 ALA A Protein S 3 FT #SUB 612 2612 TYR c 189 189 ARG A Protein S 7 FT #SUB 617 2617 ASP c 189 189 ARG A Protein S 5 FT #SUB 617 2617 ASP c 190 190 PHE A Protein B 2 FT #SUB 617 2617 ASP c 191 191 THR A Protein S 1 FT #SUB 618 2618 SER c 190 190 PHE A Protein B 1 FT #SUB 619 2619 VAL c 136 136 ALA A Protein S 1 FT #SUB 619 2619 VAL c 138 138 ALA A Protein S 2 FT #SUB 619 2619 VAL c 190 190 PHE A Protein S 3 FT #SUB 625 2625 LEU c 107 107 HIS A Protein S 1 FT #SUB 626 2626 ARG c 108 108 PRO A Protein S 4 FT #SUB 626 2626 ARG c 109 109 LEU A Protein S 3 FT #SUB 632 2632 GLU c 117 117 LYS A Protein B 3 FT #SUB 634 2634 THR c 117 117 LYS A Protein S 6 FT #SUB 635 2635 TYR c 109 109 LEU A Protein S 1 FT #SUB 636 2636 THR c 109 109 LEU A Protein B 1 FT #SUB 637 2637 VAL c 111 111 ILE A Protein S 1 FT #SUB 692 2692 LYS c 116 116 LYS A Protein S 3 FT #SUB 693 2693 TYR c 116 116 LYS A Protein S 2 FT #SUB 693 2693 TYR c 117 117 LYS A Protein S 2 FT #SUB 136 2136 THR c 387 387 ASN V Protein S 1 FT #SUB 136 2136 THR c 388 388 GLY V Protein S 6 FT #SUB 136 2136 THR c 389 389 THR V Protein A 10 FT #SUB 138 2138 LYS c 389 389 THR V Protein S 1 FT #SUB 321 2321 GLU c 1623 1623 ASN Y Protein S 1 FT #SUB 323 2323 ASN c 1651 1651 ILE Y Protein B 2 FT #SUB 323 2323 ASN c 1652 1652 PHE Y Protein A 9 FT #SUB 323 2323 ASN c 1654 1654 PRO Y Protein S 2 FT #SUB 325 2325 ALA c 1649 1649 THR Y Protein B 2 FT #SUB 325 2325 ALA c 1650 1650 ILE Y Protein B 2 FT #SUB 326 2326 ILE c 1622 1622 GLN Y Protein S 2 FT #SUB 326 2326 ILE c 1650 1650 ILE Y Protein A 5 FT #SUB 327 2327 GLU c 1648 1648 PRO Y Protein S 3 FT #SUB 327 2327 GLU c 1649 1649 THR Y Protein S 5 FT #SUB 327 2327 GLU c 1650 1650 ILE Y Protein S 6 FT #SUB 399 2399 LEU c 1603 1603 PHE Y Protein S 1 FT #SUB 399 2399 LEU c 1604 1604 ASP Y Protein S 3 FT #SUB 440 2440 ASP c 1558 1558 LEU Y Protein A 2 FT #SUB 440 2440 ASP c 1604 1604 ASP Y Protein B 1 FT #SUB 440 2440 ASP c 1605 1605 ARG Y Protein B 1 FT #SUB 440 2440 ASP c 1606 1606 LEU Y Protein A 6 FT #SUB 441 2441 ARG c 1604 1604 ASP Y Protein B 2 FT #SUB 442 2442 LEU c 1604 1604 ASP Y Protein A 5 FT #SUB 458 2458 PHE c 1481 1481 GLU Y Protein S 1 FT #SUB 458 2458 PHE c 1486 1486 LEU Y Protein A 3 FT #SUB 459 2459 ASP c 1481 1481 GLU Y Protein S 1 FT #SUB 461 2461 HIS c 1490 1490 ASN Y Protein A 9 FT #SUB 461 2461 HIS c 1512 1512 ARG Y Protein S 1 FT #SUB 486 2486 ILE c 1483 1483 ASN Y Protein B 1 FT #SUB 486 2486 ILE c 1484 1484 CYS Y Protein B 3 FT #SUB 486 2486 ILE c 1486 1486 LEU Y Protein A 5 FT #SUB 486 2486 ILE c 1487 1487 PRO Y Protein A 9 FT #SUB 487 2487 VAL c 1483 1483 ASN Y Protein B 2 FT #SUB 487 2487 VAL c 1484 1484 CYS Y Protein A 2 FT #SUB 488 2488 ARG c 1483 1483 ASN Y Protein A 9 FT #SUB 488 2488 ARG c 1486 1486 LEU Y Protein B 1 FT #SUB 490 2490 PRO c 1483 1483 ASN Y Protein S 2 FT #SUB 729 2729 LYS c 1245 1245 GLU Y Protein B 2 FT #SUB 730 2730 LEU c 1245 1245 GLU Y Protein A 3 FT #SUB 731 2731 PRO c 1245 1245 GLU Y Protein S 3 FT #SUB 736 2736 TYR c 1234 1234 VAL Y Protein B 1 FT #SUB 736 2736 TYR c 1235 1235 ILE Y Protein B 1 FT #SUB 736 2736 TYR c 1236 1236 TYR Y Protein A 7 FT #SUB 736 2736 TYR c 1238 1238 PRO Y Protein S 1 FT #SUB 736 2736 TYR c 1245 1245 GLU Y Protein S 3 FT #SUB 738 2738 ALA c 1234 1234 VAL Y Protein B 1 FT #SUB 739 2739 LEU c 1207 1207 TYR Y Protein S 3 FT #SUB 739 2739 LEU c 1211 1211 TYR Y Protein S 2 FT #SUB 739 2739 LEU c 1234 1234 VAL Y Protein A 5 FT #SUB 740 2740 ASP c 1232 1232 THR Y Protein S 13 FT #SUB 741 2741 GLN c 1232 1232 THR Y Protein S 1 FT #SUB 743 2743 ALA c 1209 1209 ASP Y Protein S 2 FT #SUB 744 2744 PHE c 1210 1210 LYS Y Protein S 4 FT #SUB 744 2744 PHE c 1211 1211 TYR Y Protein S 5 FT #SUB 809 2809 LEU c 1188 1188 PHE Y Protein S 1 FT #SUB 809 2809 LEU c 1189 1189 ASP Y Protein S 1 FT #SUB 847 2847 HIS c 1187 1187 LYS Y Protein S 4 FT #SUB 849 2849 ASP c 1147 1147 MET Y Protein B 2 FT #SUB 849 2849 ASP c 1191 1191 PRO Y Protein A 3 FT #SUB 849 2849 ASP c 1233 1233 SER Y Protein S 2 FT #SUB 851 2851 ASN c 1189 1189 ASP Y Protein S 2 FT #SUB 870 2870 LEU c 1074 1074 ILE Y Protein A 2 FT #SUB 870 2870 LEU c 1078 1078 ARG Y Protein B 2 FT #SUB 871 2871 PHE c 1070 1070 ALA Y Protein S 1 FT #SUB 871 2871 PHE c 1071 1071 ASN Y Protein S 1 FT #SUB 871 2871 PHE c 1074 1074 ILE Y Protein S 3 FT #SUB 871 2871 PHE c 1103 1103 PHE Y Protein A 6 FT #SUB 872 2872 GLU c 1078 1078 ARG Y Protein B 1 FT #SUB 873 2873 HIS c 1078 1078 ARG Y Protein A 3 FT #SUB 873 2873 HIS c 1101 1101 VAL Y Protein S 4 FT #SUB 875 2875 SER c 1078 1078 ARG Y Protein A 2 FT #SUB 877 2877 ILE c 1078 1078 ARG Y Protein A 6 FT #SUB 899 2899 PRO c 1075 1075 GLU Y Protein B 2 FT #SUB 900 2900 SER c 1075 1075 GLU Y Protein A 8 FT #SUB 901 2901 LEU c 1073 1073 ALA Y Protein B 2 FT #SUB 901 2901 LEU c 1074 1074 ILE Y Protein A 5 FT #SUB 901 2901 LEU c 1075 1075 GLU Y Protein A 5 FT #SUB 902 2902 ILE c 1071 1071 ASN Y Protein A 3 FT #SUB 903 2903 TYR c 1071 1071 ASN Y Protein A 6 FT #SUB 84 2084 ARG c 502 2502 LEU Z Protein S 3 FT #SUB 90 2090 LYS c 375 2375 GLY Z Protein S 1 FT #SUB 91 2091 ASN c 94 2094 ARG Z Protein S 2 FT #SUB 94 2094 ARG c 91 2091 ASN Z Protein S 1 FT #SUB 94 2094 ARG c 94 2094 ARG Z Protein S 3 FT #SUB 94 2094 ARG c 182 2182 TYR Z Protein A 5 FT #SUB 94 2094 ARG c 370 2370 MET Z Protein S 7 FT #SUB 96 2096 SER c 374 2374 ASN Z Protein S 3 FT #SUB 97 2097 LEU c 235 2235 PRO Z Protein S 2 FT #SUB 98 2098 GLN c 374 2374 ASN Z Protein S 1 FT #SUB 98 2098 GLN c 376 2376 MET Z Protein S 1 FT #SUB 99 2099 GLU c 375 2375 GLY Z Protein S 1 FT #SUB 109 2109 ARG c 503 2503 ASN Z Protein S 2 FT #SUB 109 2109 ARG c 504 2504 ARG Z Protein S 1 FT #SUB 109 2109 ARG c 582 2582 ARG Z Protein S 3 FT #SUB 109 2109 ARG c 585 2585 GLY Z Protein B 1 FT #SUB 112 2112 LYS c 583 2583 ARG Z Protein S 2 FT #SUB 112 2112 LYS c 584 2584 HIS Z Protein B 1 FT #SUB 113 2113 ASP c 519 2519 ASN Z Protein S 3 FT #SUB 113 2113 ASP c 585 2585 GLY Z Protein S 4 FT #SUB 114 2114 ARG c 526 2526 ARG Z Protein S 5 FT #SUB 115 2115 SER c 518 2518 GLN Z Protein S 2 FT #SUB 115 2115 SER c 519 2519 ASN Z Protein S 3 FT #SUB 115 2115 SER c 522 2522 SER Z Protein B 3 FT #SUB 116 2116 SER c 615 2615 TRP Z Protein S 1 FT #SUB 121 2121 THR c 615 2615 TRP Z Protein S 4 FT #SUB 125 2125 PHE c 615 2615 TRP Z Protein S 1 FT #SUB 131 2131 LEU c 615 2615 TRP Z Protein S 1 FT #SUB 167 2167 ASN c 502 2502 LEU Z Protein A 4 FT #SUB 168 2168 ARG c 502 2502 LEU Z Protein B 1 FT #SUB 168 2168 ARG c 503 2503 ASN Z Protein B 4 FT #SUB 168 2168 ARG c 515 2515 ARG Z Protein S 13 FT #SUB 168 2168 ARG c 587 2587 SER Z Protein B 4 FT #SUB 169 2169 HIS c 503 2503 ASN Z Protein B 4 FT #SUB 169 2169 HIS c 585 2585 GLY Z Protein S 1 FT #SUB 169 2169 HIS c 587 2587 SER Z Protein S 5 FT #SUB 170 2170 GLY c 503 2503 ASN Z Protein B 1 FT #SUB 182 2182 TYR c 94 2094 ARG Z Protein S 7 FT #SUB 199 2199 PRO c 236 2236 ALA Z Protein B 3 FT #SUB 199 2199 PRO c 237 2237 LEU Z Protein B 2 FT #SUB 200 2200 PHE c 236 2236 ALA Z Protein B 1 FT #SUB 200 2200 PHE c 237 2237 LEU Z Protein A 9 FT #SUB 235 2235 PRO c 97 2097 LEU Z Protein S 1 FT #SUB 235 2235 PRO c 199 2199 PRO Z Protein B 1 FT #SUB 236 2236 ALA c 199 2199 PRO Z Protein B 2 FT #SUB 237 2237 LEU c 199 2199 PRO Z Protein B 2 FT #SUB 237 2237 LEU c 200 2200 PHE Z Protein A 8 FT #SUB 237 2237 LEU c 201 2201 THR Z Protein S 2 FT #SUB 237 2237 LEU c 202 2202 GLY Z Protein S 1 FT #SUB 244 2244 PHE c 98 2098 GLN Z Protein S 1 FT #SUB 341 2341 PRO c 614 2614 ALA Z Protein S 2 FT #SUB 341 2341 PRO c 615 2615 TRP Z Protein B 1 FT #SUB 342 2342 TYR c 614 2614 ALA Z Protein S 1 FT #SUB 342 2342 TYR c 615 2615 TRP Z Protein B 4 FT #SUB 344 2344 LEU c 514 2514 SER Z Protein S 2 FT #SUB 344 2344 LEU c 515 2515 ARG Z Protein A 8 FT #SUB 344 2344 LEU c 518 2518 GLN Z Protein S 1 FT #SUB 345 2345 ASN c 515 2515 ARG Z Protein S 2 FT #SUB 346 2346 PRO c 515 2515 ARG Z Protein S 4 FT #SUB 370 2370 MET c 94 2094 ARG Z Protein S 7 FT #SUB 374 2374 ASN c 96 2096 SER Z Protein A 3 FT #SUB 374 2374 ASN c 98 2098 GLN Z Protein S 3 FT #SUB 375 2375 GLY c 90 2090 LYS Z Protein B 1 FT #SUB 375 2375 GLY c 99 2099 GLU Z Protein B 1 FT #SUB 376 2376 MET c 98 2098 GLN Z Protein S 1 FT #SUB 500 2500 ILE c 84 2084 ARG Z Protein B 2 FT #SUB 502 2502 LEU c 84 2084 ARG Z Protein S 4 FT #SUB 502 2502 LEU c 167 2167 ASN Z Protein S 2 FT #SUB 502 2502 LEU c 168 2168 ARG Z Protein S 3 FT #SUB 503 2503 ASN c 109 2109 ARG Z Protein S 2 FT #SUB 503 2503 ASN c 168 2168 ARG Z Protein A 4 FT #SUB 503 2503 ASN c 169 2169 HIS Z Protein S 2 FT #SUB 503 2503 ASN c 170 2170 GLY Z Protein S 2 FT #SUB 513 2513 GLU c 346 2346 PRO Z Protein S 1 FT #SUB 514 2514 SER c 344 2344 LEU Z Protein A 3 FT #SUB 515 2515 ARG c 168 2168 ARG Z Protein S 11 FT #SUB 515 2515 ARG c 344 2344 LEU Z Protein A 5 FT #SUB 515 2515 ARG c 345 2345 ASN Z Protein S 2 FT #SUB 515 2515 ARG c 346 2346 PRO Z Protein S 7 FT #SUB 516 2516 ASP c 168 2168 ARG Z Protein S 1 FT #SUB 518 2518 GLN c 115 2115 SER Z Protein B 1 FT #SUB 519 2519 ASN c 113 2113 ASP Z Protein S 3 FT #SUB 519 2519 ASN c 115 2115 SER Z Protein A 4 FT #SUB 522 2522 SER c 115 2115 SER Z Protein S 4 FT #SUB 526 2526 ARG c 114 2114 ARG Z Protein S 5 FT #SUB 582 2582 ARG c 109 2109 ARG Z Protein B 1 FT #SUB 583 2583 ARG c 112 2112 LYS Z Protein B 3 FT #SUB 584 2584 HIS c 112 2112 LYS Z Protein B 1 FT #SUB 585 2585 GLY c 109 2109 ARG Z Protein B 1 FT #SUB 585 2585 GLY c 113 2113 ASP Z Protein B 7 FT #SUB 585 2585 GLY c 169 2169 HIS Z Protein B 2 FT #SUB 587 2587 SER c 168 2168 ARG Z Protein A 4 FT #SUB 587 2587 SER c 169 2169 HIS Z Protein S 3 FT #SUB 614 2614 ALA c 341 2341 PRO Z Protein S 2 FT #SUB 615 2615 TRP c 116 2116 SER Z Protein S 3 FT #SUB 615 2615 TRP c 121 2121 THR Z Protein S 3 FT #SUB 615 2615 TRP c 125 2125 PHE Z Protein S 1 FT #SUB 615 2615 TRP c 342 2342 TYR Z Protein S 1 FT #SUB 800 2800 LYS c 100 3020 SER d Protein S 1 FT #SUB 800 2800 LYS c 101 3021 LEU d Protein S 1 FT #SUB 800 2800 LYS c 103 3023 ILE d Protein A 6 FT #SUB 802 2802 ASP c 12 2932 PRO d Protein S 3 FT #SUB 861 2861 LYS c 16 2936 GLU d Protein S 2 FT #SUB 867 2867 PRO c 12 2932 PRO d Protein S 11 FT #SUB 868 2868 GLU c 12 2932 PRO d Protein B 1 FT #SUB 903 2903 TYR c 12 2932 PRO d Protein S 2 FT #HET 126 2126 HIS c 208 3001 CUO c S 6 FT #HET 136 2136 THR c 90 1 NAG DA B 1 FT #HET 137 2137 ALA c 90 1 NAG DA B 1 FT #HET 138 2138 LYS c 90 1 NAG DA S 2 FT #HET 138 2138 LYS c 91 2 NAG DA S 2 FT #HET 144 2144 CYS c 208 3001 CUO c S 1 FT #HET 146 2146 HIS c 208 3001 CUO c S 6 FT #HET 151 2151 PHE c 208 3001 CUO c S 1 FT #HET 155 2155 HIS c 208 3001 CUO c S 8 FT #HET 267 2267 HIS c 208 3001 CUO c S 7 FT #HET 271 2271 HIS c 208 3001 CUO c S 6 FT #HET 294 2294 PHE c 208 3001 CUO c S 1 FT #HET 298 2298 HIS c 208 3001 CUO c S 7 FT #HET 405 2405 SER c 129 1 NAG RA A 5 FT #HET 429 2429 LEU c 208 3001 CUO c S 1 FT #HET 480 2480 LEU c 130 2 NAG RA S 1 FT #HET 543 2543 HIS c 209 3002 CUO c S 6 FT #HET 559 2559 CYS c 209 3002 CUO c S 1 FT #HET 561 2561 HIS c 209 3002 CUO c S 6 FT #HET 570 2570 HIS c 209 3002 CUO c S 8 FT #HET 680 2680 HIS c 209 3002 CUO c S 6 FT #HET 684 2684 HIS c 209 3002 CUO c S 6 FT #HET 707 2707 PHE c 209 3002 CUO c S 5 FT #HET 711 2711 HIS c 209 3002 CUO c S 6 FT #MOD 472 2472 ASN c 129 1 NAG RA S FT DISORDER 1 82 FT DISORDER 908 920 CC SEQUENCE 825 AA (ATOM); CC VRGNLVRKNV DRLSLQEINS LIHALKRMQK DRSSDGFETI ASFHALPPLC PNPTAKHRHA CC CCLHGMATFP QWHRLYVVQF EHSLNRHGAI VGVPYWDWTY PMTEVPGLLT SEKYTDPFTG CC IETFNPFNHG HISFISPETM TTREVSEHLF EQPALGKQTW LFNNIILALE QTDYCDFEVQ CC FEIVHNSIHS WLGGKELYSL NHLHYAAYDP AFFLHHSNVD RLWVVWQELQ KFRGLPAYES CC NCAIELMSQP LKPFSFGAPY NLNPVTTKYS KPSDVFNYKQ NFHYEYDMLE MNGMSIAQLE CC SYIRQERQKD RVFAGFLLEG FGSSAYATFQ VCPDVGDCYE GSHFSVLGGS TEMPWAFDRL CC YKMEITDILQ AMALKFDSHF TIKTKIVAHN GTELPESLLP EATIVRIPPS AQNLEVAIPL CC NRIRRNINSL ESRDVQNLMS ALKRLKEDES DFGFQTIAGY HGSLMCPTPE APEYACCLHG CC MPTFMHWHRV YLLHFEESMR RHGASVAVPY WDWTMPSDNL PSLLGDADYY DAWTDSVIEN CC PFLRGHIKYE DTYTVREIQP ELFALAEGQK ESTLFKDVML MFEQEDYCDF EVQAEVIHNS CC IHYLIGGHQK YAMSSLMFSS FDPIFYVHHS MVDRLWAIWQ ELQKHRKLPH DKAYCALDQM CC AFPMKPFIWE SNPNPTTRAV STPSKLFDYK SLGYDYDHLN FHGMSIGQLE ALIQKQKKAD CC RVFAGFLLHG IKISADVHLK ICIEADCQEA GVIFVLGGET EMPWHFDRNY KMDITDVLKK CC RNIPPEALFE HDSKIRLEVE IKSVDGAVLD PNSLPKPSLI YAPAK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES YAGTFAVLGGETEMPWAFDRLFRYEISDEMKKLQLTEDSKFRLTTNIIAS CC ATOM -------------------------------------------------- CC CC SEQRES NGSKVSNDIFPTPTVIFVPKKERTEQVSTTKSVRGNLVRKNVDRLSLQEI CC ATOM --------------------------------VRGNLVRKNVDRLSLQEI CC ****************** CC SEQRES NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ATOM NSLIHALKRMQKDRSSDGFETIASFHALPPLCPNPTAKHRHACCLHGMAT CC ************************************************** CC SEQRES FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ATOM FPQWHRLYVVQFEHSLNRHGAIVGVPYWDWTYPMTEVPGLLTSEKYTDPF CC ************************************************** CC SEQRES TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ATOM TGIETFNPFNHGHISFISPETMTTREVSEHLFEQPALGKQTWLFNNIILA CC ************************************************** CC SEQRES LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ATOM LEQTDYCDFEVQFEIVHNSIHSWLGGKELYSLNHLHYAAYDPAFFLHHSN CC ************************************************** CC SEQRES VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ATOM VDRLWVVWQELQKFRGLPAYESNCAIELMSQPLKPFSFGAPYNLNPVTTK CC ************************************************** CC SEQRES YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ATOM YSKPSDVFNYKQNFHYEYDMLEMNGMSIAQLESYIRQERQKDRVFAGFLL CC ************************************************** CC SEQRES EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ATOM EGFGSSAYATFQVCPDVGDCYEGSHFSVLGGSTEMPWAFDRLYKMEITDI CC ************************************************** CC SEQRES LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ATOM LQAMALKFDSHFTIKTKIVAHNGTELPESLLPEATIVRIPPSAQNLEVAI CC ************************************************** CC SEQRES PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ATOM PLNRIRRNINSLESRDVQNLMSALKRLKEDESDFGFQTIAGYHGSLMCPT CC ************************************************** CC SEQRES PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ATOM PEAPEYACCLHGMPTFMHWHRVYLLHFEESMRRHGASVAVPYWDWTMPSD CC ************************************************** CC SEQRES NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ATOM NLPSLLGDADYYDAWTDSVIENPFLRGHIKYEDTYTVREIQPELFALAEG CC ************************************************** CC SEQRES QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ATOM QKESTLFKDVMLMFEQEDYCDFEVQAEVIHNSIHYLIGGHQKYAMSSLMF CC ************************************************** CC SEQRES SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ATOM SSFDPIFYVHHSMVDRLWAIWQELQKHRKLPHDKAYCALDQMAFPMKPFI CC ************************************************** CC SEQRES WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ATOM WESNPNPTTRAVSTPSKLFDYKSLGYDYDHLNFHGMSIGQLEALIQKQKK CC ************************************************** CC SEQRES ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ATOM ADRVFAGFLLHGIKISADVHLKICIEADCQEAGVIFVLGGETEMPWHFDR CC ************************************************** CC SEQRES NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ATOM NYKMDITDVLKKRNIPPEALFEHDSKIRLEVEIKSVDGAVLDPNSLPKPS CC ************************************************** CC SEQRES LIYAPAKGLIIQQVGEYDAG CC ATOM LIYAPAK------------- CC ******* SQ SEQUENCE 920 AA; MW; CN; YAGTFAVLGG ETEMPWAFDR LFRYEISDEM KKLQLTEDSK FRLTTNIIAS NGSKVSNDIF PTPTVIFVPK KERTEQVSTT KSVRGNLVRK NVDRLSLQEI NSLIHALKRM QKDRSSDGFE TIASFHALPP LCPNPTAKHR HACCLHGMAT FPQWHRLYVV QFEHSLNRHG AIVGVPYWDW TYPMTEVPGL LTSEKYTDPF TGIETFNPFN HGHISFISPE TMTTREVSEH LFEQPALGKQ TWLFNNIILA LEQTDYCDFE VQFEIVHNSI HSWLGGKELY SLNHLHYAAY DPAFFLHHSN VDRLWVVWQE LQKFRGLPAY ESNCAIELMS QPLKPFSFGA PYNLNPVTTK YSKPSDVFNY KQNFHYEYDM LEMNGMSIAQ LESYIRQERQ KDRVFAGFLL EGFGSSAYAT FQVCPDVGDC YEGSHFSVLG GSTEMPWAFD RLYKMEITDI LQAMALKFDS HFTIKTKIVA HNGTELPESL LPEATIVRIP PSAQNLEVAI PLNRIRRNIN SLESRDVQNL MSALKRLKED ESDFGFQTIA GYHGSLMCPT PEAPEYACCL HGMPTFMHWH RVYLLHFEES MRRHGASVAV PYWDWTMPSD NLPSLLGDAD YYDAWTDSVI ENPFLRGHIK YEDTYTVREI QPELFALAEG QKESTLFKDV MLMFEQEDYC DFEVQAEVIH NSIHYLIGGH QKYAMSSLMF SSFDPIFYVH HSMVDRLWAI WQELQKHRKL PHDKAYCALD QMAFPMKPFI WESNPNPTTR AVSTPSKLFD YKSLGYDYDH LNFHGMSIGQ LEALIQKQKK ADRVFAGFLL HGIKISADVH LKICIEADCQ EAGVIFVLGG ETEMPWHFDR NYKMDITDVL KKRNIPPEAL FEHDSKIRLE VEIKSVDGAV LDPNSLPKPS LIYAPAKGLI IQQVGEYDAG // ID 4YD9d STANDARD; PRT; 394 AA. DT CONVERTED FROM PDB (SEQRES) 4YD9 DE hemocyanin OS Todarodes pacificus CC EXPDTA X-RAY DIFFRACTION CC RESOLU 3.000 CC R-Factor 0.275 FT #SUB 276 3196 THR d 416 416 ASN Y Protein S 8 FT #SUB 279 3199 GLN d 419 419 THR Y Protein S 1 FT #SUB 348 3268 LYS d 1534 1534 GLY Y Protein S 1 FT #SUB 351 3271 VAL d 1533 1533 ALA Y Protein S 2 FT #SUB 352 3272 HIS d 1404 1404 TYR Y Protein B 1 FT #SUB 352 3272 HIS d 1535 1535 LEU Y Protein S 5 FT #SUB 352 3272 HIS d 1543 1543 ILE Y Protein S 2 FT #SUB 354 3274 ARG d 1350 1350 GLU Y Protein S 5 FT #SUB 354 3274 ARG d 1533 1533 ALA Y Protein S 3 FT #SUB 12 2932 PRO d 802 2802 ASP c Protein S 3 FT #SUB 12 2932 PRO d 867 2867 PRO c Protein A 11 FT #SUB 12 2932 PRO d 868 2868 GLU c Protein B 1 FT #SUB 12 2932 PRO d 903 2903 TYR c Protein S 2 FT #SUB 16 2936 GLU d 861 2861 LYS c Protein S 2 FT #SUB 100 3020 SER d 800 2800 LYS c Protein S 1 FT #SUB 101 3021 LEU d 800 2800 LYS c Protein B 1 FT #SUB 103 3023 ILE d 800 2800 LYS c Protein S 6 FT #HET 41 2961 HIS d 210 3401 CUO d S 8 FT #HET 60 2980 HIS d 210 3401 CUO d S 3 FT #HET 69 2989 HIS d 210 3401 CUO d S 13 FT #HET 118 3038 ILE d 192 5018 CUO V B 1 FT #HET 121 3041 ALA d 192 5018 CUO V B 7 FT #HET 122 3042 ASP d 192 5018 CUO V A 29 FT #HET 123 3043 THR d 192 5018 CUO V B 6 FT #HET 169 3089 HIS d 210 3401 CUO d S 7 FT #HET 173 3093 HIS d 210 3401 CUO d S 5 FT #HET 196 3116 PHE d 210 3401 CUO d S 5 FT #HET 199 3119 HIS d 210 3401 CUO d S 1 FT #HET 200 3120 HIS d 210 3401 CUO d S 10 FT DISORDER 1 7 FT DISORDER 132 141 FT DISORDER 313 319 FT DISORDER 391 394 CC SEQUENCE 366 AA (ATOM); CC NSLTPSEIEN LRNALAAVQA DKTDAGYQKI ASFHGMPLSC QYPDGTAFAC CQHGMVTFPH CC WHRLYMKQME DALKAKGAKI GIPYWDWTTA FHSLPILVTE PKNNPFHHGY IDVADTKTTR CC DPRPQSFFYR QIAFALEQRD FCDFEIQFEM GHNAIHSWVG GPSPYGMSTL HYTSYDPLFY CC VHHSNTDRIW AIWQALQKYR GLPYNSANCE INKLKKPMMP FSSEDNPNEV TKAHSTGYKS CC FDYQQLNYEY DNLNFHGMTI PQLEVHLKKI QEKDRVFAGF LLRAIGQSAD VNFDVTFGGT CC FCVLGGDYEM PWAFDRLFLY DISKSLVHLR LDAHDDFDIK VTIMGIDGKS LPPNLLPSPT CC ILFKPG CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES SMVRKNVNSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ATOM -------NSLTPSEIENLRNALAAVQADKTDAGYQKIASFHGMPLSCQYP CC ******************************************* CC SEQRES DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ATOM DGTAFACCQHGMVTFPHWHRLYMKQMEDALKAKGAKIGIPYWDWTTAFHS CC ************************************************** CC SEQRES LPILVTEPKNNPFHHGYIDVADTKTTRDPRPQLFDDPEQGDQSFFYRQIA CC ATOM LPILVTEPKNNPFHHGYIDVADTKTTRDPRP----------QSFFYRQIA CC ******************************* ********* CC SEQRES FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ATOM FALEQRDFCDFEIQFEMGHNAIHSWVGGPSPYGMSTLHYTSYDPLFYVHH CC ************************************************** CC SEQRES SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ATOM SNTDRIWAIWQALQKYRGLPYNSANCEINKLKKPMMPFSSEDNPNEVTKA CC ************************************************** CC SEQRES HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ATOM HSTGYKSFDYQQLNYEYDNLNFHGMTIPQLEVHLKKIQEKDRVFAGFLLR CC ************************************************** CC SEQRES AIGQSADVNFDVCRKDGECTFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ATOM AIGQSADVNFDV-------TFGGTFCVLGGDYEMPWAFDRLFLYDISKSL CC ************ ******************************* CC SEQRES VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPGTGKI CC ATOM VHLRLDAHDDFDIKVTIMGIDGKSLPPNLLPSPTILFKPG---- CC **************************************** SQ SEQUENCE 394 AA; MW; CN; SMVRKNVNSL TPSEIENLRN ALAAVQADKT DAGYQKIASF HGMPLSCQYP DGTAFACCQH GMVTFPHWHR LYMKQMEDAL KAKGAKIGIP YWDWTTAFHS LPILVTEPKN NPFHHGYIDV ADTKTTRDPR PQLFDDPEQG DQSFFYRQIA FALEQRDFCD FEIQFEMGHN AIHSWVGGPS PYGMSTLHYT SYDPLFYVHH SNTDRIWAIW QALQKYRGLP YNSANCEINK LKKPMMPFSS EDNPNEVTKA HSTGYKSFDY QQLNYEYDNL NFHGMTIPQL EVHLKKIQEK DRVFAGFLLR AIGQSADVNF DVCRKDGECT FGGTFCVLGG DYEMPWAFDR LFLYDISKSL VHLRLDAHDD FDIKVTIMGI DGKSLPPNLL PSPTILFKPG TGKI //