ID 1H2UA STANDARD; PRT; 723 AA. DT CONVERTED FROM PDB (SEQRES) 1H2U DE 80 KDA NUCLEAR CAP BINDING PROTEIN OS HOMO SAPIENS CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.194 FT #SUB 361 380 CYS A 701 768 LEU B Protein A 2 FT #SUB 362 381 LYS A 701 768 LEU B Protein S 1 FT #SUB 365 384 PRO A 644 711 LEU B Protein B 2 FT #SUB 366 385 GLY A 641 708 ASN B Protein B 3 FT #SUB 366 385 GLY A 644 711 LEU B Protein B 2 FT #SUB 369 388 PRO A 701 768 LEU B Protein S 1 FT #SUB 404 423 ASN A 700 767 ASN B Protein B 1 FT #SUB 405 424 PHE A 700 767 ASN B Protein S 4 FT #SUB 406 425 GLN A 700 767 ASN B Protein S 8 FT #SUB 408 427 ARG A 696 763 VAL B Protein S 1 FT #SUB 13 32 GLU A 6 6 LEU X Protein S 3 FT #SUB 13 32 GLU A 7 7 LYS X Protein S 2 FT #SUB 17 36 CYS A 6 6 LEU X Protein S 1 FT #SUB 51 70 ARG A 4 4 GLY X Protein S 1 FT #SUB 55 74 THR A 4 4 GLY X Protein S 5 FT #SUB 55 74 THR A 5 5 LEU X Protein S 1 FT #SUB 55 74 THR A 6 6 LEU X Protein S 2 FT #SUB 58 77 ARG A 4 4 GLY X Protein S 8 FT #SUB 59 78 LEU A 4 4 GLY X Protein S 1 FT #SUB 305 324 SER A 9 9 LEU X Protein B 1 FT #SUB 307 326 TRP A 100 100 TYR X Protein A 6 FT #SUB 308 327 LYS A 9 9 LEU X Protein S 1 FT #SUB 308 327 LYS A 11 11 SER X Protein S 1 FT #SUB 308 327 LYS A 12 12 ASP X Protein S 2 FT #SUB 308 327 LYS A 99 99 ARG X Protein B 1 FT #SUB 308 327 LYS A 100 100 TYR X Protein B 2 FT #SUB 309 328 GLU A 11 11 SER X Protein S 7 FT #SUB 309 328 GLU A 12 12 ASP X Protein S 3 FT #SUB 309 328 GLU A 13 13 SER X Protein S 11 FT #SUB 309 328 GLU A 14 14 TYR X Protein S 2 FT #SUB 310 329 ARG A 14 14 TYR X Protein S 7 FT #SUB 310 329 ARG A 99 99 ARG X Protein S 4 FT #SUB 310 329 ARG A 100 100 TYR X Protein S 4 FT #SUB 310 329 ARG A 102 102 ASN X Protein S 3 FT #SUB 310 329 ARG A 103 103 GLY X Protein S 1 FT #SUB 311 330 LYS A 14 14 TYR X Protein A 2 FT #SUB 349 368 ILE A 62 62 LYS X Protein S 1 FT #SUB 349 368 ILE A 63 63 SER X Protein S 1 FT #SUB 349 368 ILE A 96 96 ASN X Protein S 2 FT #SUB 349 368 ILE A 100 100 TYR X Protein S 1 FT #SUB 351 370 VAL A 62 62 LYS X Protein S 1 FT #SUB 351 370 VAL A 101 101 ILE X Protein S 1 FT #SUB 352 371 MET A 100 100 TYR X Protein A 8 FT #SUB 355 374 THR A 100 100 TYR X Protein S 6 FT #SUB 396 415 ASN A 62 62 LYS X Protein B 1 FT #SUB 400 419 HIS A 59 59 LEU X Protein S 1 FT #SUB 400 419 HIS A 62 62 LYS X Protein S 6 FT #SUB 403 422 SER A 105 105 ARG X Protein B 1 FT #SUB 404 423 ASN A 104 104 THR X Protein S 3 FT #SUB 404 423 ASN A 105 105 ARG X Protein A 9 FT #SUB 406 425 GLN A 105 105 ARG X Protein S 2 FT #SUB 406 425 GLN A 108 108 ASP X Protein S 4 FT #SUB 436 455 LYS A 58 58 GLU X Protein S 2 FT #SUB 439 458 ARG A 55 55 GLN X Protein B 1 FT #SUB 439 458 ARG A 58 58 GLU X Protein A 16 FT #SUB 440 459 LEU A 58 58 GLU X Protein S 2 FT #SUB 440 459 LEU A 59 59 LEU X Protein S 1 FT #SUB 441 460 SER A 55 55 GLN X Protein B 5 FT #SUB 539 558 SER A 53 53 GLU X Protein S 2 FT #SUB 539 558 SER A 54 54 GLU X Protein A 3 FT #SUB 540 559 PHE A 53 53 GLU X Protein A 5 FT #SUB 540 559 PHE A 54 54 GLU X Protein A 8 FT #SUB 540 559 PHE A 57 57 TYR X Protein S 1 FT #SUB 541 560 SER A 53 53 GLU X Protein A 9 FT #SUB 544 563 PHE A 57 57 TYR X Protein S 2 FT #SUB 580 599 GLN A 54 54 GLU X Protein S 4 FT #SUB 580 599 GLN A 58 58 GLU X Protein S 3 FT #SUB 588 607 LYS A 67 67 LYS X Protein S 1 FT #SUB 591 610 ARG A 65 65 ASP X Protein S 1 FT #SUB 591 610 ARG A 89 89 TYR X Protein S 14 FT DISORDER 1 6 FT DISORDER 509 519 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 706 AA (ATOM); CC ETEDHLESLI CKVGEKSACS LESNLEGLAG VLEADLPNYK SKILRLLCTV ARLLPEKLTI CC YTTLVGLLNA RNYNFGGEFV EAMIRQLKES LKANNYNEAV YLVRFLSDLV NCHVIAAPSM CC VAMFENFVSV TQEEDVPQVR RDWYVYAFLS SLPWVGKELY EKKDAEMDRI FANTESYLKR CC RQKTHVPMLQ VWTADKPHPQ EEYLDCLWAQ IQKLKKDRWQ ERHILRPYLA FDSILCEALQ CC HNLPPFTPPP HTEDSVYPMP RVIFRMFDYT DDPEGPVMPG SHSVERFVIE ENLHCIIKSH CC WKERKTCAAQ LVSYPGKNKI PLNYHIVEVI FAELFQLPAP PHIDVMYTTL LIELCKLQPG CC SLPQVLAQAT EMLYMRLDTM NTTCVDRFIN WFSHHLSNFQ FRWSWEDWSD CLSQDPESPK CC PKFVREVLEK CMRLSYHQRI LDIVPPTFSA LCPSNPTCIY KYGDESSNSL PGHSVALCLA CC VAFKSKATND EIFSILKDVP NPFNPLKIEV FVQTLLHLAA KSFSHSFSAL AKFHEVFKTL CC AESDEGKLHV LRVMFEVWRN HPQMIAVLVD KMIRTQIVDC AAVANWIFSS ELSRDFTRLF CC VWEILHSTIR KMNKHVGAQS EQKNLFLVIF QRFIMILTEH LVRCETDGTS VLTPWYKNCI CC ERLQQIFLQH HQIIQQYMVT LENLLFTAEL DPHILAVFQQ FCALQA CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES KTSDANETEDHLESLICKVGEKSACSLESNLEGLAGVLEADLPNYKSKIL CC ATOM ------ETEDHLESLICKVGEKSACSLESNLEGLAGVLEADLPNYKSKIL CC ******************************************** CC SEQRES RLLCTVARLLPEKLTIYTTLVGLLNARNYNFGGEFVEAMIRQLKESLKAN CC ATOM RLLCTVARLLPEKLTIYTTLVGLLNARNYNFGGEFVEAMIRQLKESLKAN CC ************************************************** CC SEQRES NYNEAVYLVRFLSDLVNCHVIAAPSMVAMFENFVSVTQEEDVPQVRRDWY CC ATOM NYNEAVYLVRFLSDLVNCHVIAAPSMVAMFENFVSVTQEEDVPQVRRDWY CC ************************************************** CC SEQRES VYAFLSSLPWVGKELYEKKDAEMDRIFANTESYLKRRQKTHVPMLQVWTA CC ATOM VYAFLSSLPWVGKELYEKKDAEMDRIFANTESYLKRRQKTHVPMLQVWTA CC ************************************************** CC SEQRES DKPHPQEEYLDCLWAQIQKLKKDRWQERHILRPYLAFDSILCEALQHNLP CC ATOM DKPHPQEEYLDCLWAQIQKLKKDRWQERHILRPYLAFDSILCEALQHNLP CC ************************************************** CC SEQRES PFTPPPHTEDSVYPMPRVIFRMFDYTDDPEGPVMPGSHSVERFVIEENLH CC ATOM PFTPPPHTEDSVYPMPRVIFRMFDYTDDPEGPVMPGSHSVERFVIEENLH CC ************************************************** CC SEQRES CIIKSHWKERKTCAAQLVSYPGKNKIPLNYHIVEVIFAELFQLPAPPHID CC ATOM CIIKSHWKERKTCAAQLVSYPGKNKIPLNYHIVEVIFAELFQLPAPPHID CC ************************************************** CC SEQRES VMYTTLLIELCKLQPGSLPQVLAQATEMLYMRLDTMNTTCVDRFINWFSH CC ATOM VMYTTLLIELCKLQPGSLPQVLAQATEMLYMRLDTMNTTCVDRFINWFSH CC ************************************************** CC SEQRES HLSNFQFRWSWEDWSDCLSQDPESPKPKFVREVLEKCMRLSYHQRILDIV CC ATOM HLSNFQFRWSWEDWSDCLSQDPESPKPKFVREVLEKCMRLSYHQRILDIV CC ************************************************** CC SEQRES PPTFSALCPSNPTCIYKYGDESSNSLPGHSVALCLAVAFKSKATNDEIFS CC ATOM PPTFSALCPSNPTCIYKYGDESSNSLPGHSVALCLAVAFKSKATNDEIFS CC ************************************************** CC SEQRES ILKDVPNPNQDDDDDEGFSFNPLKIEVFVQTLLHLAAKSFSHSFSALAKF CC ATOM ILKDVPNP-----------FNPLKIEVFVQTLLHLAAKSFSHSFSALAKF CC ******** ******************************* CC SEQRES HEVFKTLAESDEGKLHVLRVMFEVWRNHPQMIAVLVDKMIRTQIVDCAAV CC ATOM HEVFKTLAESDEGKLHVLRVMFEVWRNHPQMIAVLVDKMIRTQIVDCAAV CC ************************************************** CC SEQRES ANWIFSSELSRDFTRLFVWEILHSTIRKMNKHVGAQSEQKNLFLVIFQRF CC ATOM ANWIFSSELSRDFTRLFVWEILHSTIRKMNKHVGAQSEQKNLFLVIFQRF CC ************************************************** CC SEQRES IMILTEHLVRCETDGTSVLTPWYKNCIERLQQIFLQHHQIIQQYMVTLEN CC ATOM IMILTEHLVRCETDGTSVLTPWYKNCIERLQQIFLQHHQIIQQYMVTLEN CC ************************************************** CC SEQRES LLFTAELDPHILAVFQQFCALQA CC ATOM LLFTAELDPHILAVFQQFCALQA CC *********************** SQ SEQUENCE 723 AA; MW; CN; KTSDANETED HLESLICKVG EKSACSLESN LEGLAGVLEA DLPNYKSKIL RLLCTVARLL PEKLTIYTTL VGLLNARNYN FGGEFVEAMI RQLKESLKAN NYNEAVYLVR FLSDLVNCHV IAAPSMVAMF ENFVSVTQEE DVPQVRRDWY VYAFLSSLPW VGKELYEKKD AEMDRIFANT ESYLKRRQKT HVPMLQVWTA DKPHPQEEYL DCLWAQIQKL KKDRWQERHI LRPYLAFDSI LCEALQHNLP PFTPPPHTED SVYPMPRVIF RMFDYTDDPE GPVMPGSHSV ERFVIEENLH CIIKSHWKER KTCAAQLVSY PGKNKIPLNY HIVEVIFAEL FQLPAPPHID VMYTTLLIEL CKLQPGSLPQ VLAQATEMLY MRLDTMNTTC VDRFINWFSH HLSNFQFRWS WEDWSDCLSQ DPESPKPKFV REVLEKCMRL SYHQRILDIV PPTFSALCPS NPTCIYKYGD ESSNSLPGHS VALCLAVAFK SKATNDEIFS ILKDVPNPNQ DDDDDEGFSF NPLKIEVFVQ TLLHLAAKSF SHSFSALAKF HEVFKTLAES DEGKLHVLRV MFEVWRNHPQ MIAVLVDKMI RTQIVDCAAV ANWIFSSELS RDFTRLFVWE ILHSTIRKMN KHVGAQSEQK NLFLVIFQRF IMILTEHLVR CETDGTSVLT PWYKNCIERL QQIFLQHHQI IQQYMVTLEN LLFTAELDPH ILAVFQQFCA LQA // ID 1H2UB STANDARD; PRT; 723 AA. DT CONVERTED FROM PDB (SEQRES) 1H2U DE 80 KDA NUCLEAR CAP BINDING PROTEIN OS HOMO SAPIENS CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.194 FT #SUB 641 708 ASN B 366 385 GLY A Protein S 3 FT #SUB 644 711 LEU B 365 384 PRO A Protein S 2 FT #SUB 644 711 LEU B 366 385 GLY A Protein S 2 FT #SUB 696 763 VAL B 408 427 ARG A Protein S 1 FT #SUB 700 767 ASN B 404 423 ASN A Protein S 1 FT #SUB 700 767 ASN B 405 424 PHE A Protein S 4 FT #SUB 700 767 ASN B 406 425 GLN A Protein S 8 FT #SUB 701 768 LEU B 361 380 CYS A Protein S 2 FT #SUB 701 768 LEU B 362 381 LYS A Protein S 1 FT #SUB 701 768 LEU B 369 388 PRO A Protein S 1 FT #SUB 700 767 ASN B 105 105 ARG X Protein S 3 FT #SUB 13 32 GLU B 6 6 LEU Y Protein S 4 FT #SUB 13 32 GLU B 7 7 LYS Y Protein S 2 FT #SUB 17 36 CYS B 6 6 LEU Y Protein S 2 FT #SUB 55 74 THR B 5 5 LEU Y Protein S 1 FT #SUB 55 74 THR B 6 6 LEU Y Protein S 2 FT #SUB 56 75 VAL B 6 6 LEU Y Protein S 1 FT #SUB 60 79 LEU B 9 9 LEU Y Protein S 1 FT #SUB 305 324 SER B 9 9 LEU Y Protein A 2 FT #SUB 307 326 TRP B 100 100 TYR Y Protein A 6 FT #SUB 308 327 LYS B 9 9 LEU Y Protein S 1 FT #SUB 308 327 LYS B 11 11 SER Y Protein S 1 FT #SUB 308 327 LYS B 12 12 ASP Y Protein S 1 FT #SUB 308 327 LYS B 99 99 ARG Y Protein B 1 FT #SUB 308 327 LYS B 100 100 TYR Y Protein B 2 FT #SUB 309 328 GLU B 11 11 SER Y Protein S 9 FT #SUB 309 328 GLU B 12 12 ASP Y Protein S 3 FT #SUB 309 328 GLU B 13 13 SER Y Protein S 10 FT #SUB 309 328 GLU B 14 14 TYR Y Protein S 2 FT #SUB 310 329 ARG B 14 14 TYR Y Protein S 5 FT #SUB 310 329 ARG B 99 99 ARG Y Protein S 4 FT #SUB 310 329 ARG B 100 100 TYR Y Protein S 3 FT #SUB 310 329 ARG B 102 102 ASN Y Protein S 3 FT #SUB 310 329 ARG B 103 103 GLY Y Protein S 2 FT #SUB 311 330 LYS B 14 14 TYR Y Protein A 2 FT #SUB 349 368 ILE B 62 62 LYS Y Protein S 1 FT #SUB 349 368 ILE B 63 63 SER Y Protein S 1 FT #SUB 349 368 ILE B 96 96 ASN Y Protein S 1 FT #SUB 349 368 ILE B 100 100 TYR Y Protein S 1 FT #SUB 351 370 VAL B 62 62 LYS Y Protein S 2 FT #SUB 352 371 MET B 100 100 TYR Y Protein A 9 FT #SUB 355 374 THR B 100 100 TYR Y Protein S 6 FT #SUB 396 415 ASN B 62 62 LYS Y Protein B 1 FT #SUB 400 419 HIS B 59 59 LEU Y Protein S 1 FT #SUB 400 419 HIS B 62 62 LYS Y Protein S 5 FT #SUB 403 422 SER B 105 105 ARG Y Protein B 1 FT #SUB 404 423 ASN B 104 104 THR Y Protein S 3 FT #SUB 404 423 ASN B 105 105 ARG Y Protein A 9 FT #SUB 406 425 GLN B 108 108 ASP Y Protein A 7 FT #SUB 436 455 LYS B 58 58 GLU Y Protein S 2 FT #SUB 439 458 ARG B 55 55 GLN Y Protein B 2 FT #SUB 439 458 ARG B 58 58 GLU Y Protein A 16 FT #SUB 440 459 LEU B 55 55 GLN Y Protein B 2 FT #SUB 440 459 LEU B 58 58 GLU Y Protein S 2 FT #SUB 440 459 LEU B 59 59 LEU Y Protein S 1 FT #SUB 441 460 SER B 55 55 GLN Y Protein B 6 FT #SUB 445 464 ARG B 108 108 ASP Y Protein S 1 FT #SUB 539 558 SER B 53 53 GLU Y Protein S 1 FT #SUB 539 558 SER B 54 54 GLU Y Protein A 3 FT #SUB 540 559 PHE B 53 53 GLU Y Protein A 2 FT #SUB 540 559 PHE B 54 54 GLU Y Protein A 12 FT #SUB 540 559 PHE B 57 57 TYR Y Protein S 1 FT #SUB 541 560 SER B 53 53 GLU Y Protein A 10 FT #SUB 544 563 PHE B 57 57 TYR Y Protein S 2 FT #SUB 580 599 GLN B 54 54 GLU Y Protein S 4 FT #SUB 580 599 GLN B 58 58 GLU Y Protein S 4 FT #SUB 588 607 LYS B 67 67 LYS Y Protein S 1 FT #SUB 591 610 ARG B 65 65 ASP Y Protein S 2 FT #SUB 591 610 ARG B 89 89 TYR Y Protein S 14 FT #SUB 624 643 SER B 65 65 ASP Y Protein S 1 FT #SUB 627 646 ARG B 65 65 ASP Y Protein S 3 FT DISORDER 1 7 FT DISORDER 509 518 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 706 AA (ATOM); CC TEDHLESLIC KVGEKSACSL ESNLEGLAGV LEADLPNYKS KILRLLCTVA RLLPEKLTIY CC TTLVGLLNAR NYNFGGEFVE AMIRQLKESL KANNYNEAVY LVRFLSDLVN CHVIAAPSMV CC AMFENFVSVT QEEDVPQVRR DWYVYAFLSS LPWVGKELYE KKDAEMDRIF ANTESYLKRR CC QKTHVPMLQV WTADKPHPQE EYLDCLWAQI QKLKKDRWQE RHILRPYLAF DSILCEALQH CC NLPPFTPPPH TEDSVYPMPR VIFRMFDYTD DPEGPVMPGS HSVERFVIEE NLHCIIKSHW CC KERKTCAAQL VSYPGKNKIP LNYHIVEVIF AELFQLPAPP HIDVMYTTLL IELCKLQPGS CC LPQVLAQATE MLYMRLDTMN TTCVDRFINW FSHHLSNFQF RWSWEDWSDC LSQDPESPKP CC KFVREVLEKC MRLSYHQRIL DIVPPTFSAL CPSNPTCIYK YGDESSNSLP GHSVALCLAV CC AFKSKATNDE IFSILKDVPN PSFNPLKIEV FVQTLLHLAA KSFSHSFSAL AKFHEVFKTL CC AESDEGKLHV LRVMFEVWRN HPQMIAVLVD KMIRTQIVDC AAVANWIFSS ELSRDFTRLF CC VWEILHSTIR KMNKHVGAQS EQKNLFLVIF QRFIMILTEH LVRCETDGTS VLTPWYKNCI CC ERLQQIFLQH HQIIQQYMVT LENLLFTAEL DPHILAVFQQ FCALQA CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES KTSDANETEDHLESLICKVGEKSACSLESNLEGLAGVLEADLPNYKSKIL CC ATOM -------TEDHLESLICKVGEKSACSLESNLEGLAGVLEADLPNYKSKIL CC ******************************************* CC SEQRES RLLCTVARLLPEKLTIYTTLVGLLNARNYNFGGEFVEAMIRQLKESLKAN CC ATOM RLLCTVARLLPEKLTIYTTLVGLLNARNYNFGGEFVEAMIRQLKESLKAN CC ************************************************** CC SEQRES NYNEAVYLVRFLSDLVNCHVIAAPSMVAMFENFVSVTQEEDVPQVRRDWY CC ATOM NYNEAVYLVRFLSDLVNCHVIAAPSMVAMFENFVSVTQEEDVPQVRRDWY CC ************************************************** CC SEQRES VYAFLSSLPWVGKELYEKKDAEMDRIFANTESYLKRRQKTHVPMLQVWTA CC ATOM VYAFLSSLPWVGKELYEKKDAEMDRIFANTESYLKRRQKTHVPMLQVWTA CC ************************************************** CC SEQRES DKPHPQEEYLDCLWAQIQKLKKDRWQERHILRPYLAFDSILCEALQHNLP CC ATOM DKPHPQEEYLDCLWAQIQKLKKDRWQERHILRPYLAFDSILCEALQHNLP CC ************************************************** CC SEQRES PFTPPPHTEDSVYPMPRVIFRMFDYTDDPEGPVMPGSHSVERFVIEENLH CC ATOM PFTPPPHTEDSVYPMPRVIFRMFDYTDDPEGPVMPGSHSVERFVIEENLH CC ************************************************** CC SEQRES CIIKSHWKERKTCAAQLVSYPGKNKIPLNYHIVEVIFAELFQLPAPPHID CC ATOM CIIKSHWKERKTCAAQLVSYPGKNKIPLNYHIVEVIFAELFQLPAPPHID CC ************************************************** CC SEQRES VMYTTLLIELCKLQPGSLPQVLAQATEMLYMRLDTMNTTCVDRFINWFSH CC ATOM VMYTTLLIELCKLQPGSLPQVLAQATEMLYMRLDTMNTTCVDRFINWFSH CC ************************************************** CC SEQRES HLSNFQFRWSWEDWSDCLSQDPESPKPKFVREVLEKCMRLSYHQRILDIV CC ATOM HLSNFQFRWSWEDWSDCLSQDPESPKPKFVREVLEKCMRLSYHQRILDIV CC ************************************************** CC SEQRES PPTFSALCPSNPTCIYKYGDESSNSLPGHSVALCLAVAFKSKATNDEIFS CC ATOM PPTFSALCPSNPTCIYKYGDESSNSLPGHSVALCLAVAFKSKATNDEIFS CC ************************************************** CC SEQRES ILKDVPNPNQDDDDDEGFSFNPLKIEVFVQTLLHLAAKSFSHSFSALAKF CC ATOM ILKDVPNP----------SFNPLKIEVFVQTLLHLAAKSFSHSFSALAKF CC ******** ******************************** CC SEQRES HEVFKTLAESDEGKLHVLRVMFEVWRNHPQMIAVLVDKMIRTQIVDCAAV CC ATOM HEVFKTLAESDEGKLHVLRVMFEVWRNHPQMIAVLVDKMIRTQIVDCAAV CC ************************************************** CC SEQRES ANWIFSSELSRDFTRLFVWEILHSTIRKMNKHVGAQSEQKNLFLVIFQRF CC ATOM ANWIFSSELSRDFTRLFVWEILHSTIRKMNKHVGAQSEQKNLFLVIFQRF CC ************************************************** CC SEQRES IMILTEHLVRCETDGTSVLTPWYKNCIERLQQIFLQHHQIIQQYMVTLEN CC ATOM IMILTEHLVRCETDGTSVLTPWYKNCIERLQQIFLQHHQIIQQYMVTLEN CC ************************************************** CC SEQRES LLFTAELDPHILAVFQQFCALQA CC ATOM LLFTAELDPHILAVFQQFCALQA CC *********************** SQ SEQUENCE 723 AA; MW; CN; KTSDANETED HLESLICKVG EKSACSLESN LEGLAGVLEA DLPNYKSKIL RLLCTVARLL PEKLTIYTTL VGLLNARNYN FGGEFVEAMI RQLKESLKAN NYNEAVYLVR FLSDLVNCHV IAAPSMVAMF ENFVSVTQEE DVPQVRRDWY VYAFLSSLPW VGKELYEKKD AEMDRIFANT ESYLKRRQKT HVPMLQVWTA DKPHPQEEYL DCLWAQIQKL KKDRWQERHI LRPYLAFDSI LCEALQHNLP PFTPPPHTED SVYPMPRVIF RMFDYTDDPE GPVMPGSHSV ERFVIEENLH CIIKSHWKER KTCAAQLVSY PGKNKIPLNY HIVEVIFAEL FQLPAPPHID VMYTTLLIEL CKLQPGSLPQ VLAQATEMLY MRLDTMNTTC VDRFINWFSH HLSNFQFRWS WEDWSDCLSQ DPESPKPKFV REVLEKCMRL SYHQRILDIV PPTFSALCPS NPTCIYKYGD ESSNSLPGHS VALCLAVAFK SKATNDEIFS ILKDVPNPNQ DDDDDEGFSF NPLKIEVFVQ TLLHLAAKSF SHSFSALAKF HEVFKTLAES DEGKLHVLRV MFEVWRNHPQ MIAVLVDKMI RTQIVDCAAV ANWIFSSELS RDFTRLFVWE ILHSTIRKMN KHVGAQSEQK NLFLVIFQRF IMILTEHLVR CETDGTSVLT PWYKNCIERL QQIFLQHHQI IQQYMVTLEN LLFTAELDPH ILAVFQQFCA LQA // ID 1H2UX STANDARD; PRT; 156 AA. DT CONVERTED FROM PDB (SEQRES) 1H2U DE 20 KDA NUCLEAR CAP BINDING PROTEIN OS HOMO SAPIENS CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.194 FT #SUB 4 4 GLY X 51 70 ARG A Protein B 1 FT #SUB 4 4 GLY X 55 74 THR A Protein B 5 FT #SUB 4 4 GLY X 58 77 ARG A Protein B 8 FT #SUB 4 4 GLY X 59 78 LEU A Protein B 1 FT #SUB 5 5 LEU X 55 74 THR A Protein B 1 FT #SUB 6 6 LEU X 13 32 GLU A Protein A 3 FT #SUB 6 6 LEU X 17 36 CYS A Protein S 1 FT #SUB 6 6 LEU X 55 74 THR A Protein S 2 FT #SUB 7 7 LYS X 13 32 GLU A Protein A 2 FT #SUB 9 9 LEU X 305 324 SER A Protein S 1 FT #SUB 9 9 LEU X 308 327 LYS A Protein B 1 FT #SUB 11 11 SER X 308 327 LYS A Protein B 1 FT #SUB 11 11 SER X 309 328 GLU A Protein A 7 FT #SUB 12 12 ASP X 308 327 LYS A Protein S 2 FT #SUB 12 12 ASP X 309 328 GLU A Protein B 3 FT #SUB 13 13 SER X 309 328 GLU A Protein A 11 FT #SUB 14 14 TYR X 309 328 GLU A Protein S 2 FT #SUB 14 14 TYR X 310 329 ARG A Protein S 7 FT #SUB 14 14 TYR X 311 330 LYS A Protein S 2 FT #SUB 53 53 GLU X 539 558 SER A Protein S 2 FT #SUB 53 53 GLU X 540 559 PHE A Protein S 5 FT #SUB 53 53 GLU X 541 560 SER A Protein S 9 FT #SUB 54 54 GLU X 539 558 SER A Protein S 3 FT #SUB 54 54 GLU X 540 559 PHE A Protein A 8 FT #SUB 54 54 GLU X 580 599 GLN A Protein A 4 FT #SUB 55 55 GLN X 439 458 ARG A Protein B 1 FT #SUB 55 55 GLN X 441 460 SER A Protein S 5 FT #SUB 57 57 TYR X 540 559 PHE A Protein S 1 FT #SUB 57 57 TYR X 544 563 PHE A Protein S 2 FT #SUB 58 58 GLU X 436 455 LYS A Protein S 2 FT #SUB 58 58 GLU X 439 458 ARG A Protein S 16 FT #SUB 58 58 GLU X 440 459 LEU A Protein S 2 FT #SUB 58 58 GLU X 580 599 GLN A Protein A 3 FT #SUB 59 59 LEU X 400 419 HIS A Protein S 1 FT #SUB 59 59 LEU X 440 459 LEU A Protein B 1 FT #SUB 62 62 LYS X 349 368 ILE A Protein B 1 FT #SUB 62 62 LYS X 351 370 VAL A Protein B 1 FT #SUB 62 62 LYS X 396 415 ASN A Protein S 1 FT #SUB 62 62 LYS X 400 419 HIS A Protein S 6 FT #SUB 63 63 SER X 349 368 ILE A Protein B 1 FT #SUB 65 65 ASP X 591 610 ARG A Protein S 1 FT #SUB 67 67 LYS X 588 607 LYS A Protein B 1 FT #SUB 89 89 TYR X 591 610 ARG A Protein S 14 FT #SUB 96 96 ASN X 349 368 ILE A Protein S 2 FT #SUB 99 99 ARG X 308 327 LYS A Protein S 1 FT #SUB 99 99 ARG X 310 329 ARG A Protein B 4 FT #SUB 100 100 TYR X 307 326 TRP A Protein S 6 FT #SUB 100 100 TYR X 308 327 LYS A Protein S 2 FT #SUB 100 100 TYR X 310 329 ARG A Protein B 4 FT #SUB 100 100 TYR X 349 368 ILE A Protein S 1 FT #SUB 100 100 TYR X 352 371 MET A Protein S 8 FT #SUB 100 100 TYR X 355 374 THR A Protein A 6 FT #SUB 101 101 ILE X 351 370 VAL A Protein S 1 FT #SUB 102 102 ASN X 310 329 ARG A Protein B 3 FT #SUB 103 103 GLY X 310 329 ARG A Protein B 1 FT #SUB 104 104 THR X 404 423 ASN A Protein A 3 FT #SUB 105 105 ARG X 403 422 SER A Protein S 1 FT #SUB 105 105 ARG X 404 423 ASN A Protein A 9 FT #SUB 105 105 ARG X 406 425 GLN A Protein S 2 FT #SUB 108 108 ASP X 406 425 GLN A Protein S 4 FT #SUB 105 105 ARG X 700 767 ASN B Protein S 3 FT #HET 20 20 TYR X 1 1153 GDP X S 7 FT #HET 20 20 TYR X 2 1154 7MG X S 44 FT #HET 22 22 ASP X 2 1154 7MG X S 2 FT #HET 43 43 TYR X 2 1154 7MG X S 42 FT #HET 83 83 PHE X 2 1154 7MG X S 6 FT #HET 85 85 PHE X 2 1154 7MG X S 1 FT #HET 112 112 ARG X 2 1154 7MG X S 1 FT #HET 114 114 ASP X 2 1154 7MG X S 7 FT #HET 115 115 TRP X 2 1154 7MG X B 2 FT #HET 116 116 ASP X 2 1154 7MG X A 6 FT #HET 123 123 ARG X 2 1154 7MG X A 14 FT #HET 125 125 TYR X 2 1154 7MG X B 1 FT #HET 126 126 GLY X 2 1154 7MG X B 2 FT #HET 127 127 ARG X 1 1153 GDP X S 4 FT #HET 127 127 ARG X 2 1154 7MG X A 13 FT #HET 133 133 GLN X 2 1154 7MG X A 8 FT #HET 134 134 VAL X 1 1153 GDP X S 4 FT #HET 134 134 VAL X 2 1154 7MG X A 3 FT #HET 135 135 ARG X 1 1153 GDP X S 2 FT #HET 135 135 ARG X 2 1154 7MG X S 2 FT DISORDER 1 3 FT DISORDER 153 156 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 149 AA (ATOM); CC GLLKALRSDS YVELSQYRDQ HFRGDNEEQE KLLKKSCTLY VGNLSFYTTE EQIYELFSKS CC GDIKKIIMGL DKMKKTACGF CFVEYYSRAD AENAMRYING TRLDDRIIRT DWDAGFKEGR CC QYGRGRSGGQ VRDEYRQDYD AGRGGYGKL CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSGGLLKALRSDSYVELSQYRDQHFRGDNEEQEKLLKKSCTLYVGNLSFY CC ATOM ---GLLKALRSDSYVELSQYRDQHFRGDNEEQEKLLKKSCTLYVGNLSFY CC *********************************************** CC SEQRES TTEEQIYELFSKSGDIKKIIMGLDKMKKTACGFCFVEYYSRADAENAMRY CC ATOM TTEEQIYELFSKSGDIKKIIMGLDKMKKTACGFCFVEYYSRADAENAMRY CC ************************************************** CC SEQRES INGTRLDDRIIRTDWDAGFKEGRQYGRGRSGGQVRDEYRQDYDAGRGGYG CC ATOM INGTRLDDRIIRTDWDAGFKEGRQYGRGRSGGQVRDEYRQDYDAGRGGYG CC ************************************************** CC SEQRES KLAQNQ CC ATOM KL---- CC ** SQ SEQUENCE 156 AA; MW; CN; MSGGLLKALR SDSYVELSQY RDQHFRGDNE EQEKLLKKSC TLYVGNLSFY TTEEQIYELF SKSGDIKKII MGLDKMKKTA CGFCFVEYYS RADAENAMRY INGTRLDDRI IRTDWDAGFK EGRQYGRGRS GGQVRDEYRQ DYDAGRGGYG KLAQNQ // ID 1H2UY STANDARD; PRT; 156 AA. DT CONVERTED FROM PDB (SEQRES) 1H2U DE 20 KDA NUCLEAR CAP BINDING PROTEIN OS HOMO SAPIENS CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.400 CC R-Factor 0.194 FT #SUB 5 5 LEU Y 55 74 THR B Protein B 1 FT #SUB 6 6 LEU Y 13 32 GLU B Protein A 4 FT #SUB 6 6 LEU Y 17 36 CYS B Protein S 2 FT #SUB 6 6 LEU Y 55 74 THR B Protein S 2 FT #SUB 6 6 LEU Y 56 75 VAL B Protein S 1 FT #SUB 7 7 LYS Y 13 32 GLU B Protein A 2 FT #SUB 9 9 LEU Y 60 79 LEU B Protein S 1 FT #SUB 9 9 LEU Y 305 324 SER B Protein S 2 FT #SUB 9 9 LEU Y 308 327 LYS B Protein B 1 FT #SUB 11 11 SER Y 308 327 LYS B Protein B 1 FT #SUB 11 11 SER Y 309 328 GLU B Protein A 9 FT #SUB 12 12 ASP Y 308 327 LYS B Protein S 1 FT #SUB 12 12 ASP Y 309 328 GLU B Protein B 3 FT #SUB 13 13 SER Y 309 328 GLU B Protein A 10 FT #SUB 14 14 TYR Y 309 328 GLU B Protein S 2 FT #SUB 14 14 TYR Y 310 329 ARG B Protein S 5 FT #SUB 14 14 TYR Y 311 330 LYS B Protein S 2 FT #SUB 53 53 GLU Y 539 558 SER B Protein S 1 FT #SUB 53 53 GLU Y 540 559 PHE B Protein S 2 FT #SUB 53 53 GLU Y 541 560 SER B Protein S 10 FT #SUB 54 54 GLU Y 539 558 SER B Protein S 3 FT #SUB 54 54 GLU Y 540 559 PHE B Protein A 12 FT #SUB 54 54 GLU Y 580 599 GLN B Protein A 4 FT #SUB 55 55 GLN Y 439 458 ARG B Protein A 2 FT #SUB 55 55 GLN Y 440 459 LEU B Protein S 2 FT #SUB 55 55 GLN Y 441 460 SER B Protein S 6 FT #SUB 57 57 TYR Y 540 559 PHE B Protein S 1 FT #SUB 57 57 TYR Y 544 563 PHE B Protein S 2 FT #SUB 58 58 GLU Y 436 455 LYS B Protein S 2 FT #SUB 58 58 GLU Y 439 458 ARG B Protein S 16 FT #SUB 58 58 GLU Y 440 459 LEU B Protein S 2 FT #SUB 58 58 GLU Y 580 599 GLN B Protein A 4 FT #SUB 59 59 LEU Y 400 419 HIS B Protein S 1 FT #SUB 59 59 LEU Y 440 459 LEU B Protein B 1 FT #SUB 62 62 LYS Y 349 368 ILE B Protein B 1 FT #SUB 62 62 LYS Y 351 370 VAL B Protein B 2 FT #SUB 62 62 LYS Y 396 415 ASN B Protein S 1 FT #SUB 62 62 LYS Y 400 419 HIS B Protein S 5 FT #SUB 63 63 SER Y 349 368 ILE B Protein B 1 FT #SUB 65 65 ASP Y 591 610 ARG B Protein S 2 FT #SUB 65 65 ASP Y 624 643 SER B Protein S 1 FT #SUB 65 65 ASP Y 627 646 ARG B Protein S 3 FT #SUB 67 67 LYS Y 588 607 LYS B Protein B 1 FT #SUB 89 89 TYR Y 591 610 ARG B Protein S 14 FT #SUB 96 96 ASN Y 349 368 ILE B Protein S 1 FT #SUB 99 99 ARG Y 308 327 LYS B Protein S 1 FT #SUB 99 99 ARG Y 310 329 ARG B Protein B 4 FT #SUB 100 100 TYR Y 307 326 TRP B Protein S 6 FT #SUB 100 100 TYR Y 308 327 LYS B Protein S 2 FT #SUB 100 100 TYR Y 310 329 ARG B Protein B 3 FT #SUB 100 100 TYR Y 349 368 ILE B Protein S 1 FT #SUB 100 100 TYR Y 352 371 MET B Protein S 9 FT #SUB 100 100 TYR Y 355 374 THR B Protein A 6 FT #SUB 102 102 ASN Y 310 329 ARG B Protein B 3 FT #SUB 103 103 GLY Y 310 329 ARG B Protein B 2 FT #SUB 104 104 THR Y 404 423 ASN B Protein A 3 FT #SUB 105 105 ARG Y 403 422 SER B Protein S 1 FT #SUB 105 105 ARG Y 404 423 ASN B Protein A 9 FT #SUB 108 108 ASP Y 406 425 GLN B Protein S 7 FT #SUB 108 108 ASP Y 445 464 ARG B Protein S 1 FT #HET 20 20 TYR Y 3 1152 GDP Y S 7 FT #HET 20 20 TYR Y 4 1153 7MG Y S 38 FT #HET 22 22 ASP Y 4 1153 7MG Y S 2 FT #HET 43 43 TYR Y 4 1153 7MG Y S 43 FT #HET 83 83 PHE Y 4 1153 7MG Y S 6 FT #HET 85 85 PHE Y 4 1153 7MG Y S 1 FT #HET 112 112 ARG Y 4 1153 7MG Y S 3 FT #HET 114 114 ASP Y 4 1153 7MG Y S 7 FT #HET 115 115 TRP Y 4 1153 7MG Y B 1 FT #HET 116 116 ASP Y 4 1153 7MG Y A 6 FT #HET 123 123 ARG Y 4 1153 7MG Y A 14 FT #HET 125 125 TYR Y 4 1153 7MG Y B 1 FT #HET 126 126 GLY Y 4 1153 7MG Y B 2 FT #HET 127 127 ARG Y 3 1152 GDP Y S 2 FT #HET 127 127 ARG Y 4 1153 7MG Y A 16 FT #HET 133 133 GLN Y 4 1153 7MG Y A 9 FT #HET 134 134 VAL Y 3 1152 GDP Y S 6 FT #HET 134 134 VAL Y 4 1153 7MG Y A 4 FT #HET 135 135 ARG Y 3 1152 GDP Y S 1 FT #HET 135 135 ARG Y 4 1153 7MG Y S 2 FT #HET 138 138 TYR Y 3 1152 GDP Y S 33 FT DISORDER 1 4 FT DISORDER 152 156 CC Miss-BB 1 CC Miss-SC 1 CC SEQUENCE 147 AA (ATOM); CC LLKALRSDSY VELSQYRDQH FRGDNEEQEK LLKKSCTLYV GNLSFYTTEE QIYELFSKSG CC DIKKIIMGLD KMKKTACGFC FVEYYSRADA ENAMRYINGT RLDDRIIRTD WDAGFKEGRQ CC YGRGRSGGQV RDEYRQDYDA GRGGYGK CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES MSGGLLKALRSDSYVELSQYRDQHFRGDNEEQEKLLKKSCTLYVGNLSFY CC ATOM ----LLKALRSDSYVELSQYRDQHFRGDNEEQEKLLKKSCTLYVGNLSFY CC ********************************************** CC SEQRES TTEEQIYELFSKSGDIKKIIMGLDKMKKTACGFCFVEYYSRADAENAMRY CC ATOM TTEEQIYELFSKSGDIKKIIMGLDKMKKTACGFCFVEYYSRADAENAMRY CC ************************************************** CC SEQRES INGTRLDDRIIRTDWDAGFKEGRQYGRGRSGGQVRDEYRQDYDAGRGGYG CC ATOM INGTRLDDRIIRTDWDAGFKEGRQYGRGRSGGQVRDEYRQDYDAGRGGYG CC ************************************************** CC SEQRES KLAQNQ CC ATOM K----- CC * SQ SEQUENCE 156 AA; MW; CN; MSGGLLKALR SDSYVELSQY RDQHFRGDNE EQEKLLKKSC TLYVGNLSFY TTEEQIYELF SKSGDIKKII MGLDKMKKTA CGFCFVEYYS RADAENAMRY INGTRLDDRI IRTDWDAGFK EGRQYGRGRS GGQVRDEYRQ DYDAGRGGYG KLAQNQ //