ID 6GS4A STANDARD; PRT; 508 AA. DT CONVERTED FROM PDB (SEQRES) 6GS4 DE Dipeptide and tripeptide permease A OS Escherichia coli K-12 CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.650 CC R-Factor 0.217 FT #SUB 46 46 VAL A 53 53 TRP H Protein S 1 FT #SUB 46 46 VAL A 103 103 SER H Protein S 1 FT #SUB 49 49 VAL A 53 53 TRP H Protein S 2 FT #SUB 50 50 LYS A 31 31 SER H Protein A 3 FT #SUB 50 50 LYS A 53 53 TRP H Protein S 11 FT #SUB 50 50 LYS A 100 100 GLN H Protein S 3 FT #SUB 50 50 LYS A 101 101 TYR H Protein S 3 FT #SUB 50 50 LYS A 102 102 GLY H Protein S 3 FT #SUB 174 174 ALA A 101 101 TYR H Protein B 3 FT #SUB 175 175 ALA A 99 99 LYS H Protein B 3 FT #SUB 175 175 ALA A 102 102 GLY H Protein S 1 FT #SUB 177 177 TYR A 101 101 TYR H Protein B 5 FT #SUB 178 178 GLY A 101 101 TYR H Protein B 4 FT #SUB 179 179 TRP A 101 101 TYR H Protein S 2 FT #SUB 303 303 ALA A 56 56 ARG H Protein B 1 FT #SUB 304 304 ILE A 56 56 ARG H Protein B 3 FT #SUB 304 304 ILE A 103 103 SER H Protein S 2 FT #SUB 305 305 ARG A 54 54 THR H Protein S 4 FT #SUB 305 305 ARG A 56 56 ARG H Protein B 3 FT #SUB 306 306 ASN A 56 56 ARG H Protein B 3 FT #SUB 307 307 VAL A 56 56 ARG H Protein B 9 FT #SUB 308 308 GLU A 56 56 ARG H Protein B 2 FT #SUB 309 309 HIS A 56 56 ARG H Protein A 7 FT #SUB 309 309 HIS A 103 103 SER H Protein S 1 FT #SUB 309 309 HIS A 104 104 ARG H Protein A 2 FT #SUB 315 315 ALA A 104 104 ARG H Protein S 2 FT #SUB 316 316 VAL A 104 104 ARG H Protein B 5 FT #SUB 317 317 GLU A 102 102 GLY H Protein S 1 FT #SUB 317 317 GLU A 103 103 SER H Protein S 1 FT #SUB 317 317 GLU A 104 104 ARG H Protein S 5 FT #SUB 317 317 GLU A 107 107 TYR H Protein S 5 FT #SUB 318 318 PRO A 103 103 SER H Protein S 2 FT #SUB 318 318 PRO A 104 104 ARG H Protein S 1 FT #SUB 319 319 GLU A 103 103 SER H Protein S 4 FT #SUB 379 379 ILE A 56 56 ARG H Protein S 6 FT #SUB 446 446 ASN A 53 53 TRP H Protein B 2 FT #SUB 446 446 ASN A 54 54 THR H Protein B 4 FT #SUB 447 447 VAL A 53 53 TRP H Protein S 1 FT #SUB 447 447 VAL A 54 54 THR H Protein B 3 FT #SUB 447 447 VAL A 55 55 GLY H Protein S 1 FT #SUB 447 447 VAL A 72 72 ARG H Protein S 2 FT #SUB 447 447 VAL A 74 74 ASN H Protein S 3 FT #SUB 448 448 THR A 54 54 THR H Protein B 3 FT #SUB 448 448 THR A 55 55 GLY H Protein B 1 FT #SUB 450 450 PRO A 55 55 GLY H Protein S 1 FT #SUB 450 450 PRO A 56 56 ARG H Protein S 2 FT #HET 38 38 TYR A 1 601 F9E A S 9 FT #HET 130 130 LYS A 1 601 F9E A S 2 FT #HET 156 156 TYR A 1 601 F9E A S 5 FT #HET 160 160 ASN A 1 601 F9E A S 8 FT #HET 163 163 SER A 1 601 F9E A S 1 FT #HET 167 167 MET A 1 601 F9E A S 1 FT #HET 289 289 PHE A 1 601 F9E A S 3 FT #HET 325 325 ASN A 1 601 F9E A S 4 FT #HET 326 326 PRO A 1 601 F9E A S 1 FT #HET 365 365 LEU A 2 602 LMT A A 5 FT #HET 368 368 PRO A 2 602 LMT A A 13 FT #HET 369 369 LEU A 2 602 LMT A B 2 FT #HET 371 371 ALA A 2 602 LMT A S 1 FT #HET 372 372 LYS A 2 602 LMT A S 7 FT #HET 396 396 GLU A 1 601 F9E A S 6 FT #HET 399 399 ILE A 1 601 F9E A B 1 FT #HET 400 400 SER A 1 601 F9E A B 1 FT #HET 402 402 LEU A 1 601 F9E A B 4 FT #HET 458 458 GLY A 2 602 LMT A B 1 FT #HET 462 462 LEU A 2 602 LMT A S 1 FT DISORDER 1 16 FT DISORDER 143 144 FT DISORDER 342 343 FT DISORDER 489 508 CC SEQUENCE 468 AA (ATOM); CC FKQPKAFYLI FSIELWERFG YYGLQGIMAV YLVKQLGMSE ADSITLFSSF SALVYGLVAI CC GGWLGDKVLG TKRVIMLGAI VLAIGYALVA WSGHDAGIVY MGMAAIAVGN GLFKANPSSL CC LSTCYEDPRL DGAFTMYYMS VNIGSFFSMI ATPWLAAKYG WSVAFALSVV GLLITIVNFA CC FCQRWVKQYG SKPDFEPINY RNLLLTIIGV VALIAIATWL LHNQEVARMA LGVVAFGIVV CC IFGKEAFAMK GAARRKMIVA FILMLEAIIF FVLYSQMPTS LNFFAIRNVE HSILGLAVEP CC EQYQALNPFW IIIGSPILAA IYNGDTLPMP TKFAIGMVMC SGAFLILPLG AKFASDAGIV CC SVSWLVASYG LQSIGELMIS GLGLAMVAQL VPQRLMGFIM GSWFLTTAGA NLIGGYVAGM CC MAVPDNVTDP LMSLEVYGRV FLQIGVATAV IAVLMLLTAP KLHRMTQD CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES GSTANQKPTESVSLNAFKQPKAFYLIFSIELWERFGYYGLQGIMAVYLVK CC ATOM ----------------FKQPKAFYLIFSIELWERFGYYGLQGIMAVYLVK CC ********************************** CC SEQRES QLGMSEADSITLFSSFSALVYGLVAIGGWLGDKVLGTKRVIMLGAIVLAI CC ATOM QLGMSEADSITLFSSFSALVYGLVAIGGWLGDKVLGTKRVIMLGAIVLAI CC ************************************************** CC SEQRES GYALVAWSGHDAGIVYMGMAAIAVGNGLFKANPSSLLSTCYEKNDPRLDG CC ATOM GYALVAWSGHDAGIVYMGMAAIAVGNGLFKANPSSLLSTCYE--DPRLDG CC ****************************************** ****** CC SEQRES AFTMYYMSVNIGSFFSMIATPWLAAKYGWSVAFALSVVGLLITIVNFAFC CC ATOM AFTMYYMSVNIGSFFSMIATPWLAAKYGWSVAFALSVVGLLITIVNFAFC CC ************************************************** CC SEQRES QRWVKQYGSKPDFEPINYRNLLLTIIGVVALIAIATWLLHNQEVARMALG CC ATOM QRWVKQYGSKPDFEPINYRNLLLTIIGVVALIAIATWLLHNQEVARMALG CC ************************************************** CC SEQRES VVAFGIVVIFGKEAFAMKGAARRKMIVAFILMLEAIIFFVLYSQMPTSLN CC ATOM VVAFGIVVIFGKEAFAMKGAARRKMIVAFILMLEAIIFFVLYSQMPTSLN CC ************************************************** CC SEQRES FFAIRNVEHSILGLAVEPEQYQALNPFWIIIGSPILAAIYNKMGDTLPMP CC ATOM FFAIRNVEHSILGLAVEPEQYQALNPFWIIIGSPILAAIYN--GDTLPMP CC ***************************************** ******* CC SEQRES TKFAIGMVMCSGAFLILPLGAKFASDAGIVSVSWLVASYGLQSIGELMIS CC ATOM TKFAIGMVMCSGAFLILPLGAKFASDAGIVSVSWLVASYGLQSIGELMIS CC ************************************************** CC SEQRES GLGLAMVAQLVPQRLMGFIMGSWFLTTAGANLIGGYVAGMMAVPDNVTDP CC ATOM GLGLAMVAQLVPQRLMGFIMGSWFLTTAGANLIGGYVAGMMAVPDNVTDP CC ************************************************** CC SEQRES LMSLEVYGRVFLQIGVATAVIAVLMLLTAPKLHRMTQDDAADKAAKAAVA CC ATOM LMSLEVYGRVFLQIGVATAVIAVLMLLTAPKLHRMTQD------------ CC ************************************** CC SEQRES STHHHHHH CC ATOM -------- CC SQ SEQUENCE 508 AA; MW; CN; GSTANQKPTE SVSLNAFKQP KAFYLIFSIE LWERFGYYGL QGIMAVYLVK QLGMSEADSI TLFSSFSALV YGLVAIGGWL GDKVLGTKRV IMLGAIVLAI GYALVAWSGH DAGIVYMGMA AIAVGNGLFK ANPSSLLSTC YEKNDPRLDG AFTMYYMSVN IGSFFSMIAT PWLAAKYGWS VAFALSVVGL LITIVNFAFC QRWVKQYGSK PDFEPINYRN LLLTIIGVVA LIAIATWLLH NQEVARMALG VVAFGIVVIF GKEAFAMKGA ARRKMIVAFI LMLEAIIFFV LYSQMPTSLN FFAIRNVEHS ILGLAVEPEQ YQALNPFWII IGSPILAAIY NKMGDTLPMP TKFAIGMVMC SGAFLILPLG AKFASDAGIV SVSWLVASYG LQSIGELMIS GLGLAMVAQL VPQRLMGFIM GSWFLTTAGA NLIGGYVAGM MAVPDNVTDP LMSLEVYGRV FLQIGVATAV IAVLMLLTAP KLHRMTQDDA ADKAAKAAVA STHHHHHH // ID 6GS4H STANDARD; PRT; 132 AA. DT CONVERTED FROM PDB (SEQRES) 6GS4 DE nanobody OS Lama glama CC EXPDTA X-RAY DIFFRACTION CC RESOLU 2.650 CC R-Factor 0.217 FT #SUB 31 31 SER H 50 50 LYS A Protein A 3 FT #SUB 53 53 TRP H 46 46 VAL A Protein S 1 FT #SUB 53 53 TRP H 49 49 VAL A Protein S 2 FT #SUB 53 53 TRP H 50 50 LYS A Protein S 11 FT #SUB 53 53 TRP H 446 446 ASN A Protein B 2 FT #SUB 53 53 TRP H 447 447 VAL A Protein B 1 FT #SUB 54 54 THR H 305 305 ARG A Protein A 4 FT #SUB 54 54 THR H 446 446 ASN A Protein A 4 FT #SUB 54 54 THR H 447 447 VAL A Protein B 3 FT #SUB 54 54 THR H 448 448 THR A Protein B 3 FT #SUB 55 55 GLY H 447 447 VAL A Protein B 1 FT #SUB 55 55 GLY H 448 448 THR A Protein B 1 FT #SUB 55 55 GLY H 450 450 PRO A Protein B 1 FT #SUB 56 56 ARG H 303 303 ALA A Protein S 1 FT #SUB 56 56 ARG H 304 304 ILE A Protein S 3 FT #SUB 56 56 ARG H 305 305 ARG A Protein S 3 FT #SUB 56 56 ARG H 306 306 ASN A Protein S 3 FT #SUB 56 56 ARG H 307 307 VAL A Protein S 9 FT #SUB 56 56 ARG H 308 308 GLU A Protein S 2 FT #SUB 56 56 ARG H 309 309 HIS A Protein S 7 FT #SUB 56 56 ARG H 379 379 ILE A Protein S 6 FT #SUB 56 56 ARG H 450 450 PRO A Protein A 2 FT #SUB 72 72 ARG H 447 447 VAL A Protein S 2 FT #SUB 74 74 ASN H 447 447 VAL A Protein S 3 FT #SUB 99 99 LYS H 175 175 ALA A Protein S 3 FT #SUB 100 100 GLN H 50 50 LYS A Protein B 3 FT #SUB 101 101 TYR H 50 50 LYS A Protein B 3 FT #SUB 101 101 TYR H 174 174 ALA A Protein S 3 FT #SUB 101 101 TYR H 177 177 TYR A Protein S 5 FT #SUB 101 101 TYR H 178 178 GLY A Protein S 4 FT #SUB 101 101 TYR H 179 179 TRP A Protein S 2 FT #SUB 102 102 GLY H 50 50 LYS A Protein B 3 FT #SUB 102 102 GLY H 175 175 ALA A Protein B 1 FT #SUB 102 102 GLY H 317 317 GLU A Protein B 1 FT #SUB 103 103 SER H 46 46 VAL A Protein S 1 FT #SUB 103 103 SER H 304 304 ILE A Protein A 2 FT #SUB 103 103 SER H 309 309 HIS A Protein B 1 FT #SUB 103 103 SER H 317 317 GLU A Protein B 1 FT #SUB 103 103 SER H 318 318 PRO A Protein A 2 FT #SUB 103 103 SER H 319 319 GLU A Protein S 4 FT #SUB 104 104 ARG H 309 309 HIS A Protein S 2 FT #SUB 104 104 ARG H 315 315 ALA A Protein S 2 FT #SUB 104 104 ARG H 316 316 VAL A Protein S 5 FT #SUB 104 104 ARG H 317 317 GLU A Protein S 5 FT #SUB 104 104 ARG H 318 318 PRO A Protein S 1 FT #SUB 107 107 TYR H 317 317 GLU A Protein S 5 FT DISORDER 126 132 CC SEQUENCE 125 AA (ATOM); CC QVQLQESGGG LVQAGGSLRL SCAGSGRTFS SYNMGWFRQA PGKEREFVGG ISWTGRSADY CC PDSVKGRFTI SRDNAKNAVY LQMNSLKPED TAVYYCAAKQ YGSRADYPWD DYDYWGQGTQ CC VTVSS CC !---- CC ALIGNMENT OF SEQRES AND ATOMRES CC SEQRES QVQLQESGGGLVQAGGSLRLSCAGSGRTFSSYNMGWFRQAPGKEREFVGG CC ATOM QVQLQESGGGLVQAGGSLRLSCAGSGRTFSSYNMGWFRQAPGKEREFVGG CC ************************************************** CC SEQRES ISWTGRSADYPDSVKGRFTISRDNAKNAVYLQMNSLKPEDTAVYYCAAKQ CC ATOM ISWTGRSADYPDSVKGRFTISRDNAKNAVYLQMNSLKPEDTAVYYCAAKQ CC ************************************************** CC SEQRES YGSRADYPWDDYDYWGQGTQVTVSSGAAEPEA CC ATOM YGSRADYPWDDYDYWGQGTQVTVSS------- CC ************************* SQ SEQUENCE 132 AA; MW; CN; QVQLQESGGG LVQAGGSLRL SCAGSGRTFS SYNMGWFRQA PGKEREFVGG ISWTGRSADY PDSVKGRFTI SRDNAKNAVY LQMNSLKPED TAVYYCAAKQ YGSRADYPWD DYDYWGQGTQ VTVSSGAAEP EA //