Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0910 |
Symbol | |
ID | 4602123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 857529 |
End bp | 858611 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639773689 |
Product | extracellular solute-binding protein |
Protein accession | YP_920314 |
Protein GI | 119719819 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCTGA CGAGAAGGCA GTTCCTGGCA TCGCTGGGCG CGGCCGCCGC GCTTGCCAGC ATAGGGGCGT ACTACTGGAT CTTCCCGAAG CGCGAGGAAA GGGTGCTAAG GGTCTACAAC TACAGCGCGT ACATAAACCC GGACATTATA AAGGATTTTG AGTCCAAGTA CGGCGTGAAG GTAATCTACG ACGAGTACGA GTCCGCCGAA GAGGCCTACG CGAAGCTCCA GCTAGGGGGA GGCGGCTACG ACGTCATAGT GCTCACCGAC CAGTACGTCC CCCAAGCGGT GAAGAAGGGT TTAGTGGCCC AGCTGGACAA AACGCGTATA CCCAACCTAG GCAACGTGGA CAGAGTTTTC TTCGAGAACA ACTTCGACCC CGGCCTCAGG TACGCTGTTC CCTACGCGTG GGGTACGACA GGCATAGGCG TTAACAGGGA CTACGTGGCG GAGGGCGAGA TCGAAGGGTA CGAGCAACTC TTCGACACGA AAGTCTTCCT GCCGAGGCAC AAGGGAAAGG TCTCCATGCT GGAAGAGTTC CTCGAAGTCG TCAACGCGGC GAAGCTCTAC CTGGGCATAC CGCTGAACGA CTGGTCCGAG GAGAGCAAGC AACGCATCAT AGAGCTACTT AGGGAGCAAC GCGACTTCCT CGCCGGCTAC TACGGCGCTA GCGTCTACAT ACCGGCGCTC GCGAAGGGCG ACCTCCACGC CGCTCACGCG TGGAGCGGGG ACGTCGTGCA GGCTCAAAGC GAGAACAAGT CCGTAGAGTA CGTTCTGCCG AAAGAAGGCG CGTTCGTGTG GATCGACTTC ATGGTGATAC CGGTCAACGC GAGGTCGCCG GACCTGGCGT ACGAGTGGAT AAACTTCCTC CTGGACCCGG CGGTTGCCGC GAAGAACTCC TCCTACACCT ACTACCCGAG CCCCGTGAAG AGAGAGTTAC TCGAAGGGCT TATAGGCAAG GACGTGCTGG AGAACCCCGC CGTCTACCCG CCGAGCGGGA CAAAGCTCGT GCAGACTTAC CCGTTGGACG AAAGCGCCCT CAAAGTAGTC GAGGAGATAA GCACGGCTGT AAAGAGGGTG TAG
|
Protein sequence | MKLTRRQFLA SLGAAAALAS IGAYYWIFPK REERVLRVYN YSAYINPDII KDFESKYGVK VIYDEYESAE EAYAKLQLGG GGYDVIVLTD QYVPQAVKKG LVAQLDKTRI PNLGNVDRVF FENNFDPGLR YAVPYAWGTT GIGVNRDYVA EGEIEGYEQL FDTKVFLPRH KGKVSMLEEF LEVVNAAKLY LGIPLNDWSE ESKQRIIELL REQRDFLAGY YGASVYIPAL AKGDLHAAHA WSGDVVQAQS ENKSVEYVLP KEGAFVWIDF MVIPVNARSP DLAYEWINFL LDPAVAAKNS SYTYYPSPVK RELLEGLIGK DVLENPAVYP PSGTKLVQTY PLDESALKVV EEISTAVKRV
|
| |