Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1647 |
Symbol | |
ID | 4601241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1594696 |
End bp | 1596270 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774420 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_921045 |
Protein GI | 119720550 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0299074 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGTAG CCAGGGTAAG CGAGATAAAG CTTCTCGACA GGGAGGCCGC GGAGAAGTAC GGCGTAAAGG AGGAGATCCT GATGGAGAAT GCCGGCGCCA GCGTAGCCAG GCTTGCCGTG TCGCTCATAG GGCTCCCCAT GAGCGCGGCA GTTGTCTGCG GGCCGGGGAA CAACGGGGGA GACGGGCTCG TAGCGGCTAG GCACCTTTCA AGCATGGGCG CGGACGTCAA GGTCTTCCTG GTGGCGGCCC CGGACAAGCT GGCAGGCATA GTCAAGGAGA ACTACGAGCG CGTAGTCAAG GCCGGGATAG CCGTGGAAGT AGTGGATGAG GAGAGGGCGG AGGGGCTCTC CGAGGAGCTC TCCCTCTTCG ACGTCGTCGT AGACGCGCTG TTCGGGACGG GCCTCTCCAG GCCCCTGGAG GGTGTCTACA GGAAGGTTGT AGAGGCGATA AACGGTAGCG GTTCCCTAGT GATAAGCGTC GACATCCCCT CGGGGGTCCA CGGGGACACG GGGCAGGTTC TAGGCGTAGC TGTGAGAGCT GACTACACCG TGACGTTCGG GCTCCCGAAG CTCGGGAACC TCATGTACCC CGGCGCCGAG CTAGGAGGCG AGCTGTACGT ACACCACATC TCCTACCCCA GGGCGTTGCT CGAGGACAGC CGGTTGAAGG TGGAGACGAA CGACCCCGTA CCCCTGCCGC CGAGGAGGCC GGACACGCAC AAGGGGGACT ACGGGAAGGC GCTGTTCGTC GCGGGCTCGC GCAGGTACAT GGGGGCACCC CTGCTCTGCT CAAAGTCCTT CCTGAAGGCG GGCGGCGGGT ACTCTAGGCT CGCGACGATT AAGTCCATCG TGCCGTTCCT AGGCGTGAGG GCCCCTGAGG TGGTCTACGA GGCGCTCGAG GAGACAGCCT CGGGCACGGT CGCCTACGGC AACCTCGAGA GGATACTGGA GCTCTCGAAG TCCTCGGACA TAGTGGCCGT GGGGCCGGGG CTCGGGCTCG AGGAGGAGAC GCTGAGGCTT GTCTGCGACC TAGCTAGGAG CGTCGAGAAG CCGCTCATAG TGGACGGCGA CGGCCTCACC GCGGTTGCCC GATGCGGCGA GTACATCTCG GAGAGAAGGG CTCCCACAGT GCTCACCCCG CACGCCGGCG AGATGTCGAG GCTGACGGGG AAGAGCGTGG AGGAGGTAAG GGCGTCCCGG GTCGACGCGG CGCTGGAGCT CGCCGGGAAG CTTAAAGCCT ACGTGGTGCT CAAAGGGGCT CACACGGTCA TAGCAACCCC GGATGGGAGG GCGTACATCA ACCTGTCGGG CAACCCCGGC ATGGCGACCG CGGGCTCCGG GGACGTGCTG GTAGGGGCCA TAGCGGCGCT CTACGGGCTC GGCCTGGGCT TCGAGGAGGC TGTGAGGATG GGCGTCTTCG TGCACGGGCT CGCAGGGGAC ATAGCGGCGG AGGAGCGCGG GCAGGACGGC TTAACTTCAG TGACTCTGAT GAACTACCTC CCCAAGGCTC TGAGGGCGCT GAGGGAAGAC TTCGAGAGCG TGCTCGAAAG GTATACCATT AAGGTCCTGC CGTAG
|
Protein sequence | MKVARVSEIK LLDREAAEKY GVKEEILMEN AGASVARLAV SLIGLPMSAA VVCGPGNNGG DGLVAARHLS SMGADVKVFL VAAPDKLAGI VKENYERVVK AGIAVEVVDE ERAEGLSEEL SLFDVVVDAL FGTGLSRPLE GVYRKVVEAI NGSGSLVISV DIPSGVHGDT GQVLGVAVRA DYTVTFGLPK LGNLMYPGAE LGGELYVHHI SYPRALLEDS RLKVETNDPV PLPPRRPDTH KGDYGKALFV AGSRRYMGAP LLCSKSFLKA GGGYSRLATI KSIVPFLGVR APEVVYEALE ETASGTVAYG NLERILELSK SSDIVAVGPG LGLEEETLRL VCDLARSVEK PLIVDGDGLT AVARCGEYIS ERRAPTVLTP HAGEMSRLTG KSVEEVRASR VDAALELAGK LKAYVVLKGA HTVIATPDGR AYINLSGNPG MATAGSGDVL VGAIAALYGL GLGFEEAVRM GVFVHGLAGD IAAEERGQDG LTSVTLMNYL PKALRALRED FESVLERYTI KVLP
|
| |