Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2696 |
Symbol | |
ID | 4810690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3181575 |
End bp | 3183122 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640108115 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001039088 |
Protein GI | 125975178 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.590974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGTTG TTACGCCTGT GCAAATGAGA GAGATTGACT CATATACTAT AAACAAAGTG GGGATTCCGG GAATTGTATT AATGGAGAAT GCCGCCGTCA GAGTGACGGA TGAAATCCTG AAGGATTACG GAAGCCTTGT AAATAAAAAC ATAGTTTTAT TGGCCGGTAA AGGAAACAAT GGAGGGGATG CCTTTGCAAT TGCAAGGCAT CTTTGTAATA AAGGGGCAAA TACGGTTGTT TTTATACTGG CAGCTAAAAA AGATATTTCA GGTGACGCGC GGATAAATCT TGACATACTG GAAAACATGG GCGTGGAAAC GGTGGAGGTT TTGGAAGGCA ATGATTTGAC GGATATGGAA AAAAGGCTTG AGAGGGCCGA CCTTGTGGTG GACGGTATTT TCGGTACGGG CTTAAAAGGC GGTGTAAAAG GATTTAGAGG GAATGTTATA AAGCTTGTAA ATCAAAAAAA CAAACCAGTT ATTGCCGTTG ACATACCGTC GGGAGTGGAC GGAGAGACCG GGGAAATAGA AGGAGAGTGT ATAAAAGCTT ACAAAACAGT TACTTTTGGC TATCCCAAAT TCGGCCATTT CCTGCATCCC GGATGTGAAT TTGTGGGTGA ATTGGTAGTG GCGGATATAA GTATTCCCGA CAGTGTGGCA CAAAATTTTG ATATAAAGAG TTATGTTACA GACAAAGAAA CGGTTTCAAG GTTAATACCC GGAAGAAAAG CAGACAGCAA TAAAGGTGAC TACGGGAGAG TATTAGTCAT AACCGGTTCC ACGGGAATGA CCGGAGCCGG ATGCCTTGCC GGAACTGCAT CTTTGCGCTC AGGGACAGGG CTCTTGTATC TCGGTGTTCC AAAGACTCTG TCCTTTATAT ATGAGTGTAA TCTGACGGAA GCCGTAACCC TTCCTTTGGA GGATGAAAAC AAAGGTTTTT TGACTAAAGA GTGTATTCCT ATGCTTTTGG AGTATATGGA AAAAATGGAT GCTGTTGCAA TAGGCCCCGG CTTGTCCACC AAAGAAGACG TGGAAGATGT TGTGTTCAGT GTTGTTGAAA ACTGCAAGGT GCCCATGGTG ATTGATGCCG ACGGGTTGAA CCTGATATCA AGGAATTTGC CGGTGTTAAA GAAAGCCAGG GCACCGGTGG TTCTTACCCC CCATCCCGGA GAGATGGCAA GGCTTACAGG ATTGAGCATT GGAGAGATCC AGAAGAGGAG GGTTGGAACC GCCAGGGAAT TTTCCGAAAA GTGGGGAGTC ACAACTGTGT TGAAAGGAGC CAAAACTGTT GTGGCATCTC CTGACGGAAG GGTATTTATA AATCCCACGG GCAATTCCGG AATGTCAACC GGAGGCACGG GGGATGTCCT TACGGGGATA ATAGCAAGTT TTATCGGACA GGGGTTGGAT CCGGTTGATG CAGCGGTGGC CGGTGTGTAT TTGCATGGAC TTTGCGGTGA CCGTGTTGCA AATGTAAAAG GGGAGCATGG TCTTGTTGCA GGAGATTTGG CGGAGGAAAT TCCTTATGCA ATTAAGTCAC TTATATAG
|
Protein sequence | MKVVTPVQMR EIDSYTINKV GIPGIVLMEN AAVRVTDEIL KDYGSLVNKN IVLLAGKGNN GGDAFAIARH LCNKGANTVV FILAAKKDIS GDARINLDIL ENMGVETVEV LEGNDLTDME KRLERADLVV DGIFGTGLKG GVKGFRGNVI KLVNQKNKPV IAVDIPSGVD GETGEIEGEC IKAYKTVTFG YPKFGHFLHP GCEFVGELVV ADISIPDSVA QNFDIKSYVT DKETVSRLIP GRKADSNKGD YGRVLVITGS TGMTGAGCLA GTASLRSGTG LLYLGVPKTL SFIYECNLTE AVTLPLEDEN KGFLTKECIP MLLEYMEKMD AVAIGPGLST KEDVEDVVFS VVENCKVPMV IDADGLNLIS RNLPVLKKAR APVVLTPHPG EMARLTGLSI GEIQKRRVGT AREFSEKWGV TTVLKGAKTV VASPDGRVFI NPTGNSGMST GGTGDVLTGI IASFIGQGLD PVDAAVAGVY LHGLCGDRVA NVKGEHGLVA GDLAEEIPYA IKSLI
|
| |