Gene Cthe_2696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2696 
Symbol 
ID4810690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3181575 
End bp3183122 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content46% 
IMG OID640108115 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001039088 
Protein GI125975178 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.590974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTG TTACGCCTGT GCAAATGAGA GAGATTGACT CATATACTAT AAACAAAGTG 
GGGATTCCGG GAATTGTATT AATGGAGAAT GCCGCCGTCA GAGTGACGGA TGAAATCCTG
AAGGATTACG GAAGCCTTGT AAATAAAAAC ATAGTTTTAT TGGCCGGTAA AGGAAACAAT
GGAGGGGATG CCTTTGCAAT TGCAAGGCAT CTTTGTAATA AAGGGGCAAA TACGGTTGTT
TTTATACTGG CAGCTAAAAA AGATATTTCA GGTGACGCGC GGATAAATCT TGACATACTG
GAAAACATGG GCGTGGAAAC GGTGGAGGTT TTGGAAGGCA ATGATTTGAC GGATATGGAA
AAAAGGCTTG AGAGGGCCGA CCTTGTGGTG GACGGTATTT TCGGTACGGG CTTAAAAGGC
GGTGTAAAAG GATTTAGAGG GAATGTTATA AAGCTTGTAA ATCAAAAAAA CAAACCAGTT
ATTGCCGTTG ACATACCGTC GGGAGTGGAC GGAGAGACCG GGGAAATAGA AGGAGAGTGT
ATAAAAGCTT ACAAAACAGT TACTTTTGGC TATCCCAAAT TCGGCCATTT CCTGCATCCC
GGATGTGAAT TTGTGGGTGA ATTGGTAGTG GCGGATATAA GTATTCCCGA CAGTGTGGCA
CAAAATTTTG ATATAAAGAG TTATGTTACA GACAAAGAAA CGGTTTCAAG GTTAATACCC
GGAAGAAAAG CAGACAGCAA TAAAGGTGAC TACGGGAGAG TATTAGTCAT AACCGGTTCC
ACGGGAATGA CCGGAGCCGG ATGCCTTGCC GGAACTGCAT CTTTGCGCTC AGGGACAGGG
CTCTTGTATC TCGGTGTTCC AAAGACTCTG TCCTTTATAT ATGAGTGTAA TCTGACGGAA
GCCGTAACCC TTCCTTTGGA GGATGAAAAC AAAGGTTTTT TGACTAAAGA GTGTATTCCT
ATGCTTTTGG AGTATATGGA AAAAATGGAT GCTGTTGCAA TAGGCCCCGG CTTGTCCACC
AAAGAAGACG TGGAAGATGT TGTGTTCAGT GTTGTTGAAA ACTGCAAGGT GCCCATGGTG
ATTGATGCCG ACGGGTTGAA CCTGATATCA AGGAATTTGC CGGTGTTAAA GAAAGCCAGG
GCACCGGTGG TTCTTACCCC CCATCCCGGA GAGATGGCAA GGCTTACAGG ATTGAGCATT
GGAGAGATCC AGAAGAGGAG GGTTGGAACC GCCAGGGAAT TTTCCGAAAA GTGGGGAGTC
ACAACTGTGT TGAAAGGAGC CAAAACTGTT GTGGCATCTC CTGACGGAAG GGTATTTATA
AATCCCACGG GCAATTCCGG AATGTCAACC GGAGGCACGG GGGATGTCCT TACGGGGATA
ATAGCAAGTT TTATCGGACA GGGGTTGGAT CCGGTTGATG CAGCGGTGGC CGGTGTGTAT
TTGCATGGAC TTTGCGGTGA CCGTGTTGCA AATGTAAAAG GGGAGCATGG TCTTGTTGCA
GGAGATTTGG CGGAGGAAAT TCCTTATGCA ATTAAGTCAC TTATATAG
 
Protein sequence
MKVVTPVQMR EIDSYTINKV GIPGIVLMEN AAVRVTDEIL KDYGSLVNKN IVLLAGKGNN 
GGDAFAIARH LCNKGANTVV FILAAKKDIS GDARINLDIL ENMGVETVEV LEGNDLTDME
KRLERADLVV DGIFGTGLKG GVKGFRGNVI KLVNQKNKPV IAVDIPSGVD GETGEIEGEC
IKAYKTVTFG YPKFGHFLHP GCEFVGELVV ADISIPDSVA QNFDIKSYVT DKETVSRLIP
GRKADSNKGD YGRVLVITGS TGMTGAGCLA GTASLRSGTG LLYLGVPKTL SFIYECNLTE
AVTLPLEDEN KGFLTKECIP MLLEYMEKMD AVAIGPGLST KEDVEDVVFS VVENCKVPMV
IDADGLNLIS RNLPVLKKAR APVVLTPHPG EMARLTGLSI GEIQKRRVGT AREFSEKWGV
TTVLKGAKTV VASPDGRVFI NPTGNSGMST GGTGDVLTGI IASFIGQGLD PVDAAVAGVY
LHGLCGDRVA NVKGEHGLVA GDLAEEIPYA IKSLI