Gene Cthe_1956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1956 
Symbol 
ID4810739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2330336 
End bp2331268 
Gene Length933 bp 
Protein Length310 aa 
Translation table11 
GC content46% 
IMG OID640107372 
Producttagatose-6-phosphate kinase 
Protein accessionYP_001038367 
Protein GI125974457 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1105] Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) 
TIGRFAM ID[TIGR01231] tagatose-6-phosphate kinase
[TIGR03168] hexose kinase, 1-phosphofructokinase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000247641 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACAT CTGTGGCTCT CAACCCCGCA GTGGACAAAA TTTATTTTGT TGACAACTTC 
GAACCGGGAA GAATGTACCG GGTGCGGCAA ATGGTAAAAA CGGCCGGGGG AAAAGGTGTA
AATGTTGCGC GGGTTGCCCG TATGTTGGGA GAAAATGTCC GGCTGACAGG CTTTAAAGGC
GGAGAGACAG GTAACTGGCT GGAATCGCAG CTTAAGAAAC TGGGGGTCGT TACAAGATTT
GTTGAAGTAT CCGGTGAGAC AAGAACAAAC AACAATATTA TAGACAGAGT AAGAGACAGT
GAGACGGAAG TACTGGAGCC GGGGCCTTTT ATATCCGGCG AAGACATGGA AAAATTCATG
GAGGTTTATA AAGAGGCTCT TTCCGATTCC AAGGTCGTTG TGCTGTCAGG CGGGCTTCCC
CAGGGAGTGC CTGCATGCTG TTATAAGGCT CTTATTGAAG AGGCAAAAAA CTTTAATATT
CCTGTTATAC TTGACAGCGG CGGAGATGCT TTAAAAGAAG GCATAAAGGC AAAGCCAAAT
GTTATAAAAC CGAATTTGAG GGAATTGGGA AGTCTCATTC AAAAAGAATT AAGGGATATG
GACGAAATTG TTGAGGCGCT GAAAGAAATT AATGCAGACG GAATAGATAT TTCAATGGTT
TCCATGGGCG ACAAGGGAGC TGTTCTGTGC ACGAAAGATT TGTGCCTTAG AGTAAAAGTG
CCGCATGTGG AGACGGTAAA CACCATAGGC TCCGGAGATG CCATGGTGGC AGGGTTTGCA
GCGGGACTTG CAAGAGACAA AACAATGGAA GAGTGCCTAA GGCTTGCGGC AGCCTGCGGC
GTGAGCAATG CGCGCTTTTT GGAAATCGGT GTTGTGGATA AGAATGAAGT CGAAATCCAA
AAGAACAGAG TGGAAATTGA GAGAATATCT TGA
 
Protein sequence
MITSVALNPA VDKIYFVDNF EPGRMYRVRQ MVKTAGGKGV NVARVARMLG ENVRLTGFKG 
GETGNWLESQ LKKLGVVTRF VEVSGETRTN NNIIDRVRDS ETEVLEPGPF ISGEDMEKFM
EVYKEALSDS KVVVLSGGLP QGVPACCYKA LIEEAKNFNI PVILDSGGDA LKEGIKAKPN
VIKPNLRELG SLIQKELRDM DEIVEALKEI NADGIDISMV SMGDKGAVLC TKDLCLRVKV
PHVETVNTIG SGDAMVAGFA AGLARDKTME ECLRLAAACG VSNARFLEIG VVDKNEVEIQ
KNRVEIERIS