Gene Cthe_2442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2442 
Symbol 
ID4809821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2912358 
End bp2913842 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content41% 
IMG OID640107856 
Productcarbohydrate kinase, FGGY 
Protein accessionYP_001038837 
Protein GI125974927 
COG category[C] Energy production and conversion 
COG ID[COG0554] Glycerol kinase 
TIGRFAM ID[TIGR01311] glycerol kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0259159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATT TTTACGTTCT CAGCATAGAT CAGAGCACTC AAGGGACTAA AGCGGTTATT 
CTAAACGACT CTGGAATTAT TCAAGCAAGA CATGATCTCC CACATAAGCA AATCATAAAT
GAAAATGGAT GGGTTTCCCA TGATCCGGAA GAAATCTATG AAAATGTAAT TAAGACCGTG
AAAATGGTAG TGGAAAAAGC AGGCATAGAT AAAAACCGGA TTCTTTGCGT GGGAATTTCA
AACCAGAGAG AGACAACAGT CGTATGGGAC AAGAAAACAG GCAAGCCTCT TTGCAATGCA
ATTGTTTGGC AATGCAACAG AGCAAAAGAT ATTTGTGAAA GAATAAAAAA GGCGGGATAC
GAGAATTGTA TAGCCGCAAA ATCGGGTTTA AAGCTTTCTC CGTATTATCC GGCCGGTAAG
ATGACATGGT TTATGGAGAA TGTTCCGGAT GTAGACAAAA AAGCAGATGA CGGAGATGCG
GCTTTTGGAA CAATAGATAG CTGGCTTGTT TATAAACTGA CAAAGGGAAA AAGTTATAAA
ACCGATTATT CAAATGCCAG TCGTACCCAG CTTTTAAATT TAACCACACT GAAGTGGGAT
GAACAACTCT GCGACATATT TGGAATACCG GTTAAAGCAC TTCCTGAGAT TTGTGATTCA
AATTCAGTGT TTGGCGAAAC TGATTTTGAA GGTTATCTTG AAAAGCCCAT TCCAATCTGC
GGGGTACTTG GGGATTCCCA TGGTGCGTTG TTTGGACACA ACTGCAGAAA AGAAGGTTCG
ATAAAAGTTA CTTATGGAAC AGGCTCATCC GTTATGCTAA ACACGGGCAA CATACCGATT
TTCAGCAAAC ATGGATTATC CACCTCTCTT GCCTGGGTAA TCGACGGAAA AGCTTCTTAT
GTTCTCGAAG GCAATATTAA CTATACCGGT GCGGTTATTT CATGGCTTAA AGATGCTCTT
GGATTGATTC AGTCTGCGAA AGAAACGGCT GAGTTGTCAA AAAGGGCAAA CCCAAATGAT
GGAACTTATT TGGTTCCCGC ATTTACCGGT TTGGGGGCTC CGTACTGGAA AAGCGAAGCC
AAGGCGATCA TTGCCGGAAT GAGCCGTTCG ACCGGCAAAG CAGAGCTGGT GAAAGCGGCT
AATGAATCTA TTGCTTATCA AATTAATGAT GTTATTTTGG CAATGCGAAA AGATACGGGG
TTGGAAATTT CGGAATTGTG TGTTGACGGA GGACCGACCA GGGATGATTA TCTGATGCAG
TTCCAGAGCG ATATTTCTGA TGCAGATATT AAAATACCCA ATATTGAGGA GCTTTCTGCA
ACAGGAGCGG CTTTTCTGGC CGGAATGTCA GCCAATCTGT ATGATGACAC CGTGTATAAT
GCCATATCAT ATCGATTTTA CCATTCCAAA ATGAATTCTC AAGTACGCAA TGAAAAAGTT
GATGGTTGGA AAGCAGCAGT AAATATGCTT TTAAGCAAGG AGTGA
 
Protein sequence
MNNFYVLSID QSTQGTKAVI LNDSGIIQAR HDLPHKQIIN ENGWVSHDPE EIYENVIKTV 
KMVVEKAGID KNRILCVGIS NQRETTVVWD KKTGKPLCNA IVWQCNRAKD ICERIKKAGY
ENCIAAKSGL KLSPYYPAGK MTWFMENVPD VDKKADDGDA AFGTIDSWLV YKLTKGKSYK
TDYSNASRTQ LLNLTTLKWD EQLCDIFGIP VKALPEICDS NSVFGETDFE GYLEKPIPIC
GVLGDSHGAL FGHNCRKEGS IKVTYGTGSS VMLNTGNIPI FSKHGLSTSL AWVIDGKASY
VLEGNINYTG AVISWLKDAL GLIQSAKETA ELSKRANPND GTYLVPAFTG LGAPYWKSEA
KAIIAGMSRS TGKAELVKAA NESIAYQIND VILAMRKDTG LEISELCVDG GPTRDDYLMQ
FQSDISDADI KIPNIEELSA TGAAFLAGMS ANLYDDTVYN AISYRFYHSK MNSQVRNEKV
DGWKAAVNML LSKE