Gene Cthe_2938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2938 
Symbol 
ID4810221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3453351 
End bp3454304 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content45% 
IMG OID640108361 
Productglucokinase 
Protein accessionYP_001039329 
Protein GI125975419 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTACA TAGGGATTGA TCTTGGCGGT ACGAATATCG CCGTGGGTTT GGTAAATGAG 
GAAGGAAAAA TACTTCACAA GGACAGCGTT CCCACGCTGA GGGAAAGACC GTATCAGGAA
ATAATCAAAG ACATGGCAAT GCTTACCTTA AAAGTAATAA AGGATGCAGA TGTCAGCATT
GACCAGGTTA AAAGCATTGG GGTCGGAAGT CCCGGAACCC CGAACTGCAA GGATGGAATT
CTTATCTATA ACAATAACTT GAATTTCAGA AATGTTCCGA TAAGGTCTGA AATACAAAAA
TATATTGATT TGCCCGTATA CCTTGACAAT GACGCAAACT GTGCGGCACT TGCCGAAAGT
GTGGCAGGGG CGGCAAAAGG CGCAAACACA TCCGTAACCA TAACCTTGGG TACGGGAATA
GGCGGAGGAG TCGTAATAGA CGGCAAAATA TACAGCGGTT TCAACTATGC GGGAGGAGAA
CTGGGGCATA CTGTTTTGAT GATGGACGGT GAGCCTTGCA CCTGCGGAAG AAAAGGCTGC
TGGGAAGCAT ATGCGTCGGC AACGGCTCTT ATAAGGCAGG CCAGAAAGGC GGCCGAGGCA
AATCCCGATT CACTTATAAA CAAGCTTGTG GGAGGGGATT TGTCAAAAAT TGATGCAAAA
ATTCCTTTTG ATGCGGCAAA GCAGGGAGAC AAAACCGGCG AGATGGTGGT GCAGCAATAT
ATAAGATATA TTGCCGAAGG CCTTATCAAC ATGATAAATA TATTTATGCC TGAGGTACTG
GTGATAGGTG GAGGAGTATG CAAAGAAGGA GAATACCTTT TAAAGCCTCT GAGGGAACTT
ATAAAACAGG GAGTTTACAG TAAAGAGGAT ATACCTCAAA CTGAGCTGAG AACGGCCCAA
ATGGGCAATG ACGCCGGAAT AATCGGTGCC GCAATGCTGG GGAAAGAATG TTAG
 
Protein sequence
MYYIGIDLGG TNIAVGLVNE EGKILHKDSV PTLRERPYQE IIKDMAMLTL KVIKDADVSI 
DQVKSIGVGS PGTPNCKDGI LIYNNNLNFR NVPIRSEIQK YIDLPVYLDN DANCAALAES
VAGAAKGANT SVTITLGTGI GGGVVIDGKI YSGFNYAGGE LGHTVLMMDG EPCTCGRKGC
WEAYASATAL IRQARKAAEA NPDSLINKLV GGDLSKIDAK IPFDAAKQGD KTGEMVVQQY
IRYIAEGLIN MINIFMPEVL VIGGGVCKEG EYLLKPLREL IKQGVYSKED IPQTELRTAQ
MGNDAGIIGA AMLGKEC