Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2938 |
Symbol | |
ID | 4810221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3453351 |
End bp | 3454304 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108361 |
Product | glucokinase |
Protein accession | YP_001039329 |
Protein GI | 125975419 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATTACA TAGGGATTGA TCTTGGCGGT ACGAATATCG CCGTGGGTTT GGTAAATGAG GAAGGAAAAA TACTTCACAA GGACAGCGTT CCCACGCTGA GGGAAAGACC GTATCAGGAA ATAATCAAAG ACATGGCAAT GCTTACCTTA AAAGTAATAA AGGATGCAGA TGTCAGCATT GACCAGGTTA AAAGCATTGG GGTCGGAAGT CCCGGAACCC CGAACTGCAA GGATGGAATT CTTATCTATA ACAATAACTT GAATTTCAGA AATGTTCCGA TAAGGTCTGA AATACAAAAA TATATTGATT TGCCCGTATA CCTTGACAAT GACGCAAACT GTGCGGCACT TGCCGAAAGT GTGGCAGGGG CGGCAAAAGG CGCAAACACA TCCGTAACCA TAACCTTGGG TACGGGAATA GGCGGAGGAG TCGTAATAGA CGGCAAAATA TACAGCGGTT TCAACTATGC GGGAGGAGAA CTGGGGCATA CTGTTTTGAT GATGGACGGT GAGCCTTGCA CCTGCGGAAG AAAAGGCTGC TGGGAAGCAT ATGCGTCGGC AACGGCTCTT ATAAGGCAGG CCAGAAAGGC GGCCGAGGCA AATCCCGATT CACTTATAAA CAAGCTTGTG GGAGGGGATT TGTCAAAAAT TGATGCAAAA ATTCCTTTTG ATGCGGCAAA GCAGGGAGAC AAAACCGGCG AGATGGTGGT GCAGCAATAT ATAAGATATA TTGCCGAAGG CCTTATCAAC ATGATAAATA TATTTATGCC TGAGGTACTG GTGATAGGTG GAGGAGTATG CAAAGAAGGA GAATACCTTT TAAAGCCTCT GAGGGAACTT ATAAAACAGG GAGTTTACAG TAAAGAGGAT ATACCTCAAA CTGAGCTGAG AACGGCCCAA ATGGGCAATG ACGCCGGAAT AATCGGTGCC GCAATGCTGG GGAAAGAATG TTAG
|
Protein sequence | MYYIGIDLGG TNIAVGLVNE EGKILHKDSV PTLRERPYQE IIKDMAMLTL KVIKDADVSI DQVKSIGVGS PGTPNCKDGI LIYNNNLNFR NVPIRSEIQK YIDLPVYLDN DANCAALAES VAGAAKGANT SVTITLGTGI GGGVVIDGKI YSGFNYAGGE LGHTVLMMDG EPCTCGRKGC WEAYASATAL IRQARKAAEA NPDSLINKLV GGDLSKIDAK IPFDAAKQGD KTGEMVVQQY IRYIAEGLIN MINIFMPEVL VIGGGVCKEG EYLLKPLREL IKQGVYSKED IPQTELRTAQ MGNDAGIIGA AMLGKEC
|
| |