Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0390 |
Symbol | |
ID | 4808467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 484746 |
End bp | 485960 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105804 |
Product | ROK domain-containing protein |
Protein accession | YP_001036821 |
Protein GI | 125972911 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAAGT TTACGAAACT TGATTTAAAC AGTATAACAA GCAACAATAG AATGAATATA TTTAATTGTA TTCTTGAAGC AAAGGAAATA AACAGAGCAG TTATAGCCAA AAAAGTAGGG CTGAGCATTC CTGCGGTTAT GTCCATTACC GATGATTTGA TTCAGAAAGG TATTATCTAT GTTATAGGCA AGGGTAAATC CAGTGGAGGG AAACGTCCCG AATTGCTTGC GGTGGTCCCG GACAGGTTTT TCTTTGTCGG AGTGGATGTA GGAAGAACAT CTGTAAGGGT AGTTGTAATG AACAACTGCA GGGATGTTGT TTACAAAGTG AGTAAGCCGA CGGAATCTGT TGAACCAGAT GAATTAATAA ATCAGATAAC CGAAATGACT ATGGAAAGTA TAAATGAATC AAAATTCCCC CTTGACAGAG TTGTTGGTAT AGGTGTTGCA ATGCCGGGCT TAATTGAGCG GGGCACCGGC AGGGTAATTT TTAGCCCGAA TTTTGGCTGG AACAATATTG CTTTACAAGA CGAACTTAAA AAGCACCTTC CTTTCAATGT ACTGGTGGAA AACGCAAATC GCGCCCTGGT TATAGGAGAG ATAAAAAATA CACAGCCAAA TCCTACTTCA TGTATTGTCG GGGTTAACCT TGGATACGGT ATCGGATCGG CGATAGTCTT ACCAAATGGT TTGTATTATG GGGTAAGCGG AACGAGTGGT GAAATTGGAC ATATTATTGT TGAAAACCAT GGTTCATATT GTTCGTGTGG CAATTATGGG TGTATTGAAT CCATTGCAAG CGGAGAAGCG ATAGCTCGCG AGGCCCGTAT TGCAATAGCG AATAAAATAC AAAGCAGCGT TTTTGAAAAG TGTGAAGGGG ATTTGAAGAA AATAGATGCC AAAATGGTTT TTGATGCTGC AAAGGAAGGA GATCACCTTG CCCAGTCCAT AGTGGAAAAA GCGGCTGACT ATATAGGCAA GGGTTTGGCA ATCACCATAA ACATGCTTGA CCCTGAGCAG ATCATTCTTT GTGGTGGTTT GACGTTAAGC GGCGACTTTT TTATCGATAT GATCAAAAAA GCGGTATCCA AGTATCAGAT GCGTTATGCC GGAGGAAATG TTAAAATTGT TGTTGGTAAA AGTGGACTCT ATGCTACTGC CATAGGTGGT GCATGGATTG TTGCAAATAA TATTGATTTT CTGTCAAGTA ACTAA
|
Protein sequence | MDKFTKLDLN SITSNNRMNI FNCILEAKEI NRAVIAKKVG LSIPAVMSIT DDLIQKGIIY VIGKGKSSGG KRPELLAVVP DRFFFVGVDV GRTSVRVVVM NNCRDVVYKV SKPTESVEPD ELINQITEMT MESINESKFP LDRVVGIGVA MPGLIERGTG RVIFSPNFGW NNIALQDELK KHLPFNVLVE NANRALVIGE IKNTQPNPTS CIVGVNLGYG IGSAIVLPNG LYYGVSGTSG EIGHIIVENH GSYCSCGNYG CIESIASGEA IAREARIAIA NKIQSSVFEK CEGDLKKIDA KMVFDAAKEG DHLAQSIVEK AADYIGKGLA ITINMLDPEQ IILCGGLTLS GDFFIDMIKK AVSKYQMRYA GGNVKIVVGK SGLYATAIGG AWIVANNIDF LSSN
|
| |