Gene Cthe_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0390 
Symbol 
ID4808467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp484746 
End bp485960 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content41% 
IMG OID640105804 
ProductROK domain-containing protein 
Protein accessionYP_001036821 
Protein GI125972911 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATAAGT TTACGAAACT TGATTTAAAC AGTATAACAA GCAACAATAG AATGAATATA 
TTTAATTGTA TTCTTGAAGC AAAGGAAATA AACAGAGCAG TTATAGCCAA AAAAGTAGGG
CTGAGCATTC CTGCGGTTAT GTCCATTACC GATGATTTGA TTCAGAAAGG TATTATCTAT
GTTATAGGCA AGGGTAAATC CAGTGGAGGG AAACGTCCCG AATTGCTTGC GGTGGTCCCG
GACAGGTTTT TCTTTGTCGG AGTGGATGTA GGAAGAACAT CTGTAAGGGT AGTTGTAATG
AACAACTGCA GGGATGTTGT TTACAAAGTG AGTAAGCCGA CGGAATCTGT TGAACCAGAT
GAATTAATAA ATCAGATAAC CGAAATGACT ATGGAAAGTA TAAATGAATC AAAATTCCCC
CTTGACAGAG TTGTTGGTAT AGGTGTTGCA ATGCCGGGCT TAATTGAGCG GGGCACCGGC
AGGGTAATTT TTAGCCCGAA TTTTGGCTGG AACAATATTG CTTTACAAGA CGAACTTAAA
AAGCACCTTC CTTTCAATGT ACTGGTGGAA AACGCAAATC GCGCCCTGGT TATAGGAGAG
ATAAAAAATA CACAGCCAAA TCCTACTTCA TGTATTGTCG GGGTTAACCT TGGATACGGT
ATCGGATCGG CGATAGTCTT ACCAAATGGT TTGTATTATG GGGTAAGCGG AACGAGTGGT
GAAATTGGAC ATATTATTGT TGAAAACCAT GGTTCATATT GTTCGTGTGG CAATTATGGG
TGTATTGAAT CCATTGCAAG CGGAGAAGCG ATAGCTCGCG AGGCCCGTAT TGCAATAGCG
AATAAAATAC AAAGCAGCGT TTTTGAAAAG TGTGAAGGGG ATTTGAAGAA AATAGATGCC
AAAATGGTTT TTGATGCTGC AAAGGAAGGA GATCACCTTG CCCAGTCCAT AGTGGAAAAA
GCGGCTGACT ATATAGGCAA GGGTTTGGCA ATCACCATAA ACATGCTTGA CCCTGAGCAG
ATCATTCTTT GTGGTGGTTT GACGTTAAGC GGCGACTTTT TTATCGATAT GATCAAAAAA
GCGGTATCCA AGTATCAGAT GCGTTATGCC GGAGGAAATG TTAAAATTGT TGTTGGTAAA
AGTGGACTCT ATGCTACTGC CATAGGTGGT GCATGGATTG TTGCAAATAA TATTGATTTT
CTGTCAAGTA ACTAA
 
Protein sequence
MDKFTKLDLN SITSNNRMNI FNCILEAKEI NRAVIAKKVG LSIPAVMSIT DDLIQKGIIY 
VIGKGKSSGG KRPELLAVVP DRFFFVGVDV GRTSVRVVVM NNCRDVVYKV SKPTESVEPD
ELINQITEMT MESINESKFP LDRVVGIGVA MPGLIERGTG RVIFSPNFGW NNIALQDELK
KHLPFNVLVE NANRALVIGE IKNTQPNPTS CIVGVNLGYG IGSAIVLPNG LYYGVSGTSG
EIGHIIVENH GSYCSCGNYG CIESIASGEA IAREARIAIA NKIQSSVFEK CEGDLKKIDA
KMVFDAAKEG DHLAQSIVEK AADYIGKGLA ITINMLDPEQ IILCGGLTLS GDFFIDMIKK
AVSKYQMRYA GGNVKIVVGK SGLYATAIGG AWIVANNIDF LSSN