Gene Cthe_0110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0110 
Symbol 
ID4808736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp142033 
End bp142989 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content44% 
IMG OID640105521 
ProductHPr kinase 
Protein accessionYP_001036544 
Protein GI125972634 
COG category[T] Signal transduction mechanisms 
COG ID[COG1493] Serine kinase of the HPr protein, regulates carbohydrate metabolism 
TIGRFAM ID[TIGR00679] Hpr(Ser) kinase/phosphatase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTTCAG ATGCTGCTTA TTCGGTTACG CTAAAAAAGC TGCAAAAAGC TCTTTCGCTT 
GAGCTGGTTA CCGCTGATGA TGAAGATAGA CTGGAAAATA TCCTGGTAAC ATCTCCTGAG
GTGAACAGGC CCGGACTTCA GCTGGTAGGA TACCTGGAAT ATTTCGGAAC CGACAGAATT
CAAATGATTG GAAAGGTTGA GACCAGTTAT CTTGCGGGGC TTACATCGGA GGAACGCTAT
TCCAGGCTTG ACGAGTTTTT TAAATGTGGA TTTCCCTGCA TGGTAGTGGC CAGAGGGCTT
GAGGTTTTCC CGGAAATGCT GGAGGTGTCC AGAAAATACG GTATTCCGAT TTTCAGGACG
AAAGAAACCA CATCAAGAGT ATTGAGCGCT TTAATCAGCT ACCTTAATGT TGAACTTGCC
GAGAGGACGA GAGAACATGG TGTGCTTGTC GAGGTTTTCG GAGAAGGCAT TCTGATATTG
GGAGAAAGCG GTGTGGGCAA AAGCGAGACG GCTTTGGAAC TGGTAAAAAG GGGCCACAGG
CTTGTGGCGG ATGATGTAGT GGAAATCAGA AGAGTGTCTG AGAAGACTTT GTTTGGAACC
GCTCCGGATG AAATAAGGCA CTTCATTGAA ATAAGGGGAG TTGGAATACT TGATGTCAAA
AATCTGTACG GTGTGGGTGC CGTAAAACCT ACTGAAAATA TAAATCTTGT AATACAACTT
GAGTTCTGGG ATCAAAAGAA AGACTATGAA CGGTTGGGTT TGGTGGATGA TTACAAGGAG
ATACTTGGAA TCAATATTCC ATGCCTTACC ATTCCCGTGA GACCGGGCAG AAATCTTGCC
ATTATTGTGG AAGTCGCAGC TTTGAACAAC AGACAGAAAA AAATGGGTTA TAATGCTGCC
AGAGCTTTAA ATGAGAGAAT CATAGGAAAA GGAAGAATGA GGCCCTCTTA TGAATAA
 
Protein sequence
MSSDAAYSVT LKKLQKALSL ELVTADDEDR LENILVTSPE VNRPGLQLVG YLEYFGTDRI 
QMIGKVETSY LAGLTSEERY SRLDEFFKCG FPCMVVARGL EVFPEMLEVS RKYGIPIFRT
KETTSRVLSA LISYLNVELA ERTREHGVLV EVFGEGILIL GESGVGKSET ALELVKRGHR
LVADDVVEIR RVSEKTLFGT APDEIRHFIE IRGVGILDVK NLYGVGAVKP TENINLVIQL
EFWDQKKDYE RLGLVDDYKE ILGINIPCLT IPVRPGRNLA IIVEVAALNN RQKKMGYNAA
RALNERIIGK GRMRPSYE