Gene Cthe_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2226 
Symbol 
ID4811091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2654298 
End bp2655944 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content38% 
IMG OID640107632 
ProductFkbH like protein 
Protein accessionYP_001038621 
Protein GI125974711 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis 
TIGRFAM ID[TIGR01681] HAD-superfamily phosphatase, subfamily IIIC
[TIGR01686] FkbH-like domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACTGT CCCTGCTTTC AAACGTAAAT CTGGATTCTC TTATTAGCAG ATTGTCAAAG 
CAGTATGATG TGTATAAAAC TGAAGGATAT GGGACATGGA TACAGGAGAT AATAAATCCT
GATTCCGGAT TGTATTCATT TGGGCCGGAT ATGGTCTTTA TTGTTCTTGA CGGAGAAGAA
ATGTTAAGAG GCCAGAGTGA AAGCAACGAA ATTATTGACG TAAATATAAA CTATATTGAA
GAAGCAGTGA AGAATAATCC CAATATAACC TTCTTTGTGA GCAATATTGA TTTGTGGACG
GCAAAAATTG AAAGTGCGAA ATCCGGGTCA AAGGAAAGAA GGCTTGAATT CCTGTGGGAA
GACAGCCTTT TTTTGCTGTG CGAAAAATAT AAAAATCTTT ATGTTTTTGA TATAAAAAGC
ATTGTGGAGG ATAAAGGAAG AGAACAATTT TATTCTAAAA AGCTGTGGTA TTTGGGCGGA
ATAAAATATT CCATGAAGGC GGAAAAATTG CTGGAGCAGT ATATTAACAG ATGTGTTGCT
TCTTTTAAGG GTAAAAGAAA AAAGTGCCTG GTTTTGGACC TCGATAACAC CTTGTGGGGC
GGCGTTGTAG GAGAAGCCGG TATTGAGGGG ATTGAACTTT CGGATTACAA GGAAGGTGCC
AGATATAAGG ATTTTCAAAG GAAACTCAAG GAAATAAAAG ATTTGGGGAT AATACTTGCC
GTGGTATCCA AAAACAATTT TGATGATGCC ATAAAGGTTA TAAGGGAACA CAAGCATATG
GTGTTAAAAG AGGAAGACTT TGTCGCGCTG AAAATAAATT GGGATTTGAA ATCCCAAAAT
ATAAGGGATT TATCCGAAGA GCTCAATATT GGACTTGATT CCATGGTGTT TATTGACGAT
AATCCGGTGG AAAGGGAAAG TGTAAAAAGG GAGCTTCCTG AGGTGGTTGT ACCGGATTTT
CCGCAGGACA GTTCGGAACT TGTGGATTTT GCAACTGAAC TTTACAATAA TTATTTCTAT
ACATTGGACA CGACTTATGA AGATACCGTG AAAACCGAAA TATACCGTCA GAATATGAAA
CGAAGGGACG CACAAAAATC CAGTGCTTCG TATGAAGATT TTCTAAGGTC TTTGGAAACC
AGGATTGAGA TTCGCAGAAT AAATGCGGAA AATGTTCAGC GGGCTGCGCA GCTTACACAA
AAAACCAATC AGTTCAACTT GACTACAAAA AGGTATAGCG AACAGGAACT TCTGGCTTTA
ATAAATGACC AAGGATTTGA AGGATTTGTG GCTTATGTCA GTGACAAATT CGGGGACAAC
GGAATGGTAA GTGTGGTAAT AACAAGACGC AAAAGCGACA GCGAAGTTGA ACTGGACACC
TTTTTGCTAA GCTGTAGGGT CATGGGCAGG TTTATCGAAG ACCGGATAAT AGGTTTCATT
GAGGATTTAT ATAAAAAAGC CGGTTATAAG AAATTCATTA CATATTACAG GCCAACGGAA
AAAAACGCTC CGGTAAAGGA TTTGTTTGAA AGGCTGGGTT ATACGCTTTT GGATGTTGAC
CCAGAAGGAA ATAAAAAATA TGTTTTGGAT TTCGAAAGGT TAAGTGAGTG TTCCAGAAAA
GAATTCGGGG AGCTGATAGC ATTATGA
 
Protein sequence
MKLSLLSNVN LDSLISRLSK QYDVYKTEGY GTWIQEIINP DSGLYSFGPD MVFIVLDGEE 
MLRGQSESNE IIDVNINYIE EAVKNNPNIT FFVSNIDLWT AKIESAKSGS KERRLEFLWE
DSLFLLCEKY KNLYVFDIKS IVEDKGREQF YSKKLWYLGG IKYSMKAEKL LEQYINRCVA
SFKGKRKKCL VLDLDNTLWG GVVGEAGIEG IELSDYKEGA RYKDFQRKLK EIKDLGIILA
VVSKNNFDDA IKVIREHKHM VLKEEDFVAL KINWDLKSQN IRDLSEELNI GLDSMVFIDD
NPVERESVKR ELPEVVVPDF PQDSSELVDF ATELYNNYFY TLDTTYEDTV KTEIYRQNMK
RRDAQKSSAS YEDFLRSLET RIEIRRINAE NVQRAAQLTQ KTNQFNLTTK RYSEQELLAL
INDQGFEGFV AYVSDKFGDN GMVSVVITRR KSDSEVELDT FLLSCRVMGR FIEDRIIGFI
EDLYKKAGYK KFITYYRPTE KNAPVKDLFE RLGYTLLDVD PEGNKKYVLD FERLSECSRK
EFGELIAL