Gene Cthe_0930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0930 
Symbol 
ID4811223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1118172 
End bp1119188 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content38% 
IMG OID640106349 
Productradical SAM family protein 
Protein accessionYP_001037357 
Protein GI125973447 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG1243] Histone acetyltransferase 
TIGRFAM ID[TIGR01210] conserved hypothetical protein TIGR01210
[TIGR01212] radical SAM protein, TIGR01212 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000118017 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCTA AGCATATTGT CATACCAATT TTTATTCCTC ACAAAGGATG TCCTTTTGAC 
TGTATATATT GCAATCAAAA ATATATAAGC GGTCAAAAAG ATGACATGAC CGAAGAAAAA
ATGATATCGA TTATCGAGTC CCATATTGAT TCTGCCTGTG AAGATACGTA CATTGAGATA
GGCTTTTACG GAGGCAGCTT TACAGGTATA GAAAGAGAAG AACAATATAG GTATCTTGAG
ACGGCCAACA GGTATATAAA AGAGGGAAAG GTCAAAAGCA TACGGCTCTC AACCAGGCCG
GACTATATAA ACGAAGAAAT TCTTGATTAC CTCGAAAAAT ATTCCGTAAA GACAATAGAG
CTTGGGGTTC AAAGTCTTGA CAGGGAAGTT CTTGAGAAAA GCTGCAGGGG ACACAGTGTT
GAGGATGTTT ACAATGCTTC GGCCCTTATT AAGAAAAGAG GCTTTGTACT TGGGATACAA
ACAATGATAG GGCTTCCGGG AGACAGCAGA AAGAAGGCTC TTCATACTGC AGAGGAAGTT
GTTAAAATAA AGCCTGATAT TTTAAGGATT TATCCCACAT TAGTGGTAAG GGGTACCTAT
CTTGAAAAGA TGTATATAAA AGGTGAATAC ACCCCTTTGG AGCTTGAGGA AGCCGTTGAA
CTTTGTGCCG AGCTTCTTTA TATTTATAAA AAGAACAATA TAAATGTGAT AAGAATCGGG
CTTCAGCCCA CCGAGAGCAT AAACGAGGGT GGCGATGTTA TAGCAGGGCC TTTTCATCCT
GCCTTCAGGC AGCTGGTGGA ATCAAAAATG GCACTTAGTG CTATTGAAAA GGCGATTGTG
GAGAAAAATT TGTCGAAAAA AGACACCCTT GTAATTTGCA CTGATAAAAA AGAGATATCA
AATGTTATAG GCCAAGGAAG GAAAAATGTA GAATATTTAC GAAAAAAGTA TGGCTTTGAT
AAAATAATTG TCAGAGAATA TAATGTGGGA CATGAAATTT ATGATATAAA ATATTAA
 
Protein sequence
MASKHIVIPI FIPHKGCPFD CIYCNQKYIS GQKDDMTEEK MISIIESHID SACEDTYIEI 
GFYGGSFTGI EREEQYRYLE TANRYIKEGK VKSIRLSTRP DYINEEILDY LEKYSVKTIE
LGVQSLDREV LEKSCRGHSV EDVYNASALI KKRGFVLGIQ TMIGLPGDSR KKALHTAEEV
VKIKPDILRI YPTLVVRGTY LEKMYIKGEY TPLELEEAVE LCAELLYIYK KNNINVIRIG
LQPTESINEG GDVIAGPFHP AFRQLVESKM ALSAIEKAIV EKNLSKKDTL VICTDKKEIS
NVIGQGRKNV EYLRKKYGFD KIIVREYNVG HEIYDIKY