Gene Cthe_2855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2855 
Symbol 
ID4809135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3373261 
End bp3374499 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content35% 
IMG OID640108275 
Producthypothetical protein 
Protein accessionYP_001039247 
Protein GI125975337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000208059 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATC CGTTATGTGA TAAAAATTAT TTATTAAAAA CAATAGAACT TAGAAAGAAA 
TATATTTGTG AAATGAAAGG AGAAATTGTT CAATTAAAAT CTGATATAGA AAAGGGGATT
CAGAGATATC CTAGAGATAA TCAAAGTATA ATTTTTGCTA GATTTGCAAT AATGTTTATG
TATGGTATGG ACATGCTTTT AGCAAAATAT TCCTTGGGCA ATCACCCTGA TACAATGATA
GATGACTATT TAGACAACAT AACATATTTA GAGAATTGCG GTGAAGAAGA GGCCGGCTAC
ATTAACCTTT TATGGATGGT TGGACTGGGT ATCCTTTTGG AAATGGATAA AGAAGTGTTA
AAAAGACTGG CAAGAGTTAT AGAAAGGCAA AGAATAGAAG ACGCACTTAT GGATTTTCTA
TTGAAATCCT GTGATATAGG TTGGAATCAC AGTACAACGA AATATGAAAA AAAGAACCCG
TATGAAAAGA CAGCAGAGAT TATAAAAATA GCATTACACG ACAAAGACAA GGAAGCGGCA
TCAAAAAGGC TTGAAAAATA CATGGGAAAA GAATGGTTCA AGGGACATTA CGACTTTGGG
TGGAGGAATG CCCATAAGGA ACCTGGCTAT TATGGTTTTT GGAGTTTTGA TACAGCGGCA
CTGGCCAAGA TACTGGGACT GGATGACAGT GCGTTAAAAG ACAACAACCA TTATCCTTAT
GATTTGGCAC ACTATAAAAA TGGAATGACC TTTGATTTGA GTTGGTATAG TGTACCAAAG
GAAGAGGAAG ATAAGGAAGA AGAAACGGTG GTATATGGTA TACCGGGTAA TCCTGAGTTG
GAGAGAATAA TACCTGGGAG ATTCCACAGT TTTGTAAATG AGATAATAAA TGATTATAAA
ACACTGCCGG ACGAAGAATT TTGGAAGAAA TACAATTTGA AAGAAATCTG GTTTGATGTG
GAGGAGTATA AGGAGGATAA TAAAGATAAG AATTTGCTAG GAACGATTAT AGTATTCATG
CTTGTGGACA AAGATTATAT TTTGCAGTTG GATTATAAAG AAGAGTTAAT AGACTATATA
GAGAATATAC ATAATTACTG GGCCAAGAAA GAAGTTAAGC TTATAAGCTT TGAATTAGAC
AATGACCAGC AGTACTATGC ATATGTGCCG AAGGATGCGG AGGTTGGTTC GTTGTATGAG
GTAAAACTGA CAGAAGTGGA GAAAATAGAG GAGGTTTAG
 
Protein sequence
MRDPLCDKNY LLKTIELRKK YICEMKGEIV QLKSDIEKGI QRYPRDNQSI IFARFAIMFM 
YGMDMLLAKY SLGNHPDTMI DDYLDNITYL ENCGEEEAGY INLLWMVGLG ILLEMDKEVL
KRLARVIERQ RIEDALMDFL LKSCDIGWNH STTKYEKKNP YEKTAEIIKI ALHDKDKEAA
SKRLEKYMGK EWFKGHYDFG WRNAHKEPGY YGFWSFDTAA LAKILGLDDS ALKDNNHYPY
DLAHYKNGMT FDLSWYSVPK EEEDKEEETV VYGIPGNPEL ERIIPGRFHS FVNEIINDYK
TLPDEEFWKK YNLKEIWFDV EEYKEDNKDK NLLGTIIVFM LVDKDYILQL DYKEELIDYI
ENIHNYWAKK EVKLISFELD NDQQYYAYVP KDAEVGSLYE VKLTEVEKIE EV