Gene Cthe_2826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2826 
Symbol 
ID4809663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3342816 
End bp3344054 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content35% 
IMG OID640108246 
Producthypothetical protein 
Protein accessionYP_001039218 
Protein GI125975308 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000022412 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATC CATTATGTGA TAAAAAAGAT TTGATAGAAA CGATAGAATT TAACCAAAAG 
GCTATTTTAA AAATGAAAGA AAAAATTATT AATCTGAAGG CCGACATAGA GAATGGTATA
CAAAGATATC CAAGAGATAA TCAAAGTATA ATTTATGGTA CATTTAAATT AATGTTTATG
TATGGAATGA GTACACTGAG AGCAAAATAT TCTTTGGGAA ATGAGCCGGA TGCAATGATA
AATGATTATT TAGATAATAT AACGTATTTA GAGAATATGG GAGAAGAAGA AATAGGATAT
ATTTTTCTTT TATGGATGGT GGGACTGGGT ATCCTTTTGG AAGTGGATAA AGAAGAATTG
AAGAAGTTGG CGAAAGTTAT AGAGAGACGA AAAACAGAAG ATGCACTTAT AGATTTTCTT
TTGAAATCCT GTGATATAGG TTGGAACCAC AGTACAACGA AATATGAAAA AAAGAACCCG
TATGAAAAGA CAGCAGAGAT TATAAAAATA GCATTGCACG ACAAAGACAA GGAAGCGGCA
TCTAAAAGGC TTGAAAAATA TATGGAAAAA GAATGGTTCA AGGGGCACTA TGACTTTGAA
TGGAGGAATG CGCACAAGAG GCCGGGGTAT TATGGTTTTT GGAGTTTTGA TACAGCGGCA
CTGGCCAAGA TACTGGGACT GGATGACAGT GCACTGAAAA ACAACAACCA TTATCCTTAT
GATTTGGCAC ACTATAAGAA GGGAATGACC TTTGATTTGA GTTGGTATAG TGTACCAAAG
GAAGAGGAAG ATAAGGAAGA AGAAACGGTG GTATATGGTA TACCGGGTAA TCCTGAGTTG
GAGAGGATAA TACCTGGGAA GTTTCACAGT TTTGTAAATG AGATAATAAA TGATTATAAA
ACACTGCCGG ACGAAGAATT TTGGAAGAAA TACAATTTGA AAGAAATCTG GTTTGATGTG
GAGGAGTATA AGGAGGATAA TAAAGATAAG AATTTGCTAG GAACGATTAT AGTATTCATG
CTTGTGGACA AAGATTATAT TTTGCAGTTG GATTATAAAG AAGAGTTAAT AGACTATATA
GAGAATATAC ATAATTACTG GGCCAAGAAA GAAGTTAAGC TTATAAGCTT TGAATTAGAC
AATGACCAGC AGTACTATGC ATATGTGCCG AAGGATGCGG AGGTTGGTTC GTTGTATGAG
GTAAAACTGA CAGAAGTGGA GAAAATAGAG GAGGTTTAG
 
Protein sequence
MRDPLCDKKD LIETIEFNQK AILKMKEKII NLKADIENGI QRYPRDNQSI IYGTFKLMFM 
YGMSTLRAKY SLGNEPDAMI NDYLDNITYL ENMGEEEIGY IFLLWMVGLG ILLEVDKEEL
KKLAKVIERR KTEDALIDFL LKSCDIGWNH STTKYEKKNP YEKTAEIIKI ALHDKDKEAA
SKRLEKYMEK EWFKGHYDFE WRNAHKRPGY YGFWSFDTAA LAKILGLDDS ALKNNNHYPY
DLAHYKKGMT FDLSWYSVPK EEEDKEEETV VYGIPGNPEL ERIIPGKFHS FVNEIINDYK
TLPDEEFWKK YNLKEIWFDV EEYKEDNKDK NLLGTIIVFM LVDKDYILQL DYKEELIDYI
ENIHNYWAKK EVKLISFELD NDQQYYAYVP KDAEVGSLYE VKLTEVEKIE EV