Gene Cthe_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2839 
Symbol 
ID4809676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3356345 
End bp3357583 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content35% 
IMG OID640108259 
Producthypothetical protein 
Protein accessionYP_001039231 
Protein GI125975321 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0837377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATC CGTTATGCAG TGAAAGTTAT TTGTTAGAAA CAATAGAATT TGACAAGGAA 
GAAATTTGTG AAAGAAAAAA AAAGATTATT GTGCTGAAAG ATGATATGGA AAAGGGCATA
CAAAGATATC CAAAAGACAA TCAAAGCATA ATTTATGCTA CATATAGAGG AATGTTTATG
TATAATACAG AAATACTTAT AGCTAAATAC TCTTTAGGTA GTCATCCGGA TGAAATGATT
GAAGATTATT TAAACGGTAT AGAGTATTTG GAAAATGTCG GTGAAGAAAA AGTATGGTAT
ATTGATCTTT TGTGGATGCT ATCGTTAGGT ATACTTTTAG AGGTAGACAA ACAAGATTTA
AAAAGGCTTG CTTGTGTGAT AGAGAAGCAA AAAAAAGAAG ACGCACTGAT GGATTTTCTT
TTAAAGGCTT GTGATATAGG ATGGAATCAT AATACAAGTG AATATGAGAG AAAAAATCCA
TATGCAAAGA CGGCTGAAAT TATACAAATG GCATTGCATG ATAAAGACAG GGAAAAAGCT
TCGAAAAGGC TACAACAATA TATAGAGAAA GAGTGGGTTA AGGGACATAA TGATCTGGAC
TTCAAAAATG CGCATAAAGA ACCCGGCTAC GTTGGCTTGT GGAGTTTTGA GGCTGCAGCA
TTGGCAAAGA TACTGGGATT GGACGACAGC GCACTGAAAG ATAACAACCA TTACCCTTAT
GATTTGGCGC ATTATAAAAA TGGAATGAGT TTTGATTTAA GCTGGTATGG TGTGCCAGTT
GAAGAGGAAG CCAAGGAAGA AGAGGCAATA GTGTATGGAA TACCGAACAA ACCTGAGTTG
GAGCAAATAA TACCTGCAAA ATTCCACAGT TTTGTGAATG AAGTGATAGG AGACTACAAT
ACATTGACTG ATGAAGAGTT TTGGAAGAAG TATAATTTGA GAGAAATCTG GTTTGATGTT
AAGGAGTACG AGGAAGATAA TAAAGCCAAA AATATGTTGG GAACGATTAT AGTATTTTTG
CTTGTAGAGA AGGAGTATAT TTTGCAGTTG GATTATAAGG AAGATTTGGT AGATTACATA
GAAGATATAG ATAATTATTG GGGTAAAGAG GAAGTAAAGT TGATAAGCTT TGAAGTGGAC
AATGACCAGC AGTATTATGC ATACGTACCG AAAACCGCAG CAATAGATTC GTTGTATGAG
GTGAAATTGA CAGAAGTGGA GAAGATAGAG GAAGTTTAG
 
Protein sequence
MRDPLCSESY LLETIEFDKE EICERKKKII VLKDDMEKGI QRYPKDNQSI IYATYRGMFM 
YNTEILIAKY SLGSHPDEMI EDYLNGIEYL ENVGEEKVWY IDLLWMLSLG ILLEVDKQDL
KRLACVIEKQ KKEDALMDFL LKACDIGWNH NTSEYERKNP YAKTAEIIQM ALHDKDREKA
SKRLQQYIEK EWVKGHNDLD FKNAHKEPGY VGLWSFEAAA LAKILGLDDS ALKDNNHYPY
DLAHYKNGMS FDLSWYGVPV EEEAKEEEAI VYGIPNKPEL EQIIPAKFHS FVNEVIGDYN
TLTDEEFWKK YNLREIWFDV KEYEEDNKAK NMLGTIIVFL LVEKEYILQL DYKEDLVDYI
EDIDNYWGKE EVKLISFEVD NDQQYYAYVP KTAAIDSLYE VKLTEVEKIE EV