Gene Cthe_1196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1196 
Symbol 
ID4810148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1425450 
End bp1426688 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content35% 
IMG OID640106618 
Producthypothetical protein 
Protein accessionYP_001037621 
Protein GI125973711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000988852 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGATC CGTTATGCAG TGAAAGTTAT TTATTAGAAA CAATAGAATA TGACAAAGAA 
GGAATTTGTA AAAGTAAAAA AAAGATTGTT ATTCTGAAAG ATGATATGGA AAAAGGTATA
CAAAGATATC CAAGGGATAA TCAAAGCATA ATTTATGCTA CGTTTTTACA TATGTTTATG
TATAACACGG AAATGCTTAC AGCCAAATAC TCTTTAGGTA GTCATCCGGA TGAAATGATT
GAAGATTATT TAAACGGTAT AGAGTATTTG GAAAATGTCG GTGAAGAAAA AGTATGGTAT
ATTGACCTTT TGTGGATGCT ATCGTTAGGT ATACTTTTAG AGGTAGACAA ACAGGATTTA
AAAAGGCTTG CTTGTGTGAT AGAGAAGCAA AAAAAAGAAG ACGCACTGAT GGATTTTCTT
TTAAAAGCTT GTGATATAGG ATGGAATCAT AATACAAGTG AATATGAGAG AAAAAATCCA
TATGCAAAGA CGGCTGAAAT TATACAAATG GCATTGCATG ATAAAGACAG GGAAAAAGCT
TCGAAAAGGC TACAACAATA TATAGAGAAA GAATGGATTA AGGGACATAA TGATCTGGAC
TTCAAAAATG CGCATAAAGA ACCCGGCTAC GTTGGCTTGT GGAGTTTTGA GGCTGCAGCA
TTGGCAAAGA TACTGGGATT GGACGACAGC GCACTGAAAG ATAACAACCA TTACCCTTAT
GATTTGGCGC ATTATAAAAA TGGAATGAGT TTTGATTTAA GCTGGTATGG TGTGCCAGTT
GAAGAGGAAG CCAAGGAAGA AGAGGCAATA GTATATGGAA TACCGAATAA CCCGGAGTTG
GAGCAAATAA TACCTGCAAA GTTCCACAGT TTTGTGAATG AAGTGATAGG AGACTACAAT
ACATTGAGCG ACGAAGAGTT TTGGAAGAAG TATAATTTGA GAGAAATCTG GTTTGATGTT
AAGGAGTACG AGGAAGATAA TAAAGCCAAA AATATGTTGG GAACGATTAT AGTGTTTTTG
CTTGTAGAGA AGGAGTATAT TTTGCAGTTG GATTATAAGG AAGATTTGGT AGATTACATA
GAAGATATAG ATAATTATTG GGGCAAAGAG GAAGTAAAGT TGATAAGCTT TGAAGTGGAC
AATGACCAGC AGTATTATGC ATACGTACCG AAAACCGCAG CAATAGATTC ATTGTACGAG
GTAAAATTGA CAGAAGTGGA GAAGATAGAG GAAGTTTAG
 
Protein sequence
MRDPLCSESY LLETIEYDKE GICKSKKKIV ILKDDMEKGI QRYPRDNQSI IYATFLHMFM 
YNTEMLTAKY SLGSHPDEMI EDYLNGIEYL ENVGEEKVWY IDLLWMLSLG ILLEVDKQDL
KRLACVIEKQ KKEDALMDFL LKACDIGWNH NTSEYERKNP YAKTAEIIQM ALHDKDREKA
SKRLQQYIEK EWIKGHNDLD FKNAHKEPGY VGLWSFEAAA LAKILGLDDS ALKDNNHYPY
DLAHYKNGMS FDLSWYGVPV EEEAKEEEAI VYGIPNNPEL EQIIPAKFHS FVNEVIGDYN
TLSDEEFWKK YNLREIWFDV KEYEEDNKAK NMLGTIIVFL LVEKEYILQL DYKEDLVDYI
EDIDNYWGKE EVKLISFEVD NDQQYYAYVP KTAAIDSLYE VKLTEVEKIE EV