Gene Cthe_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1194 
Symbol 
ID4810146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1423353 
End bp1424501 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content36% 
IMG OID640106616 
Producthypothetical protein 
Protein accessionYP_001037619 
Protein GI125973709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAAAG ATGATATGGA AAAGGGCATA CAAAGATATC CAAAAGACAA TCAAAGCATA 
ATTTATGCTA CATATAGAGG AATGTTTATG TATAATACAG AAATACTTAT AGCTAAATAC
TCTTTAGGTA GTCATCCGGA TGAAATGATT GAAGATTATT TAAACGGTAT AGAGTATTTG
GAAAATGTCG GTGAAGAAAA AGTATGGTAT ATTGACCTTT TGTGGATGCT ATCGTTAGGT
ATACTTTTAG AGGTAGACAA ACAGGATTTA AAAAGGCTTG CTTGTGTGAT AGAGAAGCAA
AAAAAAGAAG ACGCACTGAT GGATTTTCTT TTAAAAGCTT GTGATATAGG ATGGAATCAT
AATACAAGTG AATATGAGAG AAAAAATCCA TATGCAAAGA CGGCTGAAAT TATACAAATG
GCATTGCATG ATAAAGACAG GGAAAAAGCT TCGAAAAGGC TACAACAATA TATAGAGAAA
GAATGGATTA AGGGACATAA TGATCTGGAC TTCAAAAATG CGCATAAAGA ACCCGGCTAC
GTTGGCTTGT GGAGTTTTGA GGCTGCAGCA TTGGCAAAGA TACTGGGATT GGACGACAGC
GCACTGAAAG ATAACAACCA TTACCCTTAT GATTTGGCGC ATTATAAAAA TGGAATGAGT
TTTGATTTAA GCTGGTATGG TGTGCCAGTT GAAGAGGAAG CCAAGGAAGA AGAGGCAATA
GTATATGGAA TACCGAATAA CCCGGAGTTG GAGCAAATAA TACCTGCAAA GTTCCACAGT
TTTGTGAATG AAGTGATAGG AGACTACAAT ACATTGAGCG ACGAAGAGTT TTGGAAGAAG
TATAATTTGA GAGAAATCTG GTTTGATGTT AAGGAGTACG AGGAAGATAA TAAAGCCAAA
AATATGTTGG GAACGATTAT AGTGTTTTTG CTTGTAGAGA AGGAGTATAT TTTGCAGTTG
GATTATAAGG AAGATTTGGT AGATTACATA GAAGATATAG ATAATTATTG GGGCAAAGAG
GAAGTAAAGT TGATAAGCTT TGAAGTGGAC AATGACCAGC AGTATTATGC ATACGTACCG
AAAACCGCAG CAATAGATTC ATTGTACGAG GTAAAATTGA CAGAAGTGGA GAAGATAGAG
GAAGTTTAG
 
Protein sequence
MLKDDMEKGI QRYPKDNQSI IYATYRGMFM YNTEILIAKY SLGSHPDEMI EDYLNGIEYL 
ENVGEEKVWY IDLLWMLSLG ILLEVDKQDL KRLACVIEKQ KKEDALMDFL LKACDIGWNH
NTSEYERKNP YAKTAEIIQM ALHDKDREKA SKRLQQYIEK EWIKGHNDLD FKNAHKEPGY
VGLWSFEAAA LAKILGLDDS ALKDNNHYPY DLAHYKNGMS FDLSWYGVPV EEEAKEEEAI
VYGIPNNPEL EQIIPAKFHS FVNEVIGDYN TLSDEEFWKK YNLREIWFDV KEYEEDNKAK
NMLGTIIVFL LVEKEYILQL DYKEDLVDYI EDIDNYWGKE EVKLISFEVD NDQQYYAYVP
KTAAIDSLYE VKLTEVEKIE EV