Gene Cthe_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3036 
Symbol 
ID4811108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3559919 
End bp3561631 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content36% 
IMG OID640108457 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001039425 
Protein GI125975515 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTAACT CTTTGACAAA GAAAATGTTA GGTACGGTAA TATTAATATC TGCGATATGT 
ACCGTGTCAT TTTCCGTAAC CATATTTTAT GTATTGGAGG GTGCGGTTAC AAGGCAAATG
GAAAGTGACG GCAGAGCATT GGTTACGTCG ATAAAAAGAC AGATTGTAAG CAATAATATT
ACTGATACGA ATGAAATCAG AGAAATATTT AAGGAATTAA AGGAGCAAAG CAAAGGAAAC
ATTATTTACA TGTCCATCAC GGATGATAAA CAGAACATTC TGGTTTCTTC CGATGATGAA
AGAGTCAGCG GCAATTTGGA AAACGATGCT TCAACATCTG GAGCTCAAAG CGTGGACGGC
ATGTCCAGTG CAACAGAATT TAAAGGGGAT AAAAACAAGA TAGATGCTAT CACATCTGCA
ACAGTCGGTC TCATCTCAGA AACAGGCGAA GGTGAAAAAG TATACAATGT ATCAATTCCT
TTTTCCTCAG AGTTGATAAA ATCCGGAAGT CTTAATGTTG GTATTTCTTT GGAGAGAATG
TATGATGAAA TTAGAAGTAC TCTGATAAAT ACCATCCTGA TTTCGGTGTT AATAGGGCTT
ATTGCGGTTG CCATAGGATT TGTAATAGCC AGAACAATAA TAAATCCGAT AAAAAATGTT
ATTTTAAAAC TTGACGATTT TTCAAAAGGC GATTTTACTG TTGAATTTTA CAGCAGGGCA
AATGATGAAA CGAAAAAACT TACAGATGCA CTTAACACAT CTATATCAAT TCTGAAAGAG
ACAATAAAAA CTGTCAAGGA AAGTGTAAGC CTGCTCAATA AATTCTCTGA GGAACTTGCC
TCTACAAGTG ATGAAACTGG AACTACCGGA GAACAAATTT CTAAAAGTAT AGAAGATGTA
ACAGAATCAA TTGCGGATCA AGCATCAAAT ATTCAATATA TTATCAGTAT TTTGGAGGAC
TTTGGCAGTA AATTTGATAA GATGCTTGAA GAAACCGGTA TCGTTTTGGA TAGCAACAGC
AAAATGAAAG AAATTACCGA TATGGAATAT GTTACTCTTC AAAATTTGGT AAAAACAGTA
GAGGATACGA AAAAATCCTT TAGTTCGACG GTGGAAGAAG TAAAGTCTTT AAATAACTAT
GTTTTTGAAA TAAACAAAAT AACCGAGGTT ATAAACAGCA TAGCAGGACA GACCAATTTA
CTTGCACTGA ATGCTTCAAT TGAATCGGCA AGAGCCGGGG AGGCGGGCAA GGGGTTTGCT
GTTGTGGCGG ACGAGATAAA GAAACTTGCC GCTCAGGTAA TGGGTTATTC AAACAATATT
AATCAGCTTA TTAATAATGT TACCAAAAAT ACAGAAAAAG TGTTTAATAA CATACAGGTT
ATTTCCGATA AACTGGATGC TCAGGTGCAA ACAATCAAGT ATACTGTCAA ATCCTATGAT
AATATACGTG CTGAAGCAGA CAATGTCATT ACTCAGATTG AAAGTGTAAA CAAGTCGGTG
AATAGTTTAT CTCAGGAGAA GAATACTATA ATAGAAAGGA TTGGAAACAT TTCCGATACA
TTCAACCAGG TTGTGGCTTC AGCAGAAGAG ATAACTGCAT CCGTTCAAAC GGAGGCAGAA
AATGTGCGGC AGCTTTCCGT AATGGCTCAG GAGCTTAACA ACCTTGCAAG CAGACTAAAT
CAGGATGTAG GAGTATTTAG GGTTGAAAAA TAG
 
Protein sequence
MFNSLTKKML GTVILISAIC TVSFSVTIFY VLEGAVTRQM ESDGRALVTS IKRQIVSNNI 
TDTNEIREIF KELKEQSKGN IIYMSITDDK QNILVSSDDE RVSGNLENDA STSGAQSVDG
MSSATEFKGD KNKIDAITSA TVGLISETGE GEKVYNVSIP FSSELIKSGS LNVGISLERM
YDEIRSTLIN TILISVLIGL IAVAIGFVIA RTIINPIKNV ILKLDDFSKG DFTVEFYSRA
NDETKKLTDA LNTSISILKE TIKTVKESVS LLNKFSEELA STSDETGTTG EQISKSIEDV
TESIADQASN IQYIISILED FGSKFDKMLE ETGIVLDSNS KMKEITDMEY VTLQNLVKTV
EDTKKSFSST VEEVKSLNNY VFEINKITEV INSIAGQTNL LALNASIESA RAGEAGKGFA
VVADEIKKLA AQVMGYSNNI NQLINNVTKN TEKVFNNIQV ISDKLDAQVQ TIKYTVKSYD
NIRAEADNVI TQIESVNKSV NSLSQEKNTI IERIGNISDT FNQVVASAEE ITASVQTEAE
NVRQLSVMAQ ELNNLASRLN QDVGVFRVEK