Gene Cthe_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0039 
Symbol 
ID4808804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp47345 
End bp49216 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content41% 
IMG OID640105448 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001036473 
Protein GI125972563 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAGAT GGTATTATAA TTTAAAAATA TCAGCAAAAT TAATTATTGG TTTTCTCCTG 
CTTGCATTGG TTGCAGGAGT TGTCGGAGTT GTTGCCCTTT CAAACATCAA TAATATGAGC
CAGGCGGATG CCGAACTGTA TGAAAAAAAC ACATTGGGTA TCAACTATGC TGCCGGTGCT
TCCTTGAGGT TTCAGAGAAT GAGATATAAT ACTGCAAAAC TTCTGGTATA TGATGCAGAA
CAGGTAAGTA AAGGTATAGA AAAAATTCAA GAACATGTTG AAAATACTGA AAAATATTTA
AGTTTGTATG AAAGTACAGT TATAAACGAA ACCGACCGCA TTCAGCTTCA AGAATTAAAG
GCGTTGTGGG AAAAGTATAA ATCTTTGGTT GACAAAGAAG TTGAACTTGT TAAATCGGGG
AAAACCGAAG AAGCAAGACA GTTGCTTCTT TCAGATATTG ATGATATTGG AGATACTCTG
AGAGACTATT TTGAGGCTTT TGTGGAATAT AATACTACTG CAGCGAAGGA AAAAGTGGAT
GAAAATAAGC AAGTTGCGTC AACTGCTTCA ACTGTAATGA TAGTTGTGAT ATTTGTAGGC
ATATTAATAG CTATTGCTTT GGGAGTGTTT ATATCCAGGA TTATCAGCAA ACCTATCGGC
CAGATGGTGG AAGCTGCCGA CAGGCTTGCC CTTGGAGACG TGGAAGTGGA TGTCAAGGCT
GAAACCAGGG ATGAAATAGG AAAACTGGCC GAATCTTTCA AAAGAATGAT AGAAAATATC
CGTGAACAGG CGTATGTAGT AGAAAGAATT GCTGCGGGAG ACATGACTGT CGATGTAAGA
GTCAAATCCG ACAAAGACTT GCTGGGTAAA AAACTTAAGG AAATGGTTGA TACAAATAAT
GAAGTGCTTT CAAATATCAA TGAAGTTGCT GCACAGGTGG CAGCAGGAGC AAAACAGGTA
TCCGACTCAA GCATGCAGCT TTCGCAAGGA GCAACTGAAC AGGCAAGCTC GATAGAAGAG
CTGACAGCTT CCCTTGAACA GGTGGCGAAC CAGACACAGC TTAGTGCCAA GAATGCGAAT
CAGGCCAATG AACTGGCTGA AGTTGCAAAA AACAATGCAG AGCAAGGGAA CAAGCAAATG
GCTGAAATGC TCAATGCCAT GGAGGAAATC AATAATTCTT CATCAAATAT CTCCAGAATT
ATCAAAGTGA TAGACGAAAT TGCGTTCCAG ACCAATATTC TTGCACTGAA TGCCGCAGTT
GAGGCGGCAA GGGCCGGACA ACACGGAAAA GGATTTGCGG TTGTGGCGGA AGAAGTAAGA
AACCTGGCGG CAAGATCGGC GAATGCTGCG AAAGAAACCA CGGAACTTAT TGAGGGAACA
ATCAAGCGGA CTGAAAATGG TACAAAGATA GCCCGGGAAA CTGCCGAAGC TCTCAACAAA
ATAGTTGAAG GCATATCAAA GGCTGCTACG CTGGTTAATG ATATAGCTGT TGCCTCCAAC
GAACAAGCTG CGGCAATTAC TCAAATAAAT CAGGGAATTG CCCAGGTATC CCAGGTGGTA
CAGACCAACT CGGCAACATC GGAAGAAAGT GCTGCTGCAA GTGAAGAGCT GTCAAGTCAG
GCTGAGCTTT TGAAACGGTC CATTGCAAAA TTCAAGTTAA AAAATATGGG AAAAATGACA
TCCAACAGAT ATAAGGAAGT TAGTCCTGAA ATAATGAGGA TGCTTGAAGA CTATACGGAA
AACAAGCAAC CGAAAAGTTA CAGTAAGGAA GAAAATGGAG AATATAGTGA TGGAAAGGAA
ACAGCTGAGA AGGATGTTGG AGGTTTAAAA CAGAAGATAT TGTTGTCTGA CAGTGAGTTC
GGTAAATACT AG
 
Protein sequence
MLRWYYNLKI SAKLIIGFLL LALVAGVVGV VALSNINNMS QADAELYEKN TLGINYAAGA 
SLRFQRMRYN TAKLLVYDAE QVSKGIEKIQ EHVENTEKYL SLYESTVINE TDRIQLQELK
ALWEKYKSLV DKEVELVKSG KTEEARQLLL SDIDDIGDTL RDYFEAFVEY NTTAAKEKVD
ENKQVASTAS TVMIVVIFVG ILIAIALGVF ISRIISKPIG QMVEAADRLA LGDVEVDVKA
ETRDEIGKLA ESFKRMIENI REQAYVVERI AAGDMTVDVR VKSDKDLLGK KLKEMVDTNN
EVLSNINEVA AQVAAGAKQV SDSSMQLSQG ATEQASSIEE LTASLEQVAN QTQLSAKNAN
QANELAEVAK NNAEQGNKQM AEMLNAMEEI NNSSSNISRI IKVIDEIAFQ TNILALNAAV
EAARAGQHGK GFAVVAEEVR NLAARSANAA KETTELIEGT IKRTENGTKI ARETAEALNK
IVEGISKAAT LVNDIAVASN EQAAAITQIN QGIAQVSQVV QTNSATSEES AAASEELSSQ
AELLKRSIAK FKLKNMGKMT SNRYKEVSPE IMRMLEDYTE NKQPKSYSKE ENGEYSDGKE
TAEKDVGGLK QKILLSDSEF GKY