Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0039 |
Symbol | |
ID | 4808804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 47345 |
End bp | 49216 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105448 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_001036473 |
Protein GI | 125972563 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAGAT GGTATTATAA TTTAAAAATA TCAGCAAAAT TAATTATTGG TTTTCTCCTG CTTGCATTGG TTGCAGGAGT TGTCGGAGTT GTTGCCCTTT CAAACATCAA TAATATGAGC CAGGCGGATG CCGAACTGTA TGAAAAAAAC ACATTGGGTA TCAACTATGC TGCCGGTGCT TCCTTGAGGT TTCAGAGAAT GAGATATAAT ACTGCAAAAC TTCTGGTATA TGATGCAGAA CAGGTAAGTA AAGGTATAGA AAAAATTCAA GAACATGTTG AAAATACTGA AAAATATTTA AGTTTGTATG AAAGTACAGT TATAAACGAA ACCGACCGCA TTCAGCTTCA AGAATTAAAG GCGTTGTGGG AAAAGTATAA ATCTTTGGTT GACAAAGAAG TTGAACTTGT TAAATCGGGG AAAACCGAAG AAGCAAGACA GTTGCTTCTT TCAGATATTG ATGATATTGG AGATACTCTG AGAGACTATT TTGAGGCTTT TGTGGAATAT AATACTACTG CAGCGAAGGA AAAAGTGGAT GAAAATAAGC AAGTTGCGTC AACTGCTTCA ACTGTAATGA TAGTTGTGAT ATTTGTAGGC ATATTAATAG CTATTGCTTT GGGAGTGTTT ATATCCAGGA TTATCAGCAA ACCTATCGGC CAGATGGTGG AAGCTGCCGA CAGGCTTGCC CTTGGAGACG TGGAAGTGGA TGTCAAGGCT GAAACCAGGG ATGAAATAGG AAAACTGGCC GAATCTTTCA AAAGAATGAT AGAAAATATC CGTGAACAGG CGTATGTAGT AGAAAGAATT GCTGCGGGAG ACATGACTGT CGATGTAAGA GTCAAATCCG ACAAAGACTT GCTGGGTAAA AAACTTAAGG AAATGGTTGA TACAAATAAT GAAGTGCTTT CAAATATCAA TGAAGTTGCT GCACAGGTGG CAGCAGGAGC AAAACAGGTA TCCGACTCAA GCATGCAGCT TTCGCAAGGA GCAACTGAAC AGGCAAGCTC GATAGAAGAG CTGACAGCTT CCCTTGAACA GGTGGCGAAC CAGACACAGC TTAGTGCCAA GAATGCGAAT CAGGCCAATG AACTGGCTGA AGTTGCAAAA AACAATGCAG AGCAAGGGAA CAAGCAAATG GCTGAAATGC TCAATGCCAT GGAGGAAATC AATAATTCTT CATCAAATAT CTCCAGAATT ATCAAAGTGA TAGACGAAAT TGCGTTCCAG ACCAATATTC TTGCACTGAA TGCCGCAGTT GAGGCGGCAA GGGCCGGACA ACACGGAAAA GGATTTGCGG TTGTGGCGGA AGAAGTAAGA AACCTGGCGG CAAGATCGGC GAATGCTGCG AAAGAAACCA CGGAACTTAT TGAGGGAACA ATCAAGCGGA CTGAAAATGG TACAAAGATA GCCCGGGAAA CTGCCGAAGC TCTCAACAAA ATAGTTGAAG GCATATCAAA GGCTGCTACG CTGGTTAATG ATATAGCTGT TGCCTCCAAC GAACAAGCTG CGGCAATTAC TCAAATAAAT CAGGGAATTG CCCAGGTATC CCAGGTGGTA CAGACCAACT CGGCAACATC GGAAGAAAGT GCTGCTGCAA GTGAAGAGCT GTCAAGTCAG GCTGAGCTTT TGAAACGGTC CATTGCAAAA TTCAAGTTAA AAAATATGGG AAAAATGACA TCCAACAGAT ATAAGGAAGT TAGTCCTGAA ATAATGAGGA TGCTTGAAGA CTATACGGAA AACAAGCAAC CGAAAAGTTA CAGTAAGGAA GAAAATGGAG AATATAGTGA TGGAAAGGAA ACAGCTGAGA AGGATGTTGG AGGTTTAAAA CAGAAGATAT TGTTGTCTGA CAGTGAGTTC GGTAAATACT AG
|
Protein sequence | MLRWYYNLKI SAKLIIGFLL LALVAGVVGV VALSNINNMS QADAELYEKN TLGINYAAGA SLRFQRMRYN TAKLLVYDAE QVSKGIEKIQ EHVENTEKYL SLYESTVINE TDRIQLQELK ALWEKYKSLV DKEVELVKSG KTEEARQLLL SDIDDIGDTL RDYFEAFVEY NTTAAKEKVD ENKQVASTAS TVMIVVIFVG ILIAIALGVF ISRIISKPIG QMVEAADRLA LGDVEVDVKA ETRDEIGKLA ESFKRMIENI REQAYVVERI AAGDMTVDVR VKSDKDLLGK KLKEMVDTNN EVLSNINEVA AQVAAGAKQV SDSSMQLSQG ATEQASSIEE LTASLEQVAN QTQLSAKNAN QANELAEVAK NNAEQGNKQM AEMLNAMEEI NNSSSNISRI IKVIDEIAFQ TNILALNAAV EAARAGQHGK GFAVVAEEVR NLAARSANAA KETTELIEGT IKRTENGTKI ARETAEALNK IVEGISKAAT LVNDIAVASN EQAAAITQIN QGIAQVSQVV QTNSATSEES AAASEELSSQ AELLKRSIAK FKLKNMGKMT SNRYKEVSPE IMRMLEDYTE NKQPKSYSKE ENGEYSDGKE TAEKDVGGLK QKILLSDSEF GKY
|
| |