Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3156 |
Symbol | |
ID | 4809606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3727818 |
End bp | 3729068 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108589 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_001039544 |
Protein GI | 125975634 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000160053 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT CCAATTTTGT CAGCAACAAG CTGCTGCCAT CCCTGATTCC ATTGGTTTTT GTCGTAGCGC TAAGTGCAAT AGCGGGACTG TGGATCGGTA ACTGGCAGGT CTTTTTTGCA GTCTTTGCAA CAAATGTCAT AATGCTTTTG ATAATGTACA TAACCACATC CTTTTACTCC AAACGCGCTG TTATGCAATT TGAAAAGGAA ATAAAGAGCA TAAGAGAGGG CGACTACTCA AAAACCGTCG ATGCCGAAAA ACTTGGCATG CTGCGTCCTT CTGCAGTGGT TTTCAATGAA CTGATGTCAG ATATCAGGTC ATTGATTATG GATTTCCACA ACTTGTCCAA ATCCATAATT GAAGCCACAA AATCAGTAAG CGCAACTGCC CAGCAAGCTT CAACAGCCAT GGAAGAAATA TCAAAAACCA TGGACGACAT TGCAAACGGC GCATCTGATC AGGCAGAACA GGCTCAGCAG GGTGTGGAAG TGGTTGACAA ACTTGCAGAA CAGATAAATT TTGTATATGA AAGTTACAAC AGAATTACCG AAGAAACCAG CAGAATAAAT GAACTTAACA ATATCGGTCT TGATTCTGTA AGTATACTAA GAGACAAATC AAAAGAAAAC TACGAGACTG CCGAAAAGAT ATTCTCAGTG GTTGAAAAGC TTACAAATAC GGTTAAAGAT ATCGGGCTTT TTGTCGAGTC CATTGAAAAC ATTGCCGAAC AGACAAACCT TCTGGCGTTA AATGCAGCTA TTGAAGCTGC GAGAGCAGGC GAAGCCGGAA AAGGATTCGC AGTTGTTGCG GAAGAAGTAA GAAAACTTGC AGATCAAAGC AGAAAGTCGA CTGAAGAAAT AAATATTCTG ATGCAAAGCA TCCATGAAGA ATCACAGCAT GCAATTGAGT CCATGGAAAT AATGAGAAAA GTTTCCCAGG AGCAAAACGG AGCGGTTAAC AAGACCGATA ATGCCTTTAA CAACATAGCC AACGCCATAA CCTATATAGT ATCAAAGATA AATGAAGTAA ACCAGGCCAT AACCAAAATG CAAACAGACA AAACCCAGGT TACTGCTGCC ATAGAAAACA TCTCTTCCGT ATCGGAACAA ACCGCCGCCG CAAGCCAGCA GGTTGCCGCA ACCACAGAGC ACGAATTAAG ATGTATTGAG GAAATCAAAG AATCGGCGAA AAACCTTGAA CATCTCGCGG AAGAGCTTGA GAACAAATTC AAAAAATACA ACTTGGTATA A
|
Protein sequence | MKKSNFVSNK LLPSLIPLVF VVALSAIAGL WIGNWQVFFA VFATNVIMLL IMYITTSFYS KRAVMQFEKE IKSIREGDYS KTVDAEKLGM LRPSAVVFNE LMSDIRSLIM DFHNLSKSII EATKSVSATA QQASTAMEEI SKTMDDIANG ASDQAEQAQQ GVEVVDKLAE QINFVYESYN RITEETSRIN ELNNIGLDSV SILRDKSKEN YETAEKIFSV VEKLTNTVKD IGLFVESIEN IAEQTNLLAL NAAIEAARAG EAGKGFAVVA EEVRKLADQS RKSTEEINIL MQSIHEESQH AIESMEIMRK VSQEQNGAVN KTDNAFNNIA NAITYIVSKI NEVNQAITKM QTDKTQVTAA IENISSVSEQ TAAASQQVAA TTEHELRCIE EIKESAKNLE HLAEELENKF KKYNLV
|
| |