Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2029 |
Symbol | |
ID | 3831404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2118680 |
End bp | 2120338 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829958 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_430868 |
Protein GI | 83590859 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 53 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTACTCA AGTGGCAAAC CAAACTTCTG TATTTTATCC TCCTACTACT CCTGGCTCTG GCGCCGGGTA CTGTGGGAAC GGCCCTGGTG GCCATCCAGG GTATTAGCTG GCAAATTCTC CTTTTGCCGG CTGTCGTCGC CGGTGCCCTG GGGCTGATTT TAAGCCTGGG TATCCTCCTT AACCTGTACA TAAAGTGGAT GCGGCCCCTG CACCGGGTCC TGCAGTTCCT TAATTTACTG GGCAAAGGCG ACCCGGTCCA GGCCGAGAAG TCTTTACAGA ACGCGCGACT GGGGGAGAGC TTCCAGGGGC CGGTAACTGC CGTCCTGGAC AGTTTTTACC GCCTGGTGGG TCGCTTACAA TTAACTGCCG ATGAGCTGGC CCATTTTTCC CGTAACCTCG AGGAGAGTTC CAGTACCACT TCCCGTAACC TGGAGGAAGT AACGGCAGCC ATCCAGGGGA TTACCAGCGG AGCCGATGAA CAGGCCGGTG CAGCCCAGCG GGTAGCGGAG AATATCAATG TTTTACATAA TTTAGCAGAA GATATTAACG ATCGGGCCGC CCTGGGGATG GAGATGGGGG AGGAAGTCAA TAGGAAAGAA AAGGAAGGTC GGGACCTCCT GGAGCATTTG CTCCAAGAGA TAAAGGCCGG GGCCTCCTCC ATCCAGGAAG CTGCCGGGCG GATGCGGCAG CTGGAAGCCA AAATGGACCA GATCAACACC CTGGTCCAGG CAGTAACGGA GATCGCCGAC CAGACCAACC TCCTGGCCCT GAATGCCGCC ATTGAAGCCG CCCGCGCCGG TGAGCAGGGT CGAGGTTTTG CTGTGGTGGC CGAAGAGGTG CGCAAGCTCG CAGAACAGTC GGCGGCAGCC GCCCAGGATA TAACCTCCCT GGCTGCCTCC ATTGGCGATG AAGCGCGCCA GACGGCAGCC CAGGTGGATA AGAATGTGGA ATTGGTCCAG AGCAACATCC AGCGCGGCGC CCAGGTGCGG GAGAACTTTA GCGTTGTCAG TGAGGCGATA AAAAAAGCCG CCGAAGTCAT GACCAATATC AGTCACCAGG CCCAGAACCA GCTGACCAGG GTAAAGGAAG TCGGCGAGGC CGCCGGCCGT ATGGCCGCCG TGGCTCAGGA GACTGCCGCC AGCATTGAAG AAGTAGCGGC GGCTACCGAA GAGCATAAGT CCACCATGGC CGTAGTGGAG GAGCATACGC GCCAGTTTAC GGATATGGCG CGGAATTTCT TTACCATGGT CGCCTCCTTT ACCAGGGACG GCTGGGATGA GGACCTGCGC CGGGAACTCA TTCGCCAGGG ACAGGAGGTG CTGGCAAGGC TGGCCGCCGA CCCGGGAGTT AAAAAAATGG AGGCGACAAC CCTGGCACCC ATCCTGGATG ACACCTTCAG TAAATCACCT TTTATCCAGA CATTGATTGC CGCCCTGCCT GATGGTACGG CTATCTATAA CCGGCCGGAG TCGACTATAA CAAACTGGGC CTTCCGGCCA TGGTTTCAGG CGGCTGTCAG GGGTGAAAAC TACGCTAGCG AGCCCTACGT GACCCAGTGC ACCAACCGGG TGGCCGTTAC CATCTCTGTA CCCATTTTCG GTGATGAAGG CCGCATCGCC GGGGTCCTGG CGGCCAACAT AGCCCCGGCG CGGAGATAA
|
Protein sequence | MVLKWQTKLL YFILLLLLAL APGTVGTALV AIQGISWQIL LLPAVVAGAL GLILSLGILL NLYIKWMRPL HRVLQFLNLL GKGDPVQAEK SLQNARLGES FQGPVTAVLD SFYRLVGRLQ LTADELAHFS RNLEESSSTT SRNLEEVTAA IQGITSGADE QAGAAQRVAE NINVLHNLAE DINDRAALGM EMGEEVNRKE KEGRDLLEHL LQEIKAGASS IQEAAGRMRQ LEAKMDQINT LVQAVTEIAD QTNLLALNAA IEAARAGEQG RGFAVVAEEV RKLAEQSAAA AQDITSLAAS IGDEARQTAA QVDKNVELVQ SNIQRGAQVR ENFSVVSEAI KKAAEVMTNI SHQAQNQLTR VKEVGEAAGR MAAVAQETAA SIEEVAAATE EHKSTMAVVE EHTRQFTDMA RNFFTMVASF TRDGWDEDLR RELIRQGQEV LARLAADPGV KKMEATTLAP ILDDTFSKSP FIQTLIAALP DGTAIYNRPE STITNWAFRP WFQAAVRGEN YASEPYVTQC TNRVAVTISV PIFGDEGRIA GVLAANIAPA RR
|
| |