Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0740 |
Symbol | |
ID | 3831132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 774509 |
End bp | 776299 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828671 |
Product | methyl-accepting chemotaxis sensory transducer |
Protein accession | YP_429601 |
Protein GI | 83589592 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000672019 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTTA AACTATTTAA AAAAAAGGGA CTGCAGGTAG AAGGGCCGGA CCCCCGGGCT CAAAAGAAGG CCCGGGCCGG AGGGAAGACG GGCGGTTTCT TCAGCCGCTT GAGCCTGGGG GTCAGGCTGG CGACCGGGTT CTGCCTGGTC ATCGCCATTT TTGTCGCGGT GGTCATTTAC GTCAATTTTA ACCTCTTGCA GGTAGCCGCC CTTACCAACA GGGTAACTAT CATGTACAAC CAGGGCATGC TCTATAACGA GATGACCAGT TCCATCTGGG ACGCTTACCG GCGGGCTACC GATTACATAA TCAATGGTTC CCAGACCCAC GCCTTGGGCT TTGACGACGC CATGAAGCGC TTTGATACCG CCCGCGCCCA GCTAGAAGGG CAACAACTCG ACAGCCAGAC GGCCGGCTAC CTGACGGCCA TGGCGCAGGC TGCCAAGAGT TTTACTGACA CCTTCAAAAA CAGCATCTTG AACACCAGCC AGAGTGACCG CATGGCGGCC CTGCCCATCC TGAGCTTCCA GATGGGGGCC TCCCTGGATA ACATCAATAA CATCGGTACC CATATGAACA AGGGTATAAG CGAAGAAACC GCCGCGGCGG AAGAACAACT GGCTGCCGCC GTCCGGAACG CCCGGGCCAC CCTGCTCTCT GGATTAATCC TGTCCCTGCT CCTGGGCCTG GCGATCGCCT GGTTTATCAA CCGCATGGTG GGCCGTTCCC TGGGCCAGGT GGCGGCCTAT GCCGCCCGGG TGGCCGAGGG GGACCTCACG GCCGAACCCC TTCATATCAC CAGTAAAGAC GAGGTCGGCA AGCTGGCTGC GGCCTTCAAT ACCATGGGCG AAAACCTGCG CCAGCTCATC AGCCGCGTGC GCGACATGAC CGGCCAGGTG GCTTCCGCCA GCCAGGACCT GGTCCGGATG TCCCAGGAAG TAGGAGATGC CGTCCGCCAG GTAGCCGCCA CCGTTCAGGA AATGGCCAAA GGGGCCGAAG ACCAGGCCCA GCAGGTCAGC GAGACGGCGA CAGCCACCGA CGGCCAGGCG GCCAAGGTAG AGGAGGTCCA CCGCGACACC GAGGATATGG CTGCTGCTTC CGATCAGGTG GCGGCCAGGG CCGCCGAAGG GGCCAGAGCC GTGGCCGAGG CCACGGATCA GATGGCGGCC ATATCCCAGC GTATGGAGCG CATGGCCCGC GCCGTCGAGG AACTGGGCAA CCGTTCCCAG CAAATCGGCC AGATTGTCGG CGTCATCTCC GGCATCGCCG AGCAGACCAA CCTCCTGGCC CTCAACGCCG CCATTGAGGC GGCCCGGGCC GGCGAGCAAG GACGGGGTTT CGCCGTGGTC GCCGAAGAGG TACGTAAACT GGCCGAGCAA TCAGCCGGGG CTACCAAGCA GATCGTGGAG CTGGTCCAGG AGATCCAGCG GGAGACTGAA CAAGTGGTCG CCAGTATGGC CGAGGGTTCC CGGGACGTCC AGCAGGGTAC CGAGGTGGTG GCCCGCACCG GCAAAGCCTT CAGCGCCATC GACCAGGCCA TCCATACCCT GGTAGGCAAG ATTAAAAACG TGGCCGAAAA GGCGGAAGAC ATGTACGCCG GTTCCCGCCA GGTCAAGGAA CGAGTAGAGA GCATTGCCGC CGGCATCGAA GAAGCCGCCG CCAGCACCCA GCAGGTTTCG GCCTCCACGG AAGAGCAATC AGCGGCCGTG GATCAGATCA GCCAGGCGGC CCGGCAACTG GCAGCCGCCG CCAGTAACCT GGAGGAGGCT GTGGCCAGGT TTAAACTTTA G
|
Protein sequence | MRFKLFKKKG LQVEGPDPRA QKKARAGGKT GGFFSRLSLG VRLATGFCLV IAIFVAVVIY VNFNLLQVAA LTNRVTIMYN QGMLYNEMTS SIWDAYRRAT DYIINGSQTH ALGFDDAMKR FDTARAQLEG QQLDSQTAGY LTAMAQAAKS FTDTFKNSIL NTSQSDRMAA LPILSFQMGA SLDNINNIGT HMNKGISEET AAAEEQLAAA VRNARATLLS GLILSLLLGL AIAWFINRMV GRSLGQVAAY AARVAEGDLT AEPLHITSKD EVGKLAAAFN TMGENLRQLI SRVRDMTGQV ASASQDLVRM SQEVGDAVRQ VAATVQEMAK GAEDQAQQVS ETATATDGQA AKVEEVHRDT EDMAAASDQV AARAAEGARA VAEATDQMAA ISQRMERMAR AVEELGNRSQ QIGQIVGVIS GIAEQTNLLA LNAAIEAARA GEQGRGFAVV AEEVRKLAEQ SAGATKQIVE LVQEIQRETE QVVASMAEGS RDVQQGTEVV ARTGKAFSAI DQAIHTLVGK IKNVAEKAED MYAGSRQVKE RVESIAAGIE EAAASTQQVS ASTEEQSAAV DQISQAARQL AAAASNLEEA VARFKL
|
| |