Gene Moth_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2029 
Symbol 
ID3831404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2118680 
End bp2120338 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content58% 
IMG OID637829958 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_430868 
Protein GI83590859 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACTCA AGTGGCAAAC CAAACTTCTG TATTTTATCC TCCTACTACT CCTGGCTCTG 
GCGCCGGGTA CTGTGGGAAC GGCCCTGGTG GCCATCCAGG GTATTAGCTG GCAAATTCTC
CTTTTGCCGG CTGTCGTCGC CGGTGCCCTG GGGCTGATTT TAAGCCTGGG TATCCTCCTT
AACCTGTACA TAAAGTGGAT GCGGCCCCTG CACCGGGTCC TGCAGTTCCT TAATTTACTG
GGCAAAGGCG ACCCGGTCCA GGCCGAGAAG TCTTTACAGA ACGCGCGACT GGGGGAGAGC
TTCCAGGGGC CGGTAACTGC CGTCCTGGAC AGTTTTTACC GCCTGGTGGG TCGCTTACAA
TTAACTGCCG ATGAGCTGGC CCATTTTTCC CGTAACCTCG AGGAGAGTTC CAGTACCACT
TCCCGTAACC TGGAGGAAGT AACGGCAGCC ATCCAGGGGA TTACCAGCGG AGCCGATGAA
CAGGCCGGTG CAGCCCAGCG GGTAGCGGAG AATATCAATG TTTTACATAA TTTAGCAGAA
GATATTAACG ATCGGGCCGC CCTGGGGATG GAGATGGGGG AGGAAGTCAA TAGGAAAGAA
AAGGAAGGTC GGGACCTCCT GGAGCATTTG CTCCAAGAGA TAAAGGCCGG GGCCTCCTCC
ATCCAGGAAG CTGCCGGGCG GATGCGGCAG CTGGAAGCCA AAATGGACCA GATCAACACC
CTGGTCCAGG CAGTAACGGA GATCGCCGAC CAGACCAACC TCCTGGCCCT GAATGCCGCC
ATTGAAGCCG CCCGCGCCGG TGAGCAGGGT CGAGGTTTTG CTGTGGTGGC CGAAGAGGTG
CGCAAGCTCG CAGAACAGTC GGCGGCAGCC GCCCAGGATA TAACCTCCCT GGCTGCCTCC
ATTGGCGATG AAGCGCGCCA GACGGCAGCC CAGGTGGATA AGAATGTGGA ATTGGTCCAG
AGCAACATCC AGCGCGGCGC CCAGGTGCGG GAGAACTTTA GCGTTGTCAG TGAGGCGATA
AAAAAAGCCG CCGAAGTCAT GACCAATATC AGTCACCAGG CCCAGAACCA GCTGACCAGG
GTAAAGGAAG TCGGCGAGGC CGCCGGCCGT ATGGCCGCCG TGGCTCAGGA GACTGCCGCC
AGCATTGAAG AAGTAGCGGC GGCTACCGAA GAGCATAAGT CCACCATGGC CGTAGTGGAG
GAGCATACGC GCCAGTTTAC GGATATGGCG CGGAATTTCT TTACCATGGT CGCCTCCTTT
ACCAGGGACG GCTGGGATGA GGACCTGCGC CGGGAACTCA TTCGCCAGGG ACAGGAGGTG
CTGGCAAGGC TGGCCGCCGA CCCGGGAGTT AAAAAAATGG AGGCGACAAC CCTGGCACCC
ATCCTGGATG ACACCTTCAG TAAATCACCT TTTATCCAGA CATTGATTGC CGCCCTGCCT
GATGGTACGG CTATCTATAA CCGGCCGGAG TCGACTATAA CAAACTGGGC CTTCCGGCCA
TGGTTTCAGG CGGCTGTCAG GGGTGAAAAC TACGCTAGCG AGCCCTACGT GACCCAGTGC
ACCAACCGGG TGGCCGTTAC CATCTCTGTA CCCATTTTCG GTGATGAAGG CCGCATCGCC
GGGGTCCTGG CGGCCAACAT AGCCCCGGCG CGGAGATAA
 
Protein sequence
MVLKWQTKLL YFILLLLLAL APGTVGTALV AIQGISWQIL LLPAVVAGAL GLILSLGILL 
NLYIKWMRPL HRVLQFLNLL GKGDPVQAEK SLQNARLGES FQGPVTAVLD SFYRLVGRLQ
LTADELAHFS RNLEESSSTT SRNLEEVTAA IQGITSGADE QAGAAQRVAE NINVLHNLAE
DINDRAALGM EMGEEVNRKE KEGRDLLEHL LQEIKAGASS IQEAAGRMRQ LEAKMDQINT
LVQAVTEIAD QTNLLALNAA IEAARAGEQG RGFAVVAEEV RKLAEQSAAA AQDITSLAAS
IGDEARQTAA QVDKNVELVQ SNIQRGAQVR ENFSVVSEAI KKAAEVMTNI SHQAQNQLTR
VKEVGEAAGR MAAVAQETAA SIEEVAAATE EHKSTMAVVE EHTRQFTDMA RNFFTMVASF
TRDGWDEDLR RELIRQGQEV LARLAADPGV KKMEATTLAP ILDDTFSKSP FIQTLIAALP
DGTAIYNRPE STITNWAFRP WFQAAVRGEN YASEPYVTQC TNRVAVTISV PIFGDEGRIA
GVLAANIAPA RR