Gene Moth_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0804 
Symbol 
ID3832135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp835017 
End bp835988 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content55% 
IMG OID637828735 
Productflagellar motor switch protein FliM 
Protein accessionYP_429665 
Protein GI83589656 
COG category[N] Cell motility 
COG ID[COG1868] Flagellar motor switch protein 
TIGRFAM ID[TIGR01397] flagellar motor switch protein FliM 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.629305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACG TCCTATCCCA GGCGGAGATT GACGCCCTCC TCCAGGCCCT GAACAGCGGC 
GAGGTTCAGA CGGAGGTCAT TAAAGAAGAG GCTACCCCTA AGGCCAAGAA ATACGACTTT
CGCCGGCCCA ATAAGTTTTC CAAGGAACAC CTGCGCACCC TGTATATGAT CCACGAAAAT
TACGGGCGCC TGGTGGCCAA CTTCCTTTCG GCCTACCTGC GGGCCAGCAT CCAGGTGAAG
ATCGTCTCGG TGGAACAGAT GACCTATGAG GATTTTATTC TCTCGTTGCC GACGCCGACC
CTGATGAACG TCTTCAGCAT GGAGCCTTTA AAGGGTTCGG CGGTCCTGGA GACCAACATG
AATTTCATCT TCCCCATTAT CGACCTGCTC TTCGGCGGTC GGGGGGAGAT GGTGGCCCGT
AACCGGGAGT TGACGGAGAT CGAGCTCCAC GTCCTGCGGC GTTTAAACAG CCGCATGCTG
GAACAGCTCT CCTATTCCTG GTCCGACATC CAAAACATTA CTCCCAAATT GGAGAATATG
GAAACCAACC CCCAGTTTAC CCAGGCCATT TCCCCCAACG AGACGGTTGC CGTCATCACC
ATGGGGACAA CGGTGGGCAA GTATGAGGGT CTTTTAAACC TCTGCCTGCC CTATATGCTC
CTGGAGCCGG TCATTTCCCG CCTTTCGGCC AGCCACTGGT TTGCCACCGG CGGGGAAAGG
GAAGCCAGGC CTGATTACCG GACGGTGGTC GAGAAGATCC TGGCCGAAGT GCCGGTGGAA
TTGATCGCTT ACATAGGCCG CACCCGCTTG CCGGTGCGGG ATTTTATCCA GCTCCAGGTT
GGGGATGTCA TTACCCTGGA AAAAACAGTG GGCGAGGACC TGGAACTCTA TGTAGACGGG
CACCATAAGT TTCAGGTTCA ACCGGGGATT GTGAATAAAA AAATTGCCGT CCAGGTAACA
GAGGTGGTAT AG
 
Protein sequence
MADVLSQAEI DALLQALNSG EVQTEVIKEE ATPKAKKYDF RRPNKFSKEH LRTLYMIHEN 
YGRLVANFLS AYLRASIQVK IVSVEQMTYE DFILSLPTPT LMNVFSMEPL KGSAVLETNM
NFIFPIIDLL FGGRGEMVAR NRELTEIELH VLRRLNSRML EQLSYSWSDI QNITPKLENM
ETNPQFTQAI SPNETVAVIT MGTTVGKYEG LLNLCLPYML LEPVISRLSA SHWFATGGER
EARPDYRTVV EKILAEVPVE LIAYIGRTRL PVRDFIQLQV GDVITLEKTV GEDLELYVDG
HHKFQVQPGI VNKKIAVQVT EVV