Gene Moth_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0502 
Symbol 
ID3832825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp519708 
End bp521234 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content52% 
IMG OID637828436 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_429375 
Protein GI83589366 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0174896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCTT TTGCTTTATG GGGAACCGCC CTAACGGGAT GGGTTGTGGC TGTGGTTCTC 
TGGAAGAGAT ATAGCAATGC CCAGGCAGAT TTAAAGGAAT TCCTCGACAG CCTGGGTCAG
GGCGATATAG TCCGAGCGGA AGGATTCTTT CGAACCCATA GTTTTTCTAT GGCTCTGGGG
GAAAGTACGC GCATGGTTCT CGAAAGATTC TTTGAAATCA TTGCTTTCAT TCAGAAAGTC
GCCGACGAAC TAAATTATAC AGCTTCATTA TTCGTACAAG AAACCGTAGC TGGCAATACC
AGCTTCCGGG AAATGGCCGC CGCCTTGCAG GATATTGCCG GGGGCGCTGA CGAGCAGGCG
CATGCCTCCC AAAAAACAGC CGAAAATGTT GGTCAGTTCA CCAATTTAGC CGAAGAAATC
GCCAGCCGTT CCCAAATGAG TTTTACCCTG GGTAAAGAGG CACTATCCAA GGTACAAGCG
GGCAGGGAAT TACTAGAAAA ACTGATCGAC GAAATGAAAA ATGTAGCCTC TTTTAATACC
GAAGCGGCCA CAGAGATGGA AGACCTGGAA GGAAAAATAG CACGGATCAA TGAATTTGTG
CACACAATCG ATCACCTAGC GGGCCAGACC AATCTCCTTT CCCTTAACGC TGCCATTGAA
GCAGCCCGGG CCGGAGAACA GGGCCGCGGC TTCGCGGTGG TGGCCGAAGA AGTGAGAAAA
CTGGCTGAAG AATCAGCGGG GGCGGCCCGT CATATAACCG AACTGGCAGA AGAGATTCAG
GCCCGGGCCG GCAGGGCTGC TCTCCAGGTA AAAAACAGCG CTGAACTGGT AAATAATAAC
GCCCTTCGCA GCAAGGAGGT AGAAACAGCT TTTGAAGCCA TCGGCCGCGC CGTCAACCAA
GCAGCAAAGG CGAGCGAAGA AATAACCAGG TATACTGAAC AACAATTGAC CCACGTCAAG
ATGGCCAACG ACAACGCTGC TCGCATGGCC GCCGTTGCTG AACAAACGGC AGCCAGCATA
GAGCAGATTT ATGCCTCCAG TTCTGAACAA CAAAACGCAA TGGCCAGGAT AGAGAAGAAT
GCCCGGGAAA TATCGCAGAT GGCCGACAAT TTCTTTAAAC TGGCCTCCGA ATATACCAGA
AACTGCTGGG ATGAAAACCT GTGCCAAAAA TTGGTACGGG ATGGGCTTGA TAATCTCCGC
CGGCTGGCCT CCACGCCGGA GGTTCAATCC ATGGAAGTTG ATAAAATAAA ACCCCTTTTA
GATGAAGCAG CAGCTACAAT GCCAATCGTG AAGACCATCC GGGCTGTAGA CCCGGACGGA
AATACCGTCT ATAGTCAACC GCCGGGCAAA GTAACCAATT GGTCCTTCCG CCCATGGTTC
CAGGCGGCGA AAAGAGGGCA AGAATATCAT ACTCAACCTT TCATTACCCA GGGAACCGGC
CGCCTGGCAA TCACTGTTGC CGTACCGGTG CATGGGGGAG ACGCCGGGAT TGTAGGTGTA
CTGGCCGCCA ACATTGCCCC GGCCTGA
 
Protein sequence
MISFALWGTA LTGWVVAVVL WKRYSNAQAD LKEFLDSLGQ GDIVRAEGFF RTHSFSMALG 
ESTRMVLERF FEIIAFIQKV ADELNYTASL FVQETVAGNT SFREMAAALQ DIAGGADEQA
HASQKTAENV GQFTNLAEEI ASRSQMSFTL GKEALSKVQA GRELLEKLID EMKNVASFNT
EAATEMEDLE GKIARINEFV HTIDHLAGQT NLLSLNAAIE AARAGEQGRG FAVVAEEVRK
LAEESAGAAR HITELAEEIQ ARAGRAALQV KNSAELVNNN ALRSKEVETA FEAIGRAVNQ
AAKASEEITR YTEQQLTHVK MANDNAARMA AVAEQTAASI EQIYASSSEQ QNAMARIEKN
AREISQMADN FFKLASEYTR NCWDENLCQK LVRDGLDNLR RLASTPEVQS MEVDKIKPLL
DEAAATMPIV KTIRAVDPDG NTVYSQPPGK VTNWSFRPWF QAAKRGQEYH TQPFITQGTG
RLAITVAVPV HGGDAGIVGV LAANIAPA