Gene Moth_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0740 
Symbol 
ID3831132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp774509 
End bp776299 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content62% 
IMG OID637828671 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_429601 
Protein GI83589592 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000672019 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTTA AACTATTTAA AAAAAAGGGA CTGCAGGTAG AAGGGCCGGA CCCCCGGGCT 
CAAAAGAAGG CCCGGGCCGG AGGGAAGACG GGCGGTTTCT TCAGCCGCTT GAGCCTGGGG
GTCAGGCTGG CGACCGGGTT CTGCCTGGTC ATCGCCATTT TTGTCGCGGT GGTCATTTAC
GTCAATTTTA ACCTCTTGCA GGTAGCCGCC CTTACCAACA GGGTAACTAT CATGTACAAC
CAGGGCATGC TCTATAACGA GATGACCAGT TCCATCTGGG ACGCTTACCG GCGGGCTACC
GATTACATAA TCAATGGTTC CCAGACCCAC GCCTTGGGCT TTGACGACGC CATGAAGCGC
TTTGATACCG CCCGCGCCCA GCTAGAAGGG CAACAACTCG ACAGCCAGAC GGCCGGCTAC
CTGACGGCCA TGGCGCAGGC TGCCAAGAGT TTTACTGACA CCTTCAAAAA CAGCATCTTG
AACACCAGCC AGAGTGACCG CATGGCGGCC CTGCCCATCC TGAGCTTCCA GATGGGGGCC
TCCCTGGATA ACATCAATAA CATCGGTACC CATATGAACA AGGGTATAAG CGAAGAAACC
GCCGCGGCGG AAGAACAACT GGCTGCCGCC GTCCGGAACG CCCGGGCCAC CCTGCTCTCT
GGATTAATCC TGTCCCTGCT CCTGGGCCTG GCGATCGCCT GGTTTATCAA CCGCATGGTG
GGCCGTTCCC TGGGCCAGGT GGCGGCCTAT GCCGCCCGGG TGGCCGAGGG GGACCTCACG
GCCGAACCCC TTCATATCAC CAGTAAAGAC GAGGTCGGCA AGCTGGCTGC GGCCTTCAAT
ACCATGGGCG AAAACCTGCG CCAGCTCATC AGCCGCGTGC GCGACATGAC CGGCCAGGTG
GCTTCCGCCA GCCAGGACCT GGTCCGGATG TCCCAGGAAG TAGGAGATGC CGTCCGCCAG
GTAGCCGCCA CCGTTCAGGA AATGGCCAAA GGGGCCGAAG ACCAGGCCCA GCAGGTCAGC
GAGACGGCGA CAGCCACCGA CGGCCAGGCG GCCAAGGTAG AGGAGGTCCA CCGCGACACC
GAGGATATGG CTGCTGCTTC CGATCAGGTG GCGGCCAGGG CCGCCGAAGG GGCCAGAGCC
GTGGCCGAGG CCACGGATCA GATGGCGGCC ATATCCCAGC GTATGGAGCG CATGGCCCGC
GCCGTCGAGG AACTGGGCAA CCGTTCCCAG CAAATCGGCC AGATTGTCGG CGTCATCTCC
GGCATCGCCG AGCAGACCAA CCTCCTGGCC CTCAACGCCG CCATTGAGGC GGCCCGGGCC
GGCGAGCAAG GACGGGGTTT CGCCGTGGTC GCCGAAGAGG TACGTAAACT GGCCGAGCAA
TCAGCCGGGG CTACCAAGCA GATCGTGGAG CTGGTCCAGG AGATCCAGCG GGAGACTGAA
CAAGTGGTCG CCAGTATGGC CGAGGGTTCC CGGGACGTCC AGCAGGGTAC CGAGGTGGTG
GCCCGCACCG GCAAAGCCTT CAGCGCCATC GACCAGGCCA TCCATACCCT GGTAGGCAAG
ATTAAAAACG TGGCCGAAAA GGCGGAAGAC ATGTACGCCG GTTCCCGCCA GGTCAAGGAA
CGAGTAGAGA GCATTGCCGC CGGCATCGAA GAAGCCGCCG CCAGCACCCA GCAGGTTTCG
GCCTCCACGG AAGAGCAATC AGCGGCCGTG GATCAGATCA GCCAGGCGGC CCGGCAACTG
GCAGCCGCCG CCAGTAACCT GGAGGAGGCT GTGGCCAGGT TTAAACTTTA G
 
Protein sequence
MRFKLFKKKG LQVEGPDPRA QKKARAGGKT GGFFSRLSLG VRLATGFCLV IAIFVAVVIY 
VNFNLLQVAA LTNRVTIMYN QGMLYNEMTS SIWDAYRRAT DYIINGSQTH ALGFDDAMKR
FDTARAQLEG QQLDSQTAGY LTAMAQAAKS FTDTFKNSIL NTSQSDRMAA LPILSFQMGA
SLDNINNIGT HMNKGISEET AAAEEQLAAA VRNARATLLS GLILSLLLGL AIAWFINRMV
GRSLGQVAAY AARVAEGDLT AEPLHITSKD EVGKLAAAFN TMGENLRQLI SRVRDMTGQV
ASASQDLVRM SQEVGDAVRQ VAATVQEMAK GAEDQAQQVS ETATATDGQA AKVEEVHRDT
EDMAAASDQV AARAAEGARA VAEATDQMAA ISQRMERMAR AVEELGNRSQ QIGQIVGVIS
GIAEQTNLLA LNAAIEAARA GEQGRGFAVV AEEVRKLAEQ SAGATKQIVE LVQEIQRETE
QVVASMAEGS RDVQQGTEVV ARTGKAFSAI DQAIHTLVGK IKNVAEKAED MYAGSRQVKE
RVESIAAGIE EAAASTQQVS ASTEEQSAAV DQISQAARQL AAAASNLEEA VARFKL