Gene Moth_1487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1487 
Symbol 
ID3832368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1533988 
End bp1535967 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content53% 
IMG OID637829419 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_430339 
Protein GI83590330 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000879913 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTTA AAATCGCCGA TAGTATTAGC AGCCGGTTAC TAGCGCTGTT CCTTATTTTA 
TCCTTGATAC CAGCCATTAT TATCAGCCTG GTTAACTTTT ATTTCAGTAA GGCCCAATTT
ACAGCTAACA CCTATTTGAC CCTAAGGCAA ATTACCACCA GTATGGCCGA AAATGTGAAC
GATTGGATTA ACAGCCGTTT AACGCAAATG GATAAGGATG CAACTGCAAG TGTCCTGCAA
TCAAATGATA AAGAGCAAAT TAGAGCTTTT GTAAAGATGG TAGCTGAGCA GACGGTTGAT
GCCAATTTAG TTTTTTTTGC CGGGACTAAC GGTATGGTAA TTCCATCCAG CGGGCCAGAA
GTCAATATTA GCGATAGGGA TTATTACCAG CAGGCCATCA AAGGCAAGGC CGCTATTTCC
AATTTGGTTA TCAATAGAAG CACCGGCAAA GAGGGCATAA CCATAGCTGT ACCGGTTAAA
GGGCCCGGGG GGATTATCGG TATACTGGGT ACCCATTATG ACAGCCAGAC ACTATTACAC
CAGATAAACA ACAGCAAGTA CGGGCGGACG GGTTACGCCT ACATGCTCGA TAACACGGGC
GTGGTCATGG CCCACCCGGA TGCTAAAAAA GTTTTGAACG AAAACCTGAC GAAAACTGAA
TCCCAGAGCC TGAACAATGT CGCCCAGAAA ATGCTGCAAA ATAAAGAAGG AGAAGATGAG
TATATCCGCA ATGGTGTCCG GAACCTGGTC GCCTATGCCC CGGTTAAAGC AACCGGCTGG
GTAGTAGCCA TGACGGCCCC CACCAGTGAA GTATACGCCG GGGTCACTGC CATGCAACGT
TTTAACATTA TCCTTATTAC CCTTGCTGCC ATCCTCATAG CCCTGCTGGC CTTTTATATC
AGCCGAAAGA TAGCCAGGCC CATTATCACC CTGGCGGGGC AGGCCGATGT TTTAGCCACA
GGCAACCTGC AGGTAGACAT TAACACCAAC TTCTACGGTG AGCTGGGGAC TTTAGGCCGA
TCGTTAAAGA CTATGGTCAC CAACCTGCGG TCCATAGTCC AAAAAGTTCA GGATAGCGCC
AACCAGATAG CCTCTTCCGC CCAGGAGTTC AGCGCCTCTA CGGAGGAAGC TTCCCGGTCG
GTGGAGCAGG TGGCCAATGC CATTCAGGAT ATGGCCCGGG GCGCCAACGA CCAGGCTACC
CAGTCCCAGA ATATAGCGGA ATTGGTCAAT AACATCACCG GTGCCATTGG CTCAACCAGG
GACAGGGTAG AAGCCCTGGC CAGGTATTCG GAACAAACCG GGGAGCTGGT GGACGACGGC
CTGGCGTCTA TGGAAAACCA GAACGACAAG ATGGCGGAAA ACCTGCAGGC AGCGCAAGCT
GTCAGCGAAG CTATCAATAA GCTGGCCCGG GGGGCACGGG AGGTGGGTCA GATCCTGGAA
ACAATCACCA GCATTGCCGA CCAGACCAAC CTGCTCGCCT TGAATGCGGC CATCGAGGCA
GCCCGGGCCG GAGAACACGG GCGGGGTTTT GCCGTAGTCG CTGAAGAAGT GCGCAAACTG
GCCGAAGGTT CGGCCCAGGC AGCGAGTGAG ATCGGCCAGA TTGTCCAGAA GATCCAGGAC
GAGGCCCAGG GGGCGGTGGC AGAAATGGAT AAGGCTAAAG TCATTGTCGA CGCCCAGCAG
GATGCCGTTA ATCATGCCAA CGAGGTATTC CAGAACATCT CTCAGAAGGT AAAGGCCATG
GTCAAGGGCA TCGAAGAAAT AGCCGCCGCA ACGGAGCAGA TAAACAATGA GGCCCGGAAA
ATTACGGAAG CTATCCAGGG GGTGTCGGCA ATAGCCGAGG AGAATGCGGC GGCAGCTGAA
GAGATATCGG CCAGTACCGA AGAACAGAGC GCCACGGTGG AGGAAATCGC CGCCTCGGCC
AATGCCCTGG CCAGCCTGGG GCAGGAACTG CAGCAGCTCA TTGCCCGATT TAAGCTGTGA
 
Protein sequence
MKLKIADSIS SRLLALFLIL SLIPAIIISL VNFYFSKAQF TANTYLTLRQ ITTSMAENVN 
DWINSRLTQM DKDATASVLQ SNDKEQIRAF VKMVAEQTVD ANLVFFAGTN GMVIPSSGPE
VNISDRDYYQ QAIKGKAAIS NLVINRSTGK EGITIAVPVK GPGGIIGILG THYDSQTLLH
QINNSKYGRT GYAYMLDNTG VVMAHPDAKK VLNENLTKTE SQSLNNVAQK MLQNKEGEDE
YIRNGVRNLV AYAPVKATGW VVAMTAPTSE VYAGVTAMQR FNIILITLAA ILIALLAFYI
SRKIARPIIT LAGQADVLAT GNLQVDINTN FYGELGTLGR SLKTMVTNLR SIVQKVQDSA
NQIASSAQEF SASTEEASRS VEQVANAIQD MARGANDQAT QSQNIAELVN NITGAIGSTR
DRVEALARYS EQTGELVDDG LASMENQNDK MAENLQAAQA VSEAINKLAR GAREVGQILE
TITSIADQTN LLALNAAIEA ARAGEHGRGF AVVAEEVRKL AEGSAQAASE IGQIVQKIQD
EAQGAVAEMD KAKVIVDAQQ DAVNHANEVF QNISQKVKAM VKGIEEIAAA TEQINNEARK
ITEAIQGVSA IAEENAAAAE EISASTEEQS ATVEEIAASA NALASLGQEL QQLIARFKL