Gene GM21_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2191 
Symbol 
ID8137527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2557207 
End bp2559063 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content64% 
IMG OID644869806 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003022001 
Protein GI253700812 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACA ATCTGAAAAT CCGCTCCAGG CTCATCGCCG GCTTTGGCGC GATCGTCTGC 
TTCCTGGTGC TGATCGGGGC GCTTTCCCTG AAGAACCTGA AGGCTCAGGG AGACCTGATG
ACCGATTTCC ACGAGCACCC CTTCACCGTG ACCAACGCCA TACGCTCGGT CGACGCCAAC
GTGGTGAGGA TGCACAGATC CATGAAGGAC GTGGTGCTCT ACGCCGGGGA CGCAGAGGAC
CTGGAGGCGT CGCTTGCCGA TATCGACGCC AGCGAAAGGA TCGTCTACCG CGACCTCGCC
CTGGTCAGGG AGCGCTTCCT CGGGGACAAG GGGAAGATCG ACCAGATCAA AGAGTCGATG
GACAAGTGGA AACGGGTGCG CGCGCGGACC ATCGAACTGG TGCGCCAGGG AAAGATCGCC
GAGGCGGTCG CCTTCCACAA GAACTACGCG AGGCGCATGG TCGGAGAGAT CGACGCCCAG
GTGAACGAGG TGCTGAAATT CTCCACCGGC ACGGCGGAAA GCTTCGTCCA GGAATCGCAG
AAAAACCAGC GGACAACCTT CGCTATCACC GCGATTCTGG TGCTTTTGGC CGGAGGTCTT
GCCGTCTGGA CCGCGTTCGC CATCACCTCT TCCATCACCC GCCCGCTGGA TCAGGCGGTG
CAGGCCGCGG AAAGGCTTTC CCAGGGGGAC GTGTCGGTGG AGATCCACAG CGACGCCGAC
GACGAACTGG GGCAGTTGCT GCGCGCCATG GAGAAGATGG TGCTGTCGCT GCGCCGGATG
GGGGAGACTG CGAACCAGAT CGCTCTGGGA GACCTGGACG CCGAGGTGAT CCCGGCCTCG
GAGCGGGACG TCTTCGGCCT TGCCATGTCG AACATGGTGG CGTCGCTGAA AAGGCTCGCC
GCCAGCGCCG ACCGCATCGC CGCCGGGGAC CTCACCATCG ACGTCGTCCC CGCTTCCCCC
AAGGACCGCT TAGGCATCTC CTTCAGCCAG ATGACCCGGA ACCTCCGGGA GTTGAGCCTG
GAGATCCGCG CGGTGGTGAA CGTCCTGGCC GGCTCGGCCG CCGAGATCAT GACCACGGTG
AGCCAGCTCG CCTCCAGTTC AGCCCAGACC GCCACCTCCA TCTCCGAGAC CAATGCCACG
GTGCAGGAGA TCCGCCAGAC CACCGACCTC ACCTCGCAAA AGTCGAGGCA GGTCTACGAG
AGCGCCAACC GCTCGGTGCA GGTCGCCAAG GAGGGGCGCG AGTCGGTTTC CAGCGCCATC
GGCGGGATGC AGGGGATCGA CGAGCGCATG GGGTTCATCG CGGAGCGGAT CGTCAACCTG
AGCGAGCAGA GCCAGGCCAT CGGGGAGATC ATAGCCACCG TGGCCGATCT CGCCGAACAA
TCGAACCTCC TTGCGGTGAA CGCCGCCATC GAGGCTGCCA AGGCCGGCGA GCACGGCAAG
GGATTCGCCG TGGTGGCCCA GGAGGTGAAG AACCTGGCCA CCCAGTCGAA ACAGGCGACC
TCGCAGGTGC GCAACATCAT CGGACAGATC CAGAAGGCGA CCACCGCGGC GGTCCTCGCC
ACCGAACAGG GGAGCAAGGC CGTCGAGGCC GGCGTGAAGC AGTCAAGCGA GGCGGGAGAG
TCGATCCGCG TGCTCGCCTC AAGCATCGAG GAATCCTCCA ACGCGACGCT GCAGATCGTC
ACCTCCACCC AGGAACAGGC GATCGGCATG GACCAGATCG CCATCGCCAT CCACAGCATC
AACCAGGCGA GCGACCAAAA CGTAGAGGGT TCGCGCCAGA TCGAGGCGGC GGCCCGCAAC
CTCTACGAAC TGAACCAGAA GCTCCAGGAA CTGGTGAGCG GGTACAAGGT CGCATGA
 
Protein sequence
MLNNLKIRSR LIAGFGAIVC FLVLIGALSL KNLKAQGDLM TDFHEHPFTV TNAIRSVDAN 
VVRMHRSMKD VVLYAGDAED LEASLADIDA SERIVYRDLA LVRERFLGDK GKIDQIKESM
DKWKRVRART IELVRQGKIA EAVAFHKNYA RRMVGEIDAQ VNEVLKFSTG TAESFVQESQ
KNQRTTFAIT AILVLLAGGL AVWTAFAITS SITRPLDQAV QAAERLSQGD VSVEIHSDAD
DELGQLLRAM EKMVLSLRRM GETANQIALG DLDAEVIPAS ERDVFGLAMS NMVASLKRLA
ASADRIAAGD LTIDVVPASP KDRLGISFSQ MTRNLRELSL EIRAVVNVLA GSAAEIMTTV
SQLASSSAQT ATSISETNAT VQEIRQTTDL TSQKSRQVYE SANRSVQVAK EGRESVSSAI
GGMQGIDERM GFIAERIVNL SEQSQAIGEI IATVADLAEQ SNLLAVNAAI EAAKAGEHGK
GFAVVAQEVK NLATQSKQAT SQVRNIIGQI QKATTAAVLA TEQGSKAVEA GVKQSSEAGE
SIRVLASSIE ESSNATLQIV TSTQEQAIGM DQIAIAIHSI NQASDQNVEG SRQIEAAARN
LYELNQKLQE LVSGYKVA