Gene GM21_0949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0949 
Symbol 
ID8136270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1124965 
End bp1126629 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content59% 
IMG OID644868564 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003020773 
Protein GI253699584 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0219131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGCG ATCTGAAAAT TGGTACCAGA TTACTGCTGT GCTTCGGCCT AGTCCTGCTC 
CTGCTTATCG CAGTGGCAGG CACCGGTTAC TGGGGCATCA AGCAAGTTGA AGCCACCACA
GACACAATGC TGTCCACTGA AGGACGCATC GCAGAACACT CAGCCAGGGC CAGGGCAAGC
ATCCTGGGCA TGAGACGCTA TGAGAAGGAC ATCTTCCTCA ACCTGGGCGA TGCCAACAAA
GTGGAGGAAT ACCTCCAGAA ATGGAACGCC GAGGCCGCCA AGACTACCGA GCGCATCGCC
GACCTGGAAA AATACGCCGT CGACAAAGAG GACAAGGATA AGACCAAGGA AATCAAGGAC
AATTTCCAGG TTTACAAGGC AGGTTTTGCG AAGGTCGCAG CCGCTATCCA GAGCGGAAAG
CTAAAATCCG CACAGGAAGC AAACAGCGCC GTCACAGAGT ACAAGAACGA GTCCCACCTG
ATGGAAGCGA CGGCGGTCAC CCTGGCACAG GATGGCGTCA AGTCGATGCA GCAAGCAGGA
GAGGAGATCG ACAAAGAGAC AGGGGAAATT GTTTCTTACC TGCTCGCGAT ATCGTTGATC
GCCGCAATTA TAGCTGTCGC CCTGAGTCTC CTTGTGACGC GCAGCATCAG GCGTCCGCTG
GAGGTCGGGG TCCAAACTGC GAACCGGCTT GCTGCCGGAG ATCTCACCAT GGACATCGAC
GCTACCAGTA AAGACGAGAC CGGCCAACTG CTGGCGGCGA TGGGCAATAT GGTCAACAAG
CTGAGGGAGA TCGTGGGGGA GGTGCAGGCC GCGACCAAGA ACGTCGCCGG GGGAAGCCAG
GAACTCTCCT CCAGTTCCGG GGAGATGTCG CAGGGGGCAA GCGAACAGGC GGCCGCCGCT
GAGGAAGCTT CCGCCAGCAT GGAGCAGATG ACCTCCAACA TCCGCCAGAA CGCGGACAAC
GCGCTGCAGA CCGAGAAGAT TGCGGTGAAA TCCGCGGAAA ACGCCAAGGA AGGGGGCCAG
GCGGTTCAGG AGACCGTGCA GGCCATGAAG GACATCGCCG GCAAGATCAA CATCATAGAG
GAGATCGCAC GCCAGACGAA CCTATTGGCA CTGAACGCGG CCATCGAGGC GGCCCGGGCC
GGCGAGCACG GCAAAGGATT CGCCGTTGTC GCCAGCGAAG TGAGAAAACT GGCAGAGCGG
AGCCAGAAGG CGGCGGCGGA GATTTCCCAG CTCTCCAGCA ACAGCGTCGA TGTCGCGGTG
AAGGCCGGCG ATCTGCTGTC CAAGATGGTG CCCGATATAC AGAAGACGGC GGAACTGGTC
CAGGAAATCA GCGCCTCCAG CAAAGAACAG GACACCGGTG CGGAGCAGAT CAACAAGGCG
ATCCAGCAAC TCGACACGGT CATCCAGCAA AACGCCAGCG CGTCCGAGGA AATGTCGTCG
ACCGCGGAGG AACTCGCCTC GCAGGCGGAG CTGCTGGCGA GCTCCATCGC CTTCTTCAAG
ATCGACCAAG AGGGACATTC GAAGTCCATC GTCAGGCCCA CCAGGAAACC CGGCACCGTG
AAGCCGATGA GCTTTAGCAA GCATCCCGCC GCTCCTGGCG CTGCGGGCGC AAACGTCGTG
GGCACCGACC TCCAGTTGAA CGACGAAGAG TTCGAGCGCT TCTAA
 
Protein sequence
MLSDLKIGTR LLLCFGLVLL LLIAVAGTGY WGIKQVEATT DTMLSTEGRI AEHSARARAS 
ILGMRRYEKD IFLNLGDANK VEEYLQKWNA EAAKTTERIA DLEKYAVDKE DKDKTKEIKD
NFQVYKAGFA KVAAAIQSGK LKSAQEANSA VTEYKNESHL MEATAVTLAQ DGVKSMQQAG
EEIDKETGEI VSYLLAISLI AAIIAVALSL LVTRSIRRPL EVGVQTANRL AAGDLTMDID
ATSKDETGQL LAAMGNMVNK LREIVGEVQA ATKNVAGGSQ ELSSSSGEMS QGASEQAAAA
EEASASMEQM TSNIRQNADN ALQTEKIAVK SAENAKEGGQ AVQETVQAMK DIAGKINIIE
EIARQTNLLA LNAAIEAARA GEHGKGFAVV ASEVRKLAER SQKAAAEISQ LSSNSVDVAV
KAGDLLSKMV PDIQKTAELV QEISASSKEQ DTGAEQINKA IQQLDTVIQQ NASASEEMSS
TAEELASQAE LLASSIAFFK IDQEGHSKSI VRPTRKPGTV KPMSFSKHPA APGAAGANVV
GTDLQLNDEE FERF