Gene GM21_0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0854 
Symbol 
ID8136175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1016790 
End bp1018481 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content53% 
IMG OID644868470 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003020679 
Protein GI253699490 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones94 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTCC GAAAATATCT TTCTTCCCTG CGCAGTACCT ATATCTTCAT GGTCTGCTTC 
GGCCTTCTTA TGGGAGTTGT TTTCCCTTTC TATTCCTGGC TCTTCTTTGG CGGCAAGGCA
TTCGCGCCGC TGTACGTCTT CGGCTGCATC GCCGCTGGTT TCATCGTCGG CAGTTTTTGC
TATCAGATCA TAAAGGAAGC ACTGAGGCTC TACGTCGAGC ATCAGTTGCA AACACTTTAT
AGAATCACCA ACGATGCGTC CGCAAAGGTT GGTCTCGGTC AGGGAGACGA GCTAAAGCAA
CTGATGGAGT GCAACGAGGC ACTGATGAAC AGGGTGCTGG TAATGGTGGA AAACGTCTCA
CGCCTAGCTG CCGACATTTC TGACAGACAG GGACGCCTTA CTTCCGACTT CAGCAGGACC
GTGGATAACA ACGTCCAGCA AGCCGCCAAA GAGAAAGAGA CAATCAGAGC CATCGATGAC
ATGAACGCCT TCTTCAAGGA CCTGCTTCGT GAAATCAAGG ACATCGCCTC CCGTACAGCT
GAGCGCGCTT CCATTTCAAC TCAAATGAGC GCAGCCACAG ACGCCATTGC CCTCAGTATC
CAGGAATATT CCGCATCGGT CATGGAAACT TCCGGTTCGA TAGAAGAAAT GGCGGCAAGC
ATCAAAGGGA CTTCCACGAA CATAGAGGCG CTCACTGCAT CGACAGAGCA GACCTATAAC
TCCATCAATG GCATTGGGGA TTCCATCGTC GATATTCGTG ACAATGCCCG TCGCACTTCC
GACTGCTCGG ACAAGGTGCG TGTCCAGGCA GTTGAAGGCA TGGATGCGAT GGCTGCTACC
ATTGCGGCGA TGGGTGAGAT TGAGGACCAT AGCGATCGGT CCGTGAATGC GATCAAACGG
CTCTCTTCTC ATTCGTTGCG GGTGGGCGAG TTCCTCGATG TGATAAAGGA AGTCGTCTCA
CAGACGAACC TTCTGTCCCT GAACGCATCC ATTATCGCGG CTCAGGCTGG CGACCGTGGC
AAGGCGTTTG CCGTAGTAGC CGAGGAGGTG CGCGGTCTTG CAAAGAGAAC GTCCGCATCG
ACCGATGAAA TCGAGGAATT GGTCATCAAC ATTCAGAAGG AGACCGTTGC GGCGGAAACC
GCGGCGCGAT TGGGCAAGGA AAAAGTAGCC GAAGGTGTCA AGGTATCGGA AAAGGCCGAC
GCAGCCTTGC ACAGGATAGA GGAAAGCGCG GCTGAGGCTT CGCGAATGGT TCAGCAGATT
GCCGCAGCTA CCGACGAGCA GGCGTTCGGT AGCAGGTTGA TTACCGAGGA GGCCGAGAAG
AACCTGTCAC GCGTGAAGCA GTTCAGTCGC GCCATCCAAG AGGAGGAGGC CGGAGCCCAG
CTCATCGTTC GAAGTCTCGA CCGGATGCGT GACTTGTCCG AGAAGATCAC CATCTCTACC
GACGAGCAGG CACGCGGCAA CCGGCTCTAT CTGATGAGCG TGCAGGACGA TAATAACAAG
GTCAAAAGGT TGAAAGAGAC CTGCATGGAA CAAATCGCCA TAGGTGAGAT GCTGCGCAAT
GATGTAGCAG AGGTCGACCA ACTGATAGAG GGCACCGCTG AAGAAGCCAA GAAGATGCTT
GGAGAAATAG AGACGATCAG CAACCTGATA AACGACATGC ATCGTGAAAT GGAGAGCTTC
AGGAAGTTGT AA
 
Protein sequence
MFFRKYLSSL RSTYIFMVCF GLLMGVVFPF YSWLFFGGKA FAPLYVFGCI AAGFIVGSFC 
YQIIKEALRL YVEHQLQTLY RITNDASAKV GLGQGDELKQ LMECNEALMN RVLVMVENVS
RLAADISDRQ GRLTSDFSRT VDNNVQQAAK EKETIRAIDD MNAFFKDLLR EIKDIASRTA
ERASISTQMS AATDAIALSI QEYSASVMET SGSIEEMAAS IKGTSTNIEA LTASTEQTYN
SINGIGDSIV DIRDNARRTS DCSDKVRVQA VEGMDAMAAT IAAMGEIEDH SDRSVNAIKR
LSSHSLRVGE FLDVIKEVVS QTNLLSLNAS IIAAQAGDRG KAFAVVAEEV RGLAKRTSAS
TDEIEELVIN IQKETVAAET AARLGKEKVA EGVKVSEKAD AALHRIEESA AEASRMVQQI
AAATDEQAFG SRLITEEAEK NLSRVKQFSR AIQEEEAGAQ LIVRSLDRMR DLSEKITIST
DEQARGNRLY LMSVQDDNNK VKRLKETCME QIAIGEMLRN DVAEVDQLIE GTAEEAKKML
GEIETISNLI NDMHREMESF RKL