Gene GM21_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3345 
Symbol 
ID8138712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3870951 
End bp3872579 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content62% 
IMG OID644870963 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003023128 
Protein GI253701939 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.0226154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATCT TGCAGAACAT GAGACTGGCT CAGAAATTCG GTGTCGTCGG AGGTGTCAGC 
GCGATCCTGC TCTGTCTTTC CTTGGCCGCG TCGGTGATTG GTATCAAGGA TCTATCGGCG
GGATTCTCAA ACTTCGTGGA AAGGGACCAG GCTGTCTCCC TGGCGTTGAA GGACATGTAC
GCCCAAGGAT TGCAGTCGGA GCAGGCTACC AGGAACATCC TGCTGAACCC TGCTGACGAA
AAGGCGGCCA AGAACTACGC CCAGGCCATG GAGGATTTTG ACAAGGCATA CGGCGTGGTG
GTTGCCAAGA GCGGCGGCTT GCCGGAGGTG AAGGGGCAGG TCGAGCAGGT GATGGCTGTG
TGGAAGGAGG CCGCCGCGTT GCGCCTGCAG GTACAGAGCC TGGCCAAGGA GGGGAAGAAG
GACGAGGCGC TTACCCTGCT GGTGAAGGAA GAGACACCGA AGTGGCGCGA GGTGAAGGAG
AAGTTGTTCA AACTGAGCGG CGAGAGGCTG AAGCAGATGG AGCAGACCAA AACCACGGTG
GTTGACCTCT CCAGCAAGGC GCTCACCATT TCGCTGTCGC TCGGCGTGTT CGCCATCCTG
AGCACCATGC TCCTGCTGGG TCTGGTTGCT GCAGGGGTGA CCAGGAGGGT CAGGACGATG
AGCGACCACA TGGACGACAT CGCAAGAGGT GAAGGAGACC TCACCGTTCG ACTGGATGTG
GCATCGCGGG ATGAACTGGG GAACCTCGGG AATTCGTTCA ACCTCTTCCT TGCCAAGTTG
CACGACCTTA TCGCCACTGT CGCGGAGACC ACCAAGCAGG TCTCGTCGGC CGCAGCCGAG
CTTGACGCTA CTGCCGGACG GATGGCGGTC GGCACCGAGG AGGTCGCATC CCAAACCGAG
ACCGTGGCGG CGGCAGGGGA GGAGATGACG GCGACCTCCA CCAGCATCGC TCAAAACTGC
ATGGCTGCGG CCGAAGGGGC GAAGCGCGCC GGCGAGGCCG CGGTAGCGGG CGCGGCGGTG
GTGCAGGAGA CGGTGCACGG GATGGAAAGG ATCGCTGGAA GGGTGAGGGA ATCGGCCCGG
ACTGTTGAGA GTCTCGGATC CAGATCCGAC CAGATCGGAG AGATCATAGG CACTATCGAG
GACATAGCGG ACCAGACCAA CCTTCTCGCC CTAAACGCGG CGATCGAGGC CGCCCGCGCG
GGAGAGCAGG GACGCGGCTT CGCGGTGGTG GCGGATGAGG TGCGCGCCCT GGCTGAAAGG
ACCACCAAGG CCACAGGCGA GATCGGCAAC ATGATCAAGT CGATCCAAAG CGAAACCCGG
AGCGCGGTCG GCGCGATGGA GGGGGGCGTA AAGGAAGTCG AGAAAGGAAC CTCGGAAGCC
GCCAGATCGG GAGCGGCGCT TCAGGACATC ATCGCGCAGA TAGACAGCGT GACTCAGCAG
GTGAATCAGA TAGCCGTTGC TGCCGAGGAA CAGACCGCAA CCACCAGTGA GATCAGCAGC
AATATCCAGC AAATAACCGG GGTCGTGCAT GAAACCGCCG CCGGGGCCCA ACAGTCGGCG
ACGGCGGCGG GGCGGCTTTC GGGTCTGGCG GAGAGGCTGC GGCATGTGGT TGGGCAGTTC
AAGTTGTAG
 
Protein sequence
MSILQNMRLA QKFGVVGGVS AILLCLSLAA SVIGIKDLSA GFSNFVERDQ AVSLALKDMY 
AQGLQSEQAT RNILLNPADE KAAKNYAQAM EDFDKAYGVV VAKSGGLPEV KGQVEQVMAV
WKEAAALRLQ VQSLAKEGKK DEALTLLVKE ETPKWREVKE KLFKLSGERL KQMEQTKTTV
VDLSSKALTI SLSLGVFAIL STMLLLGLVA AGVTRRVRTM SDHMDDIARG EGDLTVRLDV
ASRDELGNLG NSFNLFLAKL HDLIATVAET TKQVSSAAAE LDATAGRMAV GTEEVASQTE
TVAAAGEEMT ATSTSIAQNC MAAAEGAKRA GEAAVAGAAV VQETVHGMER IAGRVRESAR
TVESLGSRSD QIGEIIGTIE DIADQTNLLA LNAAIEAARA GEQGRGFAVV ADEVRALAER
TTKATGEIGN MIKSIQSETR SAVGAMEGGV KEVEKGTSEA ARSGAALQDI IAQIDSVTQQ
VNQIAVAAEE QTATTSEISS NIQQITGVVH ETAAGAQQSA TAAGRLSGLA ERLRHVVGQF
KL