Gene GM21_3964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3964 
Symbol 
ID8139338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4546432 
End bp4548039 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content64% 
IMG OID644871580 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_003023738 
Protein GI253702549 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.00000000000408371 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCACCA TCAGGTCCAA GATCATACTC AACCTCGTCG TTTTGTTGCT CACCATTGTC 
GGGATCGTCG CCTTCGAGTA CAGCAACATA GCAAAGCTTG GCCGGCTTCA GGATGAGGGG
GCGAAACGCT CCAAAGACGC GGTCCTGGCC AAGGAGGCCT CCATGGGGGG GCTCGCGCTG
TACCGGATCA TCGCCGACGC GCAGATAAAC CGGGATCTCG ATGAGACGGC GAAGATTTGG
AACCCGACCA AAACAGAGGT CTTGGCGAAT ATCGCTTTTG TAGAGGCGTC GGCAGATACG
GCGGAGGAGA AGAAGATAAC CGCCGAGGTC AAGGCCGCTC TCAACGATGT GGTGAACCTG
TTCGAGGGGA AGATGCTTCC TCTTCTGAAG ACGACCGACG GCGTCACGCC GGAGCTGAGC
GCGCTGGATG CCCAAATCGA CGAAAAGGTC GTGTTGGTCG AGGCGGGGGT CGACAAGGTG
GTCGCTTCAA TGCAAAAGGA GATGGACGTG GCCGACGGGG AGTTCGACGC TGACCGCAGG
ATGGCGATAG TGGTAGCGGT GGTGGTCGGA GCCGCCGGCC TGTTGCTGCT GCTGGTGGTG
GGGCTCTTGT TGCTTAGGAG CATCATGCGT TCCATCGATG CCATGAGAGT CGTCATGACC
GCGGTGCACG AGGGGGATCT GACCAGGCGG GTGGAGATCG ACAGCAAGGA CGAACTCGGC
ATCATGAGCC GGGAGTTCAA CGGGATCATC GGGAGACTGC AAGAGATGAT CAACCATATT
TCCGATACCT CCAACCAGGT CGTGGCCGCT TCCGCCCAAC TGAGCGCGAC TGCGGAAAGG
ATCGCCAACG GCGCGGAGGA GGTCGCGGCG CAATCCTCGA CCGTCGCCAC CGCGGGTGAG
GAGATGTCGG CGACCTCCGG GGACATAGCT CTCACCTGCC AGAAGGCTTC CGACGGGGCG
AAGCTTGCCG CCGAATCGGC CGGCGGCGGC GCGAGCCTGG TGGAGAGAAC CGTTTCCGTC
ATGGGCGAGA TCGCCGCCAA GGTCCAGGAA TCGGCCCGGA CGGTGGAAAC CCTGGGCGAA
CGCAGCGATC AGATCGGCGC CATCATCGTC ACCATCGAGG ACATCGCGGA CCAGACCAAC
CTCCTGGCGC TGAACGCAGC CATCGAGGCG GCCCGTGCGG GCGAACAGGG GCGGGGCTTC
GCGGTGGTGG CGGACGAGGT GCGCGCCCTC GCCGAGCGCA CGACCCGGGC CACCAGGGAG
ATCGACGAGA TGATCAAGGC GATCCAGAGG GAGACCAAGG GGGCCGTGGC AGCCATGGAG
CAGGGGGTCG GGCAGGTGGA GGCGGGAACC AGGGAGGCGG CCAAGTCAGG CGACGCCCTG
CGCGAAATCC TGGAGCAGGT GCACGACGTC GCCATGCAGG TGAACCAGAT CGCTACCGCC
GCCGAAGAGC AGACCGCCAC CACCAGCGAA ATTTCCAGCA GCATGCAGCA GATATCCCAG
GTCGTGCGGG ACACCGCCTC GGGAGCCCAC CAATCCGCGG CGGCGGCGCA CGAGCTGAAC
GGGACCGCCG AGCAGTTGCA AAGGCTGGTG CGGCAGTTCA GGCTCTGA
 
Protein sequence
MSTIRSKIIL NLVVLLLTIV GIVAFEYSNI AKLGRLQDEG AKRSKDAVLA KEASMGGLAL 
YRIIADAQIN RDLDETAKIW NPTKTEVLAN IAFVEASADT AEEKKITAEV KAALNDVVNL
FEGKMLPLLK TTDGVTPELS ALDAQIDEKV VLVEAGVDKV VASMQKEMDV ADGEFDADRR
MAIVVAVVVG AAGLLLLLVV GLLLLRSIMR SIDAMRVVMT AVHEGDLTRR VEIDSKDELG
IMSREFNGII GRLQEMINHI SDTSNQVVAA SAQLSATAER IANGAEEVAA QSSTVATAGE
EMSATSGDIA LTCQKASDGA KLAAESAGGG ASLVERTVSV MGEIAAKVQE SARTVETLGE
RSDQIGAIIV TIEDIADQTN LLALNAAIEA ARAGEQGRGF AVVADEVRAL AERTTRATRE
IDEMIKAIQR ETKGAVAAME QGVGQVEAGT REAAKSGDAL REILEQVHDV AMQVNQIATA
AEEQTATTSE ISSSMQQISQ VVRDTASGAH QSAAAAHELN GTAEQLQRLV RQFRL