Gene GM21_3467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3467 
Symbol 
ID8138839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4012495 
End bp4014117 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content61% 
IMG OID644871087 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_003023247 
Protein GI253702058 
COG category[T] Signal transduction mechanisms 
COG ID[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATTGGG AATGCTGCTG GAGCAAGGAA GAGATCGAAC CGGAGAACTG CCCCTATGTC 
GGTGAGGGGG AGGAGGATCT CTACAGCTCG CACCGCCGCA GGGTGGTGGA GAAGTGCGTC
GAGTGCCCCC GCTTCAAAAA CGACCTGGCC CGGATGAAAG GCTCCGGTTA CCCGCTCTCC
GACGTCCTTC CGTTCATATT GACTGAGTTT CAGGAACAAA AGGCGCAGAT GGGAGCGATG
CTGAGCTTCT TGAACAGCAA GACCCGCGAG ATCAAGTTTC TGCGCGAGGT CGGCATCGTG
CTGCAGACCT CCATCGACCT CGACGAGGTG CTTTCCATCG CCATGACGGC AATCACGGCG
GGAAAAGGGT TCGGCATGAA CCGCGCCTTC TTGCTGATGA CCGACAAGGA GCGCCGCCAT
ATCCGGGGGT ACCTCGGCGT CGGCCCCAGG GATTACGAGG AGGCGTGGCG GACCTGGGAA
GACATCGGCC GCAGCAACTT CACCCTGAGG GAACTGGCCC GCGACTTCCA AAAGACCAAG
CTCACCTCGG AGAAGGTCAA GTTCCACGAT ATCCTGAACC AGCTCACCGT GCCGCTTGCC
GACCAGGGGC ACATCTTCAA CCGCGCCCTG CAGGGGAAAA AGCCCATCCT GGTGGAAAAC
GCGCTGAACA ACCCCGACCT CGACCGGGGG CTCGCCCGCA TACTCGGGGT CGACTGTTTC
CTCGTCATGC CTCTCATCTC GCGCAACCGC CGCATCGGCG TCATCATTGC CGACAACTGC
ATCACGCAAA AGCCGATCAC CCTGCAGGAC ATGCAGTCTC TGGAGACCTT CGCCTTTCCG
GTCGCATTCG CGCTGGAGCG CGCCTCCCTC TACGAGCGGC TCCAGGAGGA GGTCGCCAAG
CAGAAGTCCG CGAACCTCAA GCTGCGGGAG CAGCAGGAAC TGATAGTGAA GATGGAGAAG
ATGGCGCTGG TGGGCAAGAT CACCTCCAGC ATCGCGCACT CCATCCGCAA TCCGCTCATG
GTGATCGGAG GCTTCGCCCG CACCCTCTTG AAAGGAAGCG CCGAGGACGA CGAGAAGCGG
AGTTACCTCG AATCCATCGT GCGAGAGACC CGGCAGTTGG AAGACGTTCT CTCCGAGGTG
CTGGACTACT CCGAATCGCT CTTCCCCGTC ACTGATTTCT GGGACCTGAA CGAACTGGTG
ACCAAGGCGC TGGCCGACCT GGAAGGGGTC ATGGAGCAGG CCGGGGTGGT TTGCCGCCAG
GAGCTAAGCC CCGAGCTTCC CATGGTCCGC ATCGACTACA AGCAGATCAG CTACTGCCTG
AAGACCATCA CCGCGACGGC GCTGGCCTGC ATGGAGCAGG GGGGGGAGCT GCGCATCGAG
AGCCTCAACG ACGGCGACGG CGTTCTGCTC CGGATCAGCG ACAGCGGCAA GTCCCTCACC
GAGACCGCCA AGGAGGCGCT CACCGCTCCT TTCTTCCAGA CCCAGGAGAT GGGGGAGGGG
GTGGGACTCT CCCTGTGCAA GTCGATCCTG GAGCGCCAGG GGAATTCCTT GTCCATATTC
AGCCGCCCGG GTGGCGGCAA TACCTATAGC ATCAGGCTCT TGACGAGAAA GGAGAATATC
TGA
 
Protein sequence
MDWECCWSKE EIEPENCPYV GEGEEDLYSS HRRRVVEKCV ECPRFKNDLA RMKGSGYPLS 
DVLPFILTEF QEQKAQMGAM LSFLNSKTRE IKFLREVGIV LQTSIDLDEV LSIAMTAITA
GKGFGMNRAF LLMTDKERRH IRGYLGVGPR DYEEAWRTWE DIGRSNFTLR ELARDFQKTK
LTSEKVKFHD ILNQLTVPLA DQGHIFNRAL QGKKPILVEN ALNNPDLDRG LARILGVDCF
LVMPLISRNR RIGVIIADNC ITQKPITLQD MQSLETFAFP VAFALERASL YERLQEEVAK
QKSANLKLRE QQELIVKMEK MALVGKITSS IAHSIRNPLM VIGGFARTLL KGSAEDDEKR
SYLESIVRET RQLEDVLSEV LDYSESLFPV TDFWDLNELV TKALADLEGV MEQAGVVCRQ
ELSPELPMVR IDYKQISYCL KTITATALAC MEQGGELRIE SLNDGDGVLL RISDSGKSLT
ETAKEALTAP FFQTQEMGEG VGLSLCKSIL ERQGNSLSIF SRPGGGNTYS IRLLTRKENI