Gene GM21_4130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4130 
Symbol 
ID8139504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4719767 
End bp4721485 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content63% 
IMG OID644871745 
Producthistidine kinase 
Protein accessionYP_003023903 
Protein GI253702714 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value0.964997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA AAACCCTTTT GCTGCTGGCC GCTCTCACCC TCATCGTCTC GTCGGCGCAC 
GCCGAACCGG CACCCGGCCG CGACCTCAGG GTCATCGTGG TGGGGGGCAA CAGCAACTAT
CCTCCGTACC AGTTCCTGGA CGAAAACGGC GAACCTCGCG GTTTCATAGT CGACCTGACC
CGGGCCATCG CCAGGGTGAT GGGGATGCGG ATAGAGATCC GGCTTGACGA TTTCGGCAAG
ATTCTCAAGG AGCTCGACAG CGGCGAGGTC GACATGCTGG AAGGGCTCTC GTATTCCGAT
GCGAGGGCCC GCGAATATGA TTTCTCTACC CCGCATTCCA TCATAGTTCA GGCCATCTTC
GCCCGAAAAG GGACGCCGGC GGTCAAAAGC CTGGAAGAGC TCAAGGGGAA AAAGGTGCTG
GTGCATAGCG GCGGCGGGAT GCACAGCTAC TTGCAGGAAA AGCGTTATGG CGCGGACCTG
GTGCTGACCG GCAGCCCGCG CGAGACACTG CGGCAACTGG CGGCGGGACG CTGCGACTAC
GCCGTCGTGG CCCTGCTCCC CGCCATGTAC ATCATCCGCG AAGAGAAGTT TTCGAACCTA
GTGCCGGTGG CGACCAACGT GGCGCCGCAG CGGTACTACT GCTACGCGGT GAAAAAGGGT
AATGCGGAGC TGGTGGCCCA GATGAACGAA GGGCTTTCCA TATTAAAGAA AACAGGGGAG
TTCAACGAGA TCTACGACAG GTGGATCGGG GTGCTGGAGC CGCAGCGCAC CTCCTGGCTC
GTCGTGGCCA AGTACGCGGC GCTGGTCGTG ATCCCCCTTT CCCTGGTCCT GTTGGGTACG
GTGCTCTGGT CCTACTCGCT GCGCAGGCAG GTGGCGCAGC GCACCGAGTC GCTCTCCAGC
GCCTTGGCCG AGTTGCAGAG AAACCAGCAG CAGCTGGTGC AGGCGGACAA GATGGCCGCG
CTGGGTATCC TGGTCTCCGG GGTGGCCCAC GAGATCAACA ACCCTACCGG AATCATCCTG
ATGAACATGC CCACCCTGAA GAAGATTTTC CGCGACGCGG AGCGGATCCT CGACCGGTAC
CAGGAGCAAG AGGGGGAGCT TACCCTGGGG GGGATCCGAT ACCAGAGGGT GCGGCAGGAG
GTTCCGCTGA TGCTGGACGA GATGCAGGAC GGCGCCGAGC GGATCAAGAA GACCGTCGAC
GATCTGAAGA ACTTCGCCCG CAAGGACGAC GAGGCACGCA AGGAGCTGCT GGATTTCAAC
CAGGTGGTGC AGACGGCGGT ACGCCTGGTG GACGTCGCCA CCCGCAAGTT CACCAACGAT
TTCAGCGTCA GTTATGCAGA AGCGTTGCCT GCCGTGTTCG GCAACGAGCA GCGGCTGGAG
CAGGTGGTGG TGAACCTGGT CATGAACGCC GGGCAGGCCC TCCCCGACCC CAACCGCGCC
ATCGCCCTTG AGACAAGGTA CGACGCGGGG AGCGGCAGGG TGCTGCTCAC GGTCAGCGAC
CAGGGGAGCG GTATCTCGCC GGAGCACCTG AAGCACCTCA CGGACCCTTT CTTCACCACC
AAGCGCGAGA GCGGGGGAAC CGGGCTGGGG CTCTCCATCT CCGCCAACAT AATCAAGGAC
CACGGCGGCG AAATCGCCTT CGATTCCAGG CTCGGGGAGG GGACCACGGT CACCCTTTCC
CTGCCGGGGG CCGTGGCAGG GAGCAAAAAT GGACAGTGA
 
Protein sequence
MSIKTLLLLA ALTLIVSSAH AEPAPGRDLR VIVVGGNSNY PPYQFLDENG EPRGFIVDLT 
RAIARVMGMR IEIRLDDFGK ILKELDSGEV DMLEGLSYSD ARAREYDFST PHSIIVQAIF
ARKGTPAVKS LEELKGKKVL VHSGGGMHSY LQEKRYGADL VLTGSPRETL RQLAAGRCDY
AVVALLPAMY IIREEKFSNL VPVATNVAPQ RYYCYAVKKG NAELVAQMNE GLSILKKTGE
FNEIYDRWIG VLEPQRTSWL VVAKYAALVV IPLSLVLLGT VLWSYSLRRQ VAQRTESLSS
ALAELQRNQQ QLVQADKMAA LGILVSGVAH EINNPTGIIL MNMPTLKKIF RDAERILDRY
QEQEGELTLG GIRYQRVRQE VPLMLDEMQD GAERIKKTVD DLKNFARKDD EARKELLDFN
QVVQTAVRLV DVATRKFTND FSVSYAEALP AVFGNEQRLE QVVVNLVMNA GQALPDPNRA
IALETRYDAG SGRVLLTVSD QGSGISPEHL KHLTDPFFTT KRESGGTGLG LSISANIIKD
HGGEIAFDSR LGEGTTVTLS LPGAVAGSKN GQ