Gene GM21_1832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1832 
Symbol 
ID8137163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2132349 
End bp2134001 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content62% 
IMG OID644869443 
Producthistidine kinase 
Protein accessionYP_003021643 
Protein GI253700454 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.00156636 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAAA GAGTGATAAG GAAATTCCGG CTCTCCCCCA TCACCAAGAG TTTCCGGGTC 
CGCCTCTACT TGATCTTTAC CGGGACCATT GCCCTTTTGA CGGCGGCCTT CGTCTCCTTT
TACGTCGTGA CCGAAAACAA TGCCTACCGC AGCACGCTTG AGCGCGAGGG GAGGCTGCTT
GCCACCATAC TCTCGCAAAA CGCCCGCCTG CCGCTCTTCG CCGAAAACCG TGAGGCTCTT
TCCGTGCTGG CCGAAGGGAC CTCGCGCCAA TCCTCCGTGG TTTCTGTCCT CATCAGCGAC
CAGCAGGGAA GGGTGGCGGC CGAGGCACGC AAGGTAGAGG TCCCTCAGGG GGAGACCGTC
AAGATGGAGG TGGAAATAAC CTCTCCCAGC TCGGTGCTGT CGCCTGAGTC GGTGCTCCTT
GGACACCAGG AGACCGATAA GCAGCAGGTG ATCGGCAGGG TGCACCTGCT GCTCGACATG
TCGGCGGTGC GGGAGCGGCT GGTGAACCTG GTCGCCGCCT CGCTTGCCAT CGGGACGCTT
TTCTGGGTGG CCGTTTCGCT TTTGAGCTAT CAGGTGATCA AGCGGGTCAC CTCGTCCTTC
AACATGCTGA TGGGGGGGGT CGAGGAGATA GGTTCGGGCA AGCTCTCGGC ACGGGTCGAC
CTGGAAGGGG ACGACGAATT GGCGCGCGCA GCCAATGCCA TCAACGCCAT GGCCGCGTCC
CTAGAATTGC GCGAGCTTGA GAACCTGGCC CTGCAGGAAG AGCTCCTGAA GGCTATGCAG
CTCGAGGTGC AGGAGGAGAA AAAGCTGGTC ATGGCGCGGC TGATCCAGAC CAACAAGATG
ACCTCGCTGG GGCTCCTTCT CTCCAGCATG GCCCATGAGA TCAACAACCC CAACGCCTCG
ATCCGCTTCT CCGGTTACAT GATCGGGAAG ATGTGGAGCG ACGCGGTGCC GCTTTTAGAC
CGCGTCCGTG AGGAGGAGGG GGATTTTTAC CTGGGAGGGA TCCCCTTCGA GAAGGCGCGC
CAGGCGCTGA CTGAGAATGC CGGCAAGATC GTGGAGAACT CGGAGAGGAT CGCGCGGGTG
GTGCAGGGGC TCAGGGACTA CGGGGTGGGG GGCGACGCCC ACCTGAAGCA GAAACTGGAG
CTGAACGCCG CGGTGTCGGC GGCCCTGTCG GTGCTCGCCT GCCAGATCAA GAAGGACGTG
CAGTTGAACA CCTCCCTCGG CACCGGGATT CCTGTTATCC CGGGAAGCCA GCAGCAGATC
GAGCAGGTGA TCATCAACCT GATCGTGAAC GCCATGCAGG CCCTTGAAGA CGGGCGGGGG
GAGGTGCATC TGACCACCCG CCATGACGCC CATAACGGCG AGGTGGTGGT GGAAGTAAGC
GACAACGGTG TCGGCATCAA GCCGGAAACC ATGGAGCGCC TGTTCGAACC TTTCTACTCG
ACCAAGTTGG ATCGGGGGGG AAGCGGCCTG GGGCTCTACA TCTCGCAATA CATCGTTGCC
GAACACGGCG GCCGGTTGCA GCTTACCTCC GCCCCGGGCA AGGGGACATT GGCCCGCGTG
GTGCTCCCGG CCGCGCCTGC CGCCTCAGTG CGCGGCATGG TCTCCGCCCA GAACGGTCAG
CATGCCGCCG ATGCGCTCCA TCAAGTCGGT TAA
 
Protein sequence
MKKRVIRKFR LSPITKSFRV RLYLIFTGTI ALLTAAFVSF YVVTENNAYR STLEREGRLL 
ATILSQNARL PLFAENREAL SVLAEGTSRQ SSVVSVLISD QQGRVAAEAR KVEVPQGETV
KMEVEITSPS SVLSPESVLL GHQETDKQQV IGRVHLLLDM SAVRERLVNL VAASLAIGTL
FWVAVSLLSY QVIKRVTSSF NMLMGGVEEI GSGKLSARVD LEGDDELARA ANAINAMAAS
LELRELENLA LQEELLKAMQ LEVQEEKKLV MARLIQTNKM TSLGLLLSSM AHEINNPNAS
IRFSGYMIGK MWSDAVPLLD RVREEEGDFY LGGIPFEKAR QALTENAGKI VENSERIARV
VQGLRDYGVG GDAHLKQKLE LNAAVSAALS VLACQIKKDV QLNTSLGTGI PVIPGSQQQI
EQVIINLIVN AMQALEDGRG EVHLTTRHDA HNGEVVVEVS DNGVGIKPET MERLFEPFYS
TKLDRGGSGL GLYISQYIVA EHGGRLQLTS APGKGTLARV VLPAAPAASV RGMVSAQNGQ
HAADALHQVG