Gene GM21_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1722 
Symbol 
ID8137053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2003326 
End bp2007411 
Gene Length4086 bp 
Protein Length1361 aa 
Translation table11 
GC content65% 
IMG OID644869334 
Producthistidine kinase 
Protein accessionYP_003021534 
Protein GI253700345 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.169999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TGATTCAATT GCCGACGCTC CGGGCGCGGC TTGCTTTCTG GTTCACGGCG 
ATCTGCCTCG TGCCTTTGTT CCTGCTTACC ACCATCATCT ACACCCAGCG GGTCAAGGTG
GCGCGGGCGC TGATATTCGA CAAGCTGAAC GCCGTCATCT CGCTGCGTCA GCAGCAGATC
GGGAGCATCC TCGACAACAT GATAGTGGAC GCCTCCACTA TCGCCTCCGA CCACTCGGTG
CGCGAAGCGG CGCAGCAGCT TCTGACCGGG CCCGGCCAGG GGCCTACCTT CAGCGACAGC
CTGATCCTGC TGCGGGGTTA CCAGAGCAGC AACCTCTCCG TGGCCGACGT GTCGATAGTC
TCCCCTGACG GGAGCATCGT CCTTTGCACC AGCGACAGCA GGAACGGGAC GCATGTGGCC
CGGCCCGAGA TGGTGGCCAA TGCGCTGCAG GACCTGGCGC CGACCTTTGG AGAGGCCTAC
CTCGCCAGCG ACGGCGTTCC CAGCCTCGAC GTCGCCATCC CCATCAGGGA GGGGGCGGGA
TCGCGCGTCG GCGCCTTGCT GGTGGCGCGT TACAACCTGC ACACCCTGCT CTACGACATC
ACCGCCAACC GCACGGGCAT GGGGGATACC GGGGAAACGG TGCTGGTGAA CCGGGAGATG
ATGCCGCTAA ACGAGCTGCG GGCCCTGTCC GGGGGGATCC TGAAGGTGCC GCTCAAGGGG
AAACCGGCGC AGATGGCGGT GCGCGGGGAA AGCGGCATCG CGCAGGCCAC CGACTATCGC
GGAGTGGAAG TGCTTTCCGC CTTTGTCCAC ATACCCCGCA CCGGCTGGGG GCTCGTTTGC
AAGCAGGACA CCAGCGAGGT ACTGGCGCCG ATCCGGAGCC TCCTCTTCAT TACCTACGGT
CTGGCTGCGT TCATTTCGCT TCTGGCCTGG GTGCTTGCCT TAAGGCTCGC GCGCACCATA
GCCGAACCGC TGGTGAACCT GGCGGAAAGC GCCAAGGAAC TGGAAGACGG GGACTTCGAC
GCCCGGGCCG AGGCGGAGGG TACCGACGAG CAGAAGGCCC TCGCCGGCAG TTTCAACAGC
ATGGCCCGGG CGCTGCAGAC GAAGATGGAG GCCCAGCAAG GGATGTCGCG GCTCTCCCAG
AACCTGGTTG TCGCCGAGGG AATGGACGAT TTTTTCAGCA AACTCCTCCC CGTCTTTTTG
GAGATCACCG GCGCCAAGAT GGCGGTCGCC TACGTGGAGG AGCGGCAGAC AGGGGAGTTC
GCGCCCATGC ATGCCATCGG CCTTGATTCT TCCCTGCTGC GCCGGTTCAA ACGCAGCCAT
CTTGAGGGAG AGCTGGGAGT GATCCTCTCC AGCGGCACCG TGTCCTGCTA TGCGCCGCAG
GAAGCGACTC CCCTTCGCTT TGCTACCAGC TTCGGCGACA TCGTGCCCGC CGAGATAGCC
ACCGTCCCGA TTTCGGTCGA CGCCGAGATC CGCGCCTTCG TTTCCATCGC CTCGGAGCGC
CCCTTCCCGC CTAAGGTGCG GGAGATCCTG GAGCTGGTGC AGATGCCGCT GGGTTCGGCC
TTCGCCAGGA TACTCTCCGG CGAGGACGTC CGGCTGCTGG CGAACGAACT GGCGCTCAAG
AACGCGGAAC TGACCCAGCA GTCCGAGGAA CTGGTGCAGC AGTCCACCGA GCTGGCCCAC
CAGTCGGAGG AGCTTTCCCG CCATAACCAG GTGCTGGATC TGCAAAAGCA GCAATTGGAA
GAGGCGACCC GGCTTAAGAG CGAATTCCTC TCCAACATGA GCCATGAGTT GAGGACGCCG
CTCAACTCGG TGCTGGCGCT CTCCCGTGTG CTGGCGGTGC AGGGGAGCTC TCGCCTCACC
GAGGACGAGA AGGGGTACCT TTCCATCATC GAGAGAAACG GCAAGCACCT TTTGTCCCTG
ATCAACGACA TCCTCGACCT GGCCAAGATC GAGTCCGGAC GCCAGGAGGT TTTCCTGGAG
TCGGTCTCCC TGCGAAAGGT CGCGGCCGAA GTGGTGGACG GGCTTTCGGT GCTGGCCCGC
GAAAAGGGGA TCGCGCTCTC GCTGGAGGCG CCTGAGGGGC TCCCCGCCGC GGTGATCGAC
GTGAAAAGAC TGCGCCAGAT GCTGCAGAAC CTGATAGGCA ACGCGATCAA GTTCACCGCC
GAGGGGTCGG TGACCGTGAG ACTCGACGCC GAGGGGGAAG AGTTCCTGAT CGAGGTGGAG
GATACCGGCA TCGGCATCGC GCCGCAGTAC CTGGAATCCA TCTTCCACGA GTTCCGCCAG
GCCGACGGCA GCACCTCTCG CTCCTTCGAG GGGACGGGGC TCGGGCTTGC CATCGTGAAG
AAGACGGCGC GCCTTTTGGG AGGTGACGTA TCGGTGCGAA GCGAGCTCGG CGAAGGGTCC
GTCTTCACCC TTAGGCTTCC TTGCGACGGC ACCGGGAGCG TGCAGGGACC GGCACCGGGT
AACGCCGCTT TCCCCGCGGA ACGCGCGCAG GCTCCGTTTC AGGGGAGGTC GGTCCTGGTG
GTGGATGACG ACCCCGAGGC GGTGTCGTTG ATCGCGTCGC ATCTCTCCCA GGCGGGTTTC
GAAACCTTGA CAGCGCTAAA CGGCGCCGAC GCCCTGAGGC TCGCTCGCGC GAAGAAGCCT
TTCGCCATCA CGCTCGACAT CATGATGCCG GACATGGACG GTTGGGAGGT GATGCGCGAG
CTGAAGGACC AGCGCGAAAC AGCCGATATC CCCGTGGTGA TCGTGTCGCT CTCCGACGAC
CGTGCGACGG GAATCGCTCT CGGCGCGGTC GGGGTGATAG CCAAACCGGT GAGCAGGGAA
CAGCTCATGG AGACCTTCAC CAAGCTTGCC GGCGCGGGAT GCCGGCTGGT GCTGGTGGTC
GACGACAGCG AACACGACCG TTTCTTCATC GCCTCGCTCT TCAAGGAGAT GGGGATGGAC
GTGCTCTTGG CCGAGGACGG CGCGCAGGCC CTGGAACTCG CCGCGGCTGA TCAGCCCGAC
CTGATCACGC TCGACCTGCT GATGCCGAGG ATGGACGGAG CGGCCGTCTT GGACAAGCTC
AGGTCCAACC AGTCCACCGC GGCGATTCCG GTGGTGGTGA TAACCTCCAA GGATCTGAGC
TCGACTGAGT TGGAGCGGCT CTCCTGCGGG GTGAGCGCCA TCCTTTCCAA GGGGGGGCTC
GAGCGCGAGG TGGTGCTGGA CGAATTGGTG CACAGCCTGG AGCGGATCGG CTGGCGCCTT
CCCTGCGAGC CAAGCGGAAA CGGGGCGAGG ATCCTCATCG TCGAGGACAG CGAGGCGGCC
ACCATCCAGC TCCGCTTCGC GCTGGAATCG GCGGGGTTCT GCGTCGACGC GGTCTCAGGC
GGCAGGCATG CGCTTTCCTA TCTGCAGACC CATGTTCCCG ACGGCATCAT CCTCGACCTG
ATGATGCCTG AGGTGGACGG CTTCGACGTG TTGAAGGCGG TGCGCGTCTC GCGGCTGACC
GAAAAGGTGC CGGTCATGGT GATGACCGCA AAGACCCTTT CGCCGGGCGA GCACGACCGG
CTGCGCCTGC TGGAGGTGAA CCACCTGGTC CAGAAAGGGG ACGTGGAGCA GCACGAACTG
CTGCGCAGAA TTTACGAGAT GCTCGGGGCG GTCTCCCTGT TCAAGCCGGA CACCGCGCCG
GCGGCGACCG GCGCGGCGCT TTTCCCCGAG AGTCAGCGGG GGGACGGATC GGTGTTGGTG
GTCGAGGACA ACCCGGACAA CCTGGTAACG CTTCGGGCCG TCTTGGGGGG GAGCTGCCGC
CTGGTCGAGG CGACAGACGG CGAGTCCGGC CTTCTCCTGG CGCGGACTGG GTCCCCTTCG
CTGATCCTTC TGGACATGCA TCTCCCCGGC ATGGACGGAC TCGCCGTACT CAGGCACCTG
AAGCAGGATA ACGCCACGGC TCTCATCCCG GTGGTGGCCC TCACCGCGAG CGCCATGGCG
GGGGACCGGG AGATGCTGAT GAAGGCGGGG TGCGTGGCGT ACGTCTCGAA GCCGTACCAG
CCGGAAGAGC TGCAGGAACT GGTCATCAAA TTCGTCGCAA CGGGGACTGC GCCAAGCGGG
AAGTAG
 
Protein sequence
MKKLIQLPTL RARLAFWFTA ICLVPLFLLT TIIYTQRVKV ARALIFDKLN AVISLRQQQI 
GSILDNMIVD ASTIASDHSV REAAQQLLTG PGQGPTFSDS LILLRGYQSS NLSVADVSIV
SPDGSIVLCT SDSRNGTHVA RPEMVANALQ DLAPTFGEAY LASDGVPSLD VAIPIREGAG
SRVGALLVAR YNLHTLLYDI TANRTGMGDT GETVLVNREM MPLNELRALS GGILKVPLKG
KPAQMAVRGE SGIAQATDYR GVEVLSAFVH IPRTGWGLVC KQDTSEVLAP IRSLLFITYG
LAAFISLLAW VLALRLARTI AEPLVNLAES AKELEDGDFD ARAEAEGTDE QKALAGSFNS
MARALQTKME AQQGMSRLSQ NLVVAEGMDD FFSKLLPVFL EITGAKMAVA YVEERQTGEF
APMHAIGLDS SLLRRFKRSH LEGELGVILS SGTVSCYAPQ EATPLRFATS FGDIVPAEIA
TVPISVDAEI RAFVSIASER PFPPKVREIL ELVQMPLGSA FARILSGEDV RLLANELALK
NAELTQQSEE LVQQSTELAH QSEELSRHNQ VLDLQKQQLE EATRLKSEFL SNMSHELRTP
LNSVLALSRV LAVQGSSRLT EDEKGYLSII ERNGKHLLSL INDILDLAKI ESGRQEVFLE
SVSLRKVAAE VVDGLSVLAR EKGIALSLEA PEGLPAAVID VKRLRQMLQN LIGNAIKFTA
EGSVTVRLDA EGEEFLIEVE DTGIGIAPQY LESIFHEFRQ ADGSTSRSFE GTGLGLAIVK
KTARLLGGDV SVRSELGEGS VFTLRLPCDG TGSVQGPAPG NAAFPAERAQ APFQGRSVLV
VDDDPEAVSL IASHLSQAGF ETLTALNGAD ALRLARAKKP FAITLDIMMP DMDGWEVMRE
LKDQRETADI PVVIVSLSDD RATGIALGAV GVIAKPVSRE QLMETFTKLA GAGCRLVLVV
DDSEHDRFFI ASLFKEMGMD VLLAEDGAQA LELAAADQPD LITLDLLMPR MDGAAVLDKL
RSNQSTAAIP VVVITSKDLS STELERLSCG VSAILSKGGL EREVVLDELV HSLERIGWRL
PCEPSGNGAR ILIVEDSEAA TIQLRFALES AGFCVDAVSG GRHALSYLQT HVPDGIILDL
MMPEVDGFDV LKAVRVSRLT EKVPVMVMTA KTLSPGEHDR LRLLEVNHLV QKGDVEQHEL
LRRIYEMLGA VSLFKPDTAP AATGAALFPE SQRGDGSVLV VEDNPDNLVT LRAVLGGSCR
LVEATDGESG LLLARTGSPS LILLDMHLPG MDGLAVLRHL KQDNATALIP VVALTASAMA
GDREMLMKAG CVAYVSKPYQ PEELQELVIK FVATGTAPSG K