Gene GM21_2693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2693 
Symbol 
ID8138035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3136117 
End bp3138087 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content61% 
IMG OID644870297 
Producthistidine kinase 
Protein accessionYP_003022487 
Protein GI253701298 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value2.23882e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAGAAGT ACCTCTTCGC CTTCATAAAC AAACTGAAAC TGCGCTGGAA GATGGTGGTG 
CTGGTGTTGC CGCTGGTGAT CATACCGATC TTCCTCGTGG GCGGGGTCAT CGGCTACATC
TCGACCAAGC AGGCCTACCT GGGGATCACC CAGACCAGCA AGGACGACCT GCAGCACATG
GCCGGCTTCA CCGTCGACCT CTTGAACTCC CACTACCAGC AATTCCAGGT CTACAAGCAG
GACAAGGAGA AGACCTTCCA CGAAGAGTTG AGGACGGTGA GCGAGCTCGC CTTCAACGTG
GTGAAGTCGC AGCAGAGCCT GCAGCAAAAA GGGCGCATGG ACGTGGCGGC GGCGCAGCGC
GAGGCGCGTA ACGCCCTGGC CAAGGTAAAC GTGGGGAAGA CCGGCTACAT CTACGCCATG
AACAGCCGCG GAGAGCTGAA GGTCCACGTG GCGCAGGAAG GTGTCAACGT CTTTGACAGC
CGGGACGAGA CCGGGCGCTA CTTCATCCGC GAGATGATAG ATAAGGCACA CCGCTCGAAG
CCGGGCGAGG TGCTCTACAT CGTCTACCCC TGGCGCAACG CGGCGCTAGG CGACAAGTCG
CTCAGGAAGA AGGTGGTGGC CTACCGCTAC TTCAAGGAGT GGGACTGGAT CATCGCGGCA
GGCGGATACC TGGAGGAGAC CTACGAGGAC CTCGCCTTCG AGCGCCGCTC CTTCCAGGAG
CTGAAGGAAA AGATCAAGGG GAAGAAGGTG GGCAAGACCG GCTACATCTT CGCCATGGAC
ACCTCGGGTA ACTTCATGAT CCACCCGACC GGGGAGGGAA AGAACTTCCT GAACGCGGTC
GATTTCAGCG GGCAGCATTT CATCAAGGAG ATGTGCGAGA GGAAAACCGG CTGGATCCGC
TACCCCTGGA AGAACAAGGG GGACAGCGGC CCCAGGATGA AGATCGTCCG CTACGAGTAC
TTCCAGCCCT GGAACTGGAT CGTCGCGGTA GGCTCCTACG AGGAGGAGTT CTACCAGGAG
GCGAACGTGA TCAAGGGGCG CATCATGGAG AGCATGGTGG TGCTCACCAT CCTGGTGAGC
GTGATGGCCG TGTTCCTGGT GCTCCTAGCC TCCCGGGTGA TGACCGAGCC GATCTCCAGG
ATGATCGAGG TGATCAGGAG GGTGAAGCAG GGGCGCCTGG ACGAGACCAT GAAGGTGGAG
ACCCAGGACG AACTGGGTGA ACTCGCCACC GCCTTCAACC GGATGACCAA GATCATCAAG
CACAACAAGG AGCTGGAGGC GAACCTGGCC CAGCAGGGGA AGATGGCGTC CCTAGGCGTC
CTCTCCTCGG GCGTCGCCCA CGAGATCAAC AACCCGCTCG GGGTGATACT GGGGTACGCC
GCGTACATAG AGAAGAAGCT CTCGCCCGAC GACCCCAACT ACCGGTTCAT CCACGAGATC
AAGCGCGAGA GCAAGCGCTG CAAGAAGATC GTGCAGGACC TTCTCTCCTA CGCCCGCACC
CCGCAGCCGG TGCTGGAGCC GACCGACCTG AACGCGCTTT TGGAGCAGAT CGTGGACTTC
GCCGCGAACC ACACCGACAT GCACCACGTC TCGGTGGAAA AAAGCTTCGA TCCGACGCTG
CCCGAGATCA TGGTCGACGG CGACCAGCTG CGCCAGGTGG CGATCAACCT GATCCTGAAC
GCCGGCGCCG CCATGCAAAG GGGGGGAAAG CTCGTGGTCA GCACCCAAAA CGGTGAAGAC
AACTGCGTGA GCCTCAAGTT CTCCGACAAC GGCGCGGGAA TCGCGGCCGA GCACATGGAG
CGGATCTTCG AGCCGTTTTT CACCACCAAG GTCAAGGGGA CGGGGCTCGG TCTCGCCATA
ACCAGGCAGA TCGTCGAGCA GCACCACGGC AAGATCGGCA TCGAGAGCGA GATCGGCGCC
GGGACCACGG TCGAGGTACG GCTGCCGATC AACCGGGACG ACTACTGTTA A
 
Protein sequence
MQKYLFAFIN KLKLRWKMVV LVLPLVIIPI FLVGGVIGYI STKQAYLGIT QTSKDDLQHM 
AGFTVDLLNS HYQQFQVYKQ DKEKTFHEEL RTVSELAFNV VKSQQSLQQK GRMDVAAAQR
EARNALAKVN VGKTGYIYAM NSRGELKVHV AQEGVNVFDS RDETGRYFIR EMIDKAHRSK
PGEVLYIVYP WRNAALGDKS LRKKVVAYRY FKEWDWIIAA GGYLEETYED LAFERRSFQE
LKEKIKGKKV GKTGYIFAMD TSGNFMIHPT GEGKNFLNAV DFSGQHFIKE MCERKTGWIR
YPWKNKGDSG PRMKIVRYEY FQPWNWIVAV GSYEEEFYQE ANVIKGRIME SMVVLTILVS
VMAVFLVLLA SRVMTEPISR MIEVIRRVKQ GRLDETMKVE TQDELGELAT AFNRMTKIIK
HNKELEANLA QQGKMASLGV LSSGVAHEIN NPLGVILGYA AYIEKKLSPD DPNYRFIHEI
KRESKRCKKI VQDLLSYART PQPVLEPTDL NALLEQIVDF AANHTDMHHV SVEKSFDPTL
PEIMVDGDQL RQVAINLILN AGAAMQRGGK LVVSTQNGED NCVSLKFSDN GAGIAAEHME
RIFEPFFTTK VKGTGLGLAI TRQIVEQHHG KIGIESEIGA GTTVEVRLPI NRDDYC