Gene GM21_0057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0057 
Symbol 
ID8135356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp69362 
End bp71128 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content62% 
IMG OID644867674 
Producthistidine kinase 
Protein accessionYP_003019902 
Protein GI253698713 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value1.85994e-13 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAACG AACGAGACAT TCTGATCGTC GACGACAACC AAGTGGTCTG CGATGTTCTG 
GCTGAACTAT TCCGCAACGA GGGGTTCGAC AGCTGGGGTG TTGCCACCGG CGAAGCGTGC
CTCGATGAAG TGACCCGCGC CTCCTGGAAG CTGGTGATGC TGGACGTGCG CCTTCCAGGG
ATCAGCGGCA TCGAGGTTCT CGAGGCCATC CGGCGCGACC ACCCCAAGAC CGAAGTGATC
ATCATGACCA GCCACGTATC GCTGGAGACG GCGGTCCAGG CCCTGCGTCT GGGCGCCCAG
GATTACCTTT TCAAGCCCTT CGACGACCTG GAGATGGTGA TCGCCACCGT GAACAAGGCG
CTGGAGCGGC GCCGCCTCGT CGAGGAGCGC GACAAGCTGG TGCGCACCCT GGCCGAGCTG
GCCATCGAGA ACGGCCGCAT CCTCGCCGAA TGCCGCCGGG TAAACAGCAG CCTGGAGGAA
AAGGTGGCGC AGCGTACGGC CGAACTTTCC AAGGCGAACC TGCAGCAAAA GGCGATCATC
GCCGAATTGC GCGAAGCGAA GGAAGCGGCC GAAGCGGCCA ACCGGGCCAA GTCGCAGTTT
CTCGCCAACA TGAGCCACGA GATCCGCACC CCGATGAACG GCGTCCTCGG CATGGCCGAG
CTCCTGCTGC ATTCAGAGCT CGACGAGAAG CAGAAGAGCT ACGCCAAGAT GCTGCACCAC
TCCGGCGAGT CGCTCTTGGA CATCATCAAC GACATCCTCA ACATCTCCAA GATCGAGGCG
GGGAAGCTGG AGATCGAGAG GATTCCTTTC GATCTGCACG AAACCGCGCG CGGCGCGGTG
GAGCTGTACC GTGAGGTGGG CCGGGGCAAG GGTGTCGCGG TGGAGCTGCA GATCGAGGAG
GACGTTCCGC GCTGCGTGGC CGGCGACCCG AACCGCCTGC GGCAGGTCCT GATCAACGTC
GTCAACAACG GCCTGAAATT CACCGAAAAG GGGTCGGTAC AGGTCCGGGT CTCCCTGGTG
GAGCAGAACC AAAACGGGCA GTACGTAGGT TTCGAGGTGA AGGACACGGG GATAGGGATC
CCCGCCGACA GCATCGGCGC CATCTTCGAC TTGTTCGCCC AGGTGGACGG CTCGACCACG
CGGAAGTACG GGGGGACCGG GCTGGGACTG GCCATTGCGA AGCAATTGGT GGAGTTGATG
GGGGGAGAAA TAGGGGTAGA GAGCGAGCCG GGACAGGGGT CCACCTTCAC CTTTATCGTA
TTCCTGCACC AGCAGGTCGA CCAAGCTCTA TGCGAAGAGG AGGGGGCTGA CATGCCCGTT
GATAAGGATA ATTGCACGGC AGAAGCTCGG CAGATCGGGA AGTTCAACGC ACGCGTGCTT
CTGGCCGAGG ACAACCCGGT AAACTGCGAG GTCGCCTTCG CGATGATCGC CGCGCTGGGT
TGCCAAGTCG ACGTGGCCCA GGACGGTAGA GAAGCAGTCG AAGCCTTTTC GCGCCAACCG
TACGACCTGA TTTTCATGGA CTGCCAGATG CCGGAAATGG ACGGCTACCA GGCCACCCGC
GCCATCCGGC AGCGGGAACT CGGCTCCGGC AAGCACACCA CCGTGATCGC ACTCACCGCT
CACGCCATGG CGGGGGCCAG GGAATATTGC CTCACTGCCG GAATGGACGA CTACCTCAGC
AAGCCTTTCA ACCTTGAACA GCTCCAGGAG CTGATCGCCA AATGGACCTC CCCCTCGCCT
CTTAGCCTTC CCTTAGCCTT CCCTTAG
 
Protein sequence
MGNERDILIV DDNQVVCDVL AELFRNEGFD SWGVATGEAC LDEVTRASWK LVMLDVRLPG 
ISGIEVLEAI RRDHPKTEVI IMTSHVSLET AVQALRLGAQ DYLFKPFDDL EMVIATVNKA
LERRRLVEER DKLVRTLAEL AIENGRILAE CRRVNSSLEE KVAQRTAELS KANLQQKAII
AELREAKEAA EAANRAKSQF LANMSHEIRT PMNGVLGMAE LLLHSELDEK QKSYAKMLHH
SGESLLDIIN DILNISKIEA GKLEIERIPF DLHETARGAV ELYREVGRGK GVAVELQIEE
DVPRCVAGDP NRLRQVLINV VNNGLKFTEK GSVQVRVSLV EQNQNGQYVG FEVKDTGIGI
PADSIGAIFD LFAQVDGSTT RKYGGTGLGL AIAKQLVELM GGEIGVESEP GQGSTFTFIV
FLHQQVDQAL CEEEGADMPV DKDNCTAEAR QIGKFNARVL LAEDNPVNCE VAFAMIAALG
CQVDVAQDGR EAVEAFSRQP YDLIFMDCQM PEMDGYQATR AIRQRELGSG KHTTVIALTA
HAMAGAREYC LTAGMDDYLS KPFNLEQLQE LIAKWTSPSP LSLPLAFP