Gene GM21_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1524 
Symbol 
ID8136853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1785693 
End bp1787471 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content64% 
IMG OID644869136 
Producthistidine kinase 
Protein accessionYP_003021338 
Protein GI253700149 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value2.64718e-19 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATACCA GATTAAAATT TCCGCTCAGG TTCAAGATGC TCCTCTCCCA GTTGCTGGTG 
GTGTCGGTGG TGCTGAGCCT GATCACCTTC ACCATGGCGA ACCTGTTCCA GGTCGACAAG
ACCGCCTACA TCCACGACCT CACCTCGACG GTGGTGCTGC ATACGGCGGA GGAGGCGAAC
GCGCTTTTGG CCGGTTACCG GGAGCGGTTG AAGCTCTTCG GGCGCGTCCT GGCCGAGCCG
GAGCTTTCGG GGCGGGACCA GGTGCTGCAA AGCTTTTTCG AGGAGTTCCG CGACTTCGTG
CTGGTTACCC GCAGCGGCCC CGGGGGGGAA CAGACCGTCT ACGACGGCGC TGCGCTGCAG
GCGGCCGGGG TGACCAAGGA GGAGGTCGTG GCGAACCTGC AGGCGCATCC CGCGCCGGAA
TCGATTCCGG CGGGGCAGGT GTATCTGGTG AATTCCACCG TATCGCCCAA GCTTCCCACG
CTCGCCCTCA CCATCTCCGA GCCCGCGGCG GGGGGGGCGC CGGTCATCAC GACGGCGGTG
CTGCGCCTGG ACCGGTTGCA GGAGCTCGCC AAGCGTTCGC GGGTCTTTGA CATATTCTTA
TTGGACTCGG CCGGGCGTTA CCTGGCGCAC AAGGCGCCGG GGCGGGTGGG GGTCGCTGCC
AATCTCGAAT GGTGGAACCG GGTGAAGGCC CCGCGCAGCT CCGGGATGAC CATGGAGTAC
AAAAATCTCG GCAAGGAGAT GGTGGCAGGA TTCTCGCGCT GCTCGCTGGG AGGGCTGGTG
GTCGGGGTGG AGATACCCAA AAGCGCCGCC TACCTCACCT CGCGAGAACT TCTCAGCGAT
CTCTTGCTCC TGTCGCTGGC GCTTTTGGGG GGGGCGGCCC TTTTGAGCCA GTTCTGGTCG
CGGCATTTCA CGAGCCCCCT GGAGAAGCTC TCGGAGGCGA CCCGGATGGT GGGGCAGGGG
CGCTTCGAGA TCGAGGTGAA GGCCGAATCA GGCGACGAGA TCGGCGCGCT GGCCCGCTCC
TTCAACCAGA TGGCCGCCGA GTTGAAAGTG CGCGAGAAGG CCCTCAAGGA CCTCTACGGG
CAATTGGTCC ACTCGGAGAA GATGGCGGCC TTTGGCGCCC TCGGCGCGGG GATCGCCCAC
GAGGTGAAGA ACCCGCTGGC GGGTATACTC GGCATCACCC AGCTCTCGCT CAGGGGGGCG
GGAGCCGGGC ACCCGCTGGA GAAGAATCTT CTGATCATCG AGAAGGAGAC CAAGCGCTGC
AAGACCATCA TCGAGCACCT GCTCAAGTTC GCGCGCCAGG AGCAGGTCGA GTTCGGCGAG
GTCGACCTGC AGCAGGTGGT GGCTGATGCC CTTGCCATCG TCGACCACCA GTTGGGGATC
AACAGCATAA AAGTGGAGCA GGAACTGGAG CCGGGAATGC CGACCTGCCG CGGCAACGCG
AACCAGTTGC AGCAGGTGCT GATGAACCTG ATGCTCAACG CGCAGCAGGC GATGAGCGGC
AAGACCGGCA CGGTGAAGCT TTCCGCGCGC AGGCTGGAGC AGGGGGGGGT GGAATTGCGG
GTGGCGGACA ACGGCCCCGG TATCAGCAAG GAGATCCAGG GGAAGATCTT CGATCCCTTC
TTCACCACGA AGCCGGCGGG GCAGGGGACG GGGCTTGGCC TCTCGGTCAC CTACGGCATC
GTCAAGGATC ACGGCGGCGA GATACACCTG GAGAGCGAGG AGGGGGTGGG GACTACCTTC
ATCATCACCC TGCCACCCTC CGCGGCAGCC ACAGGCTAA
 
Protein sequence
MHTRLKFPLR FKMLLSQLLV VSVVLSLITF TMANLFQVDK TAYIHDLTST VVLHTAEEAN 
ALLAGYRERL KLFGRVLAEP ELSGRDQVLQ SFFEEFRDFV LVTRSGPGGE QTVYDGAALQ
AAGVTKEEVV ANLQAHPAPE SIPAGQVYLV NSTVSPKLPT LALTISEPAA GGAPVITTAV
LRLDRLQELA KRSRVFDIFL LDSAGRYLAH KAPGRVGVAA NLEWWNRVKA PRSSGMTMEY
KNLGKEMVAG FSRCSLGGLV VGVEIPKSAA YLTSRELLSD LLLLSLALLG GAALLSQFWS
RHFTSPLEKL SEATRMVGQG RFEIEVKAES GDEIGALARS FNQMAAELKV REKALKDLYG
QLVHSEKMAA FGALGAGIAH EVKNPLAGIL GITQLSLRGA GAGHPLEKNL LIIEKETKRC
KTIIEHLLKF ARQEQVEFGE VDLQQVVADA LAIVDHQLGI NSIKVEQELE PGMPTCRGNA
NQLQQVLMNL MLNAQQAMSG KTGTVKLSAR RLEQGGVELR VADNGPGISK EIQGKIFDPF
FTTKPAGQGT GLGLSVTYGI VKDHGGEIHL ESEEGVGTTF IITLPPSAAA TG