Gene GM21_1985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1985 
Symbol 
ID8137319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2297983 
End bp2300001 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content62% 
IMG OID644869598 
Producthistidine kinase 
Protein accessionYP_003021795 
Protein GI253700606 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGC GCTTCCCGAT CAAGGCCAAA CTCACCTTCG GGGCGCTGGC GCCGCTTTTC 
GTCGCCTTCT TCATCTGCTC CCTGGCCGGC CTTTACATCA TCAACGAGAA GATCGCGAGC
CAGGCGCAGG AAAAGGTGCG CACCGACCTA AACTCCGCGC GCGAGGTCTA CAGAAACGAG
CTGGACCGGA TCCGCGAGTT CATCGATCTC ACCGCCACCA ACCCCTACAA CTCCTCCTCC
ATCGTCGCCG GCGACCATGA AATCCGTGCG CTTTTGCGCC AGCGTCTCTA CAAGAAACGG
CTGGACATCC TGACCGCCGT GGACGCGAAG GGGCGGGTTT TGTACCGCGC CCACACACCT
AGCCTTGCCG GTGACCTGCA AAAGAGCTAC TTCGTGCAGC AGGCGCTGAA AGGGGTGGCG
GTAACGGGGA CGGCGCTTAT CGGCGAGCAG GAACTGGCCC GGGAGGGTGT GGCGCTTTCC
GGCCGCGCCA CCATTCCGCT GGTCTCCACG CCGCATGCCC GTCCTCGCAA CGACGCGACC
GAAAAAACGG GGATGGCAAT GGTCTCCGCC GCCCCCTTGA GAAATCATGC GGGGCAGGTC
ATAGGCGCCC TGTACGGCGC GGTCCTGCTC AACAACAACA ACGCCCTGGT CGACAAGATC
AAGGAGATCG TGTACGAGGG GGTGCAGTTC AACGGCACGG ATGTGGGAAG CGCCACCATC
TTCTTGGGCG ACGCCCGGAT CGCCACCAAC GTCCGCCTCA CCGACGGCGC CCGCGCCATC
GGGACCAGGG TTTCCGAAGA GGTGTACCAG CGGGTGATCG TGGAGAAAAA GAAGTGGATC
CGGCGCGCCT TCGTGGTGAA CGACTGGTAC TTCACCGCCT ACGAGCCGAT CCTGGACCTG
CAGGGAAAGG CGATCGGCTC GCTCTACGTG GGGATGCTGG AGAAGCCTTA CACCCACATG
CAAAAGAGCG TCAACTCGAT CCTGTACATG GTGCTCTTCG TCACCTCGCT GATCGGGCTG
GCTGTCTCCG GCTTCATCGC CACCCTCCTC GCCCGTCCCA TCAAGGAGCT GGAGAAGCTG
GCGCACCGGG TGGCGCGCGG GGAGCGCAAC CTGCAGATGG AAGTGCACAC AAAGGACGAA
GTCGGGGACC TGGCCGACGC CTTCAACCTG ATGACGAAGG CGCTAAGCCG CCAGGAGGCG
GAAATTGGTC TCTTGCACCG GGCACTTGAG CTTAAGGTGG AGGAGCGGAC AGCGCAACTC
TCCGACAAAA ACCGCCTGCT TTTGCAGACC CAGGCGGACC TGGCTCGCGC CGAGAAGCTC
GCCGACCTTG GGATCGTCGC CGCCGGCGTA GCCCACGAGA TCAACACCCC GCTCGCAATC
ATCCGCGGCA ACGCCGAGGT GCTGGAGATG TGCCTTCCCC CCGAGCATCC CAACCACGAG
GAGGTCGATA TCATCAGCAT GCAGACCGAG CGCATGGCGA AGATCGTCGG CAACCTGCTC
ACCTTCGCGC GCCAGAAATC CCTGAACCAG AGGGAGTTCA TGGTGCACGA GATCCTGGAC
GACATCGTGG CGCAGATCAG GCACCAGGTT CCCATGGACG CCATCTCGGT GCAGTGGGAA
TACGACATGA ACCTGGGGAC GGTTATCGGA GACACCGACC AGCTGCGGCA GGTGTTCAGC
AACATCATCC TGAACGCGGT GCAGGCGATG CTTCCCAAGG GGGGGACCCT CAGGCTCACC
ACCCGGCCGC ACGGCCCGGG CAACGGCTGC GAAGTGGAGA TCCGCGACAC CGGCAAGGGG
ATCCCAGCCG AGCATCTGGA AAAGATCTTC ACCCCGTTTT TCACCACCAG GGACAGCGGC
ACCGGGCTCG GCCTCTCCGT TTCCTACGGC ATCGTCAGGG ACCACGGCGG CGACATCCAG
GTTTCCAGCA CGCCGGGAGC CGGAACCTGT TTCAAGATCT TCTTGCCTGG GGGAAGAAAA
ACGGGACAGG AAACGCCGGA AAACTTCGAA GATCTTTAA
 
Protein sequence
MLKRFPIKAK LTFGALAPLF VAFFICSLAG LYIINEKIAS QAQEKVRTDL NSAREVYRNE 
LDRIREFIDL TATNPYNSSS IVAGDHEIRA LLRQRLYKKR LDILTAVDAK GRVLYRAHTP
SLAGDLQKSY FVQQALKGVA VTGTALIGEQ ELAREGVALS GRATIPLVST PHARPRNDAT
EKTGMAMVSA APLRNHAGQV IGALYGAVLL NNNNALVDKI KEIVYEGVQF NGTDVGSATI
FLGDARIATN VRLTDGARAI GTRVSEEVYQ RVIVEKKKWI RRAFVVNDWY FTAYEPILDL
QGKAIGSLYV GMLEKPYTHM QKSVNSILYM VLFVTSLIGL AVSGFIATLL ARPIKELEKL
AHRVARGERN LQMEVHTKDE VGDLADAFNL MTKALSRQEA EIGLLHRALE LKVEERTAQL
SDKNRLLLQT QADLARAEKL ADLGIVAAGV AHEINTPLAI IRGNAEVLEM CLPPEHPNHE
EVDIISMQTE RMAKIVGNLL TFARQKSLNQ REFMVHEILD DIVAQIRHQV PMDAISVQWE
YDMNLGTVIG DTDQLRQVFS NIILNAVQAM LPKGGTLRLT TRPHGPGNGC EVEIRDTGKG
IPAEHLEKIF TPFFTTRDSG TGLGLSVSYG IVRDHGGDIQ VSSTPGAGTC FKIFLPGGRK
TGQETPENFE DL