Gene GM21_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2365 
Symbol 
ID8137706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2754372 
End bp2755679 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID644869980 
Producthistidine kinase 
Protein accessionYP_003022171 
Protein GI253700982 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones169 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG ACGGGCTGAC TTATGTTGTC GGGGAGGAAA GGGAGTTGCG CGACCTCCTC 
TGTGATGCCG ACGTGCTTCC CCTTTTGCAG GGCGCTGTGT CTGCTGGCGC CGTCAGCGCA
TGTTTGGTGG ACCAGACGGG GGAGGGGCTC TGGAAAGTGG ACGCGCCCGC GGAGGGAAGC
GAGCTGGTCG CAGACCTCGC GCTCATTCTG GAAGGGGAGC CGGTGGCTCG GGTTCAGGTG
AACGGGGAGC CGTCGCGGCG AGAACAGCTG CAGGCACTCG CGGAGTTGCT TAGGGTCGCG
CTGGATACGG TGGTAAGAAA CAACCTGAAA CGCATGCTCA CCACCGAGAT CCACACCACG
GTGGTGAAAA GGTCGTTTGA GGAACAGCTG TTGATCAACG CGGAGCTGTC GAAATCCGAG
GCGCGTTACC GCGAACTGGC ACAAAACCTG GAACTGCGCG TGAAGGAACG GACTGAGCAG
CTCGAAAAAG CACAGATCCA CCTACTCCAG CAAGAGAAGA TGGCGGCCAT AGGACAGCTC
GCCGCGGGGA TTGCCCACGA GATCAACAAC CCGCTCGGGT TCGTGATCTC CAACCTGCAC
ACCTTGCAAA AGTACGTGGC TCGTTTCACG GCGATGCTGC AGTTCTACCG ACAACACCTG
GAACTGGACC TTCCCATCGG GATGCTCGTC GAGCAGGCCG ACCAGAAGTG GCGCGAGTTG
AAGCTACAGC AGGTTGTGGC GGATGTCGGT GATTTGATGT CGCAGAGCGT GGGAGGGGCG
GAGAGGGTGG CCCGGATCGT GGCGGACCTC AAGGGGTTTT CCCACGTAGA CGAGGCGGAG
GCGATGCCGG TCGACTTGAA CGTCGAACTG GAGCGGACCT TGAGCGTGCT CTCGCACCAG
CTCCGCGACA AGGCCCACAT CGTCCGCAAG CTGCAGGCAC TCCCCCCGAT CACCTGCCCG
CCTCAACTCG CTGGTCAGAT CTTTCTCAAC CTGATACAGA ACGCACTGGT GCACTGCGGC
CCGCATCCCA CCATTACGAT CGCCAGCTGC TATGATGGAG AACGCATCAG GGTGAGCGTC
GCCGACGATG GTCCCGGCGT TCCCGAAGAG CTCCGCGACC AGATCTTCCA TCCCTTCTTC
ACCACACGCC CGGTCGGCTC TGGGACCGGC ATGGGGCTCG CCGTCGCCTG GGAGGCGGCC
CTGCAGCTTA ACGGAGGCAT CACCGTCACC GACGCCGCCG GAGGCGGCGC CGAGTTCGTA
TTGACAATAG CCGTACCAAA AGGATCAGCC GATGTCGAAG TACTCTGA
 
Protein sequence
MTADGLTYVV GEERELRDLL CDADVLPLLQ GAVSAGAVSA CLVDQTGEGL WKVDAPAEGS 
ELVADLALIL EGEPVARVQV NGEPSRREQL QALAELLRVA LDTVVRNNLK RMLTTEIHTT
VVKRSFEEQL LINAELSKSE ARYRELAQNL ELRVKERTEQ LEKAQIHLLQ QEKMAAIGQL
AAGIAHEINN PLGFVISNLH TLQKYVARFT AMLQFYRQHL ELDLPIGMLV EQADQKWREL
KLQQVVADVG DLMSQSVGGA ERVARIVADL KGFSHVDEAE AMPVDLNVEL ERTLSVLSHQ
LRDKAHIVRK LQALPPITCP PQLAGQIFLN LIQNALVHCG PHPTITIASC YDGERIRVSV
ADDGPGVPEE LRDQIFHPFF TTRPVGSGTG MGLAVAWEAA LQLNGGITVT DAAGGGAEFV
LTIAVPKGSA DVEVL