Gene GM21_1439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1439 
Symbol 
ID8136767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1692789 
End bp1694522 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content50% 
IMG OID644869051 
Producthistidine kinase 
Protein accessionYP_003021254 
Protein GI253700065 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAAACT TTCTCGAAGC CAATAAAAAC CGCATCGTCA TCGTCACCTA TGTCCTGCTC 
TTCTGCATGT TCTGCTTTTT TTCCCTGATG GATCTGCACA AGCGGACCTC CGACTACGCC
TATTTCGTCA ACAAGATCAA CAGTTTCATC AAGAATGCCT ACGATAGCGA ATCGGAGTAC
CTCTGCTCCC GACTGGTCAA CTACGTCAGC GCAGTGGCAA AAGAGAGGAA TTTTGCCGAG
CTGTTGAAAA GCGGCGATGC GGAATCCCTC CATAAGGCTG CCGAACCCAT CTTCCAGGAT
TTCGGGAAAC GGTATGTTCC GGTTATCTCG GTCCTGATCT ATGACGGAAA CGGAAAACTC
AGATATCACA TGACACCTCC ATCCATAGCC CACGACCATG AACAATTCAG GCCGACTGTC
GCCAAAGAGG TCTTTGCCTC CAAAAAATTT ACCTACGGGT ATGAAACGAA TATCGTTCCC
CTTTACTTCT GCACGGTCCT TCCTATCAAG GACAGTCTTT CAAAAGTAGT CGGAACGATA
GAGTTAGGTG TCGACCTGTC CTTTTTTCAC TACAATCTCA AATGGTTTTT CAAGGACGTC
AAGACAGCTA TTTTTTCCAA CGGTAAGCTC ATAGATAATC AAGGCGTTTA TAAAATCGAT
TCCGCAGAAT ACGTGTACGA TTACTACAGG AAGCTGAAGG CAAATGACGC GGAATTTTTC
AGTAAGTTCA TTGACAGGAT CGATTTCAAA CGTGCTGAAA ACGACATAGA ACTCGGAGAC
AAATATTACC TGGTAAGTAC CTCGCAGATA CTTAGGAACA ACAGAGGTCA CGAAGTCGGG
AAATATCTCG TGGCCTATGA CATGACCGAA CTGCGGCGCC GTCACTGGGG TTACTTCTAC
GTCTGGCTGC TGTTCTTTGC TGTAACCGCT TCTGTCATGC TCTGTATTAA CATGGTCGGG
TTCAGGAAGT ATGAGCGCAT CATCACAGAG CAGGGAAACA TGCTGGCCCA GCGCTCGAAA
CAGTGCGCTC TCGGTGAAAT GCTGGGGCAT ATCGGGCATC AATGGCGGCA ACCGCTCTAC
AACCTGTCGC TCATCGTTCA GAATATCGGA CTGCAGAACC AGTTCGGCAA GTTGGACGAC
ACCCTGCTCA GCAAGCAGAT TACCCAGGCG AACCAGAATA TTGAGTACAT GTCGAACATC
ATAGATGACT GGCGCTCTTT GCTGATGTCC GGCAGCAGCC GAACCGTGAT CGAACTCCAG
GCGTCCGTTG AGCGGGCCAT CGCGATGGTG GCACCGGTCA TGGAACAGAG CCGGATCACC
ATAGAAAACA GGATCAACTC TCCGGAGCAT ACTATGGGTT TCGTCAATGA CTTGGTGCAG
TTGACTATCA ACGTGCTGTT GAATGCCCGG GATCAGCTCT CCCTGGTTGC CGGTGAGCGC
GTTATCCTTC TCTCCAGCCG TGAAGAGGCC GACTCCCTGG TAACCGTCAC CTTCCAGGAC
AACGGTGGCG GGATCCCCAA TCATCTGTTG AAAAGAATTT TCGAGCCCTA TGTAACCACC
AAGGACAAGG CGGACGGCAC TGGGCTCGGG CTTTATCTCT GTCGCCAGAT TGTCGAAAAT
CTCGACCAGG GGAGGGTCTG GGCGGAAAAC AGGCGCTTCG AGCTGCAAGG GAAAGAGCTC
TATGGTGCCT GTATCTGTCT GCAATTTGCC AAAATAAAAA CGGAGGAAAT ATGA
 
Protein sequence
MRNFLEANKN RIVIVTYVLL FCMFCFFSLM DLHKRTSDYA YFVNKINSFI KNAYDSESEY 
LCSRLVNYVS AVAKERNFAE LLKSGDAESL HKAAEPIFQD FGKRYVPVIS VLIYDGNGKL
RYHMTPPSIA HDHEQFRPTV AKEVFASKKF TYGYETNIVP LYFCTVLPIK DSLSKVVGTI
ELGVDLSFFH YNLKWFFKDV KTAIFSNGKL IDNQGVYKID SAEYVYDYYR KLKANDAEFF
SKFIDRIDFK RAENDIELGD KYYLVSTSQI LRNNRGHEVG KYLVAYDMTE LRRRHWGYFY
VWLLFFAVTA SVMLCINMVG FRKYERIITE QGNMLAQRSK QCALGEMLGH IGHQWRQPLY
NLSLIVQNIG LQNQFGKLDD TLLSKQITQA NQNIEYMSNI IDDWRSLLMS GSSRTVIELQ
ASVERAIAMV APVMEQSRIT IENRINSPEH TMGFVNDLVQ LTINVLLNAR DQLSLVAGER
VILLSSREEA DSLVTVTFQD NGGGIPNHLL KRIFEPYVTT KDKADGTGLG LYLCRQIVEN
LDQGRVWAEN RRFELQGKEL YGACICLQFA KIKTEEI