Gene GM21_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1199 
Symbol 
ID8136524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1402962 
End bp1405850 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content59% 
IMG OID644868813 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_003021018 
Protein GI253699829 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones175 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAA AGACAGAGCC TGCTGAGAGA AACGTCACAA TTCCTTTCAT CGTTTTTACT 
TTTGCCGTCT GCGGACTGTA TTTAAGCAGC CTTAGCTCCT ACCTCTTGTT TCATACCTTG
GTGGAAATTT TCTTTATCCT GGTACTGCTC GGTTCTTTTG TGGTGGCATG GAACTCCCGC
AGGCAGCTGG ATAATCACTA CTTCTTGTTT CTCGCCATAT CCTTTCTCTC TTCCGGTACC
TTTGAGCTTT TGCACACTCT GGCCTACAAG GGTTTGTCGA TATTCCCCGG GTATGACGCC
AATCTCCCAA CCCAGTTCTG GATAGTTTCG CGATACGTCT TCAGCCTATC CTTCCTCATC
GCCCCTGTGT TCATTGCCCG GAAGCTTAAA GTGGCGGCTA CGCTGACTGG ATTTATCTGC
ATCACCGCCT TGCTGGTCGG CGCCGTATTC TCGGGCAACT TCCCCGACTG CTTCATTGAA
GGAACAGGGT TGACGCCGTT CAAGGTGTAC AGCGAATACG TGATAATCGC CGTCCTGGCA
GCTGCCATCA TGCTACTGCT TGCCAAAAAG GAGGCCTTCG ATACTCGCGT CCTGCGGGCG
CTCATCTGGT CGCTGACGGC CGCGGCCATG GCCGATGTGG CCTTTACCAA ATACGTCAGT
GTTTATGGCC CGGCAAACCT CTTGGGGCAT CTCTTTCTCC TCCTTTCCGC CTTCCTGGTG
TACCGGGCCA TCGTGGTGAC GGGGATAGAG GATCCCGCAT CCCTGTTGTT CCGCAGCCTT
GAGCTCAGCA GGGAGCAGTT CCGCAGGGCG ATAGCAGACG CGCCCATTCC CGTCATCATG
CACGCCGAGG ATGGCAAGGT GTTACAGGTG AGCGGGTCGT GGACGGAGCT CACCGGCTAC
GAGCTGGAAG ATATCCCGAC CCTCGAGGCC TGGCTCGACC GGGCCGTTCA AGGCCCAGAC
GGTGATGCCG CACGCGCCGC TGTAAGCGCG CTTTGCACCG GGAACAGGCA GAACATCAGG
TGGGATATGC CTATCCTCAC CCGTGACGGG CGCTCTCGTT ATTGGAACAT CAGTGCATCT
TCGCCCGGAA GCCTGCAGGA CGGCCGGCGC TTCGTCGTCT GCATGGCGGT TGATATCACC
GAGCGCCGGG AAAGTGAAGC CGCCCTGCAA AGAGGTAACC TGCGCCTTGA TCTGCTGGCC
TCCACCGCGA GCAGGTTGCT GGAGAGCGAC GCGCCACAGC AACTGGTCGA CGAGTTGTGT
CAGAAGGTGA TGTCCTATCT TTCCTGCGAT ACCTATTTCA ATTTCCTGGT GGATGAAGAG
GCCGGATGCT TGCGCTTGAA TTCCTGCGCC GGGATCAGCG CCGAGTCTGC GGAGTCCCTG
CAGTACCTTG AGTACGGGGT GGCGATCTGT GGCTGCGCTG CCAGGGATGC CTGCCGCATA
GTGGCCGAGA ATATCCCAGA GGTGCCAGAT CCACGCACGG AACTGGTGAA ATCCCTCGGC
ATCACGGCCT ATGCCTGTCA TCCCCTCATG TCCCACGGGC GGGTGCTGGG GACCCTTTCC
TTCGGGACCA GGACCAGGAA GGCCTTCAGC GAGGATGACC TATCGCTCAT GAAGGCGGTG
GCGGATCAGG TTTCGATTGC CATGGAGCGG AAGACTACCG AGGAGAAACT GCAGCAGGCC
AAGCAGGCGG CAGAGGCGGC CAGCAGGACC AAGAGCCAGT TCCTTGCCAA CATGAGCCAC
GAGCTGAGAA CGCCGATGAC TGGGGTGCTG GGGATGCTGG ACCTGGCCCT GCAATCCACC
TCCGAACTGC AGCAAAGCGA TTACATCCGG ACCGCTTACC GGTCAGGGCG GTCGCTGTTG
CAGATACTAA ACGACATCCT CGATCTATCT AAAGTGGAGG CCGGGAAGTT CTCGCTGGAT
CAGAAGCCTT TCTCCCTCAG GAGTTGCGTA ACGGAAGCGG CAGATATCGC CGCTCCCGAG
GCGAAGCGCA AGGGGATCGA ACTGGAGCTC GACCTCGCGC AAGACCTCCC TTGGCATGTG
GTCGGCGATC AGGTCAGGCT GCGCCAGGTA CTGACCAACC TGGTCGGCAA TGCCGTGAAG
TTCACGGAGC GAGGGCGGGT AACGGTGCAG GTCGAGGTTG CGGATAAAAC GGCTGAGGCA
GAGCTGCTGC TCCGCTTCCG CGTGAGCGAC ACAGGGATAG GGATACCGCA GGACAAGAGG
CATCTGCTCT TCCAGGCCTT CAGCCAGGTC GACGATTCCA ATACCCGCAA CTATGGGGGG
ACCGGCCTCG GCCTTGCCAT CAGCAAGGAG ATCGTTCAGC GCATGGGGGG AGAGATCGGT
TTCGAAAGCG GGGAAGGGGT GGGAAGCACC TTCACCTTTA CCGCGCGGCT GGGTATCGCC
GAAGATTGTG ACGATCTTGC TGTGGCAGCC GGCAGTGAAA GCGTCGTCCC AACTCGCACT
CCGCAGGACG TCAGTGAGGG CAATCGGCAG GAGCGCAATC GGCAGGAGGG GAGCGGTTTG
CGCCTGCTGA TCGCTGAAGA CGACGCCACT ATCAGGCAGG TACTGGCAGC GATGCTGCAG
AAGCTGCAAT TCGTCGTTGA TTTCGCCGAG GACGGCGAAA CTGTCGTTGA AAAGTGGCGG
CAAGGGGAGT ACGACCTGAT ACTGATGGAT GTGCAGATGC CCCGTCTGGA CGGCTTTCAG
GCCACTCGCG CCATCCGGGA ACAGGAACAG GAACAGGGCG CGCGCATTCC GATCATCGCC
ATGACCGCTC ACGCCATGAA AGAAGACCAA CGGCGCTGTC TCGACGCCGG CATGGACGAC
TACATATCCA AGCCGATCAA CTTCAAGGAG TGCATCGAAA AGGTGAAGGG GTTCGCCTGC
AACCGGTGA
 
Protein sequence
MATKTEPAER NVTIPFIVFT FAVCGLYLSS LSSYLLFHTL VEIFFILVLL GSFVVAWNSR 
RQLDNHYFLF LAISFLSSGT FELLHTLAYK GLSIFPGYDA NLPTQFWIVS RYVFSLSFLI
APVFIARKLK VAATLTGFIC ITALLVGAVF SGNFPDCFIE GTGLTPFKVY SEYVIIAVLA
AAIMLLLAKK EAFDTRVLRA LIWSLTAAAM ADVAFTKYVS VYGPANLLGH LFLLLSAFLV
YRAIVVTGIE DPASLLFRSL ELSREQFRRA IADAPIPVIM HAEDGKVLQV SGSWTELTGY
ELEDIPTLEA WLDRAVQGPD GDAARAAVSA LCTGNRQNIR WDMPILTRDG RSRYWNISAS
SPGSLQDGRR FVVCMAVDIT ERRESEAALQ RGNLRLDLLA STASRLLESD APQQLVDELC
QKVMSYLSCD TYFNFLVDEE AGCLRLNSCA GISAESAESL QYLEYGVAIC GCAARDACRI
VAENIPEVPD PRTELVKSLG ITAYACHPLM SHGRVLGTLS FGTRTRKAFS EDDLSLMKAV
ADQVSIAMER KTTEEKLQQA KQAAEAASRT KSQFLANMSH ELRTPMTGVL GMLDLALQST
SELQQSDYIR TAYRSGRSLL QILNDILDLS KVEAGKFSLD QKPFSLRSCV TEAADIAAPE
AKRKGIELEL DLAQDLPWHV VGDQVRLRQV LTNLVGNAVK FTERGRVTVQ VEVADKTAEA
ELLLRFRVSD TGIGIPQDKR HLLFQAFSQV DDSNTRNYGG TGLGLAISKE IVQRMGGEIG
FESGEGVGST FTFTARLGIA EDCDDLAVAA GSESVVPTRT PQDVSEGNRQ ERNRQEGSGL
RLLIAEDDAT IRQVLAAMLQ KLQFVVDFAE DGETVVEKWR QGEYDLILMD VQMPRLDGFQ
ATRAIREQEQ EQGARIPIIA MTAHAMKEDQ RRCLDAGMDD YISKPINFKE CIEKVKGFAC
NR