Gene GM21_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1798 
Symbol 
ID8137129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2089099 
End bp2092170 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content63% 
IMG OID644869410 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021610 
Protein GI253700421 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.100128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCTG CAACATCGCA GAGAAACATC AGGCAATCCG GCAGGAACCT GATGGTCTGG 
ACCGCCTCGG TGGTCCTGCT GGTAAACCTG TTCCTGGCGC TGACGGGTTT CTTCTCCCTC
TGGCAGAGCA GGGAAACCTA CCTGAACAAC GCGCAGGTAC AAAGCGACAA CCTCTTGCAG
GCGCTCTCCT ACAGCATCGC AGGGGCACTG GAGTCCGCGG ACATCGCGCT TTTGTCCATG
GTGGACGAAG TGCAGCAGGA GCTGCAGCAG GGACCAGTCG AGGAGGGACG CCTGAACCGT
TTGCTGGCGG GGCAGCATTC CCGGCTGCCG CTGCTGGACA GCATGCGGAT GGCCGATGCG
CAAGGGGATA TCCGTTACGG CACCGGGTTG CAGGCGGGCG CCTTGAAAAA CGTCTCACAG
AGGAGCTACT TCAAGAAACT GGCCGCCGAT CCGAAGGCGG GGCTCGTGAT TTCGGAACCG
ATCCTGGGGC TCATCAGCGG GAAAAGGGTC ATTATCCTCG CCCGAAGGGT TGAGCGGCGC
GGCGGCGCTT TTGCCGGCGT GGTGTACTGT GCCATCGATC TGGAGCGTGT CCGAGGCATG
CTCCAGCCGA TGCAGGTCGG TTCCCACGGA AAAATTACCC TCACCGACGC AACGCTGGCC
ACCATCGTGC GGCATCCGCA GGGGACGGAT GCCGGCGACG CCCCCGGGGA AAAGGGGGCC
AGCGGCGAGA TTTGGGACCA AATCGCCCGG GGGCGCGAGA GTGGGAGCTA CTTTGGCGAA
AGCGACGCTT GGGGCTCCTC CCGGCTCGCA TCCTTCCGAA AGATAGGCCG GTTCCCGCTG
TACCTCTCCC TGGAGCTCGC GCCTGAAGAC TTCCTGGCCC GCTGGCGCGG CGAGTTGCTC
CAGATCAGCA CCATGGTCTG CGTATTCATC CTGGTCACCC TGATCCTTTC CAGGCTCATC
TACTTGAGGT GCCGCCGGCA GGCAGCGGCG GAAGACGAGC TGACTCAGGC CAAGGAGGAA
CTGGAGCTGC GGGTGGCTGA GCGGACGGCT GAGCTGCATC TGGCAAACCT GAAGCTTACG
ACCGAGCTAG CCGAGCGTGA GCGGGCTGAG AAGAGGGAGC GCGAAGGGCG CAACATGCTG
GCGCAGATCA TCGACACCAT CCCGCAGTAC GTGTTCTGGA AGGACCGCCA GAGCATCTAC
GAGGGATGCA ACGCCGTCTT TGCCAAGGGG GCCGGGATGG CGCACAGCGA CGAGATCAGG
GGCAAGAGCG ACTTCGACCT CCCCTGGCTG CGTGAGGAGA GCGAGGCGTA CCGCAGCGAC
GACTGCGTGG TCATGGAGCA AAAACGCGCC AAGTTCCACA TCATAGAGCA GCAACTGCAG
GCGGGTGGGA AGCGTCTATG GGTGGATACC ACCAAGGTGC CGCTTCTGAA CGAGCAGGGA
GAGGTGACCG GGATACTGGG AGTGTACGAG GATATCTCGG AGAGAAAGGC GGTCGAGGAG
TCGCGCGACA AGGCGCTGGC GCTGGTTGAG TCACTTTTGG CCGCCTCGCC CACCGGCATC
CTGGTCTACG AGGGGGCGAG CGGCTGCTGC GTCATGGCGA ACCAGGCGAT GGCCGCGATG
GTGGGGGTGA CTCGGCAGCA GATGCCGGCC TTGAACTTCA GGGAGATCGG ACTCTGGCGG
GAGACGGGGA TTCTCCAGCT TGCCGAGCAG GTCCTCGTCG ACGGGAGGAC ACGGAGCATC
GAGGTCTGTG CTCAAACGAG TTTCGAGAAG AGCCTCCAGG CCGAGTTTCT CCTCTCCCGC
TTCGAAGTGG AGGGGCGTCC GCACCTCATG TTCATCGCGG TCGACATAGC GGCGCGAAAG
GGGCTGGAGG AGGAGAAGCG GCAGATCCAG GCCCAGATGC TGCACGTGCA GAAGCTGGAG
AGCCTGGGGG TCCTGGCCGG CGGCATCGCG CACGACTTCA ACAACATCCT GATGGTGGTG
CTTGGCAACG CGGACCTTGC CTTGATGCGA CTTCCCGAAG GGACGCCGGC GCGGGAAAAC
CTGCAGCAGA TCGAGCAGGC CGCGAGCAGG GCGGCGGACT TGGCTCAGCA GATGCTGGCC
TACTCCGGTA GAGGAAACTT CGTGATCGAG AAGCTGGATC TGGCCCAGAC GGTCAAGGAG
ATGGCCCAGA TGCTGGAGAT CTCCATCTCC AAGAAGACGC GGTTGCATTA CGATTTCGCC
CCAGACCTCC CGGCCATCAG CGGCGACGCC ACGCAACTGC GCCAGGTGAT CCTGAACCTG
GTGTTGAACG CCTCAGAGGC GATCGGCGAC AACATCGGCG TGATAGGGAT CAGGACAGGC
TACCTGGAGT GCGACCGCGC CTACCTTTCC GAAACGTGGA TCGACGACCG CCTCCCCGAG
GGCCCCTACC TGATGCTGGA GATCTCCGAT AACGGTTGCG GCATCAAGAA AGAGATCATC
CCCAAGATCT TCGACCCCTT CTTCACCACC AAGTTCACCG GCCGCGGGCT CGGCATGGCT
GCAGTCCTAG GCATCGTGCG CTCCCATAAC GGCGCCATCA AGATCTACAG CGAGGAAGGG
AAGGGGAGCA CCTTCAGGCT GCTTCTTCCG TGTATGTCGG CGCAGGGGGA GGCCTTGCAA
CCGACGGCGG AGGAGGATCT TTGGCGCGGC AGCGGTACGG TGCTTCTGGC CGACGACGAG
GAATCGATCA GGGCCCTTGG GCAGGAGATG CTGGAGACGC TGGGATTCCG GGTGCTGACC
GCCTGCGACG GGTGCGCGGC GGTCGAACTC TTCAGGGAAA AACGGGAGGA GATCGCCTGC
GTGGTCCTGG ACCTTACCAT GCCCGAGTTG GACGGCGAGC AGGCCTTTAA CGTACTGCGC
GAGCTGGATC CGGGGGCGAG GGTGATCCTT TCCAGCGGCT ACAACGAGCG GGAGGTGAGC
CGGAAATTCG CCGGGGCCGG GGTTTCCGGG TTCATGCAGA AGCCGTACAA GCTGGCGGAG
ATGAGCAGGA AGCTGCGGCA GATACTGGAG CCGGGGGGGG GCTCACAGCA GGCGGCAGAG
CTGGGCGGCT AG
 
Protein sequence
MNAATSQRNI RQSGRNLMVW TASVVLLVNL FLALTGFFSL WQSRETYLNN AQVQSDNLLQ 
ALSYSIAGAL ESADIALLSM VDEVQQELQQ GPVEEGRLNR LLAGQHSRLP LLDSMRMADA
QGDIRYGTGL QAGALKNVSQ RSYFKKLAAD PKAGLVISEP ILGLISGKRV IILARRVERR
GGAFAGVVYC AIDLERVRGM LQPMQVGSHG KITLTDATLA TIVRHPQGTD AGDAPGEKGA
SGEIWDQIAR GRESGSYFGE SDAWGSSRLA SFRKIGRFPL YLSLELAPED FLARWRGELL
QISTMVCVFI LVTLILSRLI YLRCRRQAAA EDELTQAKEE LELRVAERTA ELHLANLKLT
TELAERERAE KREREGRNML AQIIDTIPQY VFWKDRQSIY EGCNAVFAKG AGMAHSDEIR
GKSDFDLPWL REESEAYRSD DCVVMEQKRA KFHIIEQQLQ AGGKRLWVDT TKVPLLNEQG
EVTGILGVYE DISERKAVEE SRDKALALVE SLLAASPTGI LVYEGASGCC VMANQAMAAM
VGVTRQQMPA LNFREIGLWR ETGILQLAEQ VLVDGRTRSI EVCAQTSFEK SLQAEFLLSR
FEVEGRPHLM FIAVDIAARK GLEEEKRQIQ AQMLHVQKLE SLGVLAGGIA HDFNNILMVV
LGNADLALMR LPEGTPAREN LQQIEQAASR AADLAQQMLA YSGRGNFVIE KLDLAQTVKE
MAQMLEISIS KKTRLHYDFA PDLPAISGDA TQLRQVILNL VLNASEAIGD NIGVIGIRTG
YLECDRAYLS ETWIDDRLPE GPYLMLEISD NGCGIKKEII PKIFDPFFTT KFTGRGLGMA
AVLGIVRSHN GAIKIYSEEG KGSTFRLLLP CMSAQGEALQ PTAEEDLWRG SGTVLLADDE
ESIRALGQEM LETLGFRVLT ACDGCAAVEL FREKREEIAC VVLDLTMPEL DGEQAFNVLR
ELDPGARVIL SSGYNEREVS RKFAGAGVSG FMQKPYKLAE MSRKLRQILE PGGGSQQAAE
LGG