Gene GM21_1812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1812 
Symbol 
ID8137143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2111537 
End bp2113858 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content61% 
IMG OID644869423 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003021623 
Protein GI253700434 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.021882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACG CACTGCGTAT CCTGATCGTG GACGATTCCC CCGAAGATGC CGCACTCATC 
GTGCGTCAAC TGCAGAGGGA GTTCTCCGTA AGCTACGAGC GCGTCGAGAC CCCGGACGAA
ATGGAGTCGG CACTGGACAA GGGGGGATGG CACATCGTCA TCTCGGACTA CGTCATGCCG
AAGTTCAGCG GGCTCGCGGC GCTCAAGCTC CTTCACGACC GCGGGATGGA ACTCCCTTTC
ATCATGGTCT CCGGGCAGAT GGGCGAGGAC GCGGCCGTGG AAGCGATGCG CGCCGGCGCA
CATGACTACC TGCTGAAGGA CAGGCTCTCC AGGCTGATCC CCGCGATCAA GCGGGAATTG
AACGACGCCG TGGTGCGCCG CGAGAGGAGG ATGGCGGAGG AGGCCCTCTC GGCGACGGAG
GCGCGCTTCC ACAGTCTGGT GGAGCAGTCA CTGGTCGGCA TATTCATGCT GCAGGACGAC
ATCTTCATCT ACGTGAACCC AAAGTTCGGC GAGATCTTCG CCTACGAGCC GCAACAGCTG
ATCGAATCGA GATCTCTGCT GGAACTGGTC GCCCCCGAGG ATCAGATCAA GGTGATGACC
CGGTTCCTGC GCCCCCTCAT GGAGGGAAAC GACAGTCTGC ACTACTTCTT CCGCGGCAAG
CGCCGCGACG GGTCCCTGAT CGATCTCGAG GTGAACGGCA CCCGGACCAG GGTGAACGGC
AGCGCTGCCA TCATAGGCAC CCTGCTCGAC ATAACCGAGC GCAAGCACGC CGAGGCGGAA
CTGAGCAAGC TGTGGCGCGC CGTCGAGCAA AGCCCGGTAT CGGTGGTGAT AACCGATCTC
TTCGGCAGGA TCGAGTACGT GAACCCGAAG TTCATCGAGG TGACCGGCTA CACCGAGGAG
GAACTGATCG GGAAGAACCC TAACATCCTG AAATCCGGGA TGACCGACGC CAAGGTGTAC
GAGGAACTTT GGTCCACCAT AACCTCGGGC CGGGAGTGGC ACGGCGAGCT GCACAACAAG
AAAAAGAACG GGGAGCTCTT CTGGGAGAGC GGGCACATCT CCGCCATAAA GAACGCCGAG
GGGCAGATCA CCCATTTCGT CGGGGTCAAG GAGGATGTAA CCGAGAGGAA GCTTGCCATC
GAACAGTTGA GGCAGGTACA GAAGATGGAG GCCATAGGCC GGTTGGCAGG CGGCATCGCC
CACGACTTCA ACAACCTCCT CACCGTCATC AACGGTTACT CCACGCTCCT GGTTAGATCC
CTCGACAAGG GGTCGCCGAC ACACAAGGAA GCGGAGCAGA TCCTGCGCGC CGGCGAGCGG
GCCGCCGACC TTACCAGGCA GCTGTTGAGC TTCAGCAGAA GGCAGATCAT GGAGCCGAGG
GTGCTGAACA TCAACAAGCA AGTCAGAGCG GTACAGAAGA TGCTGGAACG GCTGATCGGG
GAGCACATCG GGCTCGTCAC CACCCTCTCC GAGGACGCCG GCTTCGTCAA GATGGACCCG
GGGCAGATGG AACAGATCGT CATGAACCTG ATCGTCAACG CCCGCGACGC CTCCGAGACC
GGCGCGGTGA TCGCCATGTC CACTGACAAC GTCGACCTGG ACCAGAATTT TTCGCACCTG
CACCCGGGGT CGGTGCCCGG AAGCTATGTC AGGCTGAGCG TGGCGGACCA GGGGCAAGGG
ATGACCGAGG AGGTGAAGCA GCACCTCTTC GAACCCTTCT TCACCACAAA GGAGATGGGT
CGCGGCACGG GCCTCGGTCT CGCCACCGTG TACGGCATCG TGAAGCAAAG CGGAGGTTAC
ATCGAGGTGG TCAGCGAGCC GGGACGGGGA GCCTGCTTCA ACATCTATCT CCCCCGCGTC
TCGGAGCCCG CTCCGGCGCC GCCCGCGCCG CCGGCCGACG AAGAGATAGA TTCCTCCCAC
GTCATACTCG TCGTGGAGGA CGAACCGGGG GTGCTCAACC TGGTGGTGCA CACCTTGCGC
ATGCGTGGTT TCACCGTCTT CGAAGCAACG GACCCCGACC AGGGGATCAC CCTTTTCGAG
GAGCACGCCC ATGAGATAGA CATGCTCCTC ACCGACGTGG TGATGCCGTT CATGAGCGGC
CCGGCCCTGG CGGAGCTGTT GATCTCCAAA AAACCCGGAT TGAAGGTGCT GTTCATGTCC
GGGCATACCG ACGACAGGGC CGGTTTCGAG AAGATATTGG AAAAAGGGAT GCAATTCCTG
CCCAAACCGT TTGCCAGCGA CGCACTGATC AGAAAGGTGA GAGACACGCT GAGCGAGGGG
GCGGCAAAGG CGGCAGGAGC GATTTCAGGA GGTAGCAATT GA
 
Protein sequence
MKHALRILIV DDSPEDAALI VRQLQREFSV SYERVETPDE MESALDKGGW HIVISDYVMP 
KFSGLAALKL LHDRGMELPF IMVSGQMGED AAVEAMRAGA HDYLLKDRLS RLIPAIKREL
NDAVVRRERR MAEEALSATE ARFHSLVEQS LVGIFMLQDD IFIYVNPKFG EIFAYEPQQL
IESRSLLELV APEDQIKVMT RFLRPLMEGN DSLHYFFRGK RRDGSLIDLE VNGTRTRVNG
SAAIIGTLLD ITERKHAEAE LSKLWRAVEQ SPVSVVITDL FGRIEYVNPK FIEVTGYTEE
ELIGKNPNIL KSGMTDAKVY EELWSTITSG REWHGELHNK KKNGELFWES GHISAIKNAE
GQITHFVGVK EDVTERKLAI EQLRQVQKME AIGRLAGGIA HDFNNLLTVI NGYSTLLVRS
LDKGSPTHKE AEQILRAGER AADLTRQLLS FSRRQIMEPR VLNINKQVRA VQKMLERLIG
EHIGLVTTLS EDAGFVKMDP GQMEQIVMNL IVNARDASET GAVIAMSTDN VDLDQNFSHL
HPGSVPGSYV RLSVADQGQG MTEEVKQHLF EPFFTTKEMG RGTGLGLATV YGIVKQSGGY
IEVVSEPGRG ACFNIYLPRV SEPAPAPPAP PADEEIDSSH VILVVEDEPG VLNLVVHTLR
MRGFTVFEAT DPDQGITLFE EHAHEIDMLL TDVVMPFMSG PALAELLISK KPGLKVLFMS
GHTDDRAGFE KILEKGMQFL PKPFASDALI RKVRDTLSEG AAKAAGAISG GSN