Gene GM21_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1033 
Symbol 
ID8136355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1212061 
End bp1215183 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content61% 
IMG OID644868644 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_003020852 
Protein GI253699663 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones97 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCATT CGCTTCAAAA CTGGCAGACG ATGTTCGACA GCATGTCCCA GGGCATGTTC 
TGCCAGGACG CATACGGAAG ACCCGTCGAG GCGAATGCAT CGGCACTCTC CCTGCTGGGC
CTGGATCGCG AAGCGTTCAT GGCGCACACC CCGCAAAATC CGCGATGGCG CCTCCTCGCG
CCGGACGGAG GCGACCTCCC CCCTGAACAA CACCCGGCCT CCGTGGCGCT CAGTAAAGGA
GAGGCCGTGA CCGGCTTCAC CGCCGGCATC CATACCCCCC ACAATGACGC CACGATCTGG
GTAAAGATCG ATGCCATCCC GCTCCAGCAG GGACATGTCT GCGTCGTCCT GAACGACATC
ACGCAACAAA AGCTGACAAA GGAGCAGGAA CACCAGGCCG CGCTGCAGTA CGAGCTTCTT
GCCAACACCT CCATGGACGG CTTTTGGGTC ATCGACCTGG AAGGAAAGAT CCTCTCCGCG
AACGAAGCCG CCTGCCGCAT GTACGGCTAC AACCGCGACG AGTTCACCTT GATGTCTGTC
TACGAGATCG AGGCGCGGGA GGACCGCCGC GAGATCAAGG AGCATACCGA GAAGGTAGTG
GCGACGCGCT ACGACCGCTT CGAGACCGTG CATCGCAGGA AGGACGGATC CCTCATCGAG
GTGGAGGTGA GCACCGCCTT CATACCGGAG AGCGGGCGTT TCCTCACCTT TTTGCAGGAC
ATAACCAGCA AGAAGGTGGC GGAAAGGGCA CTGCAGCAAA GCGAACTGCG GTACCGGGCG
ATCGTGCAGA CCCAGGCCGA GTTCGTGGTG CGCTACCGCC GGGGGGGCTT CCTCACCTTC
GTCAACGACA CCTTCTGCAA ATACATGCAT ATGTCCAGCG AGGAACTGCT GGGCCGGAGC
CTGTACCCCT ATTTTTTCCT ACAGGACCGC GAGCACCTGA TCCGCACCGT GGAGTCGATG
GACACGGAGC ACCTGGAGCA GGTTCTGGAA ATAAGGGCCT GGCTGCCGGA CGGCCGCCTG
GTGTGGCAGA AATGGAGCAA CAGCGTCATT CTCGACGACG TTGGGCAAGT GGTGGAGTTC
CAGGCGACCG GCATGGACAT CACCCGCAGC AAGCACGCCG AAGAGAGCCT GCGCAAAAGC
GAGGAGAAGT ACCGTTCGCT GTTCGACAAC ATGCTAAACG GCTTCGCCTA CTGCAGGATG
ATCCTGGATT CCGATCTCCC CATGGACTTC GTCTTCATGG AAGTGAACCA GAGCTTCGAG
AAACTGACGG GGCTGCGCGG GGTAAAGGGG AAGCGGATGA GCGAGGTGCT GCCGGGGGTC
GGCAAGTCGG CCCCTCACCT TCTCGCCGCC TTCAGGCGCG TCGCCCTGAG CGCAGAACCC
GAGCAGGTTG AGTACTTCCT GTCCGCCATC AACGAGTGGC TCGCCGTCTC CGTGTACAGC
CCGGAGGCGG GGTGCTTCGT CGCGGTCTTC GACGTGATAA CCAAGCGCAA GAGGACCGAG
GAGTGCCTGG CATTCCTGGC CCAGGCGGTC TCAGAGCCGG GCGAGCAGTT CTTCCACCGG
CTGGCGAAAT TCCTGGCGCA AGCCCTCGAC ATGGAATTCA TCTGCATAGA CCAACTGGAG
GAGGGAAACC AGTACGCGCG CACCCTCGCG GTCTACTTCG ACGGCAGCTT CGAGGACAAC
ATCCGCTACA CCCTGCGGGA CACACCCTGC GGCGAAATGG TGGGAAACAG CGTCTGCTGT
TACCGCCAAG GGGTGCGCCA CCTCTTCCCG ACCGACACCC TGCTGCAGCA GATCAAGGCG
GAGAGCTACG TGGGGACGGT GCTTTGGGGG TCCAACGGCG TGCCGATAGG GTTGATCGCC
GCCATCGGCA GGAAGCCCTT GGGAAACCGG GACCTGCCCC AGGAAATCTT CCAGATGGTC
AGCCCGCGCG CCGCCGCGGA GATGGAGCGC GGCCTGCACG AAGAGGAGCG GTTGAGGCTG
GAGCATCAGC TTTTGCACGC GCAGAAGCTG GAAAGCCTCG GCATCCTTGC CGGCGGCATC
GCGCACGATT TCAACAACAT CCTCACCGGC ATCCTTGGCA ACTCCAGCCT CGGGCTGATG
CGCATAGACC CCGACTCTCC CGCCGCCGAA AACCTGCAGA ACATCGAGAA GGCCGCAGTC
AGGGCGGCTG ACCTGGCCAA GCAGATGCTC GCCTACTCCG GCAAGGGGAT GTTCGTCGTG
GAACCGGTGA ACCTCAACCT GCTGCTGGAG GAGATGATTC ACCTTTTGGA AGTCTCGGTA
TCGAAAAAGG CCGAGCTGAA GCTCTCCCTG GCCCAGGAGC TCCCTCCGGT GCAGGCCGAC
CCGACGCAGT TGCGCCAGAT CGTGATGAAC CTGGTCATCA ACGCTTCGGA GGCCATCGGA
GAAGAGGGTG GGAGCATCAC GATCGGCACC GGATACCGGC ATTTCGATCA GAGCTACCTG
AAAGAGGCCT GGTTCGACTG CGAGCTTGTC GAAGGGGAAT TCGTCTTCCT GCAGGTGGCG
GATACCGGCT GCGGCATGGA CGAGAGCACG CGTTCGCGCA TCTTCGACCC CTTTTTCACC
ACCAAATTCA CCGGTCGCGG GCTCGGCATG TCCGCGGTCC TCGGTATCAT CAGGGGGCAC
AAGGGCGCCA TCAAGGTGCA AAGCAAGCCC GGAGAGGGAA CGACTTTCAC CGTGCTGCTG
CCGGCCAGCG ACCTCCCGGT GCCGGTGAAG GAAGCCGACC AGAAGATGGA CGACTGGCAG
GGGAGCGGAA CCATCCTTCT GGTCGACGAC GAGGAGACCA TCTGCGACAT CGGTGCGATG
ATGCTGGGAC AGCTGGGATA CGAGGTGGTG ACGGCACTTT CCGGCAGCGG CGCGCTGCAG
GCCTACCGGT CGCGGCCGGA TATAAAACTC GTGATCCTGG ACCTCACCAT GCCGCAGATG
GACGGCGAAC AGACCTTCGT CGCATTGAAA GCGTTGGACC CGGAGGTGAA GGTGATCATG
TCCAGCGGCT ACAGTGCTCA GGAGGTAACC GGGAAATTCA CCGGAACGGG TTTGCTCGAT
TTCATCCAGA AGCCGTACAG CATGCAGGCC CTTCTCGAAG TGATGAAGAG GTGCGACAGG
TAG
 
Protein sequence
MFHSLQNWQT MFDSMSQGMF CQDAYGRPVE ANASALSLLG LDREAFMAHT PQNPRWRLLA 
PDGGDLPPEQ HPASVALSKG EAVTGFTAGI HTPHNDATIW VKIDAIPLQQ GHVCVVLNDI
TQQKLTKEQE HQAALQYELL ANTSMDGFWV IDLEGKILSA NEAACRMYGY NRDEFTLMSV
YEIEAREDRR EIKEHTEKVV ATRYDRFETV HRRKDGSLIE VEVSTAFIPE SGRFLTFLQD
ITSKKVAERA LQQSELRYRA IVQTQAEFVV RYRRGGFLTF VNDTFCKYMH MSSEELLGRS
LYPYFFLQDR EHLIRTVESM DTEHLEQVLE IRAWLPDGRL VWQKWSNSVI LDDVGQVVEF
QATGMDITRS KHAEESLRKS EEKYRSLFDN MLNGFAYCRM ILDSDLPMDF VFMEVNQSFE
KLTGLRGVKG KRMSEVLPGV GKSAPHLLAA FRRVALSAEP EQVEYFLSAI NEWLAVSVYS
PEAGCFVAVF DVITKRKRTE ECLAFLAQAV SEPGEQFFHR LAKFLAQALD MEFICIDQLE
EGNQYARTLA VYFDGSFEDN IRYTLRDTPC GEMVGNSVCC YRQGVRHLFP TDTLLQQIKA
ESYVGTVLWG SNGVPIGLIA AIGRKPLGNR DLPQEIFQMV SPRAAAEMER GLHEEERLRL
EHQLLHAQKL ESLGILAGGI AHDFNNILTG ILGNSSLGLM RIDPDSPAAE NLQNIEKAAV
RAADLAKQML AYSGKGMFVV EPVNLNLLLE EMIHLLEVSV SKKAELKLSL AQELPPVQAD
PTQLRQIVMN LVINASEAIG EEGGSITIGT GYRHFDQSYL KEAWFDCELV EGEFVFLQVA
DTGCGMDEST RSRIFDPFFT TKFTGRGLGM SAVLGIIRGH KGAIKVQSKP GEGTTFTVLL
PASDLPVPVK EADQKMDDWQ GSGTILLVDD EETICDIGAM MLGQLGYEVV TALSGSGALQ
AYRSRPDIKL VILDLTMPQM DGEQTFVALK ALDPEVKVIM SSGYSAQEVT GKFTGTGLLD
FIQKPYSMQA LLEVMKRCDR