Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1033 |
Symbol | |
ID | 8136355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1212061 |
End bp | 1215183 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868644 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003020852 |
Protein GI | 253699663 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 97 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCATT CGCTTCAAAA CTGGCAGACG ATGTTCGACA GCATGTCCCA GGGCATGTTC TGCCAGGACG CATACGGAAG ACCCGTCGAG GCGAATGCAT CGGCACTCTC CCTGCTGGGC CTGGATCGCG AAGCGTTCAT GGCGCACACC CCGCAAAATC CGCGATGGCG CCTCCTCGCG CCGGACGGAG GCGACCTCCC CCCTGAACAA CACCCGGCCT CCGTGGCGCT CAGTAAAGGA GAGGCCGTGA CCGGCTTCAC CGCCGGCATC CATACCCCCC ACAATGACGC CACGATCTGG GTAAAGATCG ATGCCATCCC GCTCCAGCAG GGACATGTCT GCGTCGTCCT GAACGACATC ACGCAACAAA AGCTGACAAA GGAGCAGGAA CACCAGGCCG CGCTGCAGTA CGAGCTTCTT GCCAACACCT CCATGGACGG CTTTTGGGTC ATCGACCTGG AAGGAAAGAT CCTCTCCGCG AACGAAGCCG CCTGCCGCAT GTACGGCTAC AACCGCGACG AGTTCACCTT GATGTCTGTC TACGAGATCG AGGCGCGGGA GGACCGCCGC GAGATCAAGG AGCATACCGA GAAGGTAGTG GCGACGCGCT ACGACCGCTT CGAGACCGTG CATCGCAGGA AGGACGGATC CCTCATCGAG GTGGAGGTGA GCACCGCCTT CATACCGGAG AGCGGGCGTT TCCTCACCTT TTTGCAGGAC ATAACCAGCA AGAAGGTGGC GGAAAGGGCA CTGCAGCAAA GCGAACTGCG GTACCGGGCG ATCGTGCAGA CCCAGGCCGA GTTCGTGGTG CGCTACCGCC GGGGGGGCTT CCTCACCTTC GTCAACGACA CCTTCTGCAA ATACATGCAT ATGTCCAGCG AGGAACTGCT GGGCCGGAGC CTGTACCCCT ATTTTTTCCT ACAGGACCGC GAGCACCTGA TCCGCACCGT GGAGTCGATG GACACGGAGC ACCTGGAGCA GGTTCTGGAA ATAAGGGCCT GGCTGCCGGA CGGCCGCCTG GTGTGGCAGA AATGGAGCAA CAGCGTCATT CTCGACGACG TTGGGCAAGT GGTGGAGTTC CAGGCGACCG GCATGGACAT CACCCGCAGC AAGCACGCCG AAGAGAGCCT GCGCAAAAGC GAGGAGAAGT ACCGTTCGCT GTTCGACAAC ATGCTAAACG GCTTCGCCTA CTGCAGGATG ATCCTGGATT CCGATCTCCC CATGGACTTC GTCTTCATGG AAGTGAACCA GAGCTTCGAG AAACTGACGG GGCTGCGCGG GGTAAAGGGG AAGCGGATGA GCGAGGTGCT GCCGGGGGTC GGCAAGTCGG CCCCTCACCT TCTCGCCGCC TTCAGGCGCG TCGCCCTGAG CGCAGAACCC GAGCAGGTTG AGTACTTCCT GTCCGCCATC AACGAGTGGC TCGCCGTCTC CGTGTACAGC CCGGAGGCGG GGTGCTTCGT CGCGGTCTTC GACGTGATAA CCAAGCGCAA GAGGACCGAG GAGTGCCTGG CATTCCTGGC CCAGGCGGTC TCAGAGCCGG GCGAGCAGTT CTTCCACCGG CTGGCGAAAT TCCTGGCGCA AGCCCTCGAC ATGGAATTCA TCTGCATAGA CCAACTGGAG GAGGGAAACC AGTACGCGCG CACCCTCGCG GTCTACTTCG ACGGCAGCTT CGAGGACAAC ATCCGCTACA CCCTGCGGGA CACACCCTGC GGCGAAATGG TGGGAAACAG CGTCTGCTGT TACCGCCAAG GGGTGCGCCA CCTCTTCCCG ACCGACACCC TGCTGCAGCA GATCAAGGCG GAGAGCTACG TGGGGACGGT GCTTTGGGGG TCCAACGGCG TGCCGATAGG GTTGATCGCC GCCATCGGCA GGAAGCCCTT GGGAAACCGG GACCTGCCCC AGGAAATCTT CCAGATGGTC AGCCCGCGCG CCGCCGCGGA GATGGAGCGC GGCCTGCACG AAGAGGAGCG GTTGAGGCTG GAGCATCAGC TTTTGCACGC GCAGAAGCTG GAAAGCCTCG GCATCCTTGC CGGCGGCATC GCGCACGATT TCAACAACAT CCTCACCGGC ATCCTTGGCA ACTCCAGCCT CGGGCTGATG CGCATAGACC CCGACTCTCC CGCCGCCGAA AACCTGCAGA ACATCGAGAA GGCCGCAGTC AGGGCGGCTG ACCTGGCCAA GCAGATGCTC GCCTACTCCG GCAAGGGGAT GTTCGTCGTG GAACCGGTGA ACCTCAACCT GCTGCTGGAG GAGATGATTC ACCTTTTGGA AGTCTCGGTA TCGAAAAAGG CCGAGCTGAA GCTCTCCCTG GCCCAGGAGC TCCCTCCGGT GCAGGCCGAC CCGACGCAGT TGCGCCAGAT CGTGATGAAC CTGGTCATCA ACGCTTCGGA GGCCATCGGA GAAGAGGGTG GGAGCATCAC GATCGGCACC GGATACCGGC ATTTCGATCA GAGCTACCTG AAAGAGGCCT GGTTCGACTG CGAGCTTGTC GAAGGGGAAT TCGTCTTCCT GCAGGTGGCG GATACCGGCT GCGGCATGGA CGAGAGCACG CGTTCGCGCA TCTTCGACCC CTTTTTCACC ACCAAATTCA CCGGTCGCGG GCTCGGCATG TCCGCGGTCC TCGGTATCAT CAGGGGGCAC AAGGGCGCCA TCAAGGTGCA AAGCAAGCCC GGAGAGGGAA CGACTTTCAC CGTGCTGCTG CCGGCCAGCG ACCTCCCGGT GCCGGTGAAG GAAGCCGACC AGAAGATGGA CGACTGGCAG GGGAGCGGAA CCATCCTTCT GGTCGACGAC GAGGAGACCA TCTGCGACAT CGGTGCGATG ATGCTGGGAC AGCTGGGATA CGAGGTGGTG ACGGCACTTT CCGGCAGCGG CGCGCTGCAG GCCTACCGGT CGCGGCCGGA TATAAAACTC GTGATCCTGG ACCTCACCAT GCCGCAGATG GACGGCGAAC AGACCTTCGT CGCATTGAAA GCGTTGGACC CGGAGGTGAA GGTGATCATG TCCAGCGGCT ACAGTGCTCA GGAGGTAACC GGGAAATTCA CCGGAACGGG TTTGCTCGAT TTCATCCAGA AGCCGTACAG CATGCAGGCC CTTCTCGAAG TGATGAAGAG GTGCGACAGG TAG
|
Protein sequence | MFHSLQNWQT MFDSMSQGMF CQDAYGRPVE ANASALSLLG LDREAFMAHT PQNPRWRLLA PDGGDLPPEQ HPASVALSKG EAVTGFTAGI HTPHNDATIW VKIDAIPLQQ GHVCVVLNDI TQQKLTKEQE HQAALQYELL ANTSMDGFWV IDLEGKILSA NEAACRMYGY NRDEFTLMSV YEIEAREDRR EIKEHTEKVV ATRYDRFETV HRRKDGSLIE VEVSTAFIPE SGRFLTFLQD ITSKKVAERA LQQSELRYRA IVQTQAEFVV RYRRGGFLTF VNDTFCKYMH MSSEELLGRS LYPYFFLQDR EHLIRTVESM DTEHLEQVLE IRAWLPDGRL VWQKWSNSVI LDDVGQVVEF QATGMDITRS KHAEESLRKS EEKYRSLFDN MLNGFAYCRM ILDSDLPMDF VFMEVNQSFE KLTGLRGVKG KRMSEVLPGV GKSAPHLLAA FRRVALSAEP EQVEYFLSAI NEWLAVSVYS PEAGCFVAVF DVITKRKRTE ECLAFLAQAV SEPGEQFFHR LAKFLAQALD MEFICIDQLE EGNQYARTLA VYFDGSFEDN IRYTLRDTPC GEMVGNSVCC YRQGVRHLFP TDTLLQQIKA ESYVGTVLWG SNGVPIGLIA AIGRKPLGNR DLPQEIFQMV SPRAAAEMER GLHEEERLRL EHQLLHAQKL ESLGILAGGI AHDFNNILTG ILGNSSLGLM RIDPDSPAAE NLQNIEKAAV RAADLAKQML AYSGKGMFVV EPVNLNLLLE EMIHLLEVSV SKKAELKLSL AQELPPVQAD PTQLRQIVMN LVINASEAIG EEGGSITIGT GYRHFDQSYL KEAWFDCELV EGEFVFLQVA DTGCGMDEST RSRIFDPFFT TKFTGRGLGM SAVLGIIRGH KGAIKVQSKP GEGTTFTVLL PASDLPVPVK EADQKMDDWQ GSGTILLVDD EETICDIGAM MLGQLGYEVV TALSGSGALQ AYRSRPDIKL VILDLTMPQM DGEQTFVALK ALDPEVKVIM SSGYSAQEVT GKFTGTGLLD FIQKPYSMQA LLEVMKRCDR
|
| |