Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1798 |
Symbol | |
ID | 8137129 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2089099 |
End bp | 2092170 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869410 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003021610 |
Protein GI | 253700421 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 0.100128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTG CAACATCGCA GAGAAACATC AGGCAATCCG GCAGGAACCT GATGGTCTGG ACCGCCTCGG TGGTCCTGCT GGTAAACCTG TTCCTGGCGC TGACGGGTTT CTTCTCCCTC TGGCAGAGCA GGGAAACCTA CCTGAACAAC GCGCAGGTAC AAAGCGACAA CCTCTTGCAG GCGCTCTCCT ACAGCATCGC AGGGGCACTG GAGTCCGCGG ACATCGCGCT TTTGTCCATG GTGGACGAAG TGCAGCAGGA GCTGCAGCAG GGACCAGTCG AGGAGGGACG CCTGAACCGT TTGCTGGCGG GGCAGCATTC CCGGCTGCCG CTGCTGGACA GCATGCGGAT GGCCGATGCG CAAGGGGATA TCCGTTACGG CACCGGGTTG CAGGCGGGCG CCTTGAAAAA CGTCTCACAG AGGAGCTACT TCAAGAAACT GGCCGCCGAT CCGAAGGCGG GGCTCGTGAT TTCGGAACCG ATCCTGGGGC TCATCAGCGG GAAAAGGGTC ATTATCCTCG CCCGAAGGGT TGAGCGGCGC GGCGGCGCTT TTGCCGGCGT GGTGTACTGT GCCATCGATC TGGAGCGTGT CCGAGGCATG CTCCAGCCGA TGCAGGTCGG TTCCCACGGA AAAATTACCC TCACCGACGC AACGCTGGCC ACCATCGTGC GGCATCCGCA GGGGACGGAT GCCGGCGACG CCCCCGGGGA AAAGGGGGCC AGCGGCGAGA TTTGGGACCA AATCGCCCGG GGGCGCGAGA GTGGGAGCTA CTTTGGCGAA AGCGACGCTT GGGGCTCCTC CCGGCTCGCA TCCTTCCGAA AGATAGGCCG GTTCCCGCTG TACCTCTCCC TGGAGCTCGC GCCTGAAGAC TTCCTGGCCC GCTGGCGCGG CGAGTTGCTC CAGATCAGCA CCATGGTCTG CGTATTCATC CTGGTCACCC TGATCCTTTC CAGGCTCATC TACTTGAGGT GCCGCCGGCA GGCAGCGGCG GAAGACGAGC TGACTCAGGC CAAGGAGGAA CTGGAGCTGC GGGTGGCTGA GCGGACGGCT GAGCTGCATC TGGCAAACCT GAAGCTTACG ACCGAGCTAG CCGAGCGTGA GCGGGCTGAG AAGAGGGAGC GCGAAGGGCG CAACATGCTG GCGCAGATCA TCGACACCAT CCCGCAGTAC GTGTTCTGGA AGGACCGCCA GAGCATCTAC GAGGGATGCA ACGCCGTCTT TGCCAAGGGG GCCGGGATGG CGCACAGCGA CGAGATCAGG GGCAAGAGCG ACTTCGACCT CCCCTGGCTG CGTGAGGAGA GCGAGGCGTA CCGCAGCGAC GACTGCGTGG TCATGGAGCA AAAACGCGCC AAGTTCCACA TCATAGAGCA GCAACTGCAG GCGGGTGGGA AGCGTCTATG GGTGGATACC ACCAAGGTGC CGCTTCTGAA CGAGCAGGGA GAGGTGACCG GGATACTGGG AGTGTACGAG GATATCTCGG AGAGAAAGGC GGTCGAGGAG TCGCGCGACA AGGCGCTGGC GCTGGTTGAG TCACTTTTGG CCGCCTCGCC CACCGGCATC CTGGTCTACG AGGGGGCGAG CGGCTGCTGC GTCATGGCGA ACCAGGCGAT GGCCGCGATG GTGGGGGTGA CTCGGCAGCA GATGCCGGCC TTGAACTTCA GGGAGATCGG ACTCTGGCGG GAGACGGGGA TTCTCCAGCT TGCCGAGCAG GTCCTCGTCG ACGGGAGGAC ACGGAGCATC GAGGTCTGTG CTCAAACGAG TTTCGAGAAG AGCCTCCAGG CCGAGTTTCT CCTCTCCCGC TTCGAAGTGG AGGGGCGTCC GCACCTCATG TTCATCGCGG TCGACATAGC GGCGCGAAAG GGGCTGGAGG AGGAGAAGCG GCAGATCCAG GCCCAGATGC TGCACGTGCA GAAGCTGGAG AGCCTGGGGG TCCTGGCCGG CGGCATCGCG CACGACTTCA ACAACATCCT GATGGTGGTG CTTGGCAACG CGGACCTTGC CTTGATGCGA CTTCCCGAAG GGACGCCGGC GCGGGAAAAC CTGCAGCAGA TCGAGCAGGC CGCGAGCAGG GCGGCGGACT TGGCTCAGCA GATGCTGGCC TACTCCGGTA GAGGAAACTT CGTGATCGAG AAGCTGGATC TGGCCCAGAC GGTCAAGGAG ATGGCCCAGA TGCTGGAGAT CTCCATCTCC AAGAAGACGC GGTTGCATTA CGATTTCGCC CCAGACCTCC CGGCCATCAG CGGCGACGCC ACGCAACTGC GCCAGGTGAT CCTGAACCTG GTGTTGAACG CCTCAGAGGC GATCGGCGAC AACATCGGCG TGATAGGGAT CAGGACAGGC TACCTGGAGT GCGACCGCGC CTACCTTTCC GAAACGTGGA TCGACGACCG CCTCCCCGAG GGCCCCTACC TGATGCTGGA GATCTCCGAT AACGGTTGCG GCATCAAGAA AGAGATCATC CCCAAGATCT TCGACCCCTT CTTCACCACC AAGTTCACCG GCCGCGGGCT CGGCATGGCT GCAGTCCTAG GCATCGTGCG CTCCCATAAC GGCGCCATCA AGATCTACAG CGAGGAAGGG AAGGGGAGCA CCTTCAGGCT GCTTCTTCCG TGTATGTCGG CGCAGGGGGA GGCCTTGCAA CCGACGGCGG AGGAGGATCT TTGGCGCGGC AGCGGTACGG TGCTTCTGGC CGACGACGAG GAATCGATCA GGGCCCTTGG GCAGGAGATG CTGGAGACGC TGGGATTCCG GGTGCTGACC GCCTGCGACG GGTGCGCGGC GGTCGAACTC TTCAGGGAAA AACGGGAGGA GATCGCCTGC GTGGTCCTGG ACCTTACCAT GCCCGAGTTG GACGGCGAGC AGGCCTTTAA CGTACTGCGC GAGCTGGATC CGGGGGCGAG GGTGATCCTT TCCAGCGGCT ACAACGAGCG GGAGGTGAGC CGGAAATTCG CCGGGGCCGG GGTTTCCGGG TTCATGCAGA AGCCGTACAA GCTGGCGGAG ATGAGCAGGA AGCTGCGGCA GATACTGGAG CCGGGGGGGG GCTCACAGCA GGCGGCAGAG CTGGGCGGCT AG
|
Protein sequence | MNAATSQRNI RQSGRNLMVW TASVVLLVNL FLALTGFFSL WQSRETYLNN AQVQSDNLLQ ALSYSIAGAL ESADIALLSM VDEVQQELQQ GPVEEGRLNR LLAGQHSRLP LLDSMRMADA QGDIRYGTGL QAGALKNVSQ RSYFKKLAAD PKAGLVISEP ILGLISGKRV IILARRVERR GGAFAGVVYC AIDLERVRGM LQPMQVGSHG KITLTDATLA TIVRHPQGTD AGDAPGEKGA SGEIWDQIAR GRESGSYFGE SDAWGSSRLA SFRKIGRFPL YLSLELAPED FLARWRGELL QISTMVCVFI LVTLILSRLI YLRCRRQAAA EDELTQAKEE LELRVAERTA ELHLANLKLT TELAERERAE KREREGRNML AQIIDTIPQY VFWKDRQSIY EGCNAVFAKG AGMAHSDEIR GKSDFDLPWL REESEAYRSD DCVVMEQKRA KFHIIEQQLQ AGGKRLWVDT TKVPLLNEQG EVTGILGVYE DISERKAVEE SRDKALALVE SLLAASPTGI LVYEGASGCC VMANQAMAAM VGVTRQQMPA LNFREIGLWR ETGILQLAEQ VLVDGRTRSI EVCAQTSFEK SLQAEFLLSR FEVEGRPHLM FIAVDIAARK GLEEEKRQIQ AQMLHVQKLE SLGVLAGGIA HDFNNILMVV LGNADLALMR LPEGTPAREN LQQIEQAASR AADLAQQMLA YSGRGNFVIE KLDLAQTVKE MAQMLEISIS KKTRLHYDFA PDLPAISGDA TQLRQVILNL VLNASEAIGD NIGVIGIRTG YLECDRAYLS ETWIDDRLPE GPYLMLEISD NGCGIKKEII PKIFDPFFTT KFTGRGLGMA AVLGIVRSHN GAIKIYSEEG KGSTFRLLLP CMSAQGEALQ PTAEEDLWRG SGTVLLADDE ESIRALGQEM LETLGFRVLT ACDGCAAVEL FREKREEIAC VVLDLTMPEL DGEQAFNVLR ELDPGARVIL SSGYNEREVS RKFAGAGVSG FMQKPYKLAE MSRKLRQILE PGGGSQQAAE LGG
|
| |