Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4097 |
Symbol | |
ID | 8139471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4677434 |
End bp | 4679734 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871712 |
Product | GAF sensor signal transduction histidine kinase |
Protein accession | YP_003023870 |
Protein GI | 253702681 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 135 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGCC GGAAGGGACA AAACCTCCTG GAGCAAGCAT TAACTATTGC CCAGGCCTCA GGTCGGTCTC ATCAGGTTCG GCTCAACAGC CTGCTTCGCC TGGCGGTCCG TGGGCACTCC CTCGCCTCGG CCACCATCTA TCTCCCCGAT CCCAAAGGGG CGGCGTTGCA GCGTCGCTTC AGCACGCTTG CCACCCCCTC CGGCCATAGC TGCCACATCC CTTACGGCGC AGGTCTCGCC GGGCGCGTCG CCGCCACCCT TTCACCCCAA TCCAGCAGTA CCGTTTGGCT GCATGGCGAC GAGCCCTTCT CCGGTGACGG TTCCACGATA GCGCTTCCCC TTTTGGACGG GGACCGGCTC TTCGCCGTGC TCGCGCTGGA GAGCGGCGCC GCCGACGTCG CCCAGGAAGC CGTCGACGCG GCCGGCATCC TGGCGCCGGT CTTCTCCCTG ACCGTAACCG GCCTGGCCGC AGCCGAAGAG GCCGAGGAGG CACGTCGCAA CCTGTCGCTA CTTTCGGCGC TGGCCAAGCT CTTGAGCTCG CCGCAGCCGC GCGGGGTATT GCTGCACCGG TTGATGCAGC TTTGTACCGG TTCCGGCCTC TCCAGTTGCG CCATCGTCCG CCTGAAGCAA AGAAACTCCG GCAAGGAGAG GGTGATCCGG AGCTGCCGCA GGGGAATGGG CGACAAGCTC CCCGACCTGC TGGAAAAGGA AGCCGCGCTT GCGGTTCATG TCTGCGCCAC CGAGGCCACC TGCGCCGAGG AACTCGGAGT CGACTCCAGC TACCGGTACG CCCTCTGCAC CCCGCTGGGA AGCAACGGTG CCGCGCTCGG GACCATGACC CTCTTCGGGG GTCCCGAACT GACCGCGCCC AAGCAGATCG AGCTTGCCGA AACGGTGGCG CGCCTTTTGT CCGGCGCCAT GGCCGAGGCG ATCTGCAAGG AGCAGATCAA GACCTACGAC AGCGAGAACG AGAAGAAGCT GAAAGAGCTC TCCCTTCTCT ACAGGATGAG CAACACCATG CTCTCCACCA TTCAGCTGAA CAAGCTGATC CACCTCACCC TGACCGCGCT CACCTCAGGT CCCACCCCCT TCTTCGACCG GGCCATGCTC TTTCTCACCA ACGAGCGCTC CGGCATGCTC CTCGGCATGC TGGGGGTGAC CACGGAAACC TCCCCTTCCC TTTCAACCCA AAATGGAGGG AGCGACGACG TCCTCTCCAG CCGGTGGGAC ATCTCAGACG ACGAGATGGC CGCCCAGCGC AACTCCGAGT TCTGCCGCCA GGTACAGGGA AGGCGACTCG AACTGGACGG CACGCTCAAC ATCGCTTCCC AGGCCGTGCT GGAGAAGAGG CTGATCTACA TCCCGGAAGA AGAAGGGTTC GACGGCGGCG CGCTCCATTC CGGCCGCAGC GCCCTGGCGG CCTCTCCGCT CATCGCGCAT GGGCAGGCGG TAGGGGCGGT ACTGGTGGAC AACGCCCTCA CACATAAGCC GATCAACCAG GAGCACCTGC GTTTCCTGCA GCTCTTCACC AACCAGGCGG GGATGGCCAT CGAGAACTCG ATGCTCTACA ACAAGATCGA GGACGCGAAC CGGCAGTTGA GCGAGGCGCA GGAGCACCTG CTCCAGAAGG AGCGGCTCGC CGCCATAGGC GAGATGGCCG CCGGCATCGC GCACGAGTTG AAGGGGCCGC TGGTCTCCAT CGGCGGCTTC GCCGGCAGGC TCGCGAAAAA GCTCCCCCAG GAGACCAGCG AGTGGGCCCA TGCCGACCTC ATCGTGCGCG AAGTGCTCCG GTTGGAGGGG ATCCTCTCCG AGATCCTGCT CTTTTCGAAG AAGACAACCA TCTGTTACAC CCGGTGCGAC TTATCCGAGA TCGTGAAGGA GTCGCTCGCC GTGGTCACCC CTCCCCTGGA GGAGAAGCGG ATCAGCGTGA ACGCCAAATT CCCGCGGCAA AAGCTCGTGC TTTTGGGCGA CGGGCAGCAG TTGAAGCAGG TTTTCATCAA CATCATCCTG AACGCCCTCG ACGCCATGGG GACCGGCGGA ACGCTGAACA TCCAGGTCTT GGCGGCGGAA ATGGACGGCA AGGAAGCCGT CCAGGTGAAG ATATCCGACA CCGGCGGCGG CATACCGCTT GAGTCCCTGC ACAGCATCTT CACCCCGTTC TTCACCACCA AAGGAAGCGG CACCGGCCTC GGGCTCCCCA TCGCCAACCG CATCATAACC AACCACGGCG GGAAGATCCA AGTCACCAAC CACCCCGGCC TCGGGGTCGA GTTCAGGGTC ATCCTGCCGA AACACTGGTG A
|
Protein sequence | MAGRKGQNLL EQALTIAQAS GRSHQVRLNS LLRLAVRGHS LASATIYLPD PKGAALQRRF STLATPSGHS CHIPYGAGLA GRVAATLSPQ SSSTVWLHGD EPFSGDGSTI ALPLLDGDRL FAVLALESGA ADVAQEAVDA AGILAPVFSL TVTGLAAAEE AEEARRNLSL LSALAKLLSS PQPRGVLLHR LMQLCTGSGL SSCAIVRLKQ RNSGKERVIR SCRRGMGDKL PDLLEKEAAL AVHVCATEAT CAEELGVDSS YRYALCTPLG SNGAALGTMT LFGGPELTAP KQIELAETVA RLLSGAMAEA ICKEQIKTYD SENEKKLKEL SLLYRMSNTM LSTIQLNKLI HLTLTALTSG PTPFFDRAML FLTNERSGML LGMLGVTTET SPSLSTQNGG SDDVLSSRWD ISDDEMAAQR NSEFCRQVQG RRLELDGTLN IASQAVLEKR LIYIPEEEGF DGGALHSGRS ALAASPLIAH GQAVGAVLVD NALTHKPINQ EHLRFLQLFT NQAGMAIENS MLYNKIEDAN RQLSEAQEHL LQKERLAAIG EMAAGIAHEL KGPLVSIGGF AGRLAKKLPQ ETSEWAHADL IVREVLRLEG ILSEILLFSK KTTICYTRCD LSEIVKESLA VVTPPLEEKR ISVNAKFPRQ KLVLLGDGQQ LKQVFINIIL NALDAMGTGG TLNIQVLAAE MDGKEAVQVK ISDTGGGIPL ESLHSIFTPF FTTKGSGTGL GLPIANRIIT NHGGKIQVTN HPGLGVEFRV ILPKHW
|
| |