Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1722 |
Symbol | |
ID | 8137053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2003326 |
End bp | 2007411 |
Gene Length | 4086 bp |
Protein Length | 1361 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869334 |
Product | histidine kinase |
Protein accession | YP_003021534 |
Protein GI | 253700345 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 0.169999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TGATTCAATT GCCGACGCTC CGGGCGCGGC TTGCTTTCTG GTTCACGGCG ATCTGCCTCG TGCCTTTGTT CCTGCTTACC ACCATCATCT ACACCCAGCG GGTCAAGGTG GCGCGGGCGC TGATATTCGA CAAGCTGAAC GCCGTCATCT CGCTGCGTCA GCAGCAGATC GGGAGCATCC TCGACAACAT GATAGTGGAC GCCTCCACTA TCGCCTCCGA CCACTCGGTG CGCGAAGCGG CGCAGCAGCT TCTGACCGGG CCCGGCCAGG GGCCTACCTT CAGCGACAGC CTGATCCTGC TGCGGGGTTA CCAGAGCAGC AACCTCTCCG TGGCCGACGT GTCGATAGTC TCCCCTGACG GGAGCATCGT CCTTTGCACC AGCGACAGCA GGAACGGGAC GCATGTGGCC CGGCCCGAGA TGGTGGCCAA TGCGCTGCAG GACCTGGCGC CGACCTTTGG AGAGGCCTAC CTCGCCAGCG ACGGCGTTCC CAGCCTCGAC GTCGCCATCC CCATCAGGGA GGGGGCGGGA TCGCGCGTCG GCGCCTTGCT GGTGGCGCGT TACAACCTGC ACACCCTGCT CTACGACATC ACCGCCAACC GCACGGGCAT GGGGGATACC GGGGAAACGG TGCTGGTGAA CCGGGAGATG ATGCCGCTAA ACGAGCTGCG GGCCCTGTCC GGGGGGATCC TGAAGGTGCC GCTCAAGGGG AAACCGGCGC AGATGGCGGT GCGCGGGGAA AGCGGCATCG CGCAGGCCAC CGACTATCGC GGAGTGGAAG TGCTTTCCGC CTTTGTCCAC ATACCCCGCA CCGGCTGGGG GCTCGTTTGC AAGCAGGACA CCAGCGAGGT ACTGGCGCCG ATCCGGAGCC TCCTCTTCAT TACCTACGGT CTGGCTGCGT TCATTTCGCT TCTGGCCTGG GTGCTTGCCT TAAGGCTCGC GCGCACCATA GCCGAACCGC TGGTGAACCT GGCGGAAAGC GCCAAGGAAC TGGAAGACGG GGACTTCGAC GCCCGGGCCG AGGCGGAGGG TACCGACGAG CAGAAGGCCC TCGCCGGCAG TTTCAACAGC ATGGCCCGGG CGCTGCAGAC GAAGATGGAG GCCCAGCAAG GGATGTCGCG GCTCTCCCAG AACCTGGTTG TCGCCGAGGG AATGGACGAT TTTTTCAGCA AACTCCTCCC CGTCTTTTTG GAGATCACCG GCGCCAAGAT GGCGGTCGCC TACGTGGAGG AGCGGCAGAC AGGGGAGTTC GCGCCCATGC ATGCCATCGG CCTTGATTCT TCCCTGCTGC GCCGGTTCAA ACGCAGCCAT CTTGAGGGAG AGCTGGGAGT GATCCTCTCC AGCGGCACCG TGTCCTGCTA TGCGCCGCAG GAAGCGACTC CCCTTCGCTT TGCTACCAGC TTCGGCGACA TCGTGCCCGC CGAGATAGCC ACCGTCCCGA TTTCGGTCGA CGCCGAGATC CGCGCCTTCG TTTCCATCGC CTCGGAGCGC CCCTTCCCGC CTAAGGTGCG GGAGATCCTG GAGCTGGTGC AGATGCCGCT GGGTTCGGCC TTCGCCAGGA TACTCTCCGG CGAGGACGTC CGGCTGCTGG CGAACGAACT GGCGCTCAAG AACGCGGAAC TGACCCAGCA GTCCGAGGAA CTGGTGCAGC AGTCCACCGA GCTGGCCCAC CAGTCGGAGG AGCTTTCCCG CCATAACCAG GTGCTGGATC TGCAAAAGCA GCAATTGGAA GAGGCGACCC GGCTTAAGAG CGAATTCCTC TCCAACATGA GCCATGAGTT GAGGACGCCG CTCAACTCGG TGCTGGCGCT CTCCCGTGTG CTGGCGGTGC AGGGGAGCTC TCGCCTCACC GAGGACGAGA AGGGGTACCT TTCCATCATC GAGAGAAACG GCAAGCACCT TTTGTCCCTG ATCAACGACA TCCTCGACCT GGCCAAGATC GAGTCCGGAC GCCAGGAGGT TTTCCTGGAG TCGGTCTCCC TGCGAAAGGT CGCGGCCGAA GTGGTGGACG GGCTTTCGGT GCTGGCCCGC GAAAAGGGGA TCGCGCTCTC GCTGGAGGCG CCTGAGGGGC TCCCCGCCGC GGTGATCGAC GTGAAAAGAC TGCGCCAGAT GCTGCAGAAC CTGATAGGCA ACGCGATCAA GTTCACCGCC GAGGGGTCGG TGACCGTGAG ACTCGACGCC GAGGGGGAAG AGTTCCTGAT CGAGGTGGAG GATACCGGCA TCGGCATCGC GCCGCAGTAC CTGGAATCCA TCTTCCACGA GTTCCGCCAG GCCGACGGCA GCACCTCTCG CTCCTTCGAG GGGACGGGGC TCGGGCTTGC CATCGTGAAG AAGACGGCGC GCCTTTTGGG AGGTGACGTA TCGGTGCGAA GCGAGCTCGG CGAAGGGTCC GTCTTCACCC TTAGGCTTCC TTGCGACGGC ACCGGGAGCG TGCAGGGACC GGCACCGGGT AACGCCGCTT TCCCCGCGGA ACGCGCGCAG GCTCCGTTTC AGGGGAGGTC GGTCCTGGTG GTGGATGACG ACCCCGAGGC GGTGTCGTTG ATCGCGTCGC ATCTCTCCCA GGCGGGTTTC GAAACCTTGA CAGCGCTAAA CGGCGCCGAC GCCCTGAGGC TCGCTCGCGC GAAGAAGCCT TTCGCCATCA CGCTCGACAT CATGATGCCG GACATGGACG GTTGGGAGGT GATGCGCGAG CTGAAGGACC AGCGCGAAAC AGCCGATATC CCCGTGGTGA TCGTGTCGCT CTCCGACGAC CGTGCGACGG GAATCGCTCT CGGCGCGGTC GGGGTGATAG CCAAACCGGT GAGCAGGGAA CAGCTCATGG AGACCTTCAC CAAGCTTGCC GGCGCGGGAT GCCGGCTGGT GCTGGTGGTC GACGACAGCG AACACGACCG TTTCTTCATC GCCTCGCTCT TCAAGGAGAT GGGGATGGAC GTGCTCTTGG CCGAGGACGG CGCGCAGGCC CTGGAACTCG CCGCGGCTGA TCAGCCCGAC CTGATCACGC TCGACCTGCT GATGCCGAGG ATGGACGGAG CGGCCGTCTT GGACAAGCTC AGGTCCAACC AGTCCACCGC GGCGATTCCG GTGGTGGTGA TAACCTCCAA GGATCTGAGC TCGACTGAGT TGGAGCGGCT CTCCTGCGGG GTGAGCGCCA TCCTTTCCAA GGGGGGGCTC GAGCGCGAGG TGGTGCTGGA CGAATTGGTG CACAGCCTGG AGCGGATCGG CTGGCGCCTT CCCTGCGAGC CAAGCGGAAA CGGGGCGAGG ATCCTCATCG TCGAGGACAG CGAGGCGGCC ACCATCCAGC TCCGCTTCGC GCTGGAATCG GCGGGGTTCT GCGTCGACGC GGTCTCAGGC GGCAGGCATG CGCTTTCCTA TCTGCAGACC CATGTTCCCG ACGGCATCAT CCTCGACCTG ATGATGCCTG AGGTGGACGG CTTCGACGTG TTGAAGGCGG TGCGCGTCTC GCGGCTGACC GAAAAGGTGC CGGTCATGGT GATGACCGCA AAGACCCTTT CGCCGGGCGA GCACGACCGG CTGCGCCTGC TGGAGGTGAA CCACCTGGTC CAGAAAGGGG ACGTGGAGCA GCACGAACTG CTGCGCAGAA TTTACGAGAT GCTCGGGGCG GTCTCCCTGT TCAAGCCGGA CACCGCGCCG GCGGCGACCG GCGCGGCGCT TTTCCCCGAG AGTCAGCGGG GGGACGGATC GGTGTTGGTG GTCGAGGACA ACCCGGACAA CCTGGTAACG CTTCGGGCCG TCTTGGGGGG GAGCTGCCGC CTGGTCGAGG CGACAGACGG CGAGTCCGGC CTTCTCCTGG CGCGGACTGG GTCCCCTTCG CTGATCCTTC TGGACATGCA TCTCCCCGGC ATGGACGGAC TCGCCGTACT CAGGCACCTG AAGCAGGATA ACGCCACGGC TCTCATCCCG GTGGTGGCCC TCACCGCGAG CGCCATGGCG GGGGACCGGG AGATGCTGAT GAAGGCGGGG TGCGTGGCGT ACGTCTCGAA GCCGTACCAG CCGGAAGAGC TGCAGGAACT GGTCATCAAA TTCGTCGCAA CGGGGACTGC GCCAAGCGGG AAGTAG
|
Protein sequence | MKKLIQLPTL RARLAFWFTA ICLVPLFLLT TIIYTQRVKV ARALIFDKLN AVISLRQQQI GSILDNMIVD ASTIASDHSV REAAQQLLTG PGQGPTFSDS LILLRGYQSS NLSVADVSIV SPDGSIVLCT SDSRNGTHVA RPEMVANALQ DLAPTFGEAY LASDGVPSLD VAIPIREGAG SRVGALLVAR YNLHTLLYDI TANRTGMGDT GETVLVNREM MPLNELRALS GGILKVPLKG KPAQMAVRGE SGIAQATDYR GVEVLSAFVH IPRTGWGLVC KQDTSEVLAP IRSLLFITYG LAAFISLLAW VLALRLARTI AEPLVNLAES AKELEDGDFD ARAEAEGTDE QKALAGSFNS MARALQTKME AQQGMSRLSQ NLVVAEGMDD FFSKLLPVFL EITGAKMAVA YVEERQTGEF APMHAIGLDS SLLRRFKRSH LEGELGVILS SGTVSCYAPQ EATPLRFATS FGDIVPAEIA TVPISVDAEI RAFVSIASER PFPPKVREIL ELVQMPLGSA FARILSGEDV RLLANELALK NAELTQQSEE LVQQSTELAH QSEELSRHNQ VLDLQKQQLE EATRLKSEFL SNMSHELRTP LNSVLALSRV LAVQGSSRLT EDEKGYLSII ERNGKHLLSL INDILDLAKI ESGRQEVFLE SVSLRKVAAE VVDGLSVLAR EKGIALSLEA PEGLPAAVID VKRLRQMLQN LIGNAIKFTA EGSVTVRLDA EGEEFLIEVE DTGIGIAPQY LESIFHEFRQ ADGSTSRSFE GTGLGLAIVK KTARLLGGDV SVRSELGEGS VFTLRLPCDG TGSVQGPAPG NAAFPAERAQ APFQGRSVLV VDDDPEAVSL IASHLSQAGF ETLTALNGAD ALRLARAKKP FAITLDIMMP DMDGWEVMRE LKDQRETADI PVVIVSLSDD RATGIALGAV GVIAKPVSRE QLMETFTKLA GAGCRLVLVV DDSEHDRFFI ASLFKEMGMD VLLAEDGAQA LELAAADQPD LITLDLLMPR MDGAAVLDKL RSNQSTAAIP VVVITSKDLS STELERLSCG VSAILSKGGL EREVVLDELV HSLERIGWRL PCEPSGNGAR ILIVEDSEAA TIQLRFALES AGFCVDAVSG GRHALSYLQT HVPDGIILDL MMPEVDGFDV LKAVRVSRLT EKVPVMVMTA KTLSPGEHDR LRLLEVNHLV QKGDVEQHEL LRRIYEMLGA VSLFKPDTAP AATGAALFPE SQRGDGSVLV VEDNPDNLVT LRAVLGGSCR LVEATDGESG LLLARTGSPS LILLDMHLPG MDGLAVLRHL KQDNATALIP VVALTASAMA GDREMLMKAG CVAYVSKPYQ PEELQELVIK FVATGTAPSG K
|
| |