Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4130 |
Symbol | |
ID | 8139504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4719767 |
End bp | 4721485 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644871745 |
Product | histidine kinase |
Protein accession | YP_003023903 |
Protein GI | 253702714 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 0.964997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCA AAACCCTTTT GCTGCTGGCC GCTCTCACCC TCATCGTCTC GTCGGCGCAC GCCGAACCGG CACCCGGCCG CGACCTCAGG GTCATCGTGG TGGGGGGCAA CAGCAACTAT CCTCCGTACC AGTTCCTGGA CGAAAACGGC GAACCTCGCG GTTTCATAGT CGACCTGACC CGGGCCATCG CCAGGGTGAT GGGGATGCGG ATAGAGATCC GGCTTGACGA TTTCGGCAAG ATTCTCAAGG AGCTCGACAG CGGCGAGGTC GACATGCTGG AAGGGCTCTC GTATTCCGAT GCGAGGGCCC GCGAATATGA TTTCTCTACC CCGCATTCCA TCATAGTTCA GGCCATCTTC GCCCGAAAAG GGACGCCGGC GGTCAAAAGC CTGGAAGAGC TCAAGGGGAA AAAGGTGCTG GTGCATAGCG GCGGCGGGAT GCACAGCTAC TTGCAGGAAA AGCGTTATGG CGCGGACCTG GTGCTGACCG GCAGCCCGCG CGAGACACTG CGGCAACTGG CGGCGGGACG CTGCGACTAC GCCGTCGTGG CCCTGCTCCC CGCCATGTAC ATCATCCGCG AAGAGAAGTT TTCGAACCTA GTGCCGGTGG CGACCAACGT GGCGCCGCAG CGGTACTACT GCTACGCGGT GAAAAAGGGT AATGCGGAGC TGGTGGCCCA GATGAACGAA GGGCTTTCCA TATTAAAGAA AACAGGGGAG TTCAACGAGA TCTACGACAG GTGGATCGGG GTGCTGGAGC CGCAGCGCAC CTCCTGGCTC GTCGTGGCCA AGTACGCGGC GCTGGTCGTG ATCCCCCTTT CCCTGGTCCT GTTGGGTACG GTGCTCTGGT CCTACTCGCT GCGCAGGCAG GTGGCGCAGC GCACCGAGTC GCTCTCCAGC GCCTTGGCCG AGTTGCAGAG AAACCAGCAG CAGCTGGTGC AGGCGGACAA GATGGCCGCG CTGGGTATCC TGGTCTCCGG GGTGGCCCAC GAGATCAACA ACCCTACCGG AATCATCCTG ATGAACATGC CCACCCTGAA GAAGATTTTC CGCGACGCGG AGCGGATCCT CGACCGGTAC CAGGAGCAAG AGGGGGAGCT TACCCTGGGG GGGATCCGAT ACCAGAGGGT GCGGCAGGAG GTTCCGCTGA TGCTGGACGA GATGCAGGAC GGCGCCGAGC GGATCAAGAA GACCGTCGAC GATCTGAAGA ACTTCGCCCG CAAGGACGAC GAGGCACGCA AGGAGCTGCT GGATTTCAAC CAGGTGGTGC AGACGGCGGT ACGCCTGGTG GACGTCGCCA CCCGCAAGTT CACCAACGAT TTCAGCGTCA GTTATGCAGA AGCGTTGCCT GCCGTGTTCG GCAACGAGCA GCGGCTGGAG CAGGTGGTGG TGAACCTGGT CATGAACGCC GGGCAGGCCC TCCCCGACCC CAACCGCGCC ATCGCCCTTG AGACAAGGTA CGACGCGGGG AGCGGCAGGG TGCTGCTCAC GGTCAGCGAC CAGGGGAGCG GTATCTCGCC GGAGCACCTG AAGCACCTCA CGGACCCTTT CTTCACCACC AAGCGCGAGA GCGGGGGAAC CGGGCTGGGG CTCTCCATCT CCGCCAACAT AATCAAGGAC CACGGCGGCG AAATCGCCTT CGATTCCAGG CTCGGGGAGG GGACCACGGT CACCCTTTCC CTGCCGGGGG CCGTGGCAGG GAGCAAAAAT GGACAGTGA
|
Protein sequence | MSIKTLLLLA ALTLIVSSAH AEPAPGRDLR VIVVGGNSNY PPYQFLDENG EPRGFIVDLT RAIARVMGMR IEIRLDDFGK ILKELDSGEV DMLEGLSYSD ARAREYDFST PHSIIVQAIF ARKGTPAVKS LEELKGKKVL VHSGGGMHSY LQEKRYGADL VLTGSPRETL RQLAAGRCDY AVVALLPAMY IIREEKFSNL VPVATNVAPQ RYYCYAVKKG NAELVAQMNE GLSILKKTGE FNEIYDRWIG VLEPQRTSWL VVAKYAALVV IPLSLVLLGT VLWSYSLRRQ VAQRTESLSS ALAELQRNQQ QLVQADKMAA LGILVSGVAH EINNPTGIIL MNMPTLKKIF RDAERILDRY QEQEGELTLG GIRYQRVRQE VPLMLDEMQD GAERIKKTVD DLKNFARKDD EARKELLDFN QVVQTAVRLV DVATRKFTND FSVSYAEALP AVFGNEQRLE QVVVNLVMNA GQALPDPNRA IALETRYDAG SGRVLLTVSD QGSGISPEHL KHLTDPFFTT KRESGGTGLG LSISANIIKD HGGEIAFDSR LGEGTTVTLS LPGAVAGSKN GQ
|
| |