Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1832 |
Symbol | |
ID | 8137163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2132349 |
End bp | 2134001 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644869443 |
Product | histidine kinase |
Protein accession | YP_003021643 |
Protein GI | 253700454 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.00156636 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAAA GAGTGATAAG GAAATTCCGG CTCTCCCCCA TCACCAAGAG TTTCCGGGTC CGCCTCTACT TGATCTTTAC CGGGACCATT GCCCTTTTGA CGGCGGCCTT CGTCTCCTTT TACGTCGTGA CCGAAAACAA TGCCTACCGC AGCACGCTTG AGCGCGAGGG GAGGCTGCTT GCCACCATAC TCTCGCAAAA CGCCCGCCTG CCGCTCTTCG CCGAAAACCG TGAGGCTCTT TCCGTGCTGG CCGAAGGGAC CTCGCGCCAA TCCTCCGTGG TTTCTGTCCT CATCAGCGAC CAGCAGGGAA GGGTGGCGGC CGAGGCACGC AAGGTAGAGG TCCCTCAGGG GGAGACCGTC AAGATGGAGG TGGAAATAAC CTCTCCCAGC TCGGTGCTGT CGCCTGAGTC GGTGCTCCTT GGACACCAGG AGACCGATAA GCAGCAGGTG ATCGGCAGGG TGCACCTGCT GCTCGACATG TCGGCGGTGC GGGAGCGGCT GGTGAACCTG GTCGCCGCCT CGCTTGCCAT CGGGACGCTT TTCTGGGTGG CCGTTTCGCT TTTGAGCTAT CAGGTGATCA AGCGGGTCAC CTCGTCCTTC AACATGCTGA TGGGGGGGGT CGAGGAGATA GGTTCGGGCA AGCTCTCGGC ACGGGTCGAC CTGGAAGGGG ACGACGAATT GGCGCGCGCA GCCAATGCCA TCAACGCCAT GGCCGCGTCC CTAGAATTGC GCGAGCTTGA GAACCTGGCC CTGCAGGAAG AGCTCCTGAA GGCTATGCAG CTCGAGGTGC AGGAGGAGAA AAAGCTGGTC ATGGCGCGGC TGATCCAGAC CAACAAGATG ACCTCGCTGG GGCTCCTTCT CTCCAGCATG GCCCATGAGA TCAACAACCC CAACGCCTCG ATCCGCTTCT CCGGTTACAT GATCGGGAAG ATGTGGAGCG ACGCGGTGCC GCTTTTAGAC CGCGTCCGTG AGGAGGAGGG GGATTTTTAC CTGGGAGGGA TCCCCTTCGA GAAGGCGCGC CAGGCGCTGA CTGAGAATGC CGGCAAGATC GTGGAGAACT CGGAGAGGAT CGCGCGGGTG GTGCAGGGGC TCAGGGACTA CGGGGTGGGG GGCGACGCCC ACCTGAAGCA GAAACTGGAG CTGAACGCCG CGGTGTCGGC GGCCCTGTCG GTGCTCGCCT GCCAGATCAA GAAGGACGTG CAGTTGAACA CCTCCCTCGG CACCGGGATT CCTGTTATCC CGGGAAGCCA GCAGCAGATC GAGCAGGTGA TCATCAACCT GATCGTGAAC GCCATGCAGG CCCTTGAAGA CGGGCGGGGG GAGGTGCATC TGACCACCCG CCATGACGCC CATAACGGCG AGGTGGTGGT GGAAGTAAGC GACAACGGTG TCGGCATCAA GCCGGAAACC ATGGAGCGCC TGTTCGAACC TTTCTACTCG ACCAAGTTGG ATCGGGGGGG AAGCGGCCTG GGGCTCTACA TCTCGCAATA CATCGTTGCC GAACACGGCG GCCGGTTGCA GCTTACCTCC GCCCCGGGCA AGGGGACATT GGCCCGCGTG GTGCTCCCGG CCGCGCCTGC CGCCTCAGTG CGCGGCATGG TCTCCGCCCA GAACGGTCAG CATGCCGCCG ATGCGCTCCA TCAAGTCGGT TAA
|
Protein sequence | MKKRVIRKFR LSPITKSFRV RLYLIFTGTI ALLTAAFVSF YVVTENNAYR STLEREGRLL ATILSQNARL PLFAENREAL SVLAEGTSRQ SSVVSVLISD QQGRVAAEAR KVEVPQGETV KMEVEITSPS SVLSPESVLL GHQETDKQQV IGRVHLLLDM SAVRERLVNL VAASLAIGTL FWVAVSLLSY QVIKRVTSSF NMLMGGVEEI GSGKLSARVD LEGDDELARA ANAINAMAAS LELRELENLA LQEELLKAMQ LEVQEEKKLV MARLIQTNKM TSLGLLLSSM AHEINNPNAS IRFSGYMIGK MWSDAVPLLD RVREEEGDFY LGGIPFEKAR QALTENAGKI VENSERIARV VQGLRDYGVG GDAHLKQKLE LNAAVSAALS VLACQIKKDV QLNTSLGTGI PVIPGSQQQI EQVIINLIVN AMQALEDGRG EVHLTTRHDA HNGEVVVEVS DNGVGIKPET MERLFEPFYS TKLDRGGSGL GLYISQYIVA EHGGRLQLTS APGKGTLARV VLPAAPAASV RGMVSAQNGQ HAADALHQVG
|
| |