Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1302 |
Symbol | |
ID | 8136629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1528066 |
End bp | 1529856 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868916 |
Product | histidine kinase |
Protein accession | YP_003021120 |
Protein GI | 253699931 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000000000000333005 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAAAAC TCCGTCAATC CGTATCCCTT TTCTTCGCCG GCTCCGCCTT GCCGTGCCGG CGGCAGCCTC TGCTCGTCAT AGCCCAACTG CTCCTTTGGG CCCTGCTCGT CGTCTCCATC TGCGGCGCAG CCGAATCGGC CGCAGGCGTC AGGCCGGTCA TCGTCGGCGG CGACCGCGAC TACCCCCCTT ATGAATTCAT CGACAAATCG GGCCATCCTG CCGGCTACAA CGTCGACCTC ACCCGGGCCA TAGCCGACGT CATGGGGATG AAGGTCGAGT TTCGCCTGGG CGGCTGGGCC GGGATGCGCA GCGCCCTGCA AAGCGGGAAG GTGGACGTGC TGCAGGGGAT GTCGTACTCG CTGGAGCGTT CCTCGGAAGT CGACTTCTCC GTCCCCCATA CCGTCGTCAA CCATGCCGTC TTCGCCCGCA AGGAATCCCC TTATCTCGCC TCCCTGACCG GTCTGAAGGG GAAGACCGTG GCGGTGCATC GCGGCGGGAT CATGCACGAC TACCTGGTCC GGCAAGGTGT GGGGGCTAAG CTGACCCTGA CCGAGACCCC GGCCGACGCC CTCAGGATGG TCGCTTCCGG CCGGACCGAG TTCGCCGTGG TCGCCATCGT TCCCGGGATG TACATGATCC GGGAGTTGAA GCTCTCCAAC CTGGTCCCGG TGCTGCGCAA CGTCGCCACC CATCGCTACT GCTATGCGGT CAAGAATGGG AACGTCGAGC TTTTGTCGCG CTTCAACGAG GGGCTGGCGA TACTGAAGAA GACCGGCCAA TACGACGTCA TTCACAACCG GTGGCTGGGG GTCGTGGAAC CGCAGTTGAT AGACTGGTGG ACCTTCGTGA AATATGCGGC GGTCGTGGTG GTGCCGCTGG TGCTCCTTCT GGGGGGCTTC GCCCTTTGGT CCCGCACCTT GCACCGGCAG GTCGCCCTGC GCACGGCGGA CCTCACCCGG GAGATCGCCG AAAGGCGCCA GGTCGAGGAG GAACTGCGCC TGAACCAGCA GCAACTGGTG CAGGCGGACA AGATGGCGGC CCTGGGGGTG CTGGTTTCCG GCGTCGCCCA TGAGATAAAC AACCCGACCG GACTCATCCT TCTGGAGGTC CCGATCCTGA AGCGGTTCCA TGCGGACTCG GTGAAGATCC TGGAGCGCTA TTACGAGGAG AACGGCGACT TCACCTGCGG CGGGCTCCCC TATTCACGGA TGCGCCAGGA GATTCCCCGG TCCCTGGAGA AGATTCAGGA CGCCGGCAAG CGCATCAAGC GGATCGTGGC GGATCTGAAA GACTTCGCCC GCCGCGACGA AACCGATTGC AACGAAATCC TTGATCTGAA CGCGGCGGCT AAGGCCGCGG TACGCCTGGC CGAGCCGACC ATAAACAAGG CGACCACCCG CTTCAGCGCC GAGTATCGCA AGTCGCTGCC GCGCATCCGG GGGAACCGCC AGCGCATCGA GCAGGTGCTG GTCAACCTGA TCCTCAACGC CTGCCAGGCG CTTCCCGACC CGGAGCGGGC CATCGAGCTG ATGACCTGGC ACGACGCTTT CCGGGATCAG GTGGTCCTAC GGCTGCGGGA CGAGGGGACC GGCATCGCTC CAGAGCACCT GTCGCGCCTG ACCGATCCGT TTTTCACCAC GAAGCAGGAC CAGGGGGGGA CCGGTCTCGG GCTTTCCGTC TCGGCTGGGA TAGTCAAGGA GCATGGCGGG ACCCTGCAAT TCGAGTCGAA CGGGGAGGGG GCCACGGTCA CCCTGACCTT GCCGGTGTAC CACGAGGAGA ACAACGGATG A
|
Protein sequence | MGKLRQSVSL FFAGSALPCR RQPLLVIAQL LLWALLVVSI CGAAESAAGV RPVIVGGDRD YPPYEFIDKS GHPAGYNVDL TRAIADVMGM KVEFRLGGWA GMRSALQSGK VDVLQGMSYS LERSSEVDFS VPHTVVNHAV FARKESPYLA SLTGLKGKTV AVHRGGIMHD YLVRQGVGAK LTLTETPADA LRMVASGRTE FAVVAIVPGM YMIRELKLSN LVPVLRNVAT HRYCYAVKNG NVELLSRFNE GLAILKKTGQ YDVIHNRWLG VVEPQLIDWW TFVKYAAVVV VPLVLLLGGF ALWSRTLHRQ VALRTADLTR EIAERRQVEE ELRLNQQQLV QADKMAALGV LVSGVAHEIN NPTGLILLEV PILKRFHADS VKILERYYEE NGDFTCGGLP YSRMRQEIPR SLEKIQDAGK RIKRIVADLK DFARRDETDC NEILDLNAAA KAAVRLAEPT INKATTRFSA EYRKSLPRIR GNRQRIEQVL VNLILNACQA LPDPERAIEL MTWHDAFRDQ VVLRLRDEGT GIAPEHLSRL TDPFFTTKQD QGGTGLGLSV SAGIVKEHGG TLQFESNGEG ATVTLTLPVY HEENNG
|
| |