Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3902 |
Symbol | |
ID | 8139276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4487799 |
End bp | 4489205 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644871519 |
Product | histidine kinase |
Protein accession | YP_003023677 |
Protein GI | 253702488 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.000116044 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAACTCA ATACCAAGCT GGTGATGATC ATGCTCACCA TGCTTATCGT GGCGACGGCG ATGCTCTTCG TCCTGAACCA GTTCAGCCAG AACGACCTGG TGGGGGAGAT CCAGGAGAGT TCCACCGTGG TATCGAAAGC CATTCAGCTC AGCGTGGAAG ACCTGACCTC CGAGGTCGAA TCATCGCGCC TGACCGAGTA CCTCCAGCAG GCGAAGAGCA AAGGGCTCAA CGAGATCAAC ATCATCAACA ACGAGGGGGA GATCATCAAC TCCTCGGATC CCGCCCAAGT CGGCAAAAAA CGCGAAATCA ACAAGCTGGA GAAGGGGCTT CGGGCCTCGC GCCGCGGCGG CGGCGGGGGG CCGCTCAAGC CGTACGACCT GGTGGTGCCG GTCATCGTGG GCGACGAGCA GCTGGGTTAC GTGCAGGTGA ACCTCCTTCT CGACAACATA CGCGACATCC AGCACGCCAA CTTCGTCAAT CGTCTGGCCG CGACCACCAT GGTGTTCCTG ATGGGGATGA TACTGATCAT CTACCTGGCG CGCCGCTACA CCTCGCCGAT ACACCGGCTC GCCACCGGCG TCAAGCACGT CTCCGGGGGG GATCTGAGCG TCACCTTCCA GGTGGGGAGC GGCGACGAGA TCGGGGAACT GGCCGAGAAC CTGAACGAGA TGGTGGAGAA GCTGAAGGAA AAGGAGCAAC TCGAAAAGCG GCTCTACGAG GCGGAGCACC TCTCCAAGGT GGGGCAACTG GCCGCGGGGA TCGCGCACGA GATCAGGAAC CCGCTCAATT ACATAAGCCT CGCCATCGAC CACCTGAAGA GCGAATCCCT CCCCTCCTGC CCCGAAAAGG CCAAGGAGCT GGAGTCGATC GCCAACAACA TCAAGGAAGA GGTGCGCAAG GCGAACTACA TGGTGCTCAA TTTCATGAAT TACGGCCGAC CCTTGAAGCT GCGGCTGCAG CGGGTATGCT ACCCTGAGCT CGTGGACAAG GCGATGCAAC TCATGAAAGA TCGGCTCGAC GAGAGGGGGA TCGAAGTGGT GCGGGACATA CCCGAGTACC TGCCGCCGAT GCTGGCGGAC CCGGAGCTGA TGCGCAACTG CCTGTGCAAC TTCATCAGCA ACAGCACCCA GGCGATGCCG GAGGGGGGGA AGTTCACCAT CGGCGCGAGC ATCGCCCCCG AAACCGGCGA GTTCCGCCTC ACCTTCAGCG ACGAAGGGTC GGGGATCGAG CCGCAGGATC TGGAGAAGGT GTTTCAGCCC TACTTCACCA CCAAGGAGGC GGGGATCGGC CTAGGACTCG CCATCACCGA ACGGATCGTG AGGGAGCACG GCGGCGGCAT CGCGGTTCAG AGCACGAAAG GGGAAGGGAC CACCTTCTCG GTCACCCTCC CGGCGGCAAC GGCATAA
|
Protein sequence | MKLNTKLVMI MLTMLIVATA MLFVLNQFSQ NDLVGEIQES STVVSKAIQL SVEDLTSEVE SSRLTEYLQQ AKSKGLNEIN IINNEGEIIN SSDPAQVGKK REINKLEKGL RASRRGGGGG PLKPYDLVVP VIVGDEQLGY VQVNLLLDNI RDIQHANFVN RLAATTMVFL MGMILIIYLA RRYTSPIHRL ATGVKHVSGG DLSVTFQVGS GDEIGELAEN LNEMVEKLKE KEQLEKRLYE AEHLSKVGQL AAGIAHEIRN PLNYISLAID HLKSESLPSC PEKAKELESI ANNIKEEVRK ANYMVLNFMN YGRPLKLRLQ RVCYPELVDK AMQLMKDRLD ERGIEVVRDI PEYLPPMLAD PELMRNCLCN FISNSTQAMP EGGKFTIGAS IAPETGEFRL TFSDEGSGIE PQDLEKVFQP YFTTKEAGIG LGLAITERIV REHGGGIAVQ STKGEGTTFS VTLPAATA
|
| |