Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1524 |
Symbol | |
ID | 8136853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1785693 |
End bp | 1787471 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644869136 |
Product | histidine kinase |
Protein accession | YP_003021338 |
Protein GI | 253700149 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 2.64718e-19 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATACCA GATTAAAATT TCCGCTCAGG TTCAAGATGC TCCTCTCCCA GTTGCTGGTG GTGTCGGTGG TGCTGAGCCT GATCACCTTC ACCATGGCGA ACCTGTTCCA GGTCGACAAG ACCGCCTACA TCCACGACCT CACCTCGACG GTGGTGCTGC ATACGGCGGA GGAGGCGAAC GCGCTTTTGG CCGGTTACCG GGAGCGGTTG AAGCTCTTCG GGCGCGTCCT GGCCGAGCCG GAGCTTTCGG GGCGGGACCA GGTGCTGCAA AGCTTTTTCG AGGAGTTCCG CGACTTCGTG CTGGTTACCC GCAGCGGCCC CGGGGGGGAA CAGACCGTCT ACGACGGCGC TGCGCTGCAG GCGGCCGGGG TGACCAAGGA GGAGGTCGTG GCGAACCTGC AGGCGCATCC CGCGCCGGAA TCGATTCCGG CGGGGCAGGT GTATCTGGTG AATTCCACCG TATCGCCCAA GCTTCCCACG CTCGCCCTCA CCATCTCCGA GCCCGCGGCG GGGGGGGCGC CGGTCATCAC GACGGCGGTG CTGCGCCTGG ACCGGTTGCA GGAGCTCGCC AAGCGTTCGC GGGTCTTTGA CATATTCTTA TTGGACTCGG CCGGGCGTTA CCTGGCGCAC AAGGCGCCGG GGCGGGTGGG GGTCGCTGCC AATCTCGAAT GGTGGAACCG GGTGAAGGCC CCGCGCAGCT CCGGGATGAC CATGGAGTAC AAAAATCTCG GCAAGGAGAT GGTGGCAGGA TTCTCGCGCT GCTCGCTGGG AGGGCTGGTG GTCGGGGTGG AGATACCCAA AAGCGCCGCC TACCTCACCT CGCGAGAACT TCTCAGCGAT CTCTTGCTCC TGTCGCTGGC GCTTTTGGGG GGGGCGGCCC TTTTGAGCCA GTTCTGGTCG CGGCATTTCA CGAGCCCCCT GGAGAAGCTC TCGGAGGCGA CCCGGATGGT GGGGCAGGGG CGCTTCGAGA TCGAGGTGAA GGCCGAATCA GGCGACGAGA TCGGCGCGCT GGCCCGCTCC TTCAACCAGA TGGCCGCCGA GTTGAAAGTG CGCGAGAAGG CCCTCAAGGA CCTCTACGGG CAATTGGTCC ACTCGGAGAA GATGGCGGCC TTTGGCGCCC TCGGCGCGGG GATCGCCCAC GAGGTGAAGA ACCCGCTGGC GGGTATACTC GGCATCACCC AGCTCTCGCT CAGGGGGGCG GGAGCCGGGC ACCCGCTGGA GAAGAATCTT CTGATCATCG AGAAGGAGAC CAAGCGCTGC AAGACCATCA TCGAGCACCT GCTCAAGTTC GCGCGCCAGG AGCAGGTCGA GTTCGGCGAG GTCGACCTGC AGCAGGTGGT GGCTGATGCC CTTGCCATCG TCGACCACCA GTTGGGGATC AACAGCATAA AAGTGGAGCA GGAACTGGAG CCGGGAATGC CGACCTGCCG CGGCAACGCG AACCAGTTGC AGCAGGTGCT GATGAACCTG ATGCTCAACG CGCAGCAGGC GATGAGCGGC AAGACCGGCA CGGTGAAGCT TTCCGCGCGC AGGCTGGAGC AGGGGGGGGT GGAATTGCGG GTGGCGGACA ACGGCCCCGG TATCAGCAAG GAGATCCAGG GGAAGATCTT CGATCCCTTC TTCACCACGA AGCCGGCGGG GCAGGGGACG GGGCTTGGCC TCTCGGTCAC CTACGGCATC GTCAAGGATC ACGGCGGCGA GATACACCTG GAGAGCGAGG AGGGGGTGGG GACTACCTTC ATCATCACCC TGCCACCCTC CGCGGCAGCC ACAGGCTAA
|
Protein sequence | MHTRLKFPLR FKMLLSQLLV VSVVLSLITF TMANLFQVDK TAYIHDLTST VVLHTAEEAN ALLAGYRERL KLFGRVLAEP ELSGRDQVLQ SFFEEFRDFV LVTRSGPGGE QTVYDGAALQ AAGVTKEEVV ANLQAHPAPE SIPAGQVYLV NSTVSPKLPT LALTISEPAA GGAPVITTAV LRLDRLQELA KRSRVFDIFL LDSAGRYLAH KAPGRVGVAA NLEWWNRVKA PRSSGMTMEY KNLGKEMVAG FSRCSLGGLV VGVEIPKSAA YLTSRELLSD LLLLSLALLG GAALLSQFWS RHFTSPLEKL SEATRMVGQG RFEIEVKAES GDEIGALARS FNQMAAELKV REKALKDLYG QLVHSEKMAA FGALGAGIAH EVKNPLAGIL GITQLSLRGA GAGHPLEKNL LIIEKETKRC KTIIEHLLKF ARQEQVEFGE VDLQQVVADA LAIVDHQLGI NSIKVEQELE PGMPTCRGNA NQLQQVLMNL MLNAQQAMSG KTGTVKLSAR RLEQGGVELR VADNGPGISK EIQGKIFDPF FTTKPAGQGT GLGLSVTYGI VKDHGGEIHL ESEEGVGTTF IITLPPSAAA TG
|
| |