Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0057 |
Symbol | |
ID | 8135356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 69362 |
End bp | 71128 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644867674 |
Product | histidine kinase |
Protein accession | YP_003019902 |
Protein GI | 253698713 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.000000000000185994 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGAAACG AACGAGACAT TCTGATCGTC GACGACAACC AAGTGGTCTG CGATGTTCTG GCTGAACTAT TCCGCAACGA GGGGTTCGAC AGCTGGGGTG TTGCCACCGG CGAAGCGTGC CTCGATGAAG TGACCCGCGC CTCCTGGAAG CTGGTGATGC TGGACGTGCG CCTTCCAGGG ATCAGCGGCA TCGAGGTTCT CGAGGCCATC CGGCGCGACC ACCCCAAGAC CGAAGTGATC ATCATGACCA GCCACGTATC GCTGGAGACG GCGGTCCAGG CCCTGCGTCT GGGCGCCCAG GATTACCTTT TCAAGCCCTT CGACGACCTG GAGATGGTGA TCGCCACCGT GAACAAGGCG CTGGAGCGGC GCCGCCTCGT CGAGGAGCGC GACAAGCTGG TGCGCACCCT GGCCGAGCTG GCCATCGAGA ACGGCCGCAT CCTCGCCGAA TGCCGCCGGG TAAACAGCAG CCTGGAGGAA AAGGTGGCGC AGCGTACGGC CGAACTTTCC AAGGCGAACC TGCAGCAAAA GGCGATCATC GCCGAATTGC GCGAAGCGAA GGAAGCGGCC GAAGCGGCCA ACCGGGCCAA GTCGCAGTTT CTCGCCAACA TGAGCCACGA GATCCGCACC CCGATGAACG GCGTCCTCGG CATGGCCGAG CTCCTGCTGC ATTCAGAGCT CGACGAGAAG CAGAAGAGCT ACGCCAAGAT GCTGCACCAC TCCGGCGAGT CGCTCTTGGA CATCATCAAC GACATCCTCA ACATCTCCAA GATCGAGGCG GGGAAGCTGG AGATCGAGAG GATTCCTTTC GATCTGCACG AAACCGCGCG CGGCGCGGTG GAGCTGTACC GTGAGGTGGG CCGGGGCAAG GGTGTCGCGG TGGAGCTGCA GATCGAGGAG GACGTTCCGC GCTGCGTGGC CGGCGACCCG AACCGCCTGC GGCAGGTCCT GATCAACGTC GTCAACAACG GCCTGAAATT CACCGAAAAG GGGTCGGTAC AGGTCCGGGT CTCCCTGGTG GAGCAGAACC AAAACGGGCA GTACGTAGGT TTCGAGGTGA AGGACACGGG GATAGGGATC CCCGCCGACA GCATCGGCGC CATCTTCGAC TTGTTCGCCC AGGTGGACGG CTCGACCACG CGGAAGTACG GGGGGACCGG GCTGGGACTG GCCATTGCGA AGCAATTGGT GGAGTTGATG GGGGGAGAAA TAGGGGTAGA GAGCGAGCCG GGACAGGGGT CCACCTTCAC CTTTATCGTA TTCCTGCACC AGCAGGTCGA CCAAGCTCTA TGCGAAGAGG AGGGGGCTGA CATGCCCGTT GATAAGGATA ATTGCACGGC AGAAGCTCGG CAGATCGGGA AGTTCAACGC ACGCGTGCTT CTGGCCGAGG ACAACCCGGT AAACTGCGAG GTCGCCTTCG CGATGATCGC CGCGCTGGGT TGCCAAGTCG ACGTGGCCCA GGACGGTAGA GAAGCAGTCG AAGCCTTTTC GCGCCAACCG TACGACCTGA TTTTCATGGA CTGCCAGATG CCGGAAATGG ACGGCTACCA GGCCACCCGC GCCATCCGGC AGCGGGAACT CGGCTCCGGC AAGCACACCA CCGTGATCGC ACTCACCGCT CACGCCATGG CGGGGGCCAG GGAATATTGC CTCACTGCCG GAATGGACGA CTACCTCAGC AAGCCTTTCA ACCTTGAACA GCTCCAGGAG CTGATCGCCA AATGGACCTC CCCCTCGCCT CTTAGCCTTC CCTTAGCCTT CCCTTAG
|
Protein sequence | MGNERDILIV DDNQVVCDVL AELFRNEGFD SWGVATGEAC LDEVTRASWK LVMLDVRLPG ISGIEVLEAI RRDHPKTEVI IMTSHVSLET AVQALRLGAQ DYLFKPFDDL EMVIATVNKA LERRRLVEER DKLVRTLAEL AIENGRILAE CRRVNSSLEE KVAQRTAELS KANLQQKAII AELREAKEAA EAANRAKSQF LANMSHEIRT PMNGVLGMAE LLLHSELDEK QKSYAKMLHH SGESLLDIIN DILNISKIEA GKLEIERIPF DLHETARGAV ELYREVGRGK GVAVELQIEE DVPRCVAGDP NRLRQVLINV VNNGLKFTEK GSVQVRVSLV EQNQNGQYVG FEVKDTGIGI PADSIGAIFD LFAQVDGSTT RKYGGTGLGL AIAKQLVELM GGEIGVESEP GQGSTFTFIV FLHQQVDQAL CEEEGADMPV DKDNCTAEAR QIGKFNARVL LAEDNPVNCE VAFAMIAALG CQVDVAQDGR EAVEAFSRQP YDLIFMDCQM PEMDGYQATR AIRQRELGSG KHTTVIALTA HAMAGAREYC LTAGMDDYLS KPFNLEQLQE LIAKWTSPSP LSLPLAFP
|
| |