Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0452 |
Symbol | |
ID | 8135761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 545432 |
End bp | 548299 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644868070 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003020290 |
Protein GI | 253699101 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCAAA AGTTAGTCAA AAACGGAGGG CTTGCCCTCC TCATCCTTGC CTTCACCTGT CTGTGCACGT CCATCTTCTA TTTCTCCTAC CAAAAGGCAA AACAATCGGC CATCAACAGG CTGAACGACG AGCAGTTCAC ACATGCCAAG CAGGCGGCCA GGGGGATCGA AGAACACTTC GTCACCTGGA CCGGCATCCT CACCTCGCTT GCGAAGCTCG ACTCCGTCGT CGCCATGGAT CCCGACGGCA AGCGCCAGAT GGAATTCTTC TACGACGCCC ACAAGGACCA GATCAGATCG TTCACCCGGA TGGACGAGAA GGGGAACATA CTCTTCACCG TTCCGGATCT GAGAATGGCG GGAAGGAACA TCACGGGGCA AAAGCACATA CAGGAACTGA TCGCGACGCA AAAGCCTGTG GTGAGCGACG TCTTCCGGAC CATCCAGGGG TATGACGCCA TCGCCCTTCA CGTTCCGGTC TTCGATGGAC TCCGGTTCAG GGGTAGCATC GCCATCATCA TCAACTTCCA GAGCCTCGCC CAGCGCTACC TCGAGGTAAT CAAGATCGGC AAGACCGGCT ACGCCTGGGT GGTAAGCCGG GACGGCACCG AACTCTACTG CCCCGTCCAC GGCCACAGCG GCAAGAGCGT TTTCGCCACC GCGAACGAGT TTCCGGCGGT TCTCGCCATG GTAAAGGCGA TGCTCAAAGG GGAATCGGGG ACGGCGAGCT ACGACTTCAA CACGAGGGGC GGCGCCTCCG CCAAATCTGT AAAGAAACAT GCCGTCTACC TTCCCATCAA CCTCGGCAAC ACCTTTTGGT CCATCGTCGT CGTTTCGTCG GAGCAGGAGA TTCTCTCCTC GCTTTCCTCC TACCGCAACA GGCTGGCTCT GGTCTTCGGC GGCATCCTGC TGGGTGGGAT CATCATCTGC ATCCTGGTGC TACGGGCCCT GCTGATAGTG AGGGAGCAAG CCGTGCACAA GGAAGCGGAG GCGGAGTTGC GAGCCAGCGA GCAGAGGTAC CGCTACCTCT TCGAGCAAAA TCCCGCCCCG ATGCTCATCT ACGAAAGGGG AACCATGCAG ATGCTGGCGG TGAACGACGC CTTCGCCGTC GGCTACGGCT ACAGCAACGA GGAAGCGCTG GCGCTTGGGC TCACCGACCT CTACCCCGAG GAGGAGAAAC AGAAGATCAC CGAGGTCGCC GCCGGGCTCA GCGGCCACAC CTATGTCGGG GAATGGCACC ACCGCCGCAA GGACGGCACC GTCTTCCCCA TCGTTGTCAC CTCCCACGAC ATGACCTACG GAGGCAGGAC CGCTCGCATC GCGGTCATCA CGGACATAAC GGATCGTAAG GCGATGGAGA AGGCAATCGA GGAGGAGTCG ACCTTCAACC GCCTCCTTTT AGAGCATTCG CCCGACGGCA TAGTCATCAT CGACCCGAAG ACCGCCCGCT TCATCAACTT CAACGCCGCC GTCTGCCGGC AACTCGGGTA CTCCCGCGAG GAGTTCGCCC AACTAAGCGT CTTCGATATT GAGGCCGTGG AGACACGGGA AGACACCCGC CGCCGCATCG AGGGTATCGT GCGGGAGGGA CGGGGCGACT TCGAGACCAT GCAGCGGACC AGTCAGGGAG AGCTACGAAA CGTCCAGGTG ACAGCGCAGA TCCTGACCAT CCAGGACCAA CAGGTCTACT ACTGCATCTG GCGCGACATA ACCGAGCACA AGAAGCTGGA GGAGCAGTTA AGGCAATCGC AGAAGATGGA ATCGGTGGGA CGGCTGGCGG GGGGAGTCGC CCACGACTTC AACAATATGC TCGGCGTGAT CATCGGGTCC GCCGACCTCT GCCAGCACCA GGTACCGGCG GATAGCCCGC TGCAAAAGTA TCTCGATCAC ATCCTGAAAG CGGCGAAACG GTCGAGCGAC ATAACGCGCC AGTTGCTCGC TTTCTCCCGC AAGGAGGTAG TTTCGCCCAA GCCGGTGAAC CTTAACAGCC TCATCATCGA CTCCGAGAAG ATGCTCTGCC GTTTGATCGG CGAGGACGTC AAGCTCACCT TCAAACCCTC CACCGGCCTT TGGACCGTGA TGATCGACCC GGCCCAGTTC GACCAGATAC TCATGAACCT CTCCGCCAAC TCCCGCGACG CCATGCCCGA CGGCGGCACG CTCGACATAG CGACCGGCAA CGTTCACCTC GACGCAGGCT ACTGCCGCCA CCACTCGGAC ACCGTTCCCG GCGACTACGT CAAGATCACC GTCTCCGACA CCGGGACGGG GATGAATCGC GAAACCAGGG ATCACATCTT CGAGCCCTTC TTCACCACCA AGGGGGTCGG GGTAGGGACC GGTCTCGGTC TCGCCACGGT CTACGGCATC GTCACCCAGA ACAACGGGTT CATCAACGTC TACAGCGAGC TTGGCCAAGG GTCGGTCTTC AACATCTACC TGCCGCGCCT TTTGGAAGAT GGCGCGACAG AGGAAGAGGC CGAGGCGGCG CCCCCCCCGA AAGGAACCGG AACCATCCTC CTGGTCGAGG ACGAGGAGAT GCTGCTCTGG ACCACGACGA AAATCCTGGA GGAGATGGGT TATACCGTGC AGCAGGCCGA ATCCCCGGCA AAGGCGATAG CGATCTGCGA AAACGGCAAA CAGATAGACC TGGTGCTGAC CGATGTGGTG ATGCCCGGCA TGAACGGCCG GGAGATGGTG GACAGGATAA GGAGCGCCAG GCCTGACATA AAGGTGCTGT TCATGTCCGG CTATACAGCG GATATAGTGG CCCAGCGGGG AATCGTGGAA GAAGGGATGT TCTACATCTC CAAGCCGTTG GATTCCAAAC AGTTGCACGA GAAGATCGTC CAGACGCTGG CGTCGTAG
|
Protein sequence | MIQKLVKNGG LALLILAFTC LCTSIFYFSY QKAKQSAINR LNDEQFTHAK QAARGIEEHF VTWTGILTSL AKLDSVVAMD PDGKRQMEFF YDAHKDQIRS FTRMDEKGNI LFTVPDLRMA GRNITGQKHI QELIATQKPV VSDVFRTIQG YDAIALHVPV FDGLRFRGSI AIIINFQSLA QRYLEVIKIG KTGYAWVVSR DGTELYCPVH GHSGKSVFAT ANEFPAVLAM VKAMLKGESG TASYDFNTRG GASAKSVKKH AVYLPINLGN TFWSIVVVSS EQEILSSLSS YRNRLALVFG GILLGGIIIC ILVLRALLIV REQAVHKEAE AELRASEQRY RYLFEQNPAP MLIYERGTMQ MLAVNDAFAV GYGYSNEEAL ALGLTDLYPE EEKQKITEVA AGLSGHTYVG EWHHRRKDGT VFPIVVTSHD MTYGGRTARI AVITDITDRK AMEKAIEEES TFNRLLLEHS PDGIVIIDPK TARFINFNAA VCRQLGYSRE EFAQLSVFDI EAVETREDTR RRIEGIVREG RGDFETMQRT SQGELRNVQV TAQILTIQDQ QVYYCIWRDI TEHKKLEEQL RQSQKMESVG RLAGGVAHDF NNMLGVIIGS ADLCQHQVPA DSPLQKYLDH ILKAAKRSSD ITRQLLAFSR KEVVSPKPVN LNSLIIDSEK MLCRLIGEDV KLTFKPSTGL WTVMIDPAQF DQILMNLSAN SRDAMPDGGT LDIATGNVHL DAGYCRHHSD TVPGDYVKIT VSDTGTGMNR ETRDHIFEPF FTTKGVGVGT GLGLATVYGI VTQNNGFINV YSELGQGSVF NIYLPRLLED GATEEEAEAA PPPKGTGTIL LVEDEEMLLW TTTKILEEMG YTVQQAESPA KAIAICENGK QIDLVLTDVV MPGMNGREMV DRIRSARPDI KVLFMSGYTA DIVAQRGIVE EGMFYISKPL DSKQLHEKIV QTLAS
|
| |