Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2521 |
Symbol | |
ID | 8137863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2948183 |
End bp | 2949898 |
Gene Length | 1716 bp |
Protein Length | 571 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870130 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_003022320 |
Protein GI | 253701131 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.0595753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGG ACACCAACGA GGGCGATAAC CTGTACCGCT ACATCGTCGA CATGATTCCG CAGATCGTCT GGACGGCTAC GCCCGACGGA CAACAGGATT TTGCCAACCT CCGGTGGTAC GAGTTCAACG GACTTACGCC GGGGGAGCCG GATCCCGAAC CATGGCGCAG CATCATCCAT CCGGACGACG TGGCCATGAC CGCCGAGAAA TGGCAGCATT CGCTGGCGAC CGGGGAACCT TACTACTGCC TGCACCGAAA CAAAAGGCAC GACGGCGAGT ACCGCTGGAT GCTGTCGCGG GCACTGGCCC AAAAGGACGA TGAGGGGCGG GTGGTACGCT GGATCGGCAG CGGAACCGAC ATCACGGAGC AGAAGATCGC CGAGGCGGAG CTGATACGGT ACCGCGACCA TCTGGAGGAA CTGGTCCGGG AGCGGACGGC TGAACTTGTG CGGGCCAAGG AGACGGCCGA GATTGCGGCC CGGGCCGTTC AGGAAGCCAA CGAACTGCTG GAAAAGCGGG TGGAGGAGCG GACCGAGGAA CTGAGGAAGA CCGAAAAGGA GCTGCGCCAG GCACAGAAGA TGGAGGCTGT CGGGACGCTT GCGGCAGGTA TCGCGCATGA CTTCAACAAC ATCCTCACCT CCATCCTCGG GTTCACCGAC ATGGTCCTGC ACAAGATTCC GGAGGGAGAA ATGGGGCGGC GGGAGATGGA ACAGGTGTTC GTCTCGGCGC AGCGAGCCGC GGATCTCGTG CGCCAGATCC TCAGCTTCAG CAGGAGAAAC GATCAGGAAA GGCAGCCGGT GCATGTCTCC GGCATCATCG AAGACACCTG CAAACTGCTG CGTTCCTCGC TTCCTGCCAC AGTGGAGTTC GTCACTGAAT TTTTCGTTTC CGAGGATGAT GACAAGGTCC TGGCCGACCC GATACAACTG CACCAGGTGC TGATGAATCT CTGCACCAAC GCAGCCCACG CCATGCAGCC CGACGGCGGG ACGCTGACCA TCACCCTGAC CGCGGCGGAG GCAGGGTCGC CGGGGCTTAC CTCTCTTCCT GTCCTTACTT CGCGGGACTA CATCAGGGTC GCCGTGAGCG ACACCGGTCG CGGCATTGAG CCGTTGGTGC TGGAGAGGAT CTTCGATCCC TATTTCACCA CCAAGCCTGC GGGGGAGGGG ACGGGGCTCG GTCTTGCCGT GGTGCAGGGG ATCGTGAAGA ATCACGGCGG CGCCATCACG GTTCACAGCG AGCCGGGAAA GGGAACCTGT TTCGAGGTCT TCCTCCCCAC CGTGATAAGC GACGTGCTCG AGGAGGTACA GGTTCGCGAG CAGCTTCTGC ATGGTTCCGA ACGCGTCCTG TTCGTCGACG ACGAGGAATC GCTCACCGTC CTCGGCAAGG GGATTCTGGA GGACCTCGGC TACAACGTGG TCACCAGTAA CAGCAGCCGC AGGGCCATGG AGATGTTCCG TGCCGACCCG GCCCTCTTCG ACCTGGTGAT TACCGACCTG ACCATGCCGG GATTGACGGG TAAGGCCATC GCCAAAGAGA TCCACGCGCT GAGACCTGAC ATCCCAATCA TTCTTTGCAC CGGGTACACG GAGAGCTTTG ACGAGAAGGA CCGGGAATAC GGCATTCGCG CCTGCCTTAT GAAGCCTTAC ACCTCGAAAA TGCTGGGGCG GACCATACGG ATGGTGCTGG AAGGGAAGAC GACATCAACC TGCTGA
|
Protein sequence | MNLDTNEGDN LYRYIVDMIP QIVWTATPDG QQDFANLRWY EFNGLTPGEP DPEPWRSIIH PDDVAMTAEK WQHSLATGEP YYCLHRNKRH DGEYRWMLSR ALAQKDDEGR VVRWIGSGTD ITEQKIAEAE LIRYRDHLEE LVRERTAELV RAKETAEIAA RAVQEANELL EKRVEERTEE LRKTEKELRQ AQKMEAVGTL AAGIAHDFNN ILTSILGFTD MVLHKIPEGE MGRREMEQVF VSAQRAADLV RQILSFSRRN DQERQPVHVS GIIEDTCKLL RSSLPATVEF VTEFFVSEDD DKVLADPIQL HQVLMNLCTN AAHAMQPDGG TLTITLTAAE AGSPGLTSLP VLTSRDYIRV AVSDTGRGIE PLVLERIFDP YFTTKPAGEG TGLGLAVVQG IVKNHGGAIT VHSEPGKGTC FEVFLPTVIS DVLEEVQVRE QLLHGSERVL FVDDEESLTV LGKGILEDLG YNVVTSNSSR RAMEMFRADP ALFDLVITDL TMPGLTGKAI AKEIHALRPD IPIILCTGYT ESFDEKDREY GIRACLMKPY TSKMLGRTIR MVLEGKTTST C
|
| |