Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0047 |
Symbol | |
ID | 7272216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 45210 |
End bp | 48152 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643568705 |
Product | signal transduction histidine kinase |
Protein accession | YP_002465165 |
Protein GI | 219850733 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTGGA AGTGGTGCCC CGTGATCTCA GTGCTGTATG TCGATGATGA GCCGGTTCCT CTTGAGATGA CAAAACTCTC TCTGGAGAAG GGATTCGGCT TCTCTGTTGA CACGGCGATC AGTGCACACG ATGCCCTGGA ACACCTCAAA ATCCAGCGAT ACGATCTCAT CATATCTGAT TACCAGATGC CGGAGATGGA TGGAATAGAA CTCCTGAAAC TTTTGCGCAG AACCGGTAAT GCAATTCCAT TCATCCTCTT CACCGGGCGG GGACAAGAAG AGGTTGTAGA CGATGCCTAC AATGCAGGGG TTGATTCGTT CCTTGTCAAA GGTGGTGATC CCAGGATCCT GTATATGGAT CTCGCCCGAA GGATCGAACA GATCGTCAGC CATAAAAAAA CGGAGCAGGC GTTACAGTTC AGTAACATCC TCCTCTCCAC ACAACTGGAG ACTTCGATGG ATGGAATTCT CGTCGTTGAT GAACACGGGG CGATAATCTC GTCCAATACG CGTTTCATGG ATATGTGGAA TATTCCGGTG GACGTTATTG CTCCGGGCTC GGATGAATGT GCATTGCTGT CGGTTCTTGG CAACCTTGTT GATTTCGAAG CCTCTTCCCT ATCTGCGAAT TATCTGTCTG ATCGTCGAGA TGAGAAGAGC CATGAGGAGA TCGTTTTAAA GGACAGCAGG GTATTTGACC AGTACTCAGC TCCAATGGTA GAGAGCGATG GTGAATACTA TGGAAGGGTT TGGCACTTCC GGGATATCAC AGCCCGAAGG CAGGCGGAAG AGGCGCTCCG TGAAGATGAA GATAAATCTC GCATGATCTT TGAAAATTCC AATGATGCGA TCTTCCTTAT GGAGGTGGCT AGGGAGGGGG AGATCTGCCG GATCATCGAT GTAAACAGTG TGGCTGTCAG ACAAACCGGG TATTCGAAGG AGGAACTTCT GTCTGGATCT GCAGTCGGCA TCGATTTCCC AACCCTGATC CAGAGACTGC CGTCGACCAT CTCTGAACTG CTCGCCGGAG GCAATACTAC CTTTGAGAGT GATGTGACCA GAAAGGATGG CATACTCCTG CCGGTAGAGG TGAACATCCA GGCCGTTCAA TTCAAGAATA CCTCCTGCAT CATTGTTACC ATTCGTGATA TTTCCAGGCG CAAACGTACA GAGGAAACAC TGCAGAAGAG TGTGGCAGAT CTGGCACGCG CCCAACAGGT GGCAAATATC GGGAACTGGA GTCTTGACCC TGCCACAAAT GAAGTGGCAT GGTCAGATCA GATGTATGAA ATCTTCACCA TCCCCCGATC ATCCCTGATA ACTCGTGACC TCTATCTTGC TACCATCCAT CCAGATGATA GTGAACATGT CATACAGGTA CTGACAGAGG CCCTTAATGG TTCTCTGGAC TTTTTCGACC TGGAGTACCG GATCATCATA CAGGGAGGTC ATATCAGGAA TATCTATGAA CTCGCAGAGA TCTCCCGCGA TGAGCAGGGA ACCCCGCTTC TTGTCTTTGG CACTGCACAG GATGTCACGG AAAGGATTCT TGCCGAAATG ATATGCAGAG AGAGCGAGTT GAAATATCGA ACGCTCATCG AGAGCGCAAA TGAGGGTATC TTCATCATTC AGGATGGGTT TGTTCCGTAT GGGAATCCCA AGGCGATGGA GATCATGGGA TTTTCAGCAG ATGAATTTGC AGGGAATCAT TTCCTTGAAC TGGTTCATCC CGATGATCGG CAAGAGACTC TGGAACGGTA TCAAAAGCGG ATACGTGGCG AATTCGATGA ACCGATTGCC TACTCACGGA TCATCGATCG GAATAACAAT GTCCACTGGC TTGAGATCAA TGCGGTCAGG ATTCTGTGGA ATGGTAGACC GGCGACACTC AATTTTGTGA CCGATATCAC CAGGAGGAGA CGTGCTGAAG AGGCACTCAG AGAGAGTGAA GAGAATTACC GGATGGTCGT TGAAAATATC TCGGATGTTT TTTACCGAAC AGACAAAAAC GGGATTCTCA CCATGATCAG TTCGAGTCTG AAGACGGTGC TGGGATATGA TTCGGTGGAA GAGTGTCTTG GGCAGTCCAT CGCTGAAAAA CTTTATTTCG AGCCGGAGAA ACGGAGAGAC TTCCTGGAAG TGCTTTTGAA GAATGGTTCA GTCAAAGATT TCGAAGTGAC CCTGAAACGA AAGGATGGGA GCCCGGTGAC GGTGGCAACC AGCAGCCAGG TCTATTATGA CGAATCCGGC AACGTTCTCA GGATTGAAGG GGTTTTCCGG GATATTACTG AACGCAAGCG GACTGAGGCC GCTCTGGTTG AGTCTCTGCA TGAGAAAGAA CTGCTCTTAA AGGAGATCCA TCACCGGGTA AAGAATAATC TCCAGACGAT TTCGAGTCTC CTGTACCTCC AGTCACTCTC AACAGATAAT ATAGGGCAGA TCTCATTACT CAGGGAAGCC CGGTCGCGGG TCATATCCAT GGGTCTGATT CACCAGAAAC TCTACCAGTC CGCAGACATT GCCCATATCC AGTTCATGGA TTATATCCGG GGTCTCATCG ATTTTCTTGA AGAGTCCTAT GGTGTTGATC CGGATAAGAT CCAGACATTT GTGGATGTCA GCCCCTCCGA TCTCACGATG GACCTCGATA CTGGTATCCC CTGCGGACTC ATAATCAACG AACTGGTCAC CAATGCTCTC AAGTATGCAT TCCGGGAGTA TGGGTGTGGG ACTATCCGGA TCCGAATGGA GCGGGACGAG CATGAGTACC TTCTCACGGT CAGCGATGAT GGGGTGGGAA TCCCGGAAGA TCTGGATCTC TCCACGGTAA AATCCCTTGG GATGACCATC GTCACCGATC TCGTCAGTCA GCTCGATGGA TCGCTTGAGA TCATCCGCCA GCCGGGTGCA ACCTTCAGAA TTCGGTTCCC TGATCAGAAA TGA
|
Protein sequence | MEWKWCPVIS VLYVDDEPVP LEMTKLSLEK GFGFSVDTAI SAHDALEHLK IQRYDLIISD YQMPEMDGIE LLKLLRRTGN AIPFILFTGR GQEEVVDDAY NAGVDSFLVK GGDPRILYMD LARRIEQIVS HKKTEQALQF SNILLSTQLE TSMDGILVVD EHGAIISSNT RFMDMWNIPV DVIAPGSDEC ALLSVLGNLV DFEASSLSAN YLSDRRDEKS HEEIVLKDSR VFDQYSAPMV ESDGEYYGRV WHFRDITARR QAEEALREDE DKSRMIFENS NDAIFLMEVA REGEICRIID VNSVAVRQTG YSKEELLSGS AVGIDFPTLI QRLPSTISEL LAGGNTTFES DVTRKDGILL PVEVNIQAVQ FKNTSCIIVT IRDISRRKRT EETLQKSVAD LARAQQVANI GNWSLDPATN EVAWSDQMYE IFTIPRSSLI TRDLYLATIH PDDSEHVIQV LTEALNGSLD FFDLEYRIII QGGHIRNIYE LAEISRDEQG TPLLVFGTAQ DVTERILAEM ICRESELKYR TLIESANEGI FIIQDGFVPY GNPKAMEIMG FSADEFAGNH FLELVHPDDR QETLERYQKR IRGEFDEPIA YSRIIDRNNN VHWLEINAVR ILWNGRPATL NFVTDITRRR RAEEALRESE ENYRMVVENI SDVFYRTDKN GILTMISSSL KTVLGYDSVE ECLGQSIAEK LYFEPEKRRD FLEVLLKNGS VKDFEVTLKR KDGSPVTVAT SSQVYYDESG NVLRIEGVFR DITERKRTEA ALVESLHEKE LLLKEIHHRV KNNLQTISSL LYLQSLSTDN IGQISLLREA RSRVISMGLI HQKLYQSADI AHIQFMDYIR GLIDFLEESY GVDPDKIQTF VDVSPSDLTM DLDTGIPCGL IINELVTNAL KYAFREYGCG TIRIRMERDE HEYLLTVSDD GVGIPEDLDL STVKSLGMTI VTDLVSQLDG SLEIIRQPGA TFRIRFPDQK
|
| |