Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1818 |
Symbol | |
ID | 7270364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1924185 |
End bp | 1927115 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643570433 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_002466847 |
Protein GI | 219852415 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCC GTGAGAAGAT CCTGCTTGCA CTCGCCGTCA CCCTTATCTC TGTGCTTGTG GTGATGCTCT TCTTTACGGC GACGGTCATA CGGGACGGCT ACGTCAACCT CGAGACCGAT CAGATGCAGC ACGAAGCGAG TCAGGCCCGG GCAGCAGTCG ATGCCGATCT CTCCACCATC AACTCCCATC TGATGGAACG GGCTGCATGG AATGACACCT TCACCTTTGT CACCGATCCC ACTGATCGGA GTTATGTCAG TACAAACAAT ATCTCCCAGA CCTTTACAAC CTGCGACATC AATGTTCTGT TGATCTATGA TCATGATCAC CGGTTGCTAT ACGGTCAGGG GTTCAACCTG AGCAGCGATA CTTTTGAGCA GTTCTCTCCT CTCCTCCTCG ATACCATCTC AACAACGCCA GGTTTTTTGA ATGTGACTGC AGAGGAGGGG TCGAAGACCG GGATTCTGAT GGTCGGGGAT CAGCCGATGC TGCTGGCTTC GCAACCGGTT CTGACCAGTT CCTTTCTCGG TCCGAGTCCA GGGGTCATGA TGATGGGGCA GTATCTGGAC CCCGCACAGG TGAACCGTCT TGCAGAACAA TCTGGTGTTC CCTTCACTCT GGTTCAGAAC CGGTCTGCAG TCGGTCCCGT CTCGATGGAT AGCGTATCGA TCTCTTCACT GAACAGTTCC CAGGTCGCTG CTGTGCTGCC ACTCTCCGAC GTCGCCGGCA CCGGATCGCT GCTGATCCGG GTGCAGGAGC CACGCACGAT CGTTTTGAAT GGGGCTGAAA GTGCCAGAAC CTTCATCATG ACCACGATGC TGATCAGTCT CCTCTTCGTG TTTCTGAGCC TTGGATTTGT GGACCGGACG GTCCTTACAC GGTTGAACCG TCTGATTACG GGTGTCAGGG CGATAAGGAT GAATGGCGAG AATGCACGAA TAGAAGGTAT CGCCGGCAGT GACGAACTGG CCCTCCTCTC AGCTGCGATC AACGGGATGC TGGATGAACT CTCACTGGCC CATCAGTCGA CCCGGGAGAG CGAAGAACGG TACCGGACCC TTGCAGAGTC CGCGCAGGAC CTGATCTTCC TCTTTGGAAC GGACGGCCGG CTCACCTATG CGAATCCGAC CGCTCTGGCC ACGCTCAGGA TGACCAGCAG GGAGTCTTTT GGGCAGACGT TCACCGGGCT CTTTTCATCA GAGGAGAGTT CTTCGGCCGC CACTGTGTTT CAGATGGTCC TCGAGACCAG GACCCCCAAT CGTTTTGAGG TCGATGGTAC GATCGCCGGT TCTGACCATT GCTTTGATGT GGAGGTGATC CCCCTCATCA ATCCTGACGG GGGATTTGGT GAAGTGATGG GAATTGCCCG GGACATCACC GAGCGGAAAC GAGTGGAAGA GACGATACGG TATATGAATG CCTACAACCG GTCGCTGATC GAGATCAGTC GTGATCTGCT GATCATCGTC GATCCGGATG GAATCATCAC GGATCTGAAT GCGACCTCTG AACAGGTGCT TGGATTCCTT CGCCAGGACA TGATCGGTAC CCGGCTCGCC GAGCACTTCA CTGAGCCTGC CCTTATTAGA GACTGCTGTC AGGGGGTGCT GGATAACGGT ATTCGCAGGG AGTCAGAAGT TTGTATCCGT CATCGTGAAG GTCATATCAT TCCACTTCAC TGTACTGCAT CACTCTTTCG GGATTATGAG GGAACCCTGA TCGGAGTGCT GGTGGCGGCC AGAGATATCA GTGAGGAGAA TCAGATGGAG GAGGATCGGC TCAGACTGGC AAAGCTCGAG ACTCTGGGGG TGATGTCCGG TTCACTGGCT CACCAGTTCA ACAACCTGCA CACCAGTATC CTTGGGAACC TGACACTTGC CAGGTCGATG CTCTATGACC GGGAGGCTCT GCTCGGTCGA CTTGACGAGG CTGAAGACCA GCTCACCAGG GCGAGGATGG TGACCAACAA ACTGCTCACC TTCTCCAGAG GCGGCGAGCC GCTCCGGTCT TTCCAGGAGG TTGAACCGCT GCTCCACGAG GCTGCTGAAA ACTGTAATGG ACGTGGATCG TATCAGATCG AATACCGGGT CAGTGACGAT CTCCCCAGGG TATTCCTCGA TCGGGATCAG ATCGTTGAGG CCTTGCAGCA ACTGATCACG AATGCCATGG AGGCGATGCC CAGAGGTGGC ACCATCACTA TTGTTGCAGA ACTCTGCGAC AGAACAGATG GAGAAAGGGA GCCTCAACTC TGCATTCGTG TGATTGACAT CGGTCAGGGG ATCCCTGACG AGAACCAGAA GAAGATCTTT GAGCTCAACT TCACCACCAA GGACGGGGCA GCGGGACTCG GACTTCCCCT CGCCCGCTCG GTGATCCAGA AGCACGGGGG GGAGATCGAG GTATCTTCTG GTCCTGGCGG GGGGACCATC GTCACCTTTA TGATCCCGAT CGGACATGAC CGCCCGCTGG ACCTTGCCCC GCGCCTCCCC ACCGGCCGCA CCCGTGTCCT GATCATGGAT GATGAGGAGG CCATCACCGA TATCCTCAGA ATCTGGCTCA CCCGGCGTGG ATACGATCCG GTGATCACCG ATGATGGGGT CTCTGCGATT CAGGCCTATC AGGAGGCGAT GATCCAGCAC CGGCCGTTCG ATATCGTCTT TCTCGACCTG ATCGTCCCCG GGGGGATGGG TGGTGAGGAG ACGATGAGAG GTCTTCTCTC CCTTGATCGG GCGGTGCGGG CCGTCGTCTG CAGCGGTTAC TCCAACGACC CGGTGATGGC CTCGTATCTG GACTATGGTT TCGTCGGTCT TCTCCCCAAA CCGTTTCAAC TCACTGCGAT GGAAGAACTG ATCCAGTCGA TTCTGCTGGG AACAAAAGGT GCGATGCCAT CCGATCCGAA CAACTCAGTC ATCTCTGACC AGCCTGAGTG A
|
Protein sequence | MKLREKILLA LAVTLISVLV VMLFFTATVI RDGYVNLETD QMQHEASQAR AAVDADLSTI NSHLMERAAW NDTFTFVTDP TDRSYVSTNN ISQTFTTCDI NVLLIYDHDH RLLYGQGFNL SSDTFEQFSP LLLDTISTTP GFLNVTAEEG SKTGILMVGD QPMLLASQPV LTSSFLGPSP GVMMMGQYLD PAQVNRLAEQ SGVPFTLVQN RSAVGPVSMD SVSISSLNSS QVAAVLPLSD VAGTGSLLIR VQEPRTIVLN GAESARTFIM TTMLISLLFV FLSLGFVDRT VLTRLNRLIT GVRAIRMNGE NARIEGIAGS DELALLSAAI NGMLDELSLA HQSTRESEER YRTLAESAQD LIFLFGTDGR LTYANPTALA TLRMTSRESF GQTFTGLFSS EESSSAATVF QMVLETRTPN RFEVDGTIAG SDHCFDVEVI PLINPDGGFG EVMGIARDIT ERKRVEETIR YMNAYNRSLI EISRDLLIIV DPDGIITDLN ATSEQVLGFL RQDMIGTRLA EHFTEPALIR DCCQGVLDNG IRRESEVCIR HREGHIIPLH CTASLFRDYE GTLIGVLVAA RDISEENQME EDRLRLAKLE TLGVMSGSLA HQFNNLHTSI LGNLTLARSM LYDREALLGR LDEAEDQLTR ARMVTNKLLT FSRGGEPLRS FQEVEPLLHE AAENCNGRGS YQIEYRVSDD LPRVFLDRDQ IVEALQQLIT NAMEAMPRGG TITIVAELCD RTDGEREPQL CIRVIDIGQG IPDENQKKIF ELNFTTKDGA AGLGLPLARS VIQKHGGEIE VSSGPGGGTI VTFMIPIGHD RPLDLAPRLP TGRTRVLIMD DEEAITDILR IWLTRRGYDP VITDDGVSAI QAYQEAMIQH RPFDIVFLDL IVPGGMGGEE TMRGLLSLDR AVRAVVCSGY SNDPVMASYL DYGFVGLLPK PFQLTAMEEL IQSILLGTKG AMPSDPNNSV ISDQPE
|
| |