Gene Mpal_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0047 
Symbol 
ID7272216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp45210 
End bp48152 
Gene Length2943 bp 
Protein Length980 aa 
Translation table11 
GC content50% 
IMG OID643568705 
Productsignal transduction histidine kinase 
Protein accessionYP_002465165 
Protein GI219850733 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTGGA AGTGGTGCCC CGTGATCTCA GTGCTGTATG TCGATGATGA GCCGGTTCCT 
CTTGAGATGA CAAAACTCTC TCTGGAGAAG GGATTCGGCT TCTCTGTTGA CACGGCGATC
AGTGCACACG ATGCCCTGGA ACACCTCAAA ATCCAGCGAT ACGATCTCAT CATATCTGAT
TACCAGATGC CGGAGATGGA TGGAATAGAA CTCCTGAAAC TTTTGCGCAG AACCGGTAAT
GCAATTCCAT TCATCCTCTT CACCGGGCGG GGACAAGAAG AGGTTGTAGA CGATGCCTAC
AATGCAGGGG TTGATTCGTT CCTTGTCAAA GGTGGTGATC CCAGGATCCT GTATATGGAT
CTCGCCCGAA GGATCGAACA GATCGTCAGC CATAAAAAAA CGGAGCAGGC GTTACAGTTC
AGTAACATCC TCCTCTCCAC ACAACTGGAG ACTTCGATGG ATGGAATTCT CGTCGTTGAT
GAACACGGGG CGATAATCTC GTCCAATACG CGTTTCATGG ATATGTGGAA TATTCCGGTG
GACGTTATTG CTCCGGGCTC GGATGAATGT GCATTGCTGT CGGTTCTTGG CAACCTTGTT
GATTTCGAAG CCTCTTCCCT ATCTGCGAAT TATCTGTCTG ATCGTCGAGA TGAGAAGAGC
CATGAGGAGA TCGTTTTAAA GGACAGCAGG GTATTTGACC AGTACTCAGC TCCAATGGTA
GAGAGCGATG GTGAATACTA TGGAAGGGTT TGGCACTTCC GGGATATCAC AGCCCGAAGG
CAGGCGGAAG AGGCGCTCCG TGAAGATGAA GATAAATCTC GCATGATCTT TGAAAATTCC
AATGATGCGA TCTTCCTTAT GGAGGTGGCT AGGGAGGGGG AGATCTGCCG GATCATCGAT
GTAAACAGTG TGGCTGTCAG ACAAACCGGG TATTCGAAGG AGGAACTTCT GTCTGGATCT
GCAGTCGGCA TCGATTTCCC AACCCTGATC CAGAGACTGC CGTCGACCAT CTCTGAACTG
CTCGCCGGAG GCAATACTAC CTTTGAGAGT GATGTGACCA GAAAGGATGG CATACTCCTG
CCGGTAGAGG TGAACATCCA GGCCGTTCAA TTCAAGAATA CCTCCTGCAT CATTGTTACC
ATTCGTGATA TTTCCAGGCG CAAACGTACA GAGGAAACAC TGCAGAAGAG TGTGGCAGAT
CTGGCACGCG CCCAACAGGT GGCAAATATC GGGAACTGGA GTCTTGACCC TGCCACAAAT
GAAGTGGCAT GGTCAGATCA GATGTATGAA ATCTTCACCA TCCCCCGATC ATCCCTGATA
ACTCGTGACC TCTATCTTGC TACCATCCAT CCAGATGATA GTGAACATGT CATACAGGTA
CTGACAGAGG CCCTTAATGG TTCTCTGGAC TTTTTCGACC TGGAGTACCG GATCATCATA
CAGGGAGGTC ATATCAGGAA TATCTATGAA CTCGCAGAGA TCTCCCGCGA TGAGCAGGGA
ACCCCGCTTC TTGTCTTTGG CACTGCACAG GATGTCACGG AAAGGATTCT TGCCGAAATG
ATATGCAGAG AGAGCGAGTT GAAATATCGA ACGCTCATCG AGAGCGCAAA TGAGGGTATC
TTCATCATTC AGGATGGGTT TGTTCCGTAT GGGAATCCCA AGGCGATGGA GATCATGGGA
TTTTCAGCAG ATGAATTTGC AGGGAATCAT TTCCTTGAAC TGGTTCATCC CGATGATCGG
CAAGAGACTC TGGAACGGTA TCAAAAGCGG ATACGTGGCG AATTCGATGA ACCGATTGCC
TACTCACGGA TCATCGATCG GAATAACAAT GTCCACTGGC TTGAGATCAA TGCGGTCAGG
ATTCTGTGGA ATGGTAGACC GGCGACACTC AATTTTGTGA CCGATATCAC CAGGAGGAGA
CGTGCTGAAG AGGCACTCAG AGAGAGTGAA GAGAATTACC GGATGGTCGT TGAAAATATC
TCGGATGTTT TTTACCGAAC AGACAAAAAC GGGATTCTCA CCATGATCAG TTCGAGTCTG
AAGACGGTGC TGGGATATGA TTCGGTGGAA GAGTGTCTTG GGCAGTCCAT CGCTGAAAAA
CTTTATTTCG AGCCGGAGAA ACGGAGAGAC TTCCTGGAAG TGCTTTTGAA GAATGGTTCA
GTCAAAGATT TCGAAGTGAC CCTGAAACGA AAGGATGGGA GCCCGGTGAC GGTGGCAACC
AGCAGCCAGG TCTATTATGA CGAATCCGGC AACGTTCTCA GGATTGAAGG GGTTTTCCGG
GATATTACTG AACGCAAGCG GACTGAGGCC GCTCTGGTTG AGTCTCTGCA TGAGAAAGAA
CTGCTCTTAA AGGAGATCCA TCACCGGGTA AAGAATAATC TCCAGACGAT TTCGAGTCTC
CTGTACCTCC AGTCACTCTC AACAGATAAT ATAGGGCAGA TCTCATTACT CAGGGAAGCC
CGGTCGCGGG TCATATCCAT GGGTCTGATT CACCAGAAAC TCTACCAGTC CGCAGACATT
GCCCATATCC AGTTCATGGA TTATATCCGG GGTCTCATCG ATTTTCTTGA AGAGTCCTAT
GGTGTTGATC CGGATAAGAT CCAGACATTT GTGGATGTCA GCCCCTCCGA TCTCACGATG
GACCTCGATA CTGGTATCCC CTGCGGACTC ATAATCAACG AACTGGTCAC CAATGCTCTC
AAGTATGCAT TCCGGGAGTA TGGGTGTGGG ACTATCCGGA TCCGAATGGA GCGGGACGAG
CATGAGTACC TTCTCACGGT CAGCGATGAT GGGGTGGGAA TCCCGGAAGA TCTGGATCTC
TCCACGGTAA AATCCCTTGG GATGACCATC GTCACCGATC TCGTCAGTCA GCTCGATGGA
TCGCTTGAGA TCATCCGCCA GCCGGGTGCA ACCTTCAGAA TTCGGTTCCC TGATCAGAAA
TGA
 
Protein sequence
MEWKWCPVIS VLYVDDEPVP LEMTKLSLEK GFGFSVDTAI SAHDALEHLK IQRYDLIISD 
YQMPEMDGIE LLKLLRRTGN AIPFILFTGR GQEEVVDDAY NAGVDSFLVK GGDPRILYMD
LARRIEQIVS HKKTEQALQF SNILLSTQLE TSMDGILVVD EHGAIISSNT RFMDMWNIPV
DVIAPGSDEC ALLSVLGNLV DFEASSLSAN YLSDRRDEKS HEEIVLKDSR VFDQYSAPMV
ESDGEYYGRV WHFRDITARR QAEEALREDE DKSRMIFENS NDAIFLMEVA REGEICRIID
VNSVAVRQTG YSKEELLSGS AVGIDFPTLI QRLPSTISEL LAGGNTTFES DVTRKDGILL
PVEVNIQAVQ FKNTSCIIVT IRDISRRKRT EETLQKSVAD LARAQQVANI GNWSLDPATN
EVAWSDQMYE IFTIPRSSLI TRDLYLATIH PDDSEHVIQV LTEALNGSLD FFDLEYRIII
QGGHIRNIYE LAEISRDEQG TPLLVFGTAQ DVTERILAEM ICRESELKYR TLIESANEGI
FIIQDGFVPY GNPKAMEIMG FSADEFAGNH FLELVHPDDR QETLERYQKR IRGEFDEPIA
YSRIIDRNNN VHWLEINAVR ILWNGRPATL NFVTDITRRR RAEEALRESE ENYRMVVENI
SDVFYRTDKN GILTMISSSL KTVLGYDSVE ECLGQSIAEK LYFEPEKRRD FLEVLLKNGS
VKDFEVTLKR KDGSPVTVAT SSQVYYDESG NVLRIEGVFR DITERKRTEA ALVESLHEKE
LLLKEIHHRV KNNLQTISSL LYLQSLSTDN IGQISLLREA RSRVISMGLI HQKLYQSADI
AHIQFMDYIR GLIDFLEESY GVDPDKIQTF VDVSPSDLTM DLDTGIPCGL IINELVTNAL
KYAFREYGCG TIRIRMERDE HEYLLTVSDD GVGIPEDLDL STVKSLGMTI VTDLVSQLDG
SLEIIRQPGA TFRIRFPDQK