Gene Mpal_0994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0994 
Symbol 
ID7271727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1024681 
End bp1026603 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content57% 
IMG OID643569633 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002466068 
Protein GI219851636 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.620121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0418978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTG GAATTGAGAT GTACCTGCTC AGATTATATG TGGCTGGTGA GACTCAGGAA 
TCTGCCCGGG CAATCCAGAA CCTTGTGCGT ATCTGTGAGA CATACCTGGT GGGACGGTAC
GACCTGGAGG TGATCGATAT CTCCCGGCAT CCGAGTCTCT CACGGGATGA GCAGATCGTT
GTGACCCCGA CGCTGATCAG GAAGGGGCCG GCTCCTGAGC GGCGACTGAT CGGCGACCTG
TCAGACACCC GGGTGGTCCT TCGGGGTCTT GGTTTACCTG AGGGGGGGCT TCAAAACGAA
CCGAACGATT CAGGATCTTC TGCAAACCCG CAGGTCATAA AACTCTGTGA ACGTTTGCGG
GAGACTGGCG TCGATAGGAT GGATATCTGG ACCGGCCAGG TGGATGGAAT CGTCGTCCCG
ACCTCTGATG AAGACAGGCT ATTCATAAAC CCCGGGGGGG CATCTCCGTA CTGGACCTTT
GTCGAGACGA TGAATGAGGG GGCGGTGGTC ATCGATCGGG CCGGAACGAT TCTGTACTGC
AACCGACGGT TTGCTGCGAT CATCGAGGCG CGTATGGAGA TGATCTCAGG TTCATTCTTC
GATAGTTGGG TATGTTCTCA TGATCACCTC CTCTTTGAAG CACTCTCTGT GGCCGGTGCA
GATCGGCAAT CTTCAGGGGA TCTGCAGATG GTCAATACCC GGGGGCGCCT GGTCCCGGTT
CATCTCTCGC TCAGCCCGTT TACGGGCGGA GGGGTCTCTG AAATTTCTAT CGTGGTGACC
GATCTGACCA CGCGGAAACA TAACTGTGCG TTGCTTCGAT CCGAACATCT GGCCCATTCG
ATCCTCGAAC AGGCCGCCGA CCCGATCGTC GTGATCAACG CTGAAAAGGT GATCATCAGG
GCCAATACCG CGGCAGTGGA GATGGCCGGG ACCAACCCCC TTCTTGCACA GTTTGATTCG
GTCTTTCCAC TCTTCCAGGT GATTGGGGAT GAAGAAATAC CCTTCTCACC GTCCAGAATC
TCCAGTGCCG GCAAGCAGTT ACAGGGGATG GAGGTTCTCT TTCGAAGAAC GGACAGGTCA
CTCTTTTCCT TGCTGATCAG CGCGGCACCG ATAATCGGAG AATTCGGGGA ATCGCCCGGA
TGCGTGGCTG TGATGACCGA TATCACGGCC CAGAAACAGG TCGAAGGGGA ACTTCGAAGG
ACACTCGACG ACCTGGCCCA CTCCAATCAG GACCTGCAGC AGTTTGCGTA CATCGCCTCG
CATGACCTGC AGGAACCACT CAGGATGGTG GCCAGTTACC TGCAGCTCCT CGAGCGAAAG
TATCGGGACC GGCTCGATTC GGACGCCCAG GAGTTCATCG GGTTTGCGGT CGAGGGTGCG
AACCGGATGC AGCAGCAGAT CAACGACCTG CTGGCGTACT CGCGGGTCAC GAGCCGGGGC
CAGCCTCTCA AGCCGGTGAG CGCCGAAGAG GCATTGGCCT CTGCACTGAG TCACCTGGCC
CTGAAGATCG AAGAGACTGG TGCCACGGTC ACCCATGACC CACTTCCGAT GGTCAGGGCC
GATCTCCCGC AGCTGGTTCA GGTCTTTTCG AACCTTCTCG ACAACGCACT CACATTCCTC
CGTCCCCAGG TCGCCCCGGT GATCCATCTC TCGGTGGAAG ATCAGGCTGG CTGGGTGGTC
TTTTCTCTGC ACGACAATGG GATCGGGATC GACCCGGAGT TCTATCAGCG GATCTTTCAG
ATGTTCCAAC GACTTCACTC TCGCGCGGAG TATCCCGGGA CCGGTATCGG GCTTGCGATC
TGCCAGCGAA TTATCGAGCG GCACCATGGG CGGATCTGGG TCACTTCGGT TCCCGGCAGT
GGATCGACCT TCTCCTTTAC GATCCCTGGT GTTAATGGTC ACCTTCATGG TCCTGATCGC
TGA
 
Protein sequence
MAPGIEMYLL RLYVAGETQE SARAIQNLVR ICETYLVGRY DLEVIDISRH PSLSRDEQIV 
VTPTLIRKGP APERRLIGDL SDTRVVLRGL GLPEGGLQNE PNDSGSSANP QVIKLCERLR
ETGVDRMDIW TGQVDGIVVP TSDEDRLFIN PGGASPYWTF VETMNEGAVV IDRAGTILYC
NRRFAAIIEA RMEMISGSFF DSWVCSHDHL LFEALSVAGA DRQSSGDLQM VNTRGRLVPV
HLSLSPFTGG GVSEISIVVT DLTTRKHNCA LLRSEHLAHS ILEQAADPIV VINAEKVIIR
ANTAAVEMAG TNPLLAQFDS VFPLFQVIGD EEIPFSPSRI SSAGKQLQGM EVLFRRTDRS
LFSLLISAAP IIGEFGESPG CVAVMTDITA QKQVEGELRR TLDDLAHSNQ DLQQFAYIAS
HDLQEPLRMV ASYLQLLERK YRDRLDSDAQ EFIGFAVEGA NRMQQQINDL LAYSRVTSRG
QPLKPVSAEE ALASALSHLA LKIEETGATV THDPLPMVRA DLPQLVQVFS NLLDNALTFL
RPQVAPVIHL SVEDQAGWVV FSLHDNGIGI DPEFYQRIFQ MFQRLHSRAE YPGTGIGLAI
CQRIIERHHG RIWVTSVPGS GSTFSFTIPG VNGHLHGPDR