Gene Mpal_0667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0667 
Symbol 
ID7271811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp660766 
End bp663555 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content57% 
IMG OID643569312 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002465758 
Protein GI219851326 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTC ATCACCCGTT CGACTCCGCG ACGACGAACG TCAACCAGCG GAAGATCATC 
AGTCTCCTCA TCGTCCCTTC GTACCTCGTG ATCATCGCGC TCCTCTTCCT CACGGACAGA
ACGGCTGTCT TTGCCCAGCC CCTCCTCCTC CTGCTCCTGA ACTCGATCTT CCTCGGCCTG
ATCCCCCTGT ACGTCGCATA TGTCTCGGTT TTATCGTTTC AGCGCAGCGG TCTCGTCAAT
ATCCTGATGA TGGGCGCCGG GATGCTCTTC CTTGGCCTGG GTGCCATCGC CACAGGACTG
GTCGGGCTCC TGCCGGATTC ATCCAATATG AGCATCACTA TCTTCAACCT CAGCGCCGGA
ATCAGCGCCT TTCTCCAGTT TGCTGGCGCG CTGATAGCGC TGGTCGGCTG GACCTTCCGA
CCGGCCGGGC GCCGAACGGT GATCGCAGCA ACGGTGTACG GAGTGGTCGT CACGGGGTCC
GCCGGGTTCA TGCTCGCCGC CCTCCTGGGG CTGACCCCGA CGTTCTTTGT CCCGGGCTCC
GGCTCGACGA TCTTACGCGA CTTCGTCCTC GCTGGCGGGA TCGAATTCCT CCTGCTCGCA
TCAGGGCTGT TCTTTGTACT GTTCCGGAAG CGTGACGAGG ATTTCTTCTT CTGGTATTCC
ATCGGTATGG CGATGATCGG CCTCGGCCTC CTTGCCGCGA CCATCATCAT GGTCTTCGAC
GGTCCGCTCA ACTGGGCTGC TCGGATCGGG GAGTATCTCG GAGCCTGTTT TATTCTCGTC
GCGTTCATGG CGCTCCAGCA TCGGGCGTCG ACCGGGAACG TCTCAGACCA GGAGATGCTC
ACGAGGTTCT TCGCCGAGGC TGAAACCAGG TACCGGCAGT TGATCCAGAC AGCCGTCGAC
GCCATCATCG TCCTCGATCC TGCATTTCGG GTCATCCTCT GGAATACCGG TGCTGAACGC
CTGTACGGGT ATACATCAGA GGAAATGGTC GGCAGCCCAA TCTCCTGCAT CTTCCCCTTC
GAACAAAAGG AAGAGATACT CCACCAACTG GAAAAAATCA GACAGGGGGA GACGATCGAG
CGTGTTGAGA CCGAACGGGT CACAAAAGAC GGTTCACGCC GACTGATCTC CCTTTCACTC
TCTCCAATCC TGAGCAGTGA TGGGGATTTT ATTGGGATAT CAGACATAGC CCACGATATC
ACCGAGCGCC AGCGGTTCCA GAATGAGATC CTGAAGGCAA AGAACCGGTG GGAACGAACC
TTCGACGCCG TTCCGGATAT GATCGCCATC GTCGACAAGC ATTTCCGGAT CGTTCAGGTG
AACAGGGCGA TGGCCGACAG TATTGGTATT TCGCCCGAAG AGGTAGTGGG GATGACCTGT
TATGAAGTTA TTCATCACTC CAGCACCCCC CCGACCATAT GTCCACACCA GAAACTCCTC
CAGGACTACC AGAGTCATTC AACGGATTTG CATGAGGATA CCTCGAACAA GGACTTCTTC
CTGACCGTAT CTCCAATCCA GGATCCTCTG GGTAACGTCC GCGGAAGCGT TCATATCCTC
CGGGATATCA CCGAGCGAAA GCGTGCAGAA GAGGCTCTGA TCCGTTCAAA CGAGGAACTC
AACGAACTCA ACGAGGAGCT CACCGCCACC GAGGAGGAGA TGAAGATGAA CAACGAGGAA
CTCATGACCG CCGAAAAGAT GCTCCGCGAG AGCGAGGCGC GCCTCGCCCT TGCCCTCGAC
ATCTCCGGGA TGGGGACGTT GGACTGCGAC ATGGTCAACC ACACCTCATG GCGGTCCCTC
CGACACGACC AGATCTTCGG GTACAAGACG CCACCATCCA CCTGGAACCT GAAGATCTTC
ATCGACCACG TTCTTCCTGA AGACCAGGAG ATGGTCAGGA AGATTTTTTC CGATGCCTTC
GCAAGCCAGA GCGACTGGCG CTTCGAATGT CGGATACGAC GGACTGACGG CGAGATCCGA
TGGATCGAAA AGACAGGTCT GGGACAGTAC GACGATGCCG GGCGTCCGCT CCGGGTACTC
GGACTTCTCC TGGACATCAC CGAGCGAAAA CAGTTCGAGA CAACCCAGAA AAAGTATGCC
GAGAAACTGA TGGCAAGCAA CGAGGAACTG CAACGGTACG CCTACGTGGC GAGCCACGAT
CTCCAGGAGC CACTGCGCTC GATCGTCAGT TTCAGCCAGC TCCTCGATCG CCGATATAGG
GGGAAGCTCG ATAAGGACGC CGACGAATAC ATCAAGTTTA TCGTCGATGG CGGGGTGCGG
ATGCAGGCCC TGATCAAAGA CCTGCTCCAG GTCTCCCGGA TCGAGACACA GGCACAGCCG
CTCGCCCCCA CCGACGCGAA TGCAGTGGTC GCCGATTCCA TTCGCTCGCT TAAAACACCA
ATCCATGATA ACAATGCCAG GGTGACCGTC GACCCGCTGC CGATCGTCAT GGCCGATCCG
TCACAGCTCG AACAGATCTT CACAAACCTG ATCGGGAATG CGATCAAGTA TCATCGTCCA
GAAATGCCGC CAGCGATCAC CGTATCAGCA GAGCGACACG GCAACTGGTG GGAGTTCTCA
GTCAGGGATA ATGGGATCGG GATCAAGGCT GAGTTCTTCG ACCGAATCTT CGAGATGTTC
CGCCGGCTCC ACACGATCGA CGAGTACGAG GGGACCGGGA TCGGACTTGC GATCGTCAAG
AAGATCGTCG AACGACACGG AGGCCGGATC CGGATCGAGT CAGAGCTGGG TGAGGGGAGT
ACATTCTTCT TCACGCTGCC GGTGGTGTAA
 
Protein sequence
MARHHPFDSA TTNVNQRKII SLLIVPSYLV IIALLFLTDR TAVFAQPLLL LLLNSIFLGL 
IPLYVAYVSV LSFQRSGLVN ILMMGAGMLF LGLGAIATGL VGLLPDSSNM SITIFNLSAG
ISAFLQFAGA LIALVGWTFR PAGRRTVIAA TVYGVVVTGS AGFMLAALLG LTPTFFVPGS
GSTILRDFVL AGGIEFLLLA SGLFFVLFRK RDEDFFFWYS IGMAMIGLGL LAATIIMVFD
GPLNWAARIG EYLGACFILV AFMALQHRAS TGNVSDQEML TRFFAEAETR YRQLIQTAVD
AIIVLDPAFR VILWNTGAER LYGYTSEEMV GSPISCIFPF EQKEEILHQL EKIRQGETIE
RVETERVTKD GSRRLISLSL SPILSSDGDF IGISDIAHDI TERQRFQNEI LKAKNRWERT
FDAVPDMIAI VDKHFRIVQV NRAMADSIGI SPEEVVGMTC YEVIHHSSTP PTICPHQKLL
QDYQSHSTDL HEDTSNKDFF LTVSPIQDPL GNVRGSVHIL RDITERKRAE EALIRSNEEL
NELNEELTAT EEEMKMNNEE LMTAEKMLRE SEARLALALD ISGMGTLDCD MVNHTSWRSL
RHDQIFGYKT PPSTWNLKIF IDHVLPEDQE MVRKIFSDAF ASQSDWRFEC RIRRTDGEIR
WIEKTGLGQY DDAGRPLRVL GLLLDITERK QFETTQKKYA EKLMASNEEL QRYAYVASHD
LQEPLRSIVS FSQLLDRRYR GKLDKDADEY IKFIVDGGVR MQALIKDLLQ VSRIETQAQP
LAPTDANAVV ADSIRSLKTP IHDNNARVTV DPLPIVMADP SQLEQIFTNL IGNAIKYHRP
EMPPAITVSA ERHGNWWEFS VRDNGIGIKA EFFDRIFEMF RRLHTIDEYE GTGIGLAIVK
KIVERHGGRI RIESELGEGS TFFFTLPVV