Gene Mpal_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2002 
Symbol 
ID7271305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2128491 
End bp2130776 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content53% 
IMG OID643570617 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002467028 
Protein GI219852596 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.897805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.398669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGT ACAGATTTAA TCCCTGTTTG CAGGACCATG ATGGGGGTAG CCTGCAAGTA 
TCGTATAGAT CTCCCCATTC TGCTCTGATA AAAATAATCA AAAGCATATC TGGGCGTTTT
ACCGTATTAG TTAACTACTG GGATGAAAAC AAGATGGATA AGCCATTTCG CAATACTCAG
GAATATTTTC AGGCATTTAT CAGGACTGCA CAGTACATCC CGAACCTGAC CACACGGCAG
GATATCCTGA GCGAAACCGG CAGGGTACTG ATCAGGTTCT TTGGTGCTGA CCTTGTCGGA
TTCTTTGAAC CAGGGAATAA CGGCGGGATA GAAGGCCACC ACTGGATTTT ACCAGATGGT
GTGCCGGGTA CTGCAATACA GACGAGAGAA ACAGAAATGA TCATTACCGA GGTCCTGGAA
ACAGGGTTTC TTGCTGCCCA GCATATCGAT ATCCCAAAAC CGTTTGCAAC GGTTTTTTTA
CCGATCACAT GGGAGAATCA GACGGCAGCG GTAATGCTGA TCGGCCACAG GACATCCGAT
CCCATCCCAA AAGAATTACT CAATATCTAT CTCGCAGTTG CCGGCCTCGT TACCAACGCA
ATTACCGGCG CGGACGAGGA ATTCAAAAAT ATCGCAGCGA GAAAACATGC GGAGGAAGCG
TTAAAAGAGG CTCAGGCCCA GCTCGCATTC TTAGTATCCA GTACCCCAGT TGTACTCTAC
CGGTGCAGCG CATCCGGTGA CTTCAATGTG ACCTTTATTT CTGATAATGT CCAGCTGCAA
CTGGGCTATG AGGCACACGA GTTCATGGAT GATCCGGCGT TCTGGCCAGA TCATATCCAC
CCGGATGACC GAGAACACGT CTTTGAGAGA TTACATCGGC TGATCAAAGA TGGCACCGCC
ACCGCTGAAT ACCGTCTCCT TGGAAAGGAC GGGACATACC GCTGGACACA TGATGAAGCA
AGAGTGGGCC GTGATGCAAA TGATCAGCCG GTGGAATTCC TCGGGTACTG GATCGACATC
ACCAAGCGGA AGCATGCAGA ACTGGCTCTG ATTCATTCAA ACGAGGAACT CAACGGACTC
AACGAGGAAC TCAACGCGCT CAACGAAGAA CTCACCGCCG CCCAGGAGGA GATGGAAATG
AACAACGAGG AACTCATGAC CACCGAAAAG ATGCTCCGCG AGAGCGAGGC GCGCCTCGCC
CTCGCTCTCG ACATCTCCGG GATGGGGACG TTGGACTGCG ACATGGTCAA CCACACCTCA
TGGCGGTCCC TCCGACACGA CCAGATCTTC GGGTACGAAA CGCCACCATC CACCTGGAAC
CTGAAGATCT TCTTCGACCA CGTTCTTCCT GAAGACCGGG AGATGGTCAG GAAGGTTTTT
TGCGATGCCT TCGCAAGCCA GAGCAACTGG AGCTTCGAAT GTCGGATACG ACGGGCTGAC
GGCGAGATCC GATGGATAGA AAAGACTGGT CTGGGACAGT ACGACAATGC CGGGCGGCCG
CTTCGGGTAC TCGGACTTCT CCTGGACATC ACCGAGCGAA AACAGTTTGA GGCCACCCAG
GAAGAATATG CCGAAAAACT GATGGCAAGC AACGAGGAAC TGCAACGGTA TGCCTATGTG
GCGAGCCACG ATCTCCAGGA GCCACTGCGC TCGATCGTCA GTTTCAGCGA GCTCCTCAAT
CGCCGATATA GGGGGAAGCT CGATAAGGAC GCCGATGAGT ACATCCGATT CATCGTCGAT
GGCGGCGTGC GGATGCAGAA TCTGATCAAA GATCTGCTCC AGGTCTCCCG GATCGAGACA
CAGGCACAAC CGTTCGCCCC GACTGATGCT CGCACGGTGG TCGCCGATTC CATTCGCTCG
CTCGAAACCC CAATCCATGA GATCAGTGCC GTGGTGACCG TCGACCCGTT GCCGATCGTC
ATGGCTGATC CGTCGCAGCT CGAACAGGTC TTCACGAACC TGATCGGGAA TGCGATCAAG
TACCGGCGGC CGGAGGTGCC ACCAGTGATC ACTATCTCAG CCGAACGATA CGGCGACTGG
TGGGAGTTCT CAGTCAGGGA CAATGGGATC GGGATCGAAT CTGAGTTCTT CGACCGGATC
TTCGAGATGT TCCGCCGGCT CCATACGATC GACGAGTACG AAGGGACCGG GATCGGGCTT
GCAATCGTCA AGAGAATCGT CGAACGACAC GGCGGCCGGA TCCGGGTTGA GTCAAAGCCT
GGTGAGGGGA GTACATTCTT CTTCACCCTT CCCAACGAAA ATTTCAAATC AAGGGTGGAT
AAATGA
 
Protein sequence
MFEYRFNPCL QDHDGGSLQV SYRSPHSALI KIIKSISGRF TVLVNYWDEN KMDKPFRNTQ 
EYFQAFIRTA QYIPNLTTRQ DILSETGRVL IRFFGADLVG FFEPGNNGGI EGHHWILPDG
VPGTAIQTRE TEMIITEVLE TGFLAAQHID IPKPFATVFL PITWENQTAA VMLIGHRTSD
PIPKELLNIY LAVAGLVTNA ITGADEEFKN IAARKHAEEA LKEAQAQLAF LVSSTPVVLY
RCSASGDFNV TFISDNVQLQ LGYEAHEFMD DPAFWPDHIH PDDREHVFER LHRLIKDGTA
TAEYRLLGKD GTYRWTHDEA RVGRDANDQP VEFLGYWIDI TKRKHAELAL IHSNEELNGL
NEELNALNEE LTAAQEEMEM NNEELMTTEK MLRESEARLA LALDISGMGT LDCDMVNHTS
WRSLRHDQIF GYETPPSTWN LKIFFDHVLP EDREMVRKVF CDAFASQSNW SFECRIRRAD
GEIRWIEKTG LGQYDNAGRP LRVLGLLLDI TERKQFEATQ EEYAEKLMAS NEELQRYAYV
ASHDLQEPLR SIVSFSELLN RRYRGKLDKD ADEYIRFIVD GGVRMQNLIK DLLQVSRIET
QAQPFAPTDA RTVVADSIRS LETPIHEISA VVTVDPLPIV MADPSQLEQV FTNLIGNAIK
YRRPEVPPVI TISAERYGDW WEFSVRDNGI GIESEFFDRI FEMFRRLHTI DEYEGTGIGL
AIVKRIVERH GGRIRVESKP GEGSTFFFTL PNENFKSRVD K