Gene Mpe_A0812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0812 
Symbol 
ID4784496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp852540 
End bp855467 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content60% 
IMG OID640089373 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001020009 
Protein GI124266005 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.817585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAATA TCAACCGCCA GCTTTTTGCC GCCTCTGATT CGCTCTGCGT TCGCGATGTC 
ACATTGCGCG CCACGGACGA TGTGCAGACT CACCGCGAGA AGCTCGCCCG TATCGTGCTC
GACGAACTCT ACGAATTTGT CGGCCTGCTC GATGCCCACG GAACGACGCT CGAGATCAAT
CGTGCAGCAC TGGAAGGTGC CGGCATCGCG CTCGACGACA TCCAAGGCAG ACCGTTCTGG
GAGGCACGCT GGTGGGCAAC CTCTCCCGAG GTTCGGAGGG AGCAACGTGA GGTCATCCGC
CGAGCTGGCG AAGGCGAGTT CGTTCGCCGT GATTTCGAGA TCTATGGGCA ACAAGGTGGT
CAGGAGACGA TCCTGATCGA CTATTCCTTG CTGCCCATCC GAGACAACTC CGGGAAGATC
GTGTTCCTGT TGCCAGAAGG CAGAAATATC ACAGACAAGA AGCGCGCCGA AGCGGAGATA
GCGCGCAAGA ACCGCGAACT GCAAAGATTG CTCGACAAGA TCCAGCGTCT CGACGATGCC
AAGAGCGACT TCTTCGCGAA CGTCAGCCAC GAACTGCGGA CTCCGCTCGC GCTGATACTC
GGCCCGTCAG AGTCGCTACT CGCAACAAGC GAAGGGCTCA GCGACGCTCA GCGGCGCGAT
CTTCGGGTCA TCCAGCGCAA TGCGGCGATG CTGATGAAGC ATGTCAACGA CCTTCTGGAT
CTGGCGAAGT TCGATGCCGG AAAGATGGCG CTTCGTTATA CCCGCGTCGA CCTGGCTGCG
GAGGTTCGCA CTCTTGCCGC GCACTTCGAG GCCGTCGCGG CGGAGCGCTC GTTGTCTTAC
GTTGTTCAGG CTCCCGCGGC GCTGGAAGTC GAGGTCGACC AACAGATGTT CGAACGGATC
CTGCTCAATC TGCTGTCGAA CGCATTCAAG TTCACACCGG ATTTCGGCCG TATCCGCTGC
TCCCTCGAAG CCAACCCGGA CCACAGCATC CAGCTTGTCG TCGAGGACAG TGGATGCGGC
GTCAGGGCCG ATCTGCGAGA GGAGATCTTC GAGCGATTTC ATCAGGCGCA GAGTGGAACC
ACGCGAAGCT TCAGCGGGAC CGGTCTCGGC CTGGCCATTG CCAAGGAGTT CGTGGACCTT
CACACCGGAA CGATCTCGGT TTCCGATGCC ATCGGCGGAG GTGCACAGTT TCGAGTCGAG
CTCCCTTCGC GAGCGCCATT GGGCGCTTAC ATCAGGTCCG TCGACTCGCC CCTCGGGAAC
CGAAATCGCG GACAGATCGT CGGAACCATA GAGGAGCTGC AGCGGGCCGA GTTCGATGCC
GTCTCGGATC TGTCTGGATC AGAAAAGCCT CTCGTGCTGG TCGCCGAGGA CAACGCCGAC
ATGAGGCGCT TTATCGTCGA GGTCCTCTCC AGCGATTTCA GGGTCGTACA CGCTGCAGAC
GGCTTGCAAG CGCTCACCCA GGCCCGGGCG CAGGCGCCCG ATGCCATCAT CACAGATCTG
ATGATGCCGA AGCTCGGCGG CGACAAGCTG GTGTCGGAGC TCCGGTCGAC TCCAGAACTC
GCGCATATTC CCGTTCTCGT GTTGTCAGCC AAGGCGGATG AGTCCCTTCG CCTGAAGCTG
CTCTCCGACT CCGTCCAGGA CTACATCGTC AAGCCATTTT CATCGCGCGA GTTGCTCGTT
CGAGTACGAA ACATCGTCAC GATGAAGCTG GCCCGCGAGG CGCTGCAGAA GGAGCTGGCG
TCGCAGAACG AGGATCTTGC TCAGTTGACG CAGCAACTCA TTGCGAGCAA GCAGGGGCTT
CAACGGAGCC ACGATGCATT GAAGGAATCA GAGCGGCGCT GGCGGGCTGT CTACGAAAAC
ACCGCCGTCG GCGTATCACT GAGCGACCTA CAGGGAAACA TGCACGCTGC AAATCCCGCG
CTCCAGGAGA TGCTGGGATA CACCGAAAGT GAGCTGATCG GTCTCGGCAA CCTGATGACA
GATGCCGAAG CAGGCCATGA GGATCGTCGC CTTCAACTCG AACGGCTCGT CAACGGCAGC
CAGGTAGAGA TGCGACAGCA AAGGAGGTAC CGGCATCGCA ATGGCATGAC GATCCTGGCG
AACGTTCGTG AATCGCTCAT TCCCGGCACC TCGGACCTGC CTCCCACATT GATCACGGTT
GTCGAGGACA TCACAACGCA GAAGCGCGCG GAGGTGGAAC TCGCTCAGAC CAAGGATGCG
CTCGCACGGG TTTCGCGGGT GACAACGATG GGTGAACTCG CAGCTTCGAT TGCGCATGAA
GTCAACCAGC CCCTCACGGC CGTCGTTGTC AACGGCCACG CCTGCCTGCG CTGGCTCTCG
ACGGAACCAC GAAACGATCT CGAAGTTCAG GATGCGATAC AGCGGATCGT TCGAGATGCC
AATCGGGCCA GCGAGGTTAT CGCCAGGATC AGGGGTTTCC TCAAGCGGAG CAAGACAGAT
CGAACCATGG TCTGCATGGA CAACGTCGTT GAAGATGTCA TAGGCCTGGC GCGTGATTCG
CTCAGATCCG CTGGCGTCCA GTTGATCAAG CACGTCGACT CCGATCTGCC TCGCGTGTTC
GCGGACAGCG TCCAGCTCCA GCAGGTGATC CTCAATCTGA TGATGAATGG CATCGAGGCC
ATGGGCTCCT GCGCGACACT TGAGCGTCAG CTGGAGCTGC GCGTCGTGAA GCACGGTGGA
GATATCGATG TTTCGGTCAG CGACTCAGGG ACAGGACTGG TGACTGCTGA TTTCGAGCGG
ATATTCGAAG CGTTCTACAC CACCAAGCCC GACGGCATGG GTATGGGACT GGCGATTTGC
CGATCCATCG TCGAGGCACA TGGTGGACGG TTGTGGGCCC AGGCGAACAA GACGCAAGGA
TTGACGCTGC AGTTCCGTCT GCCGATCGCA GAGCACGCCG AACCATGA
 
Protein sequence
MPNINRQLFA ASDSLCVRDV TLRATDDVQT HREKLARIVL DELYEFVGLL DAHGTTLEIN 
RAALEGAGIA LDDIQGRPFW EARWWATSPE VRREQREVIR RAGEGEFVRR DFEIYGQQGG
QETILIDYSL LPIRDNSGKI VFLLPEGRNI TDKKRAEAEI ARKNRELQRL LDKIQRLDDA
KSDFFANVSH ELRTPLALIL GPSESLLATS EGLSDAQRRD LRVIQRNAAM LMKHVNDLLD
LAKFDAGKMA LRYTRVDLAA EVRTLAAHFE AVAAERSLSY VVQAPAALEV EVDQQMFERI
LLNLLSNAFK FTPDFGRIRC SLEANPDHSI QLVVEDSGCG VRADLREEIF ERFHQAQSGT
TRSFSGTGLG LAIAKEFVDL HTGTISVSDA IGGGAQFRVE LPSRAPLGAY IRSVDSPLGN
RNRGQIVGTI EELQRAEFDA VSDLSGSEKP LVLVAEDNAD MRRFIVEVLS SDFRVVHAAD
GLQALTQARA QAPDAIITDL MMPKLGGDKL VSELRSTPEL AHIPVLVLSA KADESLRLKL
LSDSVQDYIV KPFSSRELLV RVRNIVTMKL AREALQKELA SQNEDLAQLT QQLIASKQGL
QRSHDALKES ERRWRAVYEN TAVGVSLSDL QGNMHAANPA LQEMLGYTES ELIGLGNLMT
DAEAGHEDRR LQLERLVNGS QVEMRQQRRY RHRNGMTILA NVRESLIPGT SDLPPTLITV
VEDITTQKRA EVELAQTKDA LARVSRVTTM GELAASIAHE VNQPLTAVVV NGHACLRWLS
TEPRNDLEVQ DAIQRIVRDA NRASEVIARI RGFLKRSKTD RTMVCMDNVV EDVIGLARDS
LRSAGVQLIK HVDSDLPRVF ADSVQLQQVI LNLMMNGIEA MGSCATLERQ LELRVVKHGG
DIDVSVSDSG TGLVTADFER IFEAFYTTKP DGMGMGLAIC RSIVEAHGGR LWAQANKTQG
LTLQFRLPIA EHAEP