Gene Mpe_A0962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0962 
Symbol 
ID4787108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1020469 
End bp1022340 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content69% 
IMG OID640089524 
Productputative phenol-degradative gene regulator transcription regulator protein 
Protein accessionYP_001020159 
Protein GI124266155 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC CTCTCCATTC GCATGCGCAC GGGATCACGT CGATGAACGC TTCACCTGCA 
CCGAACGCCG CCGACTTTCT CCACGCGCAT TACCTGGCCA GTCGCAGCGA CCCGTCGGCC
ACGGAGCTCC GGCCCACGCA CCGCGAGTTG GCGGAGTGGC TGACCTTCGA CCCGGACACC
GGGCGCATCT GGCTCGACGA CGAGCGCATG GTGATGCTGA ACCACCAGAC GCTCGGCTCG
CTGCGCAGCG AACTGATCCA GGGCGTCGGT GTGGAGCGGG CGCGCGGCAT CCTCACCCGC
AGCGGCTACA TCGCCGGCAT GCGCGACGCG CGCTTCGTGG CCTCGCACTG GCCCAAGGAA
GACCCGGTCG GCAAGCTGCT GGCCGGCACG CGCCTGTTCG GCCTGCAGGG CCTCGCCCGT
GTCGAGGCGG TGCACTTCGA CTACGACCCC GAGCGCATCG CCTTCAGCGG CGAGTTCTTC
TGGCATCACT CGCTGGAGGA CGACGAACAC ATCAACGCCT TCGGCGTCGG CACCGACGCG
GGCTGCTGGC TGCCGATCGG CTACGCCAAC GGCTACGTCT CCACGCTGCT TGGCACGTCG
ATCATCTTCC GCGAGCTCGA GTGCCGCTCG ACCGGTGCCT CGGTCTGCCG CGTGATCGGC
AAGGCCGCCC GCGACTGGGA CAACGTGGAG GAAGACCTGC GCTACATGCA GGCCGAGGGC
TTCGTCGGCG AGCGCCCGGG TCCGGAACCG ACGCCGCGCG ACGGTCCGCG CCTCGGCGTG
CACGACGCCG CGGCGGATGC GGCGCCGCGC ATGGTGGGCG TCTCGTCGAG CTTCAATGCC
GCCTGCCACT CGCTGCGCAA GGTGGCCACC ACCACCGCGA CGGTCCTGTT CTTCGGCGAG
TCCGGGGTCG GCAAGGAGCT GTTCGCCAAG ATGCTGCACG AGATCAGCGA GCGACGCGAC
GGGCCCTTCG TGTCGGTCAA CTGCGCGGCG ATCCCGGAGA ACCTTGTCGA GGCCGAGCTG
TTCGGCGTCG AGCGCGGCGC CTACACCGGC GCCCACGCGT CACGGCCGGG CCGCTTCGAG
CGCTCGCACC GGGGCACGCT GTTCCTCGAC GAGATCTCGT CGCTGAGCCT GGTCGCGCAG
AGCAAGCTGC TGCGCGCGCT GCAGGAGAGC GAGGTCGAGC GGGTCGGTGG CACCCGCCCG
ATCCGGCTCG ACCTGCGCGT GGTGGCCGCG TGCAACGTCG ACCTGCGCGA GGAAGTCCGG
CAGGGGCGCT TCCGCGAAGA CCTGTTCTTC AGGCTCAACG TCTACCCGAT CCACCTGCCG
CCGCTGCGGG CCCGCAGCGA GGACGTGCCT CTGCTGATGA CGCACTTCCT CAAGCTCTAC
GGTGCCCGCT ACGGCCGGCA GATCCACGGC TTCACGCAGC GCGCGATGCG CGCGATGCTG
ACCTACACCT TCCCCGGCAA CGTGCGCGAA CTGCAGAACA TGATCGAGCG GGCCGTGATT
GCCGCCAGCG ACGAGCCGCA GATCGATACC GTGCACCTGT TCCGCGACGA GCTGTTCAAC
GCGAAGTCGA CCTTCGCCGT CGGCGATCAA GGCCGATTGG CCGACACCGA GGGGTCGGCG
CCCTCCGCGC CGCATCCGGA GCCCTCGGGC GACGAGACGC TGCTCGACCG GGTGCTCCGC
CTGCGCAGCG CGGTGCGGAG CCCTCAGGCG TCCGACACCT GGCTCGAAGA ACTGGAGACC
CAGCTCGTCG AGGAAGCGGT GCAACGCTGC AACGGCAACA TGGCCGCCGC GGCACGGCTG
ATCGGCATGG CGCGGACGCA GGTGGTCTAT CGCATGCGCA GGAAGAACGT GACGACTCAG
GCGAGGGCCT GA
 
Protein sequence
MSNPLHSHAH GITSMNASPA PNAADFLHAH YLASRSDPSA TELRPTHREL AEWLTFDPDT 
GRIWLDDERM VMLNHQTLGS LRSELIQGVG VERARGILTR SGYIAGMRDA RFVASHWPKE
DPVGKLLAGT RLFGLQGLAR VEAVHFDYDP ERIAFSGEFF WHHSLEDDEH INAFGVGTDA
GCWLPIGYAN GYVSTLLGTS IIFRELECRS TGASVCRVIG KAARDWDNVE EDLRYMQAEG
FVGERPGPEP TPRDGPRLGV HDAAADAAPR MVGVSSSFNA ACHSLRKVAT TTATVLFFGE
SGVGKELFAK MLHEISERRD GPFVSVNCAA IPENLVEAEL FGVERGAYTG AHASRPGRFE
RSHRGTLFLD EISSLSLVAQ SKLLRALQES EVERVGGTRP IRLDLRVVAA CNVDLREEVR
QGRFREDLFF RLNVYPIHLP PLRARSEDVP LLMTHFLKLY GARYGRQIHG FTQRAMRAML
TYTFPGNVRE LQNMIERAVI AASDEPQIDT VHLFRDELFN AKSTFAVGDQ GRLADTEGSA
PSAPHPEPSG DETLLDRVLR LRSAVRSPQA SDTWLEELET QLVEEAVQRC NGNMAAAARL
IGMARTQVVY RMRRKNVTTQ ARA