Gene Mpe_A2634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2634 
Symbol 
ID4785859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2809717 
End bp2810877 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID640091205 
Productputative dioxygenase (alpha subunit) oxidoreductase protein 
Protein accessionYP_001021823 
Protein GI124267819 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.456409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACC TGAGCATCAC CCGCGAAGTC CTCGGGCGCA GCCGCGCGCA GCTGCCCGTG 
TCGAGCTATT TCGACGAGGA TCTGTACCGC CGCGAGGAGG AGCTCATCAT CCGACCTGGC
CCGCGCTACG TCGGGCATGC GCTCTCGGTG CCCGAGGTCG GCGACCACCA TGCGCTGCTG
CAGGAAGGCG AAGGCCGGGC GCTGGTGCGC ACGCCGCAGG GCATCGAGCT GATCTCCAAC
GTCTGCCGCC ACCGCCAGGC GGTGATGCTG CGCGGGCGCG GCAACACGAA GAACCACATC
GTCTGCCCGC TGCACCGCTG GACCTACGAC CTGTCGGGGC AGCTGGTCGG CGCGCCGCAC
TTCCCCGACG ACCCCTGCCT GCACCTGAAC AACTACCCGG TGCAGCAGTG GAACGGGCTG
CTGTTCGAGG CGCCGCGGGA GGTGAACGGC CAACGGATCG GTCGCGACGT GCACGCCGAC
CTGGCCGGCC TCGCGCTGCC GCCGGAGTTC GACTTCTCGG GTTATGTGTT CGACCGCGTG
CACCTGCACC AGTGCGACTA CAACTGGAAG ACCTTCATCG AGGTCTACCT CGAGGACTAC
CACGTCGGCC CCTTCCACCC CGGGCTCGGC AACTTCGTCG CCTGCGACGA CCTGCGCTGG
CAGATGGGCC CCGAGCATTC GCTGCAGACC GTCGGCGTGG CGCGCCATGC CGGCGAGGCG
CTGGGCAAGC CCGGCTCCGA CGTCTATGGC CGCTGGCACC AGGCGCTGCT GAACTACCGC
CGCGACGCGC ACGACAGCGT GCCGCCGAAG CACGGTGCCA TCTGGCTGAC CTACCACCCG
ACGGTGATGC TGGAGTGGTA CCCGCATGTG CTGGTGGTCT CGACCCTGGT GCCGCAGGGG
CCGCGCAAGA CGCTCAACGT GGTCGAGTTC TTCTACCCCG AGGAGATCGC CGCGTTCGAG
CGCGAGTTCG TCGAGGCGCA GCAGGCGGCC TACATGGAGA CCTGCGTGGA GGACGACGAG
ATCGCGCTGC GCATGGACGC CGGGCGCGAG GCGCTGTGGC GACGTGGCGA CGACGAGTTC
GGCCCCTACC AGAGCCCGAT GGAAGACGGC ATGCAGCACT TCCACGAGTG GTATCGGCGG
CGCATCGCGC TGCCGCGCTG A
 
Protein sequence
MSDLSITREV LGRSRAQLPV SSYFDEDLYR REEELIIRPG PRYVGHALSV PEVGDHHALL 
QEGEGRALVR TPQGIELISN VCRHRQAVML RGRGNTKNHI VCPLHRWTYD LSGQLVGAPH
FPDDPCLHLN NYPVQQWNGL LFEAPREVNG QRIGRDVHAD LAGLALPPEF DFSGYVFDRV
HLHQCDYNWK TFIEVYLEDY HVGPFHPGLG NFVACDDLRW QMGPEHSLQT VGVARHAGEA
LGKPGSDVYG RWHQALLNYR RDAHDSVPPK HGAIWLTYHP TVMLEWYPHV LVVSTLVPQG
PRKTLNVVEF FYPEEIAAFE REFVEAQQAA YMETCVEDDE IALRMDAGRE ALWRRGDDEF
GPYQSPMEDG MQHFHEWYRR RIALPR