Gene Mpe_B0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0555 
Symbol 
ID4787398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp502407 
End bp503819 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content67% 
IMG OID640092982 
Productphthalate 4,5-dioxygenase 
Protein accessionYP_001023560 
Protein GI124263090 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.547165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.18525 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAACA GAGAGCCTTT GGCCGCGGCC GGGCAGGGCA CAGCCTACAG CGGGTACCGG 
CTGCGCGACC TGCAGAATGC CGCCCCCACG AACCTGGAAA TCCTTCGTAC GGGCCCCGGC
ACGCCGATGG GCGAGTACAT GCGCCGCTAC TGGCAGCCCG TATGCCTGTC GCAGGAACTG
ACCGACGTGC CCAAGGCGAT CCGGATCCTG CACGAGGATC TGGTGGCATT CAGGGACCGC
CAGGGCAACG TCGGCGTGCT GCACCGCAAG TGCGCCCACC GCGGGGCCTC GCTCGAGTTC
GGCATCGTGC AGGAACGCGG GATCCGCTGC TGCTACCACG GTTGGCACTT CGACGTCGAC
GGCAAACTGC TGGAGGCGCC GGCGGAACCC CCCGACACCA AGCTGAAGGA AACCGTCTGC
CAGGGCGCCT ATCCGGCCTT CGAGCGCGAC GGCCTGGTGT TCGCCTACAT GGGGCCGGCG
GATCGCAGAC CGGAGTTCCC GGTGTTCGAC GGCTACGTGT TGCCGAAGGG AACGCGGTTG
ATTCCGTTCT CCAATGTCTT CGACTGCAAC TGGCTTCAGG TCTACGAAAA CCAGATCGAC
CACTACCACA CCGCGCTGCT GCACAACAAC ATGACGGTCG CCGGCGTGGA CTCGAAGCTG
GCCGACGGCG CGACGCTGCA GGGGGGCTTC GGCGAGATGC CAATCATCGA CTGGCACCCG
ACCGACGACA ACAACGGCAT GATCTTCACC GCCGGCCGGC GCCTGTCGGA CGACGAAGTC
TGGATCCGAA TCTCGCAGAT GGGCCTGCCG AACTGGATGC AGAACGCCGC CATCGTGGCG
GCGGCGCCGC AGCGACACTC CGGCCCGGCG ATGTCGCGTT GGCAGGTGCC GGTCGACGAC
GAGCACTCGA TCGCCTTCGG CTGGCGCCAC TTCAACGACG AGGTGGACCC GGAGCACCGT
GGAAGGGAAG AGGAGTGCGG GGTCGACAAG ATCGACTTTC TGATCGGTCA GACCCGGCAT
CGGCCTTATG AAGAGAGGCA GCGGGTTCCG GGCGACTACG AAGCCATCGT CAGCCAGGGG
CCGATAGCCG TCCACGGCCT TGAGCATCCC GGCCGGTCGG ACGTGGGTGT GTACATGTGT
CGCTCGCTGC TTCGCGACGC TGTGGCCGGC AAGGCGCCGC CCGACCCGGT GCGCGTGAAG
GCTGGGTCGA CCGATGGGCA AACGCTGCCG CGATACGCGT CGGACAGTCG ACTGCGGATC
CGCCGCCGGC CGAGCCGGGA AGCGGACAGT GACGTCATCC GCAAGGCCGC GCACCAGGTT
TTCGCGATCA TGAAGGAGTG CGACGAACTG CCGGTCGTGC AGCGCAGGCC GCATGTCCTG
CGGCGCCTCG ACGAGATCGA AGCGAGCCTC TGA
 
Protein sequence
MGNREPLAAA GQGTAYSGYR LRDLQNAAPT NLEILRTGPG TPMGEYMRRY WQPVCLSQEL 
TDVPKAIRIL HEDLVAFRDR QGNVGVLHRK CAHRGASLEF GIVQERGIRC CYHGWHFDVD
GKLLEAPAEP PDTKLKETVC QGAYPAFERD GLVFAYMGPA DRRPEFPVFD GYVLPKGTRL
IPFSNVFDCN WLQVYENQID HYHTALLHNN MTVAGVDSKL ADGATLQGGF GEMPIIDWHP
TDDNNGMIFT AGRRLSDDEV WIRISQMGLP NWMQNAAIVA AAPQRHSGPA MSRWQVPVDD
EHSIAFGWRH FNDEVDPEHR GREEECGVDK IDFLIGQTRH RPYEERQRVP GDYEAIVSQG
PIAVHGLEHP GRSDVGVYMC RSLLRDAVAG KAPPDPVRVK AGSTDGQTLP RYASDSRLRI
RRRPSREADS DVIRKAAHQV FAIMKECDEL PVVQRRPHVL RRLDEIEASL