Gene Mpe_A0807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0807 
Symbol 
ID4784491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp845877 
End bp846872 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content73% 
IMG OID640089368 
Productvanillate O-demethylase oxygenase subunit 
Protein accessionYP_001020004 
Protein GI124266000 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACG AGCATCGCTG GTGGTATCCG GTCGCCACCG CTGCCGACCT GGGCGCCGGC 
CCGCTGCCGG CGGCGCTGTT CGGTGAGGAC CTGGTGCTGT GGCGCGACGA AGCCGGCACG
CCGCACGCCT TCACGGATCG CTGCCCGCAC CGCGGCACGC GGCTCTCGCT CGGCGCGGTG
CGCGTCGTCG ACGGCCGGGC GCAGCTCGAG TGTCCGTACC ACGGCTGGCG CTTCGACGGC
GGCGGGCGCT GTCTGCGCAT CCCGGCGCTG CCCGACTTCA CGCCGGCGAC TGGCCATGCG
GCGCGGGCCC ATCCGCTGCG CGAGGCGCAT GGCCTGCTGT GGGTCGTGCT CGGCGGTGAT
GCCAACCTGG AGACCGTCGC CACGGCCTGC CTGCCCGACC CGGGGCCGGT ACCGGGCCGT
GCCGTCGTCT GCGGTCACTA CGACGTGGGC ACATCGGCGC CGCGGGTGGT GGAGAACTTC
CTCGACACCT CGCACTTCGC CTTCGTGCAT GAAGGCTGGC TCGGCGACCG CGACCACACC
GAAGTGCCGA TCTACGACGT GGTGCCCGAC GCCAACGGCG CGCCTGGCGT GCCGCACTAC
CGTGCGTGGC AGCCGCAGGC CAGCGCGCAG TCGGCCGGCG GCGCCTGGGT CGACTACCGC
TACCAGGTGC TGTCTCCCTG CAGCGCCTTG CTGGTCAAGC AGGCCGGCGA CGACGCGCAG
ACGACGCAGG AGGCCTATGC GTTATGGGTT GCGCCGCTGG AACCTGAGCG CAGCCGCGTG
TGGTTCACGC TGTTCACCTG CGATACCGCC ACGCCCGACG AGACGCTGCG CGCCTTCCAG
CACGGCATCT TCACGCAGGA CCAGCCGGTG CTCGAATCGC AGCGGCCGCG CCGGCTGCCG
CTGAGCGGCA GCGAGGCGCA CTGCGCGGCC GATCGCCTGA GCACCGCCTA CCGGCGCTAC
CTGCAGGCGC AGGGCCACAC CTACGGCACC TGCTGA
 
Protein sequence
MNDEHRWWYP VATAADLGAG PLPAALFGED LVLWRDEAGT PHAFTDRCPH RGTRLSLGAV 
RVVDGRAQLE CPYHGWRFDG GGRCLRIPAL PDFTPATGHA ARAHPLREAH GLLWVVLGGD
ANLETVATAC LPDPGPVPGR AVVCGHYDVG TSAPRVVENF LDTSHFAFVH EGWLGDRDHT
EVPIYDVVPD ANGAPGVPHY RAWQPQASAQ SAGGAWVDYR YQVLSPCSAL LVKQAGDDAQ
TTQEAYALWV APLEPERSRV WFTLFTCDTA TPDETLRAFQ HGIFTQDQPV LESQRPRRLP
LSGSEAHCAA DRLSTAYRRY LQAQGHTYGT C