Gene Mpe_A0963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0963 
Symbol 
ID4787109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1022615 
End bp1023667 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content69% 
IMG OID640089525 
Productvanillate O-demethylase oxygenase subunit 
Protein accessionYP_001020160 
Protein GI124266156 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGTGA AGAACGCGTG GTACTGCGCG GGGTGGGACA AGGACCTGAG CCTGGGCCGT 
GACGGCCTGC TGGCGCGGCG CATCGCCGGC GAGTCGCTGG TGCTCTACCG TCGACCCGAT
GCCGCGGTGG TGGCGATGGA GGACCGCTGC TGCCACCGGC ACGCACCGCT GTCGCTGGGG
CGCAAGGAGG GCGACTCGAT CCGCTGCATG TACCACGGCA TGAAGTTCGG GCCCGACGGC
CGCTGCACCG AGATCCCGGG CATGAGCCGG ATCCCCGAGA AGGCCTGCGT GCGCACCTAC
CCGGTCGTCG AGCGCGACAA CTGGATCTGG GTCTGGATGG GCGAGCCCGC GAAGGCCGAC
CCGGCGTTGA TCTGCGAGGC CATCGGTCCT GGCGACCCGG CCTGGAACCT GCGGCTCGGC
TATGTGCGCG TCGACACCAA CTACCGGCAG GAGATCGCGA ACCTGGCCGA CCTGAGCCAC
GTGGCCTGGG TGCACAGCCA GACGCTGGGC GGATCGGATG CCTGGTCGAA CATCAAGCCG
CGCCATGAGC TGACCGAGCG CGGCATCGAC ACCCGCTACT GCGTGCGCCG CACGCCGCCC
CCCAGTTTCG CCAGGCACCT GTTCCCGGAG GGCGCGCTGT TCGACATCCA GGTCCATGTG
CGCATGAGCG TGCCATGCAA CTTCATCCTG CATTTCTCGG TGCACGAGGT GGGCAGCGCG
ACCGAGGGGC CGACCAACGG ACGCCTGGTG CTCGACACCT TCTCCAGCCA GGCCGTGACG
CCGCGCGACG CGCACTCCTG CGACTACTAC TACTCCTGGG GCTGCAGCCG CGCCACCGAC
ATGCCGGGCC TCACCGACCT GATGCACGAG GCCAACAACG ACGCCTTCCT CGAGGACAAG
GCGATGCTCG AAGGGCAGTA CCAGCGGATG CGCGAGCGCC CCGACGCGCC CAGCGTGGAC
ATCGTCCACG ACGCGGGGCC CGGCAAGTTG CTGTGGGTGC TGGACCGCCT GCTGAAGGCG
GAGGCGCGCG CGATCGAGAT CGTTCCGGCC TGA
 
Protein sequence
MFVKNAWYCA GWDKDLSLGR DGLLARRIAG ESLVLYRRPD AAVVAMEDRC CHRHAPLSLG 
RKEGDSIRCM YHGMKFGPDG RCTEIPGMSR IPEKACVRTY PVVERDNWIW VWMGEPAKAD
PALICEAIGP GDPAWNLRLG YVRVDTNYRQ EIANLADLSH VAWVHSQTLG GSDAWSNIKP
RHELTERGID TRYCVRRTPP PSFARHLFPE GALFDIQVHV RMSVPCNFIL HFSVHEVGSA
TEGPTNGRLV LDTFSSQAVT PRDAHSCDYY YSWGCSRATD MPGLTDLMHE ANNDAFLEDK
AMLEGQYQRM RERPDAPSVD IVHDAGPGKL LWVLDRLLKA EARAIEIVPA