Gene Mpe_A0876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0876 
Symbol 
ID4787199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp916066 
End bp917397 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content72% 
IMG OID640089437 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001020073 
Protein GI124266069 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0480084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCG AGGACGGGAC TTCGATGCAC CACGCTCCCC TGCGCTACCA GAGCGGCTTC 
GCCAACCACT TCGAGAGCGA GGCGCTGCCC GGTGCGCTGC CGGTGGGCCG CAACTCGCCG
CAGCGCTGCC CCTACGGTCT GTACGCCGAG CAGTTCAGCG CCACCGCCTT CACCGCGCCG
CGCGCCGACA ACCGCCGCAG CTGGCTCTAC CGCATCCGCC CGGCCGCGAT GCACGCACCG
TTCGAGCCGT ACGACGACGG CGGCCGGTTG GTCAGCGACT TCTCCACGCT CGCCACGCCG
CCCGATCCGC TGCGCTGGAA CCCGTTGCCG CTGCCGGCGG CGCCGACCGA CTTCGTCGAC
GGCCTCGTGA CCTGGGCCGG CCACGGCGAT GCGGGCGTGC AGGCCGGCGC CGCGGTCCAT
CTCTATGCCG CCGACCGCTC GATGGAGCAG CGCAGCTTCT GCAGTGCCGA CGGCGAGCTG
CTGATCGTGC CCCAGCTCGG CCGCCACCGC TTCGTCACCG AGCTCGGCGT GCTGGAGGTC
GAGCCGCAGG AGATCGTCGT CATCCCGCGC GGCCTGCGCT TTCGCGTCGA GCTGCCCGAC
GGCGCGGGCC GCGGCTACGT GTGCGAGAAC CACGGCGCGC CGTTTCGCCT GCCCGACCTG
GGGCCGATCG GCGCCAACGG CCTGGCGCAT GCGCGCGACT TCCTCGCGCC GGTGGCGGCC
TACGAGGACA TCGACGGGCC GCACCAGCTG GTCACCAAGT TCATGGGCCG GCTGTGGTCG
GCGGCGATGG ATCACTCGCC GCTCGACGTG GTGGCCTGGC ACGGCAACTG CGCACCGTAC
AAGTACGACC TGCGGCGCTT CAATGCCATC GGCTCGATCA GCCACGACCA CCCCGATCCG
TCGATCTTCC TGGTGCTGCA TGCGGCCTCC GACACGCCGG GCACCAGCGC CATCGACTTC
GTGGTCTTCC CGCCGCGCAT CCTGGCGATG CAGGACACCT TCCGGCCGCC CTGGTTCCAC
CGCAACGTCG CCAGCGAGTT CATGGGCCTG ATCCACGGCG TGTACGACGC CAAGGCCGAA
GGCTTCCTGC CCGGCGGCGC GAGCCTGCAC AACTGCATGA CCGGCCACGG CCCCGACGCC
GAGACCTTCG AGAAGGCGAG CCGTGCCGAC CTGTCGCAGC CCGACGTGAT CCGCGACACC
ATGGCCTTCA TGTTCGAGGC GCGCCACGTC TGGCGCCCGA CACCCCGGGC GCTGGCCTCG
CCGCTGCGGC AGGCCGACTA CGCGCGCTGC TGGCAGGGCC TGCGCCGGCA CTTCGATCCC
GCACGGCGCT GA
 
Protein sequence
MLGEDGTSMH HAPLRYQSGF ANHFESEALP GALPVGRNSP QRCPYGLYAE QFSATAFTAP 
RADNRRSWLY RIRPAAMHAP FEPYDDGGRL VSDFSTLATP PDPLRWNPLP LPAAPTDFVD
GLVTWAGHGD AGVQAGAAVH LYAADRSMEQ RSFCSADGEL LIVPQLGRHR FVTELGVLEV
EPQEIVVIPR GLRFRVELPD GAGRGYVCEN HGAPFRLPDL GPIGANGLAH ARDFLAPVAA
YEDIDGPHQL VTKFMGRLWS AAMDHSPLDV VAWHGNCAPY KYDLRRFNAI GSISHDHPDP
SIFLVLHAAS DTPGTSAIDF VVFPPRILAM QDTFRPPWFH RNVASEFMGL IHGVYDAKAE
GFLPGGASLH NCMTGHGPDA ETFEKASRAD LSQPDVIRDT MAFMFEARHV WRPTPRALAS
PLRQADYARC WQGLRRHFDP ARR