Gene Mpe_A2470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2470 
Symbol 
ID4785667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2630709 
End bp2632139 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content69% 
IMG OID640091040 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001021660 
Protein GI124267656 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.097815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTC CGACCCTGAC CGACATGCTG GAACCGGCTG CGCCGAGCGA GGAACTCCTG 
AGTCGTCTGG ACGCCGGTGG GCCGCGCTAC ACCTCCTACC CGACCGCCGA CCGCTTCGTG
GAGGCCTTCG GCCCCGCTCA GTACCGGCAG GCCCTGGGCC AGCGCCGCAG CGGAGCCCCG
GCGGCCGGCG CGGTGGGCGG CGGCACACCG CTGTCGATCT ACGTTCACAT CCCATTCTGC
GAGTCGGTCT GCTACTACTG CGCCTGTAAC AAGGTGATCA CCCGGCAGCA CCGGCGCGGC
ACCGAGTACC TGGAGTGGCT GGGCCGCGAG GTGGCATTGC ACCAGGACGT GATCGGCAGC
CGCCAGCGCG TCAGTCAGCT GCACCTCGGC GGCGGTACGC CGACCTTCCT GGACGATACG
GAGCTGTCGC AGCTGATGGC CCTGCTGCGC GGTGCCTTCG ACCTGCAGCC GCAGGCCGAA
TGCTCGATCG AGGTCGACCC GCGCACCGTC AGTGCCTCGC GGCTGGCGCA CCTGAAGGCG
CTGGGTTTCA ACCGCCTGAG CTTCGGCGTG CAGGACTTCG ACCCCGAAGT GCAGCACGCG
GTGCATCGCG TGCAGCCGGC CGAGCAGGTG GAGGCACTGA TCGCCAGCGC GCGCGCATTG
GGCTTCGATT CGATCAACGT CGACCTGATC CACGGCCTGC CCAAGCAGAC CCCGGCCTCG
CTGGCCCGGA CGGTGGAGCA GGTGGCGCGG TTGCGGCCGG ACCGCGTGGC GCTCTACAGC
TATGCGCACC TGCCGCAGCG CTTCAAGCCG CAGCGGCGCA TCCATGCGGC AGACCTGCCA
GGCACCGCCC AGCGCGTGAC GATGCTGTCG AACGCGATCG CCGCCTTCCT GGCGCACGGT
TACGACTACA TCGGCATGGA CCACTTCGCC CTGCCGGGTG ACGCGCTGGC CGCGGCCAAG
CGGCAGGGCC GGCTGCACCG CAATTTCCAG GGCTACAGCA CCCAGCCCGA CTGCGACCTG
ATCGGCCTGG GCGTCTCGGC CATCGGCCGC ATCGGTGCGA CCTACAGCCA GAATGCGAAG
ACGCTGCCGG AGTACCAGGA CGCCCTGCGC TCCGGGGAAC TGCCGGTGGT GCGCGGCCTC
GTGCTGACGC GCGACGACGT GGTGCGCCGC TCGGTGATCA TGGCGCTGAT GTGCCAGGGG
CGGGTGGTGT TCGAATCCAT CGAACTGGCT CACCTGCTCG ATTTCCGCCG CTACTTCGAG
GCCGAACTGC GCCGGCTGCA GCCACTGGCC GGGCAGGGGC TGATCGAGAT CGACGACGAC
GCGATCCAGC TCACCCCGCT GGGCTGGTAC TTCGTGCGCG CCGTGGCGAT GGTGTTCGAT
CGTCACCTGC AGGCCGACCG CGATCGCGAA CGCTATTCGC GGCTGATCTG A
 
Protein sequence
MPPPTLTDML EPAAPSEELL SRLDAGGPRY TSYPTADRFV EAFGPAQYRQ ALGQRRSGAP 
AAGAVGGGTP LSIYVHIPFC ESVCYYCACN KVITRQHRRG TEYLEWLGRE VALHQDVIGS
RQRVSQLHLG GGTPTFLDDT ELSQLMALLR GAFDLQPQAE CSIEVDPRTV SASRLAHLKA
LGFNRLSFGV QDFDPEVQHA VHRVQPAEQV EALIASARAL GFDSINVDLI HGLPKQTPAS
LARTVEQVAR LRPDRVALYS YAHLPQRFKP QRRIHAADLP GTAQRVTMLS NAIAAFLAHG
YDYIGMDHFA LPGDALAAAK RQGRLHRNFQ GYSTQPDCDL IGLGVSAIGR IGATYSQNAK
TLPEYQDALR SGELPVVRGL VLTRDDVVRR SVIMALMCQG RVVFESIELA HLLDFRRYFE
AELRRLQPLA GQGLIEIDDD AIQLTPLGWY FVRAVAMVFD RHLQADRDRE RYSRLI