Gene Mpe_B0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0016 
Symbol 
ID4787619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp13393 
End bp15450 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content62% 
IMG OID640092427 
Productendothelin-converting protein 1 
Protein accessionYP_001023032 
Protein GI124262562 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00932901 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAAAA ACCTGCTGTT TCTACCCGTC CTGCTAGCGA CGGCGCTGGC CGTGGCCGGT 
CCCAAGTCCG AAGGTCCACT GCAATCCCTT CCTTACACCC CAAGCCTCGA CCTGAGCGCG
ATAGACCGGA GCGTGAACCC CTGCGACGAC CTCTATCAAT ACGCGTGCGG CGGTTGGATC
AAGAACAACC CTATACCGTG GGACCAGGCG CGATGGGACG TGTATTCCAA GGCGACCAAC
GAGAACCAGC GCTACCTCTG GGGCATTCTG GCCGAGCTCG CCGCTGGCAG CACGGACCGC
AGCGCCACCC AGGTCAAGCT GGGCGACTAC TTCGCCGCGT GCATGGACGA GGCCGCCGTG
CAGGCCGCCG GCCTCAAACC CTTGCAGCCT TATCTCGACC GCATCGATGC GATGGCGAGC
AACCGCCAGC TCCCGGCCCT GCTTGCCGAC CTACAACTCG TCACTGGCAA CGAACGGCTG
TTTTTCGGCT TCAGCTCAGG ACAGGACTAC GCCGATGCCA CACAGCAGAT CGCCTTCGCT
GTTGCCGGCG GCATCTCGCT GCCGGACCGC GACTACTACG TCAAGAACAA CGCCAAGATC
GCCAAGATCC GCAGGCAGTA CCAGGCGCAT GTGGCGCAGA TGTTCGAGCT GCTTGGTGAC
ACGCGCGCCG CCGCCAAAGC CAACTCGGCC ACGGTGCTCG CGGTCGAGAC CGCCCTGGCC
AGATCGTCAC TCGGCACGGC CGATAAGCGC GATCCTTACA AGACCTTCCA CAAGTTCAAC
GCCCATCAGC TGCAGGCCCT CACGCCGGGC TTCAACTGGG CGGATTACCG CGCTGCACTC
GGTGTAGCCG CAGACCTCGA TGTTTACAAC GTCACCGAAC CTGCCTTCTA CAAAGCCTTC
AACGCGCTGC TGGCCCGGCT GAGCTTGGCC GAGCTCAAGA CCTATCTGCG CTGGCAGCTC
GCCAGCAGCC AAAGCGCCTA CCTCGAGCCA CGCTTCGTAC AGGCCAACTT CGATTTCTTC
GGCAAAACCC TGAACGGCGT GCCGCAGTTG AAGAGCCGCT GGAAGCGCTG TGTCGAGCTG
GTCGACCAGC AACTCGGCGA GGCCCTCGGC CAGGAGTTCG TCGCGCGCAA CTTCGCGCCC
GTGCTCAAGC AAGCTGCGCT CACCATGACG ACCGAAATCG AAGCCGCCAT GGCCCAGAGC
ATTCGGGACC TAACCTGGAT GAGCGACTCC ACCAAGGCCA AGGCCATCGC CAAGCTGAAC
ACCATCGTCA ACAAGATTGG CTACCCCGAC CGCTGGCGCG ATTACTCGGC AATGCCAGTC
ACTCGCGGTG ACTTCCTCGC CAATGTGACC GCGGGCCGGG TGTTCGAGGC CAAGCGCAAG
CTCGCCAAGA TAGGCCAGCC GCTGGACCGG GGCGAGTGGG GCATGACGCC GCAGACCGTC
AACGCTTACT ACAACCCTCA GATGAACGAT ATCAACTTCC CGGCCGGCGT GCTGCAGCCG
CCTCTCTACG ACGCCAAGAT GGACGACGCG CCCAACTACG GCAACACCGG CGGGACCATC
GGGCACGAGT TGATTCACGG TTTCGATGAC GAGGGCCGCC AGTTCGACGC TCATGGCAAC
CTCAGGAACT GGTGGACGAA AAAGGACGGC CGCGAGTTCG AGAAGCGCGC AGCATGCGTG
GGCAACCAGT ACAAGACCTA CACCATCGTC GACGAAATTA AGATCAACCC CAAGCTCACG
ATGGGAGAGG ACCTCGCAGA CTTCGGTGGA CTGGTGTTGG CCTGGGAAGC CTGGAAGGCG
CATGTGGCGG ACAAGCAGCT GGCACCCATC GACGGTCTCA CTCCGCAGCA ACGGTTCTTC
GTAGGCTTCG CGCAATGGGA CTGCAGCGAT TCCCGCCCCG AGGTCTTGCG CGTGAAGGCC
CTGACCGATC CGCATTCACC AAGCCGCTAC CGCATCAACG GCGTGGTGGT GAACATGCCT
GAGTTCGAGA ATGCCTTTGC CTGCAAGCCC ACGGCCAAGC TGGTCAAACC TGAAGCGCAA
CGCTGCAAAA TGTGGTGA
 
Protein sequence
MEKNLLFLPV LLATALAVAG PKSEGPLQSL PYTPSLDLSA IDRSVNPCDD LYQYACGGWI 
KNNPIPWDQA RWDVYSKATN ENQRYLWGIL AELAAGSTDR SATQVKLGDY FAACMDEAAV
QAAGLKPLQP YLDRIDAMAS NRQLPALLAD LQLVTGNERL FFGFSSGQDY ADATQQIAFA
VAGGISLPDR DYYVKNNAKI AKIRRQYQAH VAQMFELLGD TRAAAKANSA TVLAVETALA
RSSLGTADKR DPYKTFHKFN AHQLQALTPG FNWADYRAAL GVAADLDVYN VTEPAFYKAF
NALLARLSLA ELKTYLRWQL ASSQSAYLEP RFVQANFDFF GKTLNGVPQL KSRWKRCVEL
VDQQLGEALG QEFVARNFAP VLKQAALTMT TEIEAAMAQS IRDLTWMSDS TKAKAIAKLN
TIVNKIGYPD RWRDYSAMPV TRGDFLANVT AGRVFEAKRK LAKIGQPLDR GEWGMTPQTV
NAYYNPQMND INFPAGVLQP PLYDAKMDDA PNYGNTGGTI GHELIHGFDD EGRQFDAHGN
LRNWWTKKDG REFEKRAACV GNQYKTYTIV DEIKINPKLT MGEDLADFGG LVLAWEAWKA
HVADKQLAPI DGLTPQQRFF VGFAQWDCSD SRPEVLRVKA LTDPHSPSRY RINGVVVNMP
EFENAFACKP TAKLVKPEAQ RCKMW