Gene Mpe_A2132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2132 
Symbol 
ID4785796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2284935 
End bp2286968 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content70% 
IMG OID640090700 
Productoligopeptidase A 
Protein accessionYP_001021323 
Protein GI124267319 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0694294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC CTCTACTCGA TACCGCCGCC CTGCCGCGCT TCGGCGAGAT CCGCCCCGAG 
CATGTGGAGC CTGCGGTCGA GGCCCTGCTG GCCGAAGCGA ACGTGGCGCT GGAACGCGCC
ACGGGCGCGG ACGTGCCGGC CGACTACGAT GCCCTCTCCG CGGTGCTCGA TGTCGCAACC
GAGCGCCTCG GGCGCGCCTG GGGAGCGGTC AGCCACCTCA ACGCGGTGGC CGACACGCCC
GAGCTGCGCG CCGCCTACAC CGAGGCCCTG CCGCGCGTGA CCGAGTTCTA CACGAACCTG
GGTGCCGACG AGCGGCTCTA CGCGAAGTAC AAGGCCGTGG CCGGCAGCCC TGCGGCAGCC
GCGCTCAACG CGGCACGGCG CAAGGCGCTG TCGAACGCGA TGCGCGATTT CGTGCTGTCC
GGCGCCGAGC TGCAGGGGGC CGCCAAGGAG CGCTTCGCAG CCATCCAGGC GCGTCAGGCC
GAACTCGGCC AGGCCTTCTC CGAGCATGTG CTCGACGCGA CAGACGGCTG GAGCTTCATC
GCGAGCGAGG CCGAGCTCGC CGGCGTACCG GAGGACGTCA AGGCCGCCAC GCGCGCCGCC
GCCCGGGCCG ACGGCCAAGA CGGCCACAAG CTCAGCCTGC ACATGCCGGT CTACCTGCCG
GTGCTGCAGT ACGCGCAGGA CCGCGCGCTG CGCGAACGCG TCTACCGCGC CCACGTCACG
CGTGCCTCGG AGGCCGGCCC GACCGAGCGC GACAACAGCG CCGTGATGGG CGAGATCCTC
ACTCTGCGGC AGGAAGAGGC CGAGCTGCTG GGCTACCGCA ACTTCGCCGA GGTTTCGCTG
GTCGCCAAGA TGGCCGACTC GCCGGCGCAG GTGCAGGGAT TCCTGCGCGA CCTGGCGCGG
CGCGCACGGC CCCACGCCGA GCGCGATCTG GCCGAGCTTC GCGAGTTCGC CCGCACCGAG
CTCGGCCTGG CCGATCTGCA GGCCTGGGAC ATGGCCTACG CGGCCGAGCG CCTGAAGGAA
GCCCGTTACG CCTTCAGCGA CACCGAGGTG AAGCAGTACT TCACCGAACC CAAGGTGCTG
GCCGGCCTGT TCCGCATCAT CGAGACGCTG TTCGAGGTGG CGATCCGCCC GGACACGGCG
CCGGTCTGGA ACGAGCACGT GCGATTCTTC CGCATCGAGC GCGGCACTCA GCTGGTCGGC
CAGTTCTACC TCGACCCCTA CGCACGCCCC GGCAAGCGCC CGGGCGCCTG GATGGACGAC
GTACGGGGTC GCTGGGCCCG CCCTGAGGGC AGGGTGCAGA CGCCGGTGGC TCATCTGGTT
TGCAACTTCG CTGCGCCCGT CGGTGACCGC CCCGCGCTGC TGTCGCACGA CGACGTGACC
ACGTTGTTCC ACGAGTTCGG CCACGGCCTG CACCACATGC TCACGCAGGT CGACGAGATC
GGCGTGGCCG GCATCTCCGG CGTCGAGTGG GACGCGGTCG AGCTGCCCAG CCAGTTCATG
GAGAACTTCT GCTGGGAGTG GGAGGTGGTG AAGCACATGA CGGCGCACGT CGACAGCGGC
GAGCCGCTGC CGCGGACGCT GTTCGACAAG ATGCTCGCCG CAAAGAACTT CCAGAGTGGC
TTGCAGACGT TGCGCCAGGT GGAGTTCTCG CTGTTCGACA TGCTGATCCA CGACGGTGCG
TCGGCGGCGC CCTACGGCGC CGACGCCATC CAGCGGGTGC TCGACGGCGT GCGCCGCGAG
ATCGCGGTCA TCGTGCCGCC GGCCTTCAAC CGCTTCCAGC ACAGCTTCTC GCACATCTTC
GCCGGCGGTT ACGCAGCCGG CTACTACAGC TACAAGTGGG CCGAGGTACT GAGCGCCGAC
GCCTACGCCG CCTTCGAGGA GGAAGGGGTG TTCAACCCCG CCGTCGGCCG CCGTTACCGC
GAAGCCATCC TCGAAGCCGG TGGCAGCCGT CCGGCGATGG AGAGCTTCAA GGCCTTCCGC
GGTCGCGAGC CACGCATCGA CGCCCTGCTG CGCCACCAGG GCATGGCGGA CTAA
 
Protein sequence
MSNPLLDTAA LPRFGEIRPE HVEPAVEALL AEANVALERA TGADVPADYD ALSAVLDVAT 
ERLGRAWGAV SHLNAVADTP ELRAAYTEAL PRVTEFYTNL GADERLYAKY KAVAGSPAAA
ALNAARRKAL SNAMRDFVLS GAELQGAAKE RFAAIQARQA ELGQAFSEHV LDATDGWSFI
ASEAELAGVP EDVKAATRAA ARADGQDGHK LSLHMPVYLP VLQYAQDRAL RERVYRAHVT
RASEAGPTER DNSAVMGEIL TLRQEEAELL GYRNFAEVSL VAKMADSPAQ VQGFLRDLAR
RARPHAERDL AELREFARTE LGLADLQAWD MAYAAERLKE ARYAFSDTEV KQYFTEPKVL
AGLFRIIETL FEVAIRPDTA PVWNEHVRFF RIERGTQLVG QFYLDPYARP GKRPGAWMDD
VRGRWARPEG RVQTPVAHLV CNFAAPVGDR PALLSHDDVT TLFHEFGHGL HHMLTQVDEI
GVAGISGVEW DAVELPSQFM ENFCWEWEVV KHMTAHVDSG EPLPRTLFDK MLAAKNFQSG
LQTLRQVEFS LFDMLIHDGA SAAPYGADAI QRVLDGVRRE IAVIVPPAFN RFQHSFSHIF
AGGYAAGYYS YKWAEVLSAD AYAAFEEEGV FNPAVGRRYR EAILEAGGSR PAMESFKAFR
GREPRIDALL RHQGMAD