Gene Mpe_A3793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3793 
Symbol 
ID4785962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp4011547 
End bp4013685 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content68% 
IMG OID640092376 
Productcolicin V processing peptidase 
Protein accessionYP_001022981 
Protein GI124268977 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00145248 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAACG CCTTGCGCAT GGGCTTCGGC GCGCCGCGCC TGCCGATGAT CCTGCAGACC 
GAGGCGGCGG AGTGCGGCCT GGCCTGCCTC GCCATGGTGG CTTCGCACCA CGGCCATCGC
AGCGACCTCC CGAGCTTGAG GCAGCGCTTC TCGCTGTCCT TGAAGGGCGT GACGCTGGCC
GACCTGGTGG CGATGGCGGG GCAACTCCAG CTCAATGCAC GGCCGCTGCG CGCGGAGCTC
GCGCATCTTT CGCAGCTGCA GTTGCCCTGC ATCCTGCACT GGGACCTGAA CCACTTCGTG
GTGCTGGCGA AGGTGCAGCG AGGCGTTGCC GTCATCCACG ATCCGGCGCA CGGGGTGCGG
CGGCTGCCGC TGGCCGAGGT CTCGCGCCAC TTCACCGGCG TGGTGCTGGA ACTGATGCCC
GGCACCGGCT TCGAGCCCCG CACCGAACGC CAGCACGTCA GCCTACGGCA GCTGCTCGGC
CCGGTGCGGG GCCTGAAGCG CTCGCTGGCG CAGGTGTTCG CGCTCGCGCT CGCGCTCGAG
GTCTTCATGC TGCTGGCGCC GTTCTTCCTG CAATGGGTGG TCGACGGCGC GCTGCTCAGC
GCCGACCGCG ACCTGCTGCT CACGCTGGTG ATCGGCTTCG GCTTGCTCGT GGTGATTCAG
GTGGCCACCG GTGCCTTGCG GTCCTGGGCG GTGCTCTACC TGTCGAGCAC GCTCAACCTG
CAATGGCTGG GCAATGTGTT CGCACACCTG ATGCGCCTGC CGGTCGAGTG GTTCGAGAAG
CGCCATGCCG GTGACGTGAT GTCGCGTTTC GGCGCGGTGC AGAAGATCCA GCAGACGCTG
ACCACCAGCT TCATCGAGGC CATGCTCGAC GGGCTGCTGG TGGTTGCCAC GCTGGTGATG
ATGTGGGTCT ACAGCGCCAC ATTGACGGCC ATCGCGATCG GCGCGGTGGC GCTCTACGCG
CTGCTGCGCT GGGCCTTCTT CACGCCGCTG CGCGACGCGA CCGAAGAGGC GATCGTGCAC
GATGCGAAGC GCTCGACGCA TTTTCTCGAA TCGCTGCGCG GCGTGCAGGC GATCAAGCTG
TTCAATCGCC AGGACGAGCG CCGCGCCCGC TTCATGAACC TGGTGGTCGA CGCGATGAAC
GCCGACATCG CGATCAAGAA GCTCGAGCTC GCCTTCGCCG TGCTCAACAA GCTGGTGTTC
GGCGTCGAGC GCATCGCCGT CATCGGCATC GGCGCGCTGC TGGTGATGGA GCAGCAGTTC
ACCGTGGGCA TGCTGTTCGC GTTCCTGGCC TTCAAGGAGC AGTTCGCGCA GCGCGTCAGC
GGTCTGATCG ACAAGGCGAT CGAGCTGAAG ATGCTGCGGC TGCAAGGCGA GCGCCTGGCC
GACATCGTGC TGGCGGCGCC CGAGGCGCAG GGCGAGGGCC TGCACGCGGC GCGCGACCTC
GCACCGCGCA TCGAGCTGCG CGACGTGAGC TTCCGCTACG CGGATACCGA GCCCGACGTG
CTGAAGGGCT GCAGCCTGCA CATCGAGCCC GGCGAGGCGG TGGCGATCGT CGGACCGTCG
GGCTGCGGCA AGACCACGCT GCTCAAGCTG ATGCTCGGCA TCCACGCGCC CGCCGCCGGC
GAGATCCGCA TCGGAGGCCT GCCGCTGTCG CAGCTGGGGC TCGGCCGCTG GCGCGCCATG
ATCGGCACCG TGATGCAGGA CGACCAGCTG TTCGCCGGGT CAATTGCCGA CAACATCTCG
TTCTTCGACG TGGATGCCGA TGCCGCCTGG GTCGAGCAGT GCGCCCGGCT GGCCTGCGTG
AACGACGAGA TCGACGCGCT GCCGATGGGT TACCACACGC TGATCGGCGA CATGGGCGCC
AGCCTGTCCG GTGGGCAGCG CCAGCGCATC CTGCTGGCCC GTGCGCTGTA CAAGCGACCT
CGCATCCTGT TTCTCGACGA GGCGACGAGC GCGCTCGATG TCGAGCGGGA GCGGCAGGTG
AACCAGGCCA TCCGCGGGCT CGACATCACG CGCGTGATCG TCGCCCACCG GCCCGAGACC
ATCGCGGCGG CGGCCCGCGT GATCGTGCTC CAGCAGGGGC GTGTCGCGCA GGACCTGCGC
AGCGTGCCGA ACACGCAGCA GTCCGAGCCG GGCAACTGA
 
Protein sequence
MANALRMGFG APRLPMILQT EAAECGLACL AMVASHHGHR SDLPSLRQRF SLSLKGVTLA 
DLVAMAGQLQ LNARPLRAEL AHLSQLQLPC ILHWDLNHFV VLAKVQRGVA VIHDPAHGVR
RLPLAEVSRH FTGVVLELMP GTGFEPRTER QHVSLRQLLG PVRGLKRSLA QVFALALALE
VFMLLAPFFL QWVVDGALLS ADRDLLLTLV IGFGLLVVIQ VATGALRSWA VLYLSSTLNL
QWLGNVFAHL MRLPVEWFEK RHAGDVMSRF GAVQKIQQTL TTSFIEAMLD GLLVVATLVM
MWVYSATLTA IAIGAVALYA LLRWAFFTPL RDATEEAIVH DAKRSTHFLE SLRGVQAIKL
FNRQDERRAR FMNLVVDAMN ADIAIKKLEL AFAVLNKLVF GVERIAVIGI GALLVMEQQF
TVGMLFAFLA FKEQFAQRVS GLIDKAIELK MLRLQGERLA DIVLAAPEAQ GEGLHAARDL
APRIELRDVS FRYADTEPDV LKGCSLHIEP GEAVAIVGPS GCGKTTLLKL MLGIHAPAAG
EIRIGGLPLS QLGLGRWRAM IGTVMQDDQL FAGSIADNIS FFDVDADAAW VEQCARLACV
NDEIDALPMG YHTLIGDMGA SLSGGQRQRI LLARALYKRP RILFLDEATS ALDVERERQV
NQAIRGLDIT RVIVAHRPET IAAAARVIVL QQGRVAQDLR SVPNTQQSEP GN