Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3793 |
Symbol | |
ID | 4785962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 4011547 |
End bp | 4013685 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640092376 |
Product | colicin V processing peptidase |
Protein accession | YP_001022981 |
Protein GI | 124268977 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00145248 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAACG CCTTGCGCAT GGGCTTCGGC GCGCCGCGCC TGCCGATGAT CCTGCAGACC GAGGCGGCGG AGTGCGGCCT GGCCTGCCTC GCCATGGTGG CTTCGCACCA CGGCCATCGC AGCGACCTCC CGAGCTTGAG GCAGCGCTTC TCGCTGTCCT TGAAGGGCGT GACGCTGGCC GACCTGGTGG CGATGGCGGG GCAACTCCAG CTCAATGCAC GGCCGCTGCG CGCGGAGCTC GCGCATCTTT CGCAGCTGCA GTTGCCCTGC ATCCTGCACT GGGACCTGAA CCACTTCGTG GTGCTGGCGA AGGTGCAGCG AGGCGTTGCC GTCATCCACG ATCCGGCGCA CGGGGTGCGG CGGCTGCCGC TGGCCGAGGT CTCGCGCCAC TTCACCGGCG TGGTGCTGGA ACTGATGCCC GGCACCGGCT TCGAGCCCCG CACCGAACGC CAGCACGTCA GCCTACGGCA GCTGCTCGGC CCGGTGCGGG GCCTGAAGCG CTCGCTGGCG CAGGTGTTCG CGCTCGCGCT CGCGCTCGAG GTCTTCATGC TGCTGGCGCC GTTCTTCCTG CAATGGGTGG TCGACGGCGC GCTGCTCAGC GCCGACCGCG ACCTGCTGCT CACGCTGGTG ATCGGCTTCG GCTTGCTCGT GGTGATTCAG GTGGCCACCG GTGCCTTGCG GTCCTGGGCG GTGCTCTACC TGTCGAGCAC GCTCAACCTG CAATGGCTGG GCAATGTGTT CGCACACCTG ATGCGCCTGC CGGTCGAGTG GTTCGAGAAG CGCCATGCCG GTGACGTGAT GTCGCGTTTC GGCGCGGTGC AGAAGATCCA GCAGACGCTG ACCACCAGCT TCATCGAGGC CATGCTCGAC GGGCTGCTGG TGGTTGCCAC GCTGGTGATG ATGTGGGTCT ACAGCGCCAC ATTGACGGCC ATCGCGATCG GCGCGGTGGC GCTCTACGCG CTGCTGCGCT GGGCCTTCTT CACGCCGCTG CGCGACGCGA CCGAAGAGGC GATCGTGCAC GATGCGAAGC GCTCGACGCA TTTTCTCGAA TCGCTGCGCG GCGTGCAGGC GATCAAGCTG TTCAATCGCC AGGACGAGCG CCGCGCCCGC TTCATGAACC TGGTGGTCGA CGCGATGAAC GCCGACATCG CGATCAAGAA GCTCGAGCTC GCCTTCGCCG TGCTCAACAA GCTGGTGTTC GGCGTCGAGC GCATCGCCGT CATCGGCATC GGCGCGCTGC TGGTGATGGA GCAGCAGTTC ACCGTGGGCA TGCTGTTCGC GTTCCTGGCC TTCAAGGAGC AGTTCGCGCA GCGCGTCAGC GGTCTGATCG ACAAGGCGAT CGAGCTGAAG ATGCTGCGGC TGCAAGGCGA GCGCCTGGCC GACATCGTGC TGGCGGCGCC CGAGGCGCAG GGCGAGGGCC TGCACGCGGC GCGCGACCTC GCACCGCGCA TCGAGCTGCG CGACGTGAGC TTCCGCTACG CGGATACCGA GCCCGACGTG CTGAAGGGCT GCAGCCTGCA CATCGAGCCC GGCGAGGCGG TGGCGATCGT CGGACCGTCG GGCTGCGGCA AGACCACGCT GCTCAAGCTG ATGCTCGGCA TCCACGCGCC CGCCGCCGGC GAGATCCGCA TCGGAGGCCT GCCGCTGTCG CAGCTGGGGC TCGGCCGCTG GCGCGCCATG ATCGGCACCG TGATGCAGGA CGACCAGCTG TTCGCCGGGT CAATTGCCGA CAACATCTCG TTCTTCGACG TGGATGCCGA TGCCGCCTGG GTCGAGCAGT GCGCCCGGCT GGCCTGCGTG AACGACGAGA TCGACGCGCT GCCGATGGGT TACCACACGC TGATCGGCGA CATGGGCGCC AGCCTGTCCG GTGGGCAGCG CCAGCGCATC CTGCTGGCCC GTGCGCTGTA CAAGCGACCT CGCATCCTGT TTCTCGACGA GGCGACGAGC GCGCTCGATG TCGAGCGGGA GCGGCAGGTG AACCAGGCCA TCCGCGGGCT CGACATCACG CGCGTGATCG TCGCCCACCG GCCCGAGACC ATCGCGGCGG CGGCCCGCGT GATCGTGCTC CAGCAGGGGC GTGTCGCGCA GGACCTGCGC AGCGTGCCGA ACACGCAGCA GTCCGAGCCG GGCAACTGA
|
Protein sequence | MANALRMGFG APRLPMILQT EAAECGLACL AMVASHHGHR SDLPSLRQRF SLSLKGVTLA DLVAMAGQLQ LNARPLRAEL AHLSQLQLPC ILHWDLNHFV VLAKVQRGVA VIHDPAHGVR RLPLAEVSRH FTGVVLELMP GTGFEPRTER QHVSLRQLLG PVRGLKRSLA QVFALALALE VFMLLAPFFL QWVVDGALLS ADRDLLLTLV IGFGLLVVIQ VATGALRSWA VLYLSSTLNL QWLGNVFAHL MRLPVEWFEK RHAGDVMSRF GAVQKIQQTL TTSFIEAMLD GLLVVATLVM MWVYSATLTA IAIGAVALYA LLRWAFFTPL RDATEEAIVH DAKRSTHFLE SLRGVQAIKL FNRQDERRAR FMNLVVDAMN ADIAIKKLEL AFAVLNKLVF GVERIAVIGI GALLVMEQQF TVGMLFAFLA FKEQFAQRVS GLIDKAIELK MLRLQGERLA DIVLAAPEAQ GEGLHAARDL APRIELRDVS FRYADTEPDV LKGCSLHIEP GEAVAIVGPS GCGKTTLLKL MLGIHAPAAG EIRIGGLPLS QLGLGRWRAM IGTVMQDDQL FAGSIADNIS FFDVDADAAW VEQCARLACV NDEIDALPMG YHTLIGDMGA SLSGGQRQRI LLARALYKRP RILFLDEATS ALDVERERQV NQAIRGLDIT RVIVAHRPET IAAAARVIVL QQGRVAQDLR SVPNTQQSEP GN
|
| |