Gene Mpe_A0011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0011 
Symbol 
ID4784021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp12269 
End bp15454 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content66% 
IMG OID640088558 
Producttype I site-specific deoxyribonuclease 
Protein accessionYP_001019208 
Protein GI124265204 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.386666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGG ACCAGCTCGA ACAAGAGTGC CTGGCGTGGC TGGCCGACGT GGGCTGGCAA 
CACCGCTATG GGCCGGACAT CGCGCCCGAT GGCGACGCCC CCGAGCGCGA CAGCTACCGC
CAAGTGCTGC TGCTCGGGCG CTTGCGCTCC GCAGTGGCCG CGCTGAACCC GACCGTGCCC
GCGGCGGCGC GCGAGGATGC CATCCGCCAG GTGCTGGATC TCGGCACCCC GGTGCTGTTG
GCCGCCAACC GACACTTTCA CCGGCTGCTG GTGGGCGGCG TGCCGGTGCA GTACCAACAA
GACGGCGAAA CGCGCGGCGA CTTTGTGCGT TTGGTGGACT GGTCCGACCC GTCGCGCAAC
GAGTGGCTGG CCGTCAACCA GTTCTCCGTG ACCGGGCCAC ACCACACGCG CCGGCCCGAC
ATCGTGCTGT TCGTCAACGG CCTGCCGCTG GTGCTGATCG AGCTGAAGAA CCCGGCGGAC
CTGAACGCCG ACGTGTGGAA GGCCTTCCAC CAGATCCAGA CCTACAAGGC GCAGATCCCG
GACATCTTCC AGACCAACGA GGTGCTGGTG GTGTCCGACG GCAGCGAGGC GCTGCTCGGC
TCGCTCTCGG CCGACAGCGA GCGCTTCATG GCCTGGCGCA CGATCGACGG CAACACGCTC
GACCCGCTGG GCAAGTTCCA TGAGCTGCAG ACGCTGGTGC GTGGCGCGCT GGCGCCGGCC
TATCTGCTCG ACTACCTGCG CTACTTCGTG CTCTTCGAGG ACGACGGCCA ACTCGCCAAG
AAGATCGCCG GCTATCACCA GTTCCACGCG GTGCGCGCGG CCATTGCGCA GGTGGTGACC
GCGTCGCGCC CGAACAGCGA TGCGCGTTTG CGTGGCAAGG GCGGCGTGGT CTGGCACACC
CAGGGCAGCG GCAAGAGCAT CACGATGACC TGCTTCGCAG CGCGCGTGAT GCAGGAGCCG
GCGATGGAGA ACCCGACCAT CGTCGTCATC ACCGACCGTA ACGACCTGGA CGGCCAGCTC
TTCGGCGTGT TCAGCCTGGC GCAGGATCTA TTGCGCGAGC AGCCGGTGCA GGCGAACACC
CGGCAGGAAC TGCGCGCGCT GCTCGGCAAC CGCCCGAGCG GCGGCATCGT GTTCGCCACC
ATCCAGAAGT TCATGCCGGG CGAAGACGAG GACACGTTCC CGCTGCTCTC CGATCGCCAC
AACATCGTCG TGATGGCCGA CGAGGCGCAC CGCACGCAGT ACGGCTTCGA GGCCAAGCTG
AAGACGCCGA AGTCGGCGCT CAAGGCATCG AGCGAGCTGA CCACGGGCAA TGGCCAGCCA
CCCGCGCACC GGGCCGAGTT CGCGCCGAGC GCGAAGTACC AAGTGGGATA CGCCCAGCAC
CTGCGCGATG CGCTGCCCAA CGCCACCTTC GTGGCCTTCA CCGGCACGCC GGTGTCGGGC
GAAGACCGCG ACACGCGGGC GGTGTTCGGC GACTACATCA GCGTCTACGA CATGCAGCAG
GCCAAGGAAG ATGGCGCCAC GGTGGCCATC TACTACGAGA GCCGGCTCGC CAAGCTGGGC
CTCAAGGCCG ACGAGATGGC CACCATCGAC GACGAGGTCG ACGAGCTGGC CGAAGACGAG
GAAGAGAGCC AGCAGGCCAA GCTCAAGAGC CGCTGGGCGG CGCTGGAGAA GGTGGTCGGT
GCCGCGCCGC GCGTCGCCCA GGTGGCGGCG GATCTGGTGG CGCACTTCGA GGAGCGCAAC
AAGGCGCAGA CCGGCAAGGC CATGGTGGTG GCCATGAGCC GCGAGATCTG CGTGCATGTC
TACGACGAGA TCGTCAAGCT GCGACCCGAC TGGCACAGCC CCGACCCCGA GCAAGGCACG
ATCAAGATCG TGATGACGGG CTCGGCCAGC GACAAGGCAC TGCTGCGGCC CCACATCTAC
AGCGCCCAGG TAAAAAAGCG GCTGGAGAAG CGCTTCAAGA ACCCGGCCGA CCCGCTGCGC
ATGGTCATCG TGCGCGACAT GTGGCTCACG GGCTTCGACG CGCCCTGCGT GCACACGCTC
TACGTCGACA AGCCGATGAA GGGCCACAAC CTCATGCAGG CGATTGCGCG CGTGAACCGC
GTGTTCAAGG ACAAGCAGGG CGGCCTGGTG GTGGACTACA TCGGCATCGC CAACGAGTTG
AAGTCGGCGT TGAAGGAGTA CACCGCAGCA CAGGGCCGCG GCCGGCCGAC GGTGGACGCG
CACGAGGCGT ATAGCGTGTT GGCCGAGAAG CTCGACGCGC TGCGAGGCAT GCTGGCGGGA
ACGAACGGGC ACGGCTTCGA CTACAGCGGG TTCCTCACTG GGGGTCACAA GACACTGGCC
GGCGCCGCCA ACTTCGTGCT TGGGATCAAG GAAGGCAAGA AGCGCTTCGC CGACTTGGCG
CTGGCGATGA GCAAAGCCTT CACGCTCTGC TGCACGCTCG ACGAAGCCAA GGCCGTGCGC
GAGGAGGTGG CCTTCTTCCA GGCCGTGAAG GTGATCCTGA CCAAGCGGGA CATCAGCGCG
CAGAAGAAGA TGGACGAGCA ACGTGAACTG GCCATCCGGC AGATCATCAG CGCGGCCGTG
GTCTCGGAGG AGGTGGTCGA CATCTTCGAC GCCGTGGGGC TGGACAAGCC CAACATCGGC
ATCCTGGACG ACGCCTTCCT GGCCGAGGTT CGCAACCTGC CAGAGCGCAA CCTCGCGGTG
GAATTGCTGG AGCGGCTGCT CGAAGGCGAG ATCAAGTCAC GCTTCGCCGG CAACGTGGTC
CAGAACAAGA AGTTCTCGGA CATGCTGGCC GACGTGGTGC AGCGCTACCA AAACCGGTCC
ATCGAAGCTG CTCAGGTGAT GGAAGAGCTG GTGCAGATGG CCAAGAAGTT TCGCGCGGCT
GCGGCGCGCG GGGAGCAGCT TGGCCTCACC GAAGACGAAG TGCGCTTCTA TGACGCGCTG
GCCAACAACG AATCCGCCGT TCGAGAGCTG AACGATGAGA CGCTGAAGAA GATCGCCCAT
GAGCTGGCCG AGAACTTGCG CAAGAACCTC ACGGTCGATT GGTCCGCGAG AGAAAGCGTC
CAGGCCAAGC TGCGACTGAT GGTCAAGCGC ATCCTGCGCA AGTACAAGTA CCCACCGGAT
CAGCAGGACG CTGCGGTGGA GCTTGTGCTG CAGCAGGCCA AGGCGTTGGG AGAAGCGTGG
GCATGA
 
Protein sequence
MTEDQLEQEC LAWLADVGWQ HRYGPDIAPD GDAPERDSYR QVLLLGRLRS AVAALNPTVP 
AAAREDAIRQ VLDLGTPVLL AANRHFHRLL VGGVPVQYQQ DGETRGDFVR LVDWSDPSRN
EWLAVNQFSV TGPHHTRRPD IVLFVNGLPL VLIELKNPAD LNADVWKAFH QIQTYKAQIP
DIFQTNEVLV VSDGSEALLG SLSADSERFM AWRTIDGNTL DPLGKFHELQ TLVRGALAPA
YLLDYLRYFV LFEDDGQLAK KIAGYHQFHA VRAAIAQVVT ASRPNSDARL RGKGGVVWHT
QGSGKSITMT CFAARVMQEP AMENPTIVVI TDRNDLDGQL FGVFSLAQDL LREQPVQANT
RQELRALLGN RPSGGIVFAT IQKFMPGEDE DTFPLLSDRH NIVVMADEAH RTQYGFEAKL
KTPKSALKAS SELTTGNGQP PAHRAEFAPS AKYQVGYAQH LRDALPNATF VAFTGTPVSG
EDRDTRAVFG DYISVYDMQQ AKEDGATVAI YYESRLAKLG LKADEMATID DEVDELAEDE
EESQQAKLKS RWAALEKVVG AAPRVAQVAA DLVAHFEERN KAQTGKAMVV AMSREICVHV
YDEIVKLRPD WHSPDPEQGT IKIVMTGSAS DKALLRPHIY SAQVKKRLEK RFKNPADPLR
MVIVRDMWLT GFDAPCVHTL YVDKPMKGHN LMQAIARVNR VFKDKQGGLV VDYIGIANEL
KSALKEYTAA QGRGRPTVDA HEAYSVLAEK LDALRGMLAG TNGHGFDYSG FLTGGHKTLA
GAANFVLGIK EGKKRFADLA LAMSKAFTLC CTLDEAKAVR EEVAFFQAVK VILTKRDISA
QKKMDEQREL AIRQIISAAV VSEEVVDIFD AVGLDKPNIG ILDDAFLAEV RNLPERNLAV
ELLERLLEGE IKSRFAGNVV QNKKFSDMLA DVVQRYQNRS IEAAQVMEEL VQMAKKFRAA
AARGEQLGLT EDEVRFYDAL ANNESAVREL NDETLKKIAH ELAENLRKNL TVDWSARESV
QAKLRLMVKR ILRKYKYPPD QQDAAVELVL QQAKALGEAW A