Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0011 |
Symbol | |
ID | 4784021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 12269 |
End bp | 15454 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640088558 |
Product | type I site-specific deoxyribonuclease |
Protein accession | YP_001019208 |
Protein GI | 124265204 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.386666 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGG ACCAGCTCGA ACAAGAGTGC CTGGCGTGGC TGGCCGACGT GGGCTGGCAA CACCGCTATG GGCCGGACAT CGCGCCCGAT GGCGACGCCC CCGAGCGCGA CAGCTACCGC CAAGTGCTGC TGCTCGGGCG CTTGCGCTCC GCAGTGGCCG CGCTGAACCC GACCGTGCCC GCGGCGGCGC GCGAGGATGC CATCCGCCAG GTGCTGGATC TCGGCACCCC GGTGCTGTTG GCCGCCAACC GACACTTTCA CCGGCTGCTG GTGGGCGGCG TGCCGGTGCA GTACCAACAA GACGGCGAAA CGCGCGGCGA CTTTGTGCGT TTGGTGGACT GGTCCGACCC GTCGCGCAAC GAGTGGCTGG CCGTCAACCA GTTCTCCGTG ACCGGGCCAC ACCACACGCG CCGGCCCGAC ATCGTGCTGT TCGTCAACGG CCTGCCGCTG GTGCTGATCG AGCTGAAGAA CCCGGCGGAC CTGAACGCCG ACGTGTGGAA GGCCTTCCAC CAGATCCAGA CCTACAAGGC GCAGATCCCG GACATCTTCC AGACCAACGA GGTGCTGGTG GTGTCCGACG GCAGCGAGGC GCTGCTCGGC TCGCTCTCGG CCGACAGCGA GCGCTTCATG GCCTGGCGCA CGATCGACGG CAACACGCTC GACCCGCTGG GCAAGTTCCA TGAGCTGCAG ACGCTGGTGC GTGGCGCGCT GGCGCCGGCC TATCTGCTCG ACTACCTGCG CTACTTCGTG CTCTTCGAGG ACGACGGCCA ACTCGCCAAG AAGATCGCCG GCTATCACCA GTTCCACGCG GTGCGCGCGG CCATTGCGCA GGTGGTGACC GCGTCGCGCC CGAACAGCGA TGCGCGTTTG CGTGGCAAGG GCGGCGTGGT CTGGCACACC CAGGGCAGCG GCAAGAGCAT CACGATGACC TGCTTCGCAG CGCGCGTGAT GCAGGAGCCG GCGATGGAGA ACCCGACCAT CGTCGTCATC ACCGACCGTA ACGACCTGGA CGGCCAGCTC TTCGGCGTGT TCAGCCTGGC GCAGGATCTA TTGCGCGAGC AGCCGGTGCA GGCGAACACC CGGCAGGAAC TGCGCGCGCT GCTCGGCAAC CGCCCGAGCG GCGGCATCGT GTTCGCCACC ATCCAGAAGT TCATGCCGGG CGAAGACGAG GACACGTTCC CGCTGCTCTC CGATCGCCAC AACATCGTCG TGATGGCCGA CGAGGCGCAC CGCACGCAGT ACGGCTTCGA GGCCAAGCTG AAGACGCCGA AGTCGGCGCT CAAGGCATCG AGCGAGCTGA CCACGGGCAA TGGCCAGCCA CCCGCGCACC GGGCCGAGTT CGCGCCGAGC GCGAAGTACC AAGTGGGATA CGCCCAGCAC CTGCGCGATG CGCTGCCCAA CGCCACCTTC GTGGCCTTCA CCGGCACGCC GGTGTCGGGC GAAGACCGCG ACACGCGGGC GGTGTTCGGC GACTACATCA GCGTCTACGA CATGCAGCAG GCCAAGGAAG ATGGCGCCAC GGTGGCCATC TACTACGAGA GCCGGCTCGC CAAGCTGGGC CTCAAGGCCG ACGAGATGGC CACCATCGAC GACGAGGTCG ACGAGCTGGC CGAAGACGAG GAAGAGAGCC AGCAGGCCAA GCTCAAGAGC CGCTGGGCGG CGCTGGAGAA GGTGGTCGGT GCCGCGCCGC GCGTCGCCCA GGTGGCGGCG GATCTGGTGG CGCACTTCGA GGAGCGCAAC AAGGCGCAGA CCGGCAAGGC CATGGTGGTG GCCATGAGCC GCGAGATCTG CGTGCATGTC TACGACGAGA TCGTCAAGCT GCGACCCGAC TGGCACAGCC CCGACCCCGA GCAAGGCACG ATCAAGATCG TGATGACGGG CTCGGCCAGC GACAAGGCAC TGCTGCGGCC CCACATCTAC AGCGCCCAGG TAAAAAAGCG GCTGGAGAAG CGCTTCAAGA ACCCGGCCGA CCCGCTGCGC ATGGTCATCG TGCGCGACAT GTGGCTCACG GGCTTCGACG CGCCCTGCGT GCACACGCTC TACGTCGACA AGCCGATGAA GGGCCACAAC CTCATGCAGG CGATTGCGCG CGTGAACCGC GTGTTCAAGG ACAAGCAGGG CGGCCTGGTG GTGGACTACA TCGGCATCGC CAACGAGTTG AAGTCGGCGT TGAAGGAGTA CACCGCAGCA CAGGGCCGCG GCCGGCCGAC GGTGGACGCG CACGAGGCGT ATAGCGTGTT GGCCGAGAAG CTCGACGCGC TGCGAGGCAT GCTGGCGGGA ACGAACGGGC ACGGCTTCGA CTACAGCGGG TTCCTCACTG GGGGTCACAA GACACTGGCC GGCGCCGCCA ACTTCGTGCT TGGGATCAAG GAAGGCAAGA AGCGCTTCGC CGACTTGGCG CTGGCGATGA GCAAAGCCTT CACGCTCTGC TGCACGCTCG ACGAAGCCAA GGCCGTGCGC GAGGAGGTGG CCTTCTTCCA GGCCGTGAAG GTGATCCTGA CCAAGCGGGA CATCAGCGCG CAGAAGAAGA TGGACGAGCA ACGTGAACTG GCCATCCGGC AGATCATCAG CGCGGCCGTG GTCTCGGAGG AGGTGGTCGA CATCTTCGAC GCCGTGGGGC TGGACAAGCC CAACATCGGC ATCCTGGACG ACGCCTTCCT GGCCGAGGTT CGCAACCTGC CAGAGCGCAA CCTCGCGGTG GAATTGCTGG AGCGGCTGCT CGAAGGCGAG ATCAAGTCAC GCTTCGCCGG CAACGTGGTC CAGAACAAGA AGTTCTCGGA CATGCTGGCC GACGTGGTGC AGCGCTACCA AAACCGGTCC ATCGAAGCTG CTCAGGTGAT GGAAGAGCTG GTGCAGATGG CCAAGAAGTT TCGCGCGGCT GCGGCGCGCG GGGAGCAGCT TGGCCTCACC GAAGACGAAG TGCGCTTCTA TGACGCGCTG GCCAACAACG AATCCGCCGT TCGAGAGCTG AACGATGAGA CGCTGAAGAA GATCGCCCAT GAGCTGGCCG AGAACTTGCG CAAGAACCTC ACGGTCGATT GGTCCGCGAG AGAAAGCGTC CAGGCCAAGC TGCGACTGAT GGTCAAGCGC ATCCTGCGCA AGTACAAGTA CCCACCGGAT CAGCAGGACG CTGCGGTGGA GCTTGTGCTG CAGCAGGCCA AGGCGTTGGG AGAAGCGTGG GCATGA
|
Protein sequence | MTEDQLEQEC LAWLADVGWQ HRYGPDIAPD GDAPERDSYR QVLLLGRLRS AVAALNPTVP AAAREDAIRQ VLDLGTPVLL AANRHFHRLL VGGVPVQYQQ DGETRGDFVR LVDWSDPSRN EWLAVNQFSV TGPHHTRRPD IVLFVNGLPL VLIELKNPAD LNADVWKAFH QIQTYKAQIP DIFQTNEVLV VSDGSEALLG SLSADSERFM AWRTIDGNTL DPLGKFHELQ TLVRGALAPA YLLDYLRYFV LFEDDGQLAK KIAGYHQFHA VRAAIAQVVT ASRPNSDARL RGKGGVVWHT QGSGKSITMT CFAARVMQEP AMENPTIVVI TDRNDLDGQL FGVFSLAQDL LREQPVQANT RQELRALLGN RPSGGIVFAT IQKFMPGEDE DTFPLLSDRH NIVVMADEAH RTQYGFEAKL KTPKSALKAS SELTTGNGQP PAHRAEFAPS AKYQVGYAQH LRDALPNATF VAFTGTPVSG EDRDTRAVFG DYISVYDMQQ AKEDGATVAI YYESRLAKLG LKADEMATID DEVDELAEDE EESQQAKLKS RWAALEKVVG AAPRVAQVAA DLVAHFEERN KAQTGKAMVV AMSREICVHV YDEIVKLRPD WHSPDPEQGT IKIVMTGSAS DKALLRPHIY SAQVKKRLEK RFKNPADPLR MVIVRDMWLT GFDAPCVHTL YVDKPMKGHN LMQAIARVNR VFKDKQGGLV VDYIGIANEL KSALKEYTAA QGRGRPTVDA HEAYSVLAEK LDALRGMLAG TNGHGFDYSG FLTGGHKTLA GAANFVLGIK EGKKRFADLA LAMSKAFTLC CTLDEAKAVR EEVAFFQAVK VILTKRDISA QKKMDEQREL AIRQIISAAV VSEEVVDIFD AVGLDKPNIG ILDDAFLAEV RNLPERNLAV ELLERLLEGE IKSRFAGNVV QNKKFSDMLA DVVQRYQNRS IEAAQVMEEL VQMAKKFRAA AARGEQLGLT EDEVRFYDAL ANNESAVREL NDETLKKIAH ELAENLRKNL TVDWSARESV QAKLRLMVKR ILRKYKYPPD QQDAAVELVL QQAKALGEAW A
|
| |