Gene Mpe_A2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2995 
Symbol 
ID4784684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3183560 
End bp3185179 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content64% 
IMG OID640091566 
Producttype I site-specific deoxyribonuclease 
Protein accessionYP_001022183 
Protein GI124268179 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCAG CGGGTCAGTT CTGGGCGATC GTCAACAAAC TGGCCGAAGA CGAGGAAGAA 
GGCCAGCAGG CCAAGCTCAA GAGTCGCTGG GCGGCACTGG AGAAGGTGGT GGGCGCCGCG
CCGCGCATCG CCCAGGTGGC GGCCGACCTG GTAGCGCACT TCGAGGAGCG CAACAAGGCG
CAGACCGGCA AGGCCATGGT GGTGGCCATG AGCCGCGAGA TCTGCGTGCA TGTCTACGAC
GAGATCGTCA AGCTGCGGCC CGACTGGCAC AGCCCCGACC CCGAGCAAGG CGCGATCAAG
ATTGTGATGA CGGGCTCGGC CAGCGACAAG GCACTGCTGC GGCCCCACAT CTACAGCGCC
CAGGTAAAGA AGCGGCTGGA GAAGCGCTTC AAGAACCCGG CCGACCCGCT GCGCATGGTC
ATCGTGCGCG ACATGTGGCT CACGGGCTTC GACGCGCCCT GCGTGCACAC GCTCTACGTC
GACAAGCCGA TGAAGGGCCA CAACCTCATG CAGGCGATTG CGCGCGTGAA CCGCGTGTTC
AAGGACAAGC AGGGCGGCCT GGTGGTGGAC TACATCGGCA TCGCCAACGA GTTGAAGTCG
GCGTTGAAGG AGTACACCGC AGCACAGGGC CGCGGCCGGC CGACGGTGGA CGCGCACGAG
GCCTATAGCG TGCTGGCCGA GAAGCTGGAT GCGCTGCGCG GCATGCTCGC TGGCACGAAC
GGGCATGGCT TCGACTACAG CGACTTCCTC ACCGGCGGCC ACAAGACGTT GGCCGGCGCG
GCGAACTACG TCTTGGGCCT GAAGGACGGT AAGAAGCGCT TCGCCGACCT GGCGCTGGCG
ATGAGCAAGG CCTTCACGTT GTGCTGCACG CTCGACGAGG CCAAGGCCGT GCGCGAGGAG
GTGGCCTTCT TCCAGGCCGT GAAGGTGATC CTGACCAAGC GGGACATCAG CGCGCAGAAG
AAGATGGACG AGCAACGTGA ACTGGCCATC CGGCAGATCA TCAGCGCGGC CGTGGTCTCG
GAGGAGGTGG TCGACATCTT CGACGCCGTG GGGCTGGACA AGCCCAACAT CGGCATCCTG
GACGACGCCT TCCTGGCCGA GGTTCGCAAC CTGCCAGAGC GCAACCTCGC GGTGGAATTG
CTGGAGCGGC TGCTCGAAGG CGAGATCAAG TCACGCTTCG CCAGCAACGT GGTGCAGAGC
AAGAAGTTCT CCGACATGCT GACGCACGTA GTTCAGCGCT ACCAGAACCG GTCCATCGAA
GCCGCCCAGG TGATGGAAGA GCTGGTGGAG ATAGCCAAGA AGTTCCGTGA GGCGGCGTCA
CGCGGTGAGC AACTGGGCCT GAATGAGGAC GAGGTGCGCT TCTACGACGC GCTGGCAAAC
AACGAATCGG CAGTGCGCGA ACTGACCGAC GAAACGCTGA AGAAGATCGC TCACGAACTG
GCCGAGAGCC TGCGCAAGAA CCTGACCGTG GACTGGTCTG CCCGCGAGAG CGTACAGGCC
AAGCTGCGGT TGATGGTCAA GCGCATCTTG CGCAAGTACA AGTACCCGCC GGATCAGCAG
GAGGCGGCGG TGGAACTGGT GTTGCAGCAG GCGAAGGCGT TGGGGGAGGC GTGGGCGTAG
 
Protein sequence
MIAAGQFWAI VNKLAEDEEE GQQAKLKSRW AALEKVVGAA PRIAQVAADL VAHFEERNKA 
QTGKAMVVAM SREICVHVYD EIVKLRPDWH SPDPEQGAIK IVMTGSASDK ALLRPHIYSA
QVKKRLEKRF KNPADPLRMV IVRDMWLTGF DAPCVHTLYV DKPMKGHNLM QAIARVNRVF
KDKQGGLVVD YIGIANELKS ALKEYTAAQG RGRPTVDAHE AYSVLAEKLD ALRGMLAGTN
GHGFDYSDFL TGGHKTLAGA ANYVLGLKDG KKRFADLALA MSKAFTLCCT LDEAKAVREE
VAFFQAVKVI LTKRDISAQK KMDEQRELAI RQIISAAVVS EEVVDIFDAV GLDKPNIGIL
DDAFLAEVRN LPERNLAVEL LERLLEGEIK SRFASNVVQS KKFSDMLTHV VQRYQNRSIE
AAQVMEELVE IAKKFREAAS RGEQLGLNED EVRFYDALAN NESAVRELTD ETLKKIAHEL
AESLRKNLTV DWSARESVQA KLRLMVKRIL RKYKYPPDQQ EAAVELVLQQ AKALGEAWA