Gene Mmar10_0160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_0160 
Symbol 
ID4285525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp164209 
End bp167622 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content59% 
IMG OID638139626 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_755394 
Protein GI114568714 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0444998 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.740458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCCG TCAATTTCGA AATCCTGCGT AAAGACTGGC CAGAGCTCGC TCAGCTGGGC 
GCGTTTGCTG AGACATACGC GCATACAGAC GCGGCGAGCG CCCTGGTTAA GCTTCGGCTG
TTCGCGGAAA ATCTCACTAA GGATATCTAC CGCGATCTGC GCCTGCCTCG TGAAGATCGT
ATGTCCTTTG TCGACATGCT TGGTGGCAGC GCGTTTCAGT CAGCAATCCC GCGTGTTGTG
ATCGATAAGC TTCATGCGAT CCGTGTCCAC GGAAACAAGG CGGCCCATGG TGAGCAGGCG
ACCATTAAGA CGGCTCTCTT TCTGCTGAAG GAAGCGCACG GATTGGCGCG GTGGCTGCTT
GTGGCGTTTA AGCGCCAGAA AGCAGACGAT CTTCCGCTTT TCAAAGAGCT GGATAACCCT
GTCCAGGGAG CAAGTGCCAC CGAGCTTCAA CGCGAGAAAA AGCGGGCAAT GGAGCGCCTG
GCGGCCCAAG AAGCACAGAT GCAGGTGCTG CTGAAAGAGC TCGAGGCGGA GCGCGAGAAA
GCCAAGGCGG CTGAGGCGAA GACGGAGGAG CTTTCCGCTG CCCGTGAGGA AGGCGACAAG
GCGGCATCGG AGCTTGAATT CGACGAGGCG ACAACTCGAG CTCGGATCAT CGACATGATG
CTCGCCTCGG CTGGGTGGGA TGTCGCCAAC GGTCTCAAAT CGACGGAACA GGTTGCCAAG
GAGTTGGAAG TCGCCGACCA GCCGACCGAG ACCGGCAAAG GCTATGTCGA CTACGTCCTG
CGTGATGGCG ACGGTATGCC GCTCGCAGTC GTGGAGGCGA AGCGAACCTC AAAGAACGCC
GAACTGGGTC GCAAACAGGC GGAGCTCTAT GCCAATGCGC TGGAGAAGCA GTACGGTATC
CGTCCGGCGA TCTTTTACAC GAATGGCTAT GACATTTGGC TGTGGGACGA TGCGGGCGGT
TTCCCGCCGC GCAAGGTCTA CGGCTACTAC TCGCCGGATA GCCTCCAATA TCTGGTGCGG
TTCCAGCGGG CGAATAAGCA GCCGCTGACG GCGCTGCCGC CAAACAAGGA CATCGCCGGC
CGCCTCTACC AGGTTGAGAC CATCAAGCGC GTCAGTGAGC GGTTCGAAGA TAAGCACCGC
AAGGCCCTGA TCGTTCAGGC GACGGGGACC GGTAAGACGC GGGTCGCCAT CGCGCTGTCA
GAGCTGCTGA TCCGCGCCAA ATGGGCCAAG CGGGTGCTCT TTCTGTGTGA CCGGAGGGAG
CTCCGCAAAC AGGCTAAAAA CGCATTCGGA AATTTCCTCA AGGGCGAGCC GATCCGCGTC
GTCAACTCGG CCATTCGCGG GAATGCGAAC GAACGGATAT TCGTCGCGAC CTACCCGGCG
ATGCTGAAGG TCTATCAGGC GTTTGACGTC GGATTTTTCG ACCTGATCAT CGCGGACGAG
AGCCATCGCA GCATCTACAA TGTCTATGGC GACCTGTTTT ACTACTTTGA TTGTCCCCAG
GTTGGCCTGA CCGCGACCCC TGTCGATTTC GTCACGCGGT CGACCTTCGA TCTCTTCAAG
TGCGAGGGGC AGAACCCGAC CGCGAACTAT GAGCTCGAAC AGGCTATCGA AGAGGGCTAC
CTTACCCCGT TCGAAGTCTA TGAGCACACG ACCAAATTCC TGCGGGAAGG TATCCGCCTA
GAGGGTCTGA CCAAGGCCCA GCGCCAGCAG CTCGAGGATC AGGGCGAAGA CCCGGATCAA
TACGACTATT CCTCAGAAGT CGTCGACAAG GCGATCTTCA ACAAGGACAC GAACCGCGCC
GTCCTGCGCA ATCTGATGGA AAACGGCCTG CGGGACGCCA GCGGACAGAC CATCGGCAAG
ACGATCATCT TTGCGCGCAG CCATGCCCAC GCCCTCTTGC TGCGCGAGGT GTTCGATGAG
CTCTATCCGC AATATGGTGG CCGCTTCTGC CAGGTCATCG ACAACTATGA TCCGCGCGCA
GAGCAGCTGA TCGACGACTT CAAAGGCGAC GGCGCGAACA AGGACCTCAC TATCGCGATC
TCCGTGGACA TGCTGGACAC GGGCATCGAC GTTCCGGAAA TTCTCAACCT GGTCTTTGCA
AAGCCGGTCC GCTCGCCGGT GAAGTTCTGG CAAATGGTGG GGCGCGGGAC GCGTCTGTGT
GAGGATTTGT TCGGTCCCGG CCAGCACAAG ACCGTTTTCC GCATCTTCGA TCACTGGGGC
AATTTCGAGC GTTTCGAGAC CGGCTACAAA CCCGCAGAGC CACGCCCGTC CAAATCGCTT
CTGCAACAGC TCTTCGAGCA GCGGATCGAC CTGGCGGAGA CGGCCCTCCG CAAGAGTGAA
ATCCCGGTCT TCGAGACAGT CGTGCAATTA ATTGCAGCCG ACGTTTCGGC CCTGCCGGAA
AACTCCATCG CCGTTCGTGA GAAGTGGCGG GAGAAGCGGC AATTCGCCGA CCTTGAAACG
GTCAAATCCT TCGCCCCGAC AACGGTGGCG GCGCTTCGCC AGACCATCGC GCCTCTGATG
CAGTGGCGGG ATGTGCGGGG CAAAACCGAA GCCCATGCGT TTGATCTGTT GATCGCCCGG
CTCCAGACGG CGGCCCTTCA AGGGTCTGCG GAGGTCGAGG ACCTAAGAGC GGACATGATG
GGACGGCTCG ACACGCTGCA AATGCACCTG AACCCCGTCC GCGAGCGGGC GGAAATCATC
AAAGAGGTGA AGTCGGCTGC GTTCTGGTCC GACCTCAAGC CGTTGGACCT GGAGCGGGTT
CGCATCCCGC TTCGCGAGAT CATCCACCAC CATGACCGCA AGGCCGGGCC GAAAGTACCG
GCGAAGGTCA TCGATGTGGA AGAGGAGGCG TCGGAGTTTC AATACGGCCG CCGGGCCACC
TCCTACACCG CCAATGAGAT GAAGGCGTTC AAGCAGGCGG TGGATGCCGA GCTGAAGAAG
CATATCGAGA CCAACCCCAT CCTGCAGAAA ATCCGCAATG GCGAGCCGGT CACAGATGCC
GACCTGCAAG CTATCGTCTC TCTCGTCCTG ACCCAGAACC CGGATCTCAG TGAAGCCCAG
CTCAACGAAT TCTTCTCGCC AACGGCGGAG AATTTGTTGT TCAAACTGCG GGAAATGACG
GGCCTGGAGG CCACGACGGT GCGGGACCGC TTTGCGGACT TCGTGAACGC GCACCCTACA
CTGACGGCCA AGCAGACCCG CTTCCTCAAC CTGTTGCAGA ACCATCTGAT CAAGTTCGGC
GTCATCACGG TCGACCGGCT GTATGACCAG CCTTTCACGG TGGTCGATGC GGATGGACCG
GACGGGGTCT TCGACGAGCC CGATCTGATG AACGAACTCA TGACCATCAT TTCTCAATTC
GCGCCGCCCT CAACGAGCCG TGAGAGCGCG GAAGATAAAA GGACGCCCAA CTAA
 
Protein sequence
MKSVNFEILR KDWPELAQLG AFAETYAHTD AASALVKLRL FAENLTKDIY RDLRLPREDR 
MSFVDMLGGS AFQSAIPRVV IDKLHAIRVH GNKAAHGEQA TIKTALFLLK EAHGLARWLL
VAFKRQKADD LPLFKELDNP VQGASATELQ REKKRAMERL AAQEAQMQVL LKELEAEREK
AKAAEAKTEE LSAAREEGDK AASELEFDEA TTRARIIDMM LASAGWDVAN GLKSTEQVAK
ELEVADQPTE TGKGYVDYVL RDGDGMPLAV VEAKRTSKNA ELGRKQAELY ANALEKQYGI
RPAIFYTNGY DIWLWDDAGG FPPRKVYGYY SPDSLQYLVR FQRANKQPLT ALPPNKDIAG
RLYQVETIKR VSERFEDKHR KALIVQATGT GKTRVAIALS ELLIRAKWAK RVLFLCDRRE
LRKQAKNAFG NFLKGEPIRV VNSAIRGNAN ERIFVATYPA MLKVYQAFDV GFFDLIIADE
SHRSIYNVYG DLFYYFDCPQ VGLTATPVDF VTRSTFDLFK CEGQNPTANY ELEQAIEEGY
LTPFEVYEHT TKFLREGIRL EGLTKAQRQQ LEDQGEDPDQ YDYSSEVVDK AIFNKDTNRA
VLRNLMENGL RDASGQTIGK TIIFARSHAH ALLLREVFDE LYPQYGGRFC QVIDNYDPRA
EQLIDDFKGD GANKDLTIAI SVDMLDTGID VPEILNLVFA KPVRSPVKFW QMVGRGTRLC
EDLFGPGQHK TVFRIFDHWG NFERFETGYK PAEPRPSKSL LQQLFEQRID LAETALRKSE
IPVFETVVQL IAADVSALPE NSIAVREKWR EKRQFADLET VKSFAPTTVA ALRQTIAPLM
QWRDVRGKTE AHAFDLLIAR LQTAALQGSA EVEDLRADMM GRLDTLQMHL NPVRERAEII
KEVKSAAFWS DLKPLDLERV RIPLREIIHH HDRKAGPKVP AKVIDVEEEA SEFQYGRRAT
SYTANEMKAF KQAVDAELKK HIETNPILQK IRNGEPVTDA DLQAIVSLVL TQNPDLSEAQ
LNEFFSPTAE NLLFKLREMT GLEATTVRDR FADFVNAHPT LTAKQTRFLN LLQNHLIKFG
VITVDRLYDQ PFTVVDADGP DGVFDEPDLM NELMTIISQF APPSTSRESA EDKRTPN