Gene Mmc1_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmc1_0134 
Symbol 
ID4481023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMagnetococcus sp. MC-1 
KingdomBacteria 
Replicon accessionNC_008576 
Strand
Start bp141254 
End bp144301 
Gene Length3048 bp 
Protein Length1015 aa 
Translation table11 
GC content56% 
IMG OID639720879 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_864067 
Protein GI117923450 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.363012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGAA TAGACTCAAT GACCGAACAA CAAATAGAAC AAGATTTTAT TACTAAGCTA 
GGAGAGCTCA AATACAGCTT TCGCAAGGAT ATTCGTGATC GAGCCTCCCT TGAGAATAAT
TTTCGCGAGA AATTTAACGC GCTCAACCGT GTCCGTCTGA CCGATGCCGA GTTCACCCGG
CTGCGTGATG AGATCATCAC CGCCGATGTT TTCCAGGCCG CCAAAACCCT ACGCGAGTAT
GGCTATATCG AACGTGAAGA CGGCACCCCG CTGGACTACA TGTTGGTCAA CCTTAAGGAT
TGGTGCAAAA ACGAGTTCGA GGTCATCCAT CAGCTACGTA TTAACACCGA CAACAGTCAC
CACCGCTATG ACGTGATCCT GCTCATCAAC GGCCTGCCGC TGGTGCAAAT TGAGCTGAAG
AGCCTCGGCA TCAACCCACG CCGGGCTATG GAGCAGATTA TCGAGTACCG CAACGACTCC
GGCACCGGCT ACGCCAACAC GCTGCTCTGT TTTATGCAGC TTTTTATCGT CAGTAACCGC
GATGACACTT GGTACTTCAC CAACAACCAC AACCAGCACT TTGCCTTCAA CGCCGAAGAG
CGTTTTTTGC CCATCTACCA ATGGGCCGAT AAAGACAACC GCAAAGTGTG CCACCTAGAC
GACTTTGCTG AGAAATTTCT GGCCAAGTGT ACCCTGGGGC AGATGATCAG TCGCTACATG
GTGCTGGTCG TCAGCGAGCA GAAGCTGCTG ATCATGCGCC CCTACCAGAT CTATGCGGTC
CAGGCCATTG TCGACTGTAT CCACCAGAAC CGGGGCAACG GCTACATCTG GCACACCACC
GGCAGCGGGA AGACCCTTAC CTCCTTCAAG GCCTCTACCC TGCTCAAGGA CAACCTGGAC
ATCGAGAAAT GTCTCTTCGT GGTGGACCGC AAAGACCTGG ACCGCCAGAC CCGCATCGAA
TTCAACAGGT TTCAGGAAGG TTGCGTCGAG GAGAACACCA ACACCGAAAC CCTGGTGCGC
CGTCTGCTCT CGGAGGACTA CGCCCACAAG GTGATCGTCA CCACCATCCA GAAGCTCGGC
CTGGCCCTGG ATGAAACCGG CAATAAGGCT CAGCAATACA AGGAGAAGGG CAAGCCCACC
TTTAAGGAGC GGCTTGCTCC GCTACGCGAC CAGCGCATCA TCATCATCTT CGACGAATGC
CACCGCTCCC AGTTTGGCGA GAACCACAAG GCCATCAAGG AATTCTTCCC CAAGGCGCAA
CTGTTCGGCT TTACCGGCAC GCCCATCTTT GAGCAGAACG CCAGCTATAC CCAGGTCGAT
GGTGAACTTG CCTCCCACAA GACCACCGAG GAGATCTTTG AAAAACGTCT GCACGCCTAC
ACCATCACCC ACGCCATTGA TGACCGCAAC GTGCTGCGTT TTCACATCGA CTACTTTAAG
CCCGAGGCTG CATCACCCGC AGGCGATAAG GCCAAAGCTA ACACCACCAG TCTCACCCGG
CCCGAAACCC AGCGCCTCGT GGTGGATACC ATCCTGAAGA AGCACGACGC CGCCACCGAT
CACCGCCGCT TTAACGCCCT CCTGGCCACC GCCTCCATCA ACGACGCCAT TAATTATTAT
GAGCTGTTCA AGCAAGCCCA GGCCGAGCGC CGGGAGGAAG ACCCCGACTT CATCCCGCTT
CACATTGCCT GCGTCTTCTC GCCCCCGGCG GAGGGCAACA AGGATGTCAA ACAGCTACAA
GAAGACCTGC CCCAGGAGAA GGACGACAAC CAGCAGGAGC CGGAGCGGAA AAAGGCCGCG
CTGCAACAGA TCATTGCCGA CTACAACACC CAATATGGTA CCAACCATAA GCTTGGCGAG
TTTGATCTTT ACTATCAGGA TGTGCAACTG CGCATCAAGG ATCAGAAGTA CCCCAACAGC
GACTTCCTGC GCAAAAACAA GATCGACCTC ACCATTGTGG TGGATATGCT GCTCACCGGT
TTTGACGCCC AGTACCTCAA TACCCTGTAT GTGGACAAGA ACCTCAAGCA CCATGGTTTG
ATTCAAGCTC TCTCGCGCAC CAACCGGATG CTCAACGACA CCAAGCCCTA CGGCAATATT
CTCGATTTTC GCGCTCAGAA AGGGGCCGTT GACGAGGCCA TCGCCCTCTT CTCTGGTGAA
GATGTCACCC GCTCCCGTGA AATCTGGCTG GTGGACCCGG CCCCCAAGGT CATCGACAAG
CTCGATAGCG CCGTAAAGCG ATTGGAAGCG TTTATGCAGG CCCAGGGCCA GCCCTGCACC
CCCGCTGCGG TCAACGACCT CAAGGGCGAT GTCGCCCGTG CCGAGTTCAT TAACTGTTTC
AAGGAGGTGC AGCGCCTCAA GACCCAGCTT GACCAATACA CCGACCTGAG CGACGCGCAA
AAACAGCAGA TCGAGCAGTA CCTGCCAGCC GAGCAATTGC GTGGCTTTAA GGGCATGTAC
CTGGAGACCG CCCAGCGCCT CAAGGCGCAG CAGGGCAAGA GCGGCATAGA GGCCAGCGAA
GCCAAGGATG CCATCGAGCA GCTCGATTTT GAGTTTGTGC TCTTCTCCTC GGCCATGATC
GACTACGACT ACATCATGGG CTTGGTCTCC CGCTTTACCC AGCAGCTACC AGGCAAGCAG
AGCATGAGCC GCGACCAAAT CATCAACCTG CTCGCCACCA ACAGCAACCT GATGGCGGAG
CGCGAGGAGA TCACCGCCTA CATCAAAACC CTGGAAGAGG GCCAAGGGCG GAGTGTAGAG
GCCATTCGCG ATGGCTACCA GAGCTTCAAG GCGCACAAGG CCGAAGGGGA ACTGGCCGCG
CTGGCCCAAC GGCACGGGCT GGAACGCGAC GCCCTGCAAG GCTTGGTCGT GGGCATTCTG
GACCGCATGA TCTTCGATGG CGAGCAGCTA AACGACCTCT TCGCGCCGCT GGAACTGGGC
TGGAAGGATC GCAGCAAGGC CGAGCTGGCG TTGATGGAGG AGTTGGTCCC ATTGTTGAAG
AAACAGGCCG GCGGGCGGGA GATATCGGGG TTGGCGGCTT ATGAGTAA
 
Protein sequence
MIGIDSMTEQ QIEQDFITKL GELKYSFRKD IRDRASLENN FREKFNALNR VRLTDAEFTR 
LRDEIITADV FQAAKTLREY GYIEREDGTP LDYMLVNLKD WCKNEFEVIH QLRINTDNSH
HRYDVILLIN GLPLVQIELK SLGINPRRAM EQIIEYRNDS GTGYANTLLC FMQLFIVSNR
DDTWYFTNNH NQHFAFNAEE RFLPIYQWAD KDNRKVCHLD DFAEKFLAKC TLGQMISRYM
VLVVSEQKLL IMRPYQIYAV QAIVDCIHQN RGNGYIWHTT GSGKTLTSFK ASTLLKDNLD
IEKCLFVVDR KDLDRQTRIE FNRFQEGCVE ENTNTETLVR RLLSEDYAHK VIVTTIQKLG
LALDETGNKA QQYKEKGKPT FKERLAPLRD QRIIIIFDEC HRSQFGENHK AIKEFFPKAQ
LFGFTGTPIF EQNASYTQVD GELASHKTTE EIFEKRLHAY TITHAIDDRN VLRFHIDYFK
PEAASPAGDK AKANTTSLTR PETQRLVVDT ILKKHDAATD HRRFNALLAT ASINDAINYY
ELFKQAQAER REEDPDFIPL HIACVFSPPA EGNKDVKQLQ EDLPQEKDDN QQEPERKKAA
LQQIIADYNT QYGTNHKLGE FDLYYQDVQL RIKDQKYPNS DFLRKNKIDL TIVVDMLLTG
FDAQYLNTLY VDKNLKHHGL IQALSRTNRM LNDTKPYGNI LDFRAQKGAV DEAIALFSGE
DVTRSREIWL VDPAPKVIDK LDSAVKRLEA FMQAQGQPCT PAAVNDLKGD VARAEFINCF
KEVQRLKTQL DQYTDLSDAQ KQQIEQYLPA EQLRGFKGMY LETAQRLKAQ QGKSGIEASE
AKDAIEQLDF EFVLFSSAMI DYDYIMGLVS RFTQQLPGKQ SMSRDQIINL LATNSNLMAE
REEITAYIKT LEEGQGRSVE AIRDGYQSFK AHKAEGELAA LAQRHGLERD ALQGLVVGIL
DRMIFDGEQL NDLFAPLELG WKDRSKAELA LMEELVPLLK KQAGGREISG LAAYE