Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_0134 |
Symbol | |
ID | 4481023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | - |
Start bp | 141254 |
End bp | 144301 |
Gene Length | 3048 bp |
Protein Length | 1015 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639720879 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_864067 |
Protein GI | 117923450 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.363012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGGAA TAGACTCAAT GACCGAACAA CAAATAGAAC AAGATTTTAT TACTAAGCTA GGAGAGCTCA AATACAGCTT TCGCAAGGAT ATTCGTGATC GAGCCTCCCT TGAGAATAAT TTTCGCGAGA AATTTAACGC GCTCAACCGT GTCCGTCTGA CCGATGCCGA GTTCACCCGG CTGCGTGATG AGATCATCAC CGCCGATGTT TTCCAGGCCG CCAAAACCCT ACGCGAGTAT GGCTATATCG AACGTGAAGA CGGCACCCCG CTGGACTACA TGTTGGTCAA CCTTAAGGAT TGGTGCAAAA ACGAGTTCGA GGTCATCCAT CAGCTACGTA TTAACACCGA CAACAGTCAC CACCGCTATG ACGTGATCCT GCTCATCAAC GGCCTGCCGC TGGTGCAAAT TGAGCTGAAG AGCCTCGGCA TCAACCCACG CCGGGCTATG GAGCAGATTA TCGAGTACCG CAACGACTCC GGCACCGGCT ACGCCAACAC GCTGCTCTGT TTTATGCAGC TTTTTATCGT CAGTAACCGC GATGACACTT GGTACTTCAC CAACAACCAC AACCAGCACT TTGCCTTCAA CGCCGAAGAG CGTTTTTTGC CCATCTACCA ATGGGCCGAT AAAGACAACC GCAAAGTGTG CCACCTAGAC GACTTTGCTG AGAAATTTCT GGCCAAGTGT ACCCTGGGGC AGATGATCAG TCGCTACATG GTGCTGGTCG TCAGCGAGCA GAAGCTGCTG ATCATGCGCC CCTACCAGAT CTATGCGGTC CAGGCCATTG TCGACTGTAT CCACCAGAAC CGGGGCAACG GCTACATCTG GCACACCACC GGCAGCGGGA AGACCCTTAC CTCCTTCAAG GCCTCTACCC TGCTCAAGGA CAACCTGGAC ATCGAGAAAT GTCTCTTCGT GGTGGACCGC AAAGACCTGG ACCGCCAGAC CCGCATCGAA TTCAACAGGT TTCAGGAAGG TTGCGTCGAG GAGAACACCA ACACCGAAAC CCTGGTGCGC CGTCTGCTCT CGGAGGACTA CGCCCACAAG GTGATCGTCA CCACCATCCA GAAGCTCGGC CTGGCCCTGG ATGAAACCGG CAATAAGGCT CAGCAATACA AGGAGAAGGG CAAGCCCACC TTTAAGGAGC GGCTTGCTCC GCTACGCGAC CAGCGCATCA TCATCATCTT CGACGAATGC CACCGCTCCC AGTTTGGCGA GAACCACAAG GCCATCAAGG AATTCTTCCC CAAGGCGCAA CTGTTCGGCT TTACCGGCAC GCCCATCTTT GAGCAGAACG CCAGCTATAC CCAGGTCGAT GGTGAACTTG CCTCCCACAA GACCACCGAG GAGATCTTTG AAAAACGTCT GCACGCCTAC ACCATCACCC ACGCCATTGA TGACCGCAAC GTGCTGCGTT TTCACATCGA CTACTTTAAG CCCGAGGCTG CATCACCCGC AGGCGATAAG GCCAAAGCTA ACACCACCAG TCTCACCCGG CCCGAAACCC AGCGCCTCGT GGTGGATACC ATCCTGAAGA AGCACGACGC CGCCACCGAT CACCGCCGCT TTAACGCCCT CCTGGCCACC GCCTCCATCA ACGACGCCAT TAATTATTAT GAGCTGTTCA AGCAAGCCCA GGCCGAGCGC CGGGAGGAAG ACCCCGACTT CATCCCGCTT CACATTGCCT GCGTCTTCTC GCCCCCGGCG GAGGGCAACA AGGATGTCAA ACAGCTACAA GAAGACCTGC CCCAGGAGAA GGACGACAAC CAGCAGGAGC CGGAGCGGAA AAAGGCCGCG CTGCAACAGA TCATTGCCGA CTACAACACC CAATATGGTA CCAACCATAA GCTTGGCGAG TTTGATCTTT ACTATCAGGA TGTGCAACTG CGCATCAAGG ATCAGAAGTA CCCCAACAGC GACTTCCTGC GCAAAAACAA GATCGACCTC ACCATTGTGG TGGATATGCT GCTCACCGGT TTTGACGCCC AGTACCTCAA TACCCTGTAT GTGGACAAGA ACCTCAAGCA CCATGGTTTG ATTCAAGCTC TCTCGCGCAC CAACCGGATG CTCAACGACA CCAAGCCCTA CGGCAATATT CTCGATTTTC GCGCTCAGAA AGGGGCCGTT GACGAGGCCA TCGCCCTCTT CTCTGGTGAA GATGTCACCC GCTCCCGTGA AATCTGGCTG GTGGACCCGG CCCCCAAGGT CATCGACAAG CTCGATAGCG CCGTAAAGCG ATTGGAAGCG TTTATGCAGG CCCAGGGCCA GCCCTGCACC CCCGCTGCGG TCAACGACCT CAAGGGCGAT GTCGCCCGTG CCGAGTTCAT TAACTGTTTC AAGGAGGTGC AGCGCCTCAA GACCCAGCTT GACCAATACA CCGACCTGAG CGACGCGCAA AAACAGCAGA TCGAGCAGTA CCTGCCAGCC GAGCAATTGC GTGGCTTTAA GGGCATGTAC CTGGAGACCG CCCAGCGCCT CAAGGCGCAG CAGGGCAAGA GCGGCATAGA GGCCAGCGAA GCCAAGGATG CCATCGAGCA GCTCGATTTT GAGTTTGTGC TCTTCTCCTC GGCCATGATC GACTACGACT ACATCATGGG CTTGGTCTCC CGCTTTACCC AGCAGCTACC AGGCAAGCAG AGCATGAGCC GCGACCAAAT CATCAACCTG CTCGCCACCA ACAGCAACCT GATGGCGGAG CGCGAGGAGA TCACCGCCTA CATCAAAACC CTGGAAGAGG GCCAAGGGCG GAGTGTAGAG GCCATTCGCG ATGGCTACCA GAGCTTCAAG GCGCACAAGG CCGAAGGGGA ACTGGCCGCG CTGGCCCAAC GGCACGGGCT GGAACGCGAC GCCCTGCAAG GCTTGGTCGT GGGCATTCTG GACCGCATGA TCTTCGATGG CGAGCAGCTA AACGACCTCT TCGCGCCGCT GGAACTGGGC TGGAAGGATC GCAGCAAGGC CGAGCTGGCG TTGATGGAGG AGTTGGTCCC ATTGTTGAAG AAACAGGCCG GCGGGCGGGA GATATCGGGG TTGGCGGCTT ATGAGTAA
|
Protein sequence | MIGIDSMTEQ QIEQDFITKL GELKYSFRKD IRDRASLENN FREKFNALNR VRLTDAEFTR LRDEIITADV FQAAKTLREY GYIEREDGTP LDYMLVNLKD WCKNEFEVIH QLRINTDNSH HRYDVILLIN GLPLVQIELK SLGINPRRAM EQIIEYRNDS GTGYANTLLC FMQLFIVSNR DDTWYFTNNH NQHFAFNAEE RFLPIYQWAD KDNRKVCHLD DFAEKFLAKC TLGQMISRYM VLVVSEQKLL IMRPYQIYAV QAIVDCIHQN RGNGYIWHTT GSGKTLTSFK ASTLLKDNLD IEKCLFVVDR KDLDRQTRIE FNRFQEGCVE ENTNTETLVR RLLSEDYAHK VIVTTIQKLG LALDETGNKA QQYKEKGKPT FKERLAPLRD QRIIIIFDEC HRSQFGENHK AIKEFFPKAQ LFGFTGTPIF EQNASYTQVD GELASHKTTE EIFEKRLHAY TITHAIDDRN VLRFHIDYFK PEAASPAGDK AKANTTSLTR PETQRLVVDT ILKKHDAATD HRRFNALLAT ASINDAINYY ELFKQAQAER REEDPDFIPL HIACVFSPPA EGNKDVKQLQ EDLPQEKDDN QQEPERKKAA LQQIIADYNT QYGTNHKLGE FDLYYQDVQL RIKDQKYPNS DFLRKNKIDL TIVVDMLLTG FDAQYLNTLY VDKNLKHHGL IQALSRTNRM LNDTKPYGNI LDFRAQKGAV DEAIALFSGE DVTRSREIWL VDPAPKVIDK LDSAVKRLEA FMQAQGQPCT PAAVNDLKGD VARAEFINCF KEVQRLKTQL DQYTDLSDAQ KQQIEQYLPA EQLRGFKGMY LETAQRLKAQ QGKSGIEASE AKDAIEQLDF EFVLFSSAMI DYDYIMGLVS RFTQQLPGKQ SMSRDQIINL LATNSNLMAE REEITAYIKT LEEGQGRSVE AIRDGYQSFK AHKAEGELAA LAQRHGLERD ALQGLVVGIL DRMIFDGEQL NDLFAPLELG WKDRSKAELA LMEELVPLLK KQAGGREISG LAAYE
|
| |