Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_0790 |
Symbol | |
ID | 4481079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 987302 |
End bp | 990280 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639721534 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_864717 |
Protein GI | 117924100 |
COG category | [V] Defense mechanisms |
COG ID | [COG3587] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.186387 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTC ATTTTGAACC AGACCTGGAT TATCAACTGG CGGCCATCGA GTCGGTCTGC GCCCTGTTTC GGGGCCAGGA GATTGGGCGA ACCGAGTTCA CGGTGACCTT GCCATCGGGC ACGTTGGATT ATGGAGAGGA TCGTCTTGGC ATCGGCAACC GCCTGAAGAT GCTGGAAGAT GAAGTCCACG CCAATCTGAA GGAGGCCCAA CTCCGCAACG GGCTGCGTCC GGCGTCATCT CTGGCCACCA TGGATTTTAC CGTGGAGATG GAGACCGGCA CCGGCAAGAC CTACGTCTAC CTGCGCACCA TTTTCGAGTT GCACAGGCGC TACGGTTTCA CCAAGTTCGT CATCGTCGTC CCCTCGGTGG CCATCAAGGA GGGGGTCTAC AAGTCAATTC AGATCATGGA GGAGCATTTT CGGGGCCTCT ACGCCAACGC CCCGTTCGAA TATTTCCTCT ACGACTCCGG CAAGCTGGGA CAGGTGCGCA ACTTCGCCAC CAGTCCCAAT ATCCAGATCA TGGTGGTCAC CGTCGGCGCC ATCAACAAGC AGGACGTGAA CAACCTCTAC AAGGAGAGCG AGAAGACCGG CGGCGAAAAG CCCATCGATC TGGTGCGGGC CACCCGACCC GTGCTGATCG TGGACGAGCC CCAGAGTGTG GATGGTGGCC TGACCGGCAA GGGCAAGGCC GCCCTCTCCG AGATGCACCC CCTCTGCACC CTGCGCTATT CCGCCACCCA TGTGGACAAG CACCACATGA TCTACCGCCT CGACGCCGTG GATGCCTACG AGCGCCGTCT GGTGAAACAG ATCGAGGTTG CATCCCTGGA GGTGGAAGGC GGTCACAACA AGCCGTATCT CCGGCTCCTC TCCGTCAGCA ACACCCGGGG CCGGATCGCC GCCAAGGTGG AAGTGGACGC GCTGCAAGGA AAGGCGGTGC GCCGCAAAAC GGTCACGGTG CAGGACGGGG ATGACCTGGA GCAGATCACC GAACGCAGCC TCTATGCAGA CCACCGCATC GGTGAAATCC GGGTGGCCAA AGGGGATGAA TTCCTGGAGG TGCGCATACC CGGCAGCGAA ACCTTCCTTC GTCTGGGAGA GGCCATTGGC GACGTGGACC AGGAGGAGAT GAAGCGCCAG ATGATCCGGC GCACCATCAA GGAGCACCTG GACAAGGAGA AGCGCCTGCG CCCCCTCGGC ATCAAGGTGC TGTCACTGTT CTTCATCGAC TCGGTGGCCA ACTACCGCGC CTACGATGCC GAAGGCAATC CGGCAAAGGG TCCATACGCC GAGATTTTCG AGAAGGAGTA CCGCCGTTGG ATTCGCCACC CGGACTACAA CACCCTGTTC CGCGAGGTGG ACACCGAAAC GTTGCCGGCC CAGGTTCACG ACGGTTACTT CTCCATCGAC AAGAAGGGCT CCTGGACCGA TACCGCCGAA AACAACCAGG CCAACCGGGA GAGCGCCGAA CGGGCCTATA CCCTGATCAT GAAGGACAAG GAGCGGCTTC TGGGATTCGA GACGCCGCTC AAGTTCATCT TCTCCCACTC CGCCCTGCGC GAGGGATGGG ATAACCCCAA CGTCTTTCAG ATCTGCGTCC TGCGGGACAT GGGCTCCGAA CTGGCCCGCC GCCAATCCAT CGGTCGCGGC CTGCGCCTGT GCGTCAACCA GAACGGCGAA CGCCAGCGGG GCTTCGACAT CAACACCCTG ACGGTCATCG CCACCGAGAG CTACGAGCAG TTCGCCGAGA CATTGCAGAA GGAGATCGAG GCCGACACCG GCATCCGCTT CGGGGTCGTG GAGAAGCATC AGTTTGCCAC CATCCCCGTC ACTGACGAGC ATGGCCGCCA GTCCCCCTTG GGGGTCGAGA AGTCGGAGGC ACTCTGGAAA CACCTCCATG CAACCGGCTA CGTGGATACC AAGGGCAAGG TCCAGGACGC CCTGCGTACC GCATTAAAGG AGGGAACCCT GGACGTTCCA GAAGCCTTCA ACGCCCAGGC CCCGCAGATT CAGGAGATCC TGAAGAAACT AGCGGGGCGG CTGGAGATCA AGAATGCCGA CGAACGGGAG AGCGTCAGAG TCCGCAAGGA GGTGCTGTGC AGCCCCGAGT TTCAGGCCCT GTGGGATCGC ATCAAGCACA AGACCACCTA CCGAGTAGAG TTCGACAACG ACCGTCTGCT GGAGGAGTGC GCCAAGGCCA TCGGCAGTGC GCCGCCGGTT TCCCGCGCCC GGTTGCAGAT TAGCAAGGCG GATCTGGCCA TCGGCAAGGG CGGCGTTCAA GCCAAGGAGA CGTCCAAAGC CGCCCCGATC ACCATTGACG AGCGGGACAT CGAACTGCCC GACTTGCTCT CGGTCCTGCA GGACAACACC CAGCTGACCC GCCGGAGCAT CGTTCGGATT CTGACCGGCA GCGGTAGGCT CAGCGACTTC GCCCACAACC CTCAGCAGTT TATCGAACTG GCCACCGAGG CGATCAACCG CGCCAAACGC CTAGCCCTGG TGGACGGCAT CCGCTACCAG CGTATCGGCG ACGACCAGTT CTATGCCCAG GAGCTGTTCG AGCAGGAGGA GTTGACGGGA TACCTGAAGG ACATGCTGAA GGATGCCAAG AAATCGGTTT TTGAGCACGT CATCTACGAT TCCGGCGGCG TGGAGCGGCT GTTTGCAGAG CAGTTGGAGC GGAACGAGGC CGTCAGGGTG TATGCCAAGC TGCCGGGCTG GTTCAAGGTG CCTACGCCTC TGGGAACCTA CAACCCGGAC TGGGCAGTGT TGGTGGAGAA GGACGGCGAG GAAAGGCTGT ACTTCGTGGT GGAAACCAAG GGGCGTGTTG AAGGGCAGCT GTTCGCTGAC GATCTCAGGG ATAAGGAAAA AGCCAAAATC GCCTGCGGCA AGGCTCACTT CAAGGCACTG GCGGTAGGCG AGAATCCAGC GCGCTATGTG GTGGCCACCA ACAGCGATGA CCTGATGGCG CAGTTGTAA
|
Protein sequence | MKLHFEPDLD YQLAAIESVC ALFRGQEIGR TEFTVTLPSG TLDYGEDRLG IGNRLKMLED EVHANLKEAQ LRNGLRPASS LATMDFTVEM ETGTGKTYVY LRTIFELHRR YGFTKFVIVV PSVAIKEGVY KSIQIMEEHF RGLYANAPFE YFLYDSGKLG QVRNFATSPN IQIMVVTVGA INKQDVNNLY KESEKTGGEK PIDLVRATRP VLIVDEPQSV DGGLTGKGKA ALSEMHPLCT LRYSATHVDK HHMIYRLDAV DAYERRLVKQ IEVASLEVEG GHNKPYLRLL SVSNTRGRIA AKVEVDALQG KAVRRKTVTV QDGDDLEQIT ERSLYADHRI GEIRVAKGDE FLEVRIPGSE TFLRLGEAIG DVDQEEMKRQ MIRRTIKEHL DKEKRLRPLG IKVLSLFFID SVANYRAYDA EGNPAKGPYA EIFEKEYRRW IRHPDYNTLF REVDTETLPA QVHDGYFSID KKGSWTDTAE NNQANRESAE RAYTLIMKDK ERLLGFETPL KFIFSHSALR EGWDNPNVFQ ICVLRDMGSE LARRQSIGRG LRLCVNQNGE RQRGFDINTL TVIATESYEQ FAETLQKEIE ADTGIRFGVV EKHQFATIPV TDEHGRQSPL GVEKSEALWK HLHATGYVDT KGKVQDALRT ALKEGTLDVP EAFNAQAPQI QEILKKLAGR LEIKNADERE SVRVRKEVLC SPEFQALWDR IKHKTTYRVE FDNDRLLEEC AKAIGSAPPV SRARLQISKA DLAIGKGGVQ AKETSKAAPI TIDERDIELP DLLSVLQDNT QLTRRSIVRI LTGSGRLSDF AHNPQQFIEL ATEAINRAKR LALVDGIRYQ RIGDDQFYAQ ELFEQEELTG YLKDMLKDAK KSVFEHVIYD SGGVERLFAE QLERNEAVRV YAKLPGWFKV PTPLGTYNPD WAVLVEKDGE ERLYFVVETK GRVEGQLFAD DLRDKEKAKI ACGKAHFKAL AVGENPARYV VATNSDDLMA QL
|
| |