Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MmarC5_0639 |
Symbol | |
ID | 4928565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcus maripaludis C5 |
Kingdom | Archaea |
Replicon accession | NC_009135 |
Strand | - |
Start bp | 607469 |
End bp | 610498 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640166141 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001097166 |
Protein GI | 134045680 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.28859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAATACA CAGAAAAGCG ATTTGAACAG GACATTGAAG AGTCAATGCT AACTAATGGT GGGTATATCA AAGGCGATAA GTCTAGCTAC AGTGCGGAAC GTGGAATAGA TTTATCAAAA CTTGTTGAAT TCATCAAAAC AACACAAGAA AAGGAATGGG ATCGGTATGT ACGCATTTAT GGGGATGAAG CAGAAAACAA TCTATTCAAG CGATTCAATG ATTCTGTCAA TGCGCAAGGA CTACTGTATT CACTTAGAAA GGGCATTGCT GATAGGGGTG TTAAGTTAAA GCTATGCTAC TTTAAACCCG AATCAAATTT AAACCATGAA CTCGTGAAAA AATACGAAAA TAACATATTA ACAGTTACAC GACAGTTTGC ATACTCAACG GAAAACAAAA ATACAATTGA TATGGTTTTA TCGCTTAATG GTATTCCGAT AGTTGCGCTA GAGCTTAAAA ACCAGATTAC TGGTCAATCT ATCGAAAACG GTAAAAAACA GTTCATGTAC GATAGAAACC CTCGAGAGCC ATGCTTCCAG TTTAACAAAC GATTTTTAGC ATATTTTGCT GTTGATTTGC ATGAAGTAGC AATGACTACA AAATTAGCCG GACCAAACAC TATGTTTTTA CCATTTAACC AGGGTTCTGG CGGAGCTGGA CAGGTCGGTG GTGCAGGTAA TCCTGAAAAC AAAGACGGAT ATGCTACATC ATACCTCTGG GAGCAGGTAT TGACGAAAGA TTCACTGATG AACATCATAC ACAGATTCCT CCATATAAAC GTTGAAAAGA AAAAAGTAGT TAAAAACGGA CGTGAAACTA CAAAAATCAG TCAAAAGTTA ATATTTCCTC GTTATCACCA GCTCGATGTT GTAACAAAGC TTGTAAACAA CGTAAAACAG CACGGATCTG GATGTAATTA CTTGATTCAG CATTCAGCAG GATCAGGCAA AAGTAACAGT ATCGCATGGA TTGCTTACAG ACTTGCAAGT CTTCACGATG AGGATAATAA CAGCATATTC AATTCAGTTA TCGTTGTAAC CGATAGGAAA GTGCTTGACA GTCAGCTTCA AGATACAATA TCTGGTTTCG ACCACACTTA CGGCCTCGTT GAAACAATAG GGGATAAAAA AACATCTCAA GATTTGAAAA ACGCAATTAA CGATGGTAAA AAGATTATCG TTACAACACT TCAAAAATTC CCTGTAATCT ATGAAGATGT AGAAAACAAT AAGGGTAAAC GATTCGCTGT AATTGTTGAC GAAGCGCATT CCAGTCAGAC TGGAACGAGT GCACAGAAAT TAAAGATTGC ACTTGCAGAT AACAAAGATG CACTTGAAGA ATATGCTAAA ATTGAGGGTG AAGCTGAAGA AGCTTCTGAA GATTTCGAAG ATAAGCTGGT AAGTGAGTTA AGTACACACG GGCGACACCA AAATTTATCA TTTTTCGCAT TTACTGCAAC ACCAAAACAA AAAACATTAG AAATGTTTGG AAATCGATGC ACTGATGGTT CATTCCACCC ATTCCACATA TATTCGATGA AACAGGCAAT CGAAGAAGGA TTCATTCTTG ATGTGCTTAA AAATTACATG ACTTATAAAA CATGCTTTAA AATCGCTAAA AACATACCTG ATAATCCTGA ATTACCAGAA AGTGAAGCAA AAAGGGCTAT CAGGCGTTAC GAATCACTGC ACCCATATAA CTTACAGCAA AAAACATCAG TTATGGTTGA GTACTTCCGT GAAATCACTT CCAAAAAAAT CAATGGTCAA GCAAAAGCAA TGATTATAAC ACCATCAAGA CTTCACGCAG TTAGATATTA CAAAGAATTT AAGAAATACA TTAAAAATAA AGGATACAAC GATCTTGATG TTCTTATAGC ATTTTCTGGA ACCGTAAAAG ACGGAGACGA AGAATATACT GAAAGTGGAA TGAATGTAAC AAAATCAGGT AATAAAATAA GTGAAAAGCA GCTCCCACAG GTATTTCACG GGGACGAATT TAACATGCTG ATAGTTGCTG AAAAGTATCA GACAGGCTTT GATGAACCTT TATTACACAC GATGTTTGTT GACAAGAAGC TTAATGGTGT TAAAGCTGTT CAAACACTTT CAAGACTTAA CAGAACCGCC CCTTATAAAA ACGATACATT TATACTCGAC TTTGTGAACA ATTCAGAAGA CATATTACTT TCATTTGCAC CATACTACAA AGAAACCGCC CTTGACGAAG AAATCAATGT TAATTTAATT TATGATACAA AAGCGCTTCT TAGAAATTTC AGAATATACA ATGATGACGA TGTTGAGAAA AGTACCAAGA TTTACTATAA ATCAGGTGCG CAGTCAAATA CTGCTCACGG TAAAATCACG AGTATGTTCC TGCCGATAAT TAGGGTGTAT TCCGAAATTA GCGAAGAAGA TCAGTTCAAG TTTAAAAAAG CGGTCAGAAA CTTTATAAAA TGGTATTCGT ACATCACACA AATTGAAAGA ATGTTCGATA AGGATTTACA GAAGGAATAT CGCTTCTTGC AATATCTCGA AAAAATGCTG CCTAAAAACT CAGCTGAAAA GATCGACCTC GAGGATAAAA TCAAACTCGA ATACTACAAG CTCAGCAAAA CTTTCGAAGG CGATATCAGT CTTGAAACTA AGGACGACTG CGCAATGCTT ACAAATCCTA AAACAATTGA TAATGTAATC GGAGTAACTG GTAACGATGA ATTGCTAGAT GCAATAATCT CAAAGATCAA TGAAGTTTAC GAAGGCGAGT TTAGCGATGG TGACAAGGTC ATGGTTAAAA CAATCTACGG AAAACTACTA AACGACAGTG AAAAACTTGA ACAGTACACG AAAGATTCGT TTGAGATATT TAACGATAGT TTCTTCCCAG AACTCTTCGA TAAAGCCACT CGCAGCTGCT TCTTAGAACA ATCGAATTCT TTTAAAAAGA TCTTTGAAAA CAGAACTTTT TACAACACAA TACAGCAAGA AATGGCACGA CAAGTATACC GAAATTACAG AAACCGGTAA
|
Protein sequence | MEYTEKRFEQ DIEESMLTNG GYIKGDKSSY SAERGIDLSK LVEFIKTTQE KEWDRYVRIY GDEAENNLFK RFNDSVNAQG LLYSLRKGIA DRGVKLKLCY FKPESNLNHE LVKKYENNIL TVTRQFAYST ENKNTIDMVL SLNGIPIVAL ELKNQITGQS IENGKKQFMY DRNPREPCFQ FNKRFLAYFA VDLHEVAMTT KLAGPNTMFL PFNQGSGGAG QVGGAGNPEN KDGYATSYLW EQVLTKDSLM NIIHRFLHIN VEKKKVVKNG RETTKISQKL IFPRYHQLDV VTKLVNNVKQ HGSGCNYLIQ HSAGSGKSNS IAWIAYRLAS LHDEDNNSIF NSVIVVTDRK VLDSQLQDTI SGFDHTYGLV ETIGDKKTSQ DLKNAINDGK KIIVTTLQKF PVIYEDVENN KGKRFAVIVD EAHSSQTGTS AQKLKIALAD NKDALEEYAK IEGEAEEASE DFEDKLVSEL STHGRHQNLS FFAFTATPKQ KTLEMFGNRC TDGSFHPFHI YSMKQAIEEG FILDVLKNYM TYKTCFKIAK NIPDNPELPE SEAKRAIRRY ESLHPYNLQQ KTSVMVEYFR EITSKKINGQ AKAMIITPSR LHAVRYYKEF KKYIKNKGYN DLDVLIAFSG TVKDGDEEYT ESGMNVTKSG NKISEKQLPQ VFHGDEFNML IVAEKYQTGF DEPLLHTMFV DKKLNGVKAV QTLSRLNRTA PYKNDTFILD FVNNSEDILL SFAPYYKETA LDEEINVNLI YDTKALLRNF RIYNDDDVEK STKIYYKSGA QSNTAHGKIT SMFLPIIRVY SEISEEDQFK FKKAVRNFIK WYSYITQIER MFDKDLQKEY RFLQYLEKML PKNSAEKIDL EDKIKLEYYK LSKTFEGDIS LETKDDCAML TNPKTIDNVI GVTGNDELLD AIISKINEVY EGEFSDGDKV MVKTIYGKLL NDSEKLEQYT KDSFEIFNDS FFPELFDKAT RSCFLEQSNS FKKIFENRTF YNTIQQEMAR QVYRNYRNR
|
| |