Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Memar_1159 |
Symbol | |
ID | 4847183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanoculleus marisnigri JR1 |
Kingdom | Archaea |
Replicon accession | NC_009051 |
Strand | + |
Start bp | 1143895 |
End bp | 1147005 |
Gene Length | 3111 bp |
Protein Length | 1036 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640115848 |
Product | hypothetical protein |
Protein accession | YP_001047072 |
Protein GI | 126179107 |
COG category | [V] Defense mechanisms |
COG ID | [COG2810] Predicted type IV restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCTC CCCCCGAAAT AGTCGATCTG GTCAAGCGGT TCGACCAGTA TTATGCAAAG TATACGTCAC AGTCCTACAA CGAGTTCCAG GTCAGAAAAG AGTTCATCGA TCCTTTCTTC GAGGCGCTCG GCTGGGACGT GAACAACCGC TCGGGGCTCG ATGAGCGCTA CAAAGACGTG ATCCACGAGG ACACCGTCAG GGTGGAGAAG TCCACCAAAG CGCCGGACTA CTCGTTCCGG ATCGGCGGCA TGCGCAAGTT CTTCGTCGAG GCGAAGAAAC CCGCCGTGAA TCTCAAGGAG AACCCGGAGC CAGCCTACCA GGTGCGTCGG TATGCCTGGA GCGCGAAACT GCCGGTCAGC CTGCTCACGG ACTTCGAGGA GTTCATCATC TACGACTGTA CGGCGAAACC CTCACCGAAG GATAAGGCGT CGGTGAAGAG GATCGGTTAC TACCGCTACA CCGACTATGT CGAGAAATGG GACGAGATTG CGGCGATCTT CTCCAAGCAG GGCATCCTGA CCGGGGGGTA TGACGACTAC GTCGATAACC TGAAACAGAA GAAGGGCGGA AAGGGCACCG CCGGCATCGA CGATGCTTTC CTCGCGGAGA TCGAGGGGTG GCGTGATATT CTCGCGAAGA ACATCGCGCT CCGGAACAAG GATCTCTCGG TCAGGGAATT GAACGCAGCG GTCCAGAAGA CGATCGACCG GATCATCTTC CTCAGAATCT GCGAAGACCG CGGGATCGAG GAGTACGGGC AGCTGAAGAG AATCGCCGCC GGGAAAGATG TCTATGAGCA GTTGAAATTG CTCTTCCGGT ATGCAGACGA CCGGTACAAC TCCGGGCTGT TTCACTTCTC CGGGGAAGCC GGCCGGGGAG AAGAGCCGGA CAACCTGACG CTCTCCCTTG CCATCGACGA CAAGGTGCTC AAGCAGATCA TCAGCCACCT CTACTACCCG GACTCGCCAT ATGAGTTCTC GGTTTTCCCG GCGGACATCC TCGGCCAGGT CTACGAGCAG TTCCTCGGGA AGGTCATCCG CCTGACGGCC GGGCACCAGG CGAAGGTCGA GGAGAAGCCG GAGGTGAAGA AGGCGGGCGG GGTCTTCTAC ACGCCGACCT ACATCGTAGA GTATATCGTG AAGCAGACCG TCGGGAACCT GGTCGAGGGG AAGGATCCGA AGGCGGTCGC AGGCCTCCAC GTCCTCGACC CGGCCTGCGG TTCGGGGTCG TTCCTCCTCG GCGCCTACCA GTACCTCCTC GACTGGCACC TTGCGTGGTA CATGGACAAC CTGGTCCCGC TGCTCGCGAG CGGGAAGAAG ATGACCGATC CGGCGGTGCT GGAACTTCTG CCGGCCAGGC CTGCGCCGGT GAAGAACGGC CGCGGTAAGA AGAAGCGGGG GGACGAGTAC ATCCTTCCGG TCTACCAGGT GACCGATACC GACTGGCGGC TGACGACGGA GGAGAAGAAG CGGATTCTCT TAAACAACAT CTACGGTGTG GACATCGACC CGCAGGCAGT GGAGGTGACG AAACTCTCGC TCCTCCTGAA GGTGCTCGAG GGCGAGAAGT CCGAGCGGAT AGGCAAGCAG CTGACGATCA CCGAGGAGCG GGTGCTTCCC TCCCTCCATG AGAATATCAA GTGCGGGAAC TCGCTCGTTG GTCCCGACAT CTACAATGAT GTGCAGATGA CCCTCGACGA CGAGGATGCG ATCTTCCGGA TCAATGCGTT CGACTGGAAC CAGGCCTTCC CGTCTATTCT GCAGGCGGGT GGGTTTGATG CGGTGATCGG GAACCCGCCG TATGTGCGCC AGGAGATCCT CGGGCGGGAG TTCAAGGACT ACGCGAAAAA GCATTACGCG GTCTACCACG GCGTTGCCGA CCTCTACACC TACTTTATCG AGAAGGGGGT TTCGCTCCTC CGCCCGGGCG GGCAGTTCGC CTACATCGTG GCGAACAAGT GGATGCGGGC GAACTACGGG AAACCTCTCC GGCAGTGGCT GAAAGAGCAG CGCATCGAGG AGATCATGGA TTTCGGCGAT CTTCCCGTGT TTGAGAGCGC AACAACCTAT CCCTGCATCC TCCGGATCGG GGCCGGATCT CCCTGCGAGT GGTTCGATGC TGTGCAGGTG GAAAACCTTG ATTATCCGGA CCTCACCGAC TACGTGAAGG CTCATGCGTA TCCGGTAAAC CTGACGTATC TCGATGAGGA GGGGTGGTCT CTGGCCGACG AGAAGACGCA GGCGCTTCTT CTGAAGATCA GGGCCGCCGG GGTTCCTCTG GGGACGTATG TGGATGGGAA GATCTATAGG GGTATCCTGA CCGGACTGAA CGAGGCTTTC GTGATCGACG AGGCGACGAG AGGGCGGTTG ATCGCCGAAG ACCCCCGGAG CGCGGAGGTT ATCAAGCCGT TTCTCGCTGG CCGAGATGTC AAGCGATATC AACCATTAGA GTTGAGCAAA TATCTCATCT TTACGCGGCG TGGAATTAAC ATCGATAGTT ACCCCGCGAT TGGAAGGTAT CTGAAACAGT ATAAAGAAAA ACTCATACCC CGGCCAAGAG ATTGGCCGTC AGATAAACCA TGGCTGGGTA GAAAAGCAGG TTCCTATCAA TGGTATGAGG TTCAGGACTC AATTGATTAC TATCTGGAAT TTGAAAAGCC AAAGATCATC GTTCCCGCAA TAGTTCAGAG GGGGTCATAC ACGTATGATG AAAGGTCAAT CTACTCCAAC GATAAGACAT CCATAATCCC ATGCACTGAC ATCTATCTAC TTGGCATACT CAATTCAAAA GTTGCTGACT TTGTTGTCCA TCTCATCTCC TCGACGAAGC AAGGAGGCTA TTACGAGTAT AAACCGATGT ATATCTCTCA GATCCCAATC CACCCCATCG ACCCCTCCGA CCCCGCCGAC GTCGCCCGCC ACGACCGCAT GGTCGCCCTC GTCGAGAAGA TGCTCGACCT GAACAGGCGC CTCGCGGCGG CGAAGGCCCC GCATGAGAAG GAGGTGCTCG CCGGGATGAT TGACGCGACC GACCGGGAGA TCGATAGGCT GGTGTACGAG TTGTACGGGC TGACTGAGGA GGAGATCGCG GTTGTGGAGG GGGTTGTGTA G
|
Protein sequence | MPAPPEIVDL VKRFDQYYAK YTSQSYNEFQ VRKEFIDPFF EALGWDVNNR SGLDERYKDV IHEDTVRVEK STKAPDYSFR IGGMRKFFVE AKKPAVNLKE NPEPAYQVRR YAWSAKLPVS LLTDFEEFII YDCTAKPSPK DKASVKRIGY YRYTDYVEKW DEIAAIFSKQ GILTGGYDDY VDNLKQKKGG KGTAGIDDAF LAEIEGWRDI LAKNIALRNK DLSVRELNAA VQKTIDRIIF LRICEDRGIE EYGQLKRIAA GKDVYEQLKL LFRYADDRYN SGLFHFSGEA GRGEEPDNLT LSLAIDDKVL KQIISHLYYP DSPYEFSVFP ADILGQVYEQ FLGKVIRLTA GHQAKVEEKP EVKKAGGVFY TPTYIVEYIV KQTVGNLVEG KDPKAVAGLH VLDPACGSGS FLLGAYQYLL DWHLAWYMDN LVPLLASGKK MTDPAVLELL PARPAPVKNG RGKKKRGDEY ILPVYQVTDT DWRLTTEEKK RILLNNIYGV DIDPQAVEVT KLSLLLKVLE GEKSERIGKQ LTITEERVLP SLHENIKCGN SLVGPDIYND VQMTLDDEDA IFRINAFDWN QAFPSILQAG GFDAVIGNPP YVRQEILGRE FKDYAKKHYA VYHGVADLYT YFIEKGVSLL RPGGQFAYIV ANKWMRANYG KPLRQWLKEQ RIEEIMDFGD LPVFESATTY PCILRIGAGS PCEWFDAVQV ENLDYPDLTD YVKAHAYPVN LTYLDEEGWS LADEKTQALL LKIRAAGVPL GTYVDGKIYR GILTGLNEAF VIDEATRGRL IAEDPRSAEV IKPFLAGRDV KRYQPLELSK YLIFTRRGIN IDSYPAIGRY LKQYKEKLIP RPRDWPSDKP WLGRKAGSYQ WYEVQDSIDY YLEFEKPKII VPAIVQRGSY TYDERSIYSN DKTSIIPCTD IYLLGILNSK VADFVVHLIS STKQGGYYEY KPMYISQIPI HPIDPSDPAD VARHDRMVAL VEKMLDLNRR LAAAKAPHEK EVLAGMIDAT DREIDRLVYE LYGLTEEEIA VVEGVV
|
| |