Gene Memar_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMemar_1159 
Symbol 
ID4847183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanoculleus marisnigri JR1 
KingdomArchaea 
Replicon accessionNC_009051 
Strand
Start bp1143895 
End bp1147005 
Gene Length3111 bp 
Protein Length1036 aa 
Translation table11 
GC content57% 
IMG OID640115848 
Producthypothetical protein 
Protein accessionYP_001047072 
Protein GI126179107 
COG category[V] Defense mechanisms 
COG ID[COG2810] Predicted type IV restriction endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCTC CCCCCGAAAT AGTCGATCTG GTCAAGCGGT TCGACCAGTA TTATGCAAAG 
TATACGTCAC AGTCCTACAA CGAGTTCCAG GTCAGAAAAG AGTTCATCGA TCCTTTCTTC
GAGGCGCTCG GCTGGGACGT GAACAACCGC TCGGGGCTCG ATGAGCGCTA CAAAGACGTG
ATCCACGAGG ACACCGTCAG GGTGGAGAAG TCCACCAAAG CGCCGGACTA CTCGTTCCGG
ATCGGCGGCA TGCGCAAGTT CTTCGTCGAG GCGAAGAAAC CCGCCGTGAA TCTCAAGGAG
AACCCGGAGC CAGCCTACCA GGTGCGTCGG TATGCCTGGA GCGCGAAACT GCCGGTCAGC
CTGCTCACGG ACTTCGAGGA GTTCATCATC TACGACTGTA CGGCGAAACC CTCACCGAAG
GATAAGGCGT CGGTGAAGAG GATCGGTTAC TACCGCTACA CCGACTATGT CGAGAAATGG
GACGAGATTG CGGCGATCTT CTCCAAGCAG GGCATCCTGA CCGGGGGGTA TGACGACTAC
GTCGATAACC TGAAACAGAA GAAGGGCGGA AAGGGCACCG CCGGCATCGA CGATGCTTTC
CTCGCGGAGA TCGAGGGGTG GCGTGATATT CTCGCGAAGA ACATCGCGCT CCGGAACAAG
GATCTCTCGG TCAGGGAATT GAACGCAGCG GTCCAGAAGA CGATCGACCG GATCATCTTC
CTCAGAATCT GCGAAGACCG CGGGATCGAG GAGTACGGGC AGCTGAAGAG AATCGCCGCC
GGGAAAGATG TCTATGAGCA GTTGAAATTG CTCTTCCGGT ATGCAGACGA CCGGTACAAC
TCCGGGCTGT TTCACTTCTC CGGGGAAGCC GGCCGGGGAG AAGAGCCGGA CAACCTGACG
CTCTCCCTTG CCATCGACGA CAAGGTGCTC AAGCAGATCA TCAGCCACCT CTACTACCCG
GACTCGCCAT ATGAGTTCTC GGTTTTCCCG GCGGACATCC TCGGCCAGGT CTACGAGCAG
TTCCTCGGGA AGGTCATCCG CCTGACGGCC GGGCACCAGG CGAAGGTCGA GGAGAAGCCG
GAGGTGAAGA AGGCGGGCGG GGTCTTCTAC ACGCCGACCT ACATCGTAGA GTATATCGTG
AAGCAGACCG TCGGGAACCT GGTCGAGGGG AAGGATCCGA AGGCGGTCGC AGGCCTCCAC
GTCCTCGACC CGGCCTGCGG TTCGGGGTCG TTCCTCCTCG GCGCCTACCA GTACCTCCTC
GACTGGCACC TTGCGTGGTA CATGGACAAC CTGGTCCCGC TGCTCGCGAG CGGGAAGAAG
ATGACCGATC CGGCGGTGCT GGAACTTCTG CCGGCCAGGC CTGCGCCGGT GAAGAACGGC
CGCGGTAAGA AGAAGCGGGG GGACGAGTAC ATCCTTCCGG TCTACCAGGT GACCGATACC
GACTGGCGGC TGACGACGGA GGAGAAGAAG CGGATTCTCT TAAACAACAT CTACGGTGTG
GACATCGACC CGCAGGCAGT GGAGGTGACG AAACTCTCGC TCCTCCTGAA GGTGCTCGAG
GGCGAGAAGT CCGAGCGGAT AGGCAAGCAG CTGACGATCA CCGAGGAGCG GGTGCTTCCC
TCCCTCCATG AGAATATCAA GTGCGGGAAC TCGCTCGTTG GTCCCGACAT CTACAATGAT
GTGCAGATGA CCCTCGACGA CGAGGATGCG ATCTTCCGGA TCAATGCGTT CGACTGGAAC
CAGGCCTTCC CGTCTATTCT GCAGGCGGGT GGGTTTGATG CGGTGATCGG GAACCCGCCG
TATGTGCGCC AGGAGATCCT CGGGCGGGAG TTCAAGGACT ACGCGAAAAA GCATTACGCG
GTCTACCACG GCGTTGCCGA CCTCTACACC TACTTTATCG AGAAGGGGGT TTCGCTCCTC
CGCCCGGGCG GGCAGTTCGC CTACATCGTG GCGAACAAGT GGATGCGGGC GAACTACGGG
AAACCTCTCC GGCAGTGGCT GAAAGAGCAG CGCATCGAGG AGATCATGGA TTTCGGCGAT
CTTCCCGTGT TTGAGAGCGC AACAACCTAT CCCTGCATCC TCCGGATCGG GGCCGGATCT
CCCTGCGAGT GGTTCGATGC TGTGCAGGTG GAAAACCTTG ATTATCCGGA CCTCACCGAC
TACGTGAAGG CTCATGCGTA TCCGGTAAAC CTGACGTATC TCGATGAGGA GGGGTGGTCT
CTGGCCGACG AGAAGACGCA GGCGCTTCTT CTGAAGATCA GGGCCGCCGG GGTTCCTCTG
GGGACGTATG TGGATGGGAA GATCTATAGG GGTATCCTGA CCGGACTGAA CGAGGCTTTC
GTGATCGACG AGGCGACGAG AGGGCGGTTG ATCGCCGAAG ACCCCCGGAG CGCGGAGGTT
ATCAAGCCGT TTCTCGCTGG CCGAGATGTC AAGCGATATC AACCATTAGA GTTGAGCAAA
TATCTCATCT TTACGCGGCG TGGAATTAAC ATCGATAGTT ACCCCGCGAT TGGAAGGTAT
CTGAAACAGT ATAAAGAAAA ACTCATACCC CGGCCAAGAG ATTGGCCGTC AGATAAACCA
TGGCTGGGTA GAAAAGCAGG TTCCTATCAA TGGTATGAGG TTCAGGACTC AATTGATTAC
TATCTGGAAT TTGAAAAGCC AAAGATCATC GTTCCCGCAA TAGTTCAGAG GGGGTCATAC
ACGTATGATG AAAGGTCAAT CTACTCCAAC GATAAGACAT CCATAATCCC ATGCACTGAC
ATCTATCTAC TTGGCATACT CAATTCAAAA GTTGCTGACT TTGTTGTCCA TCTCATCTCC
TCGACGAAGC AAGGAGGCTA TTACGAGTAT AAACCGATGT ATATCTCTCA GATCCCAATC
CACCCCATCG ACCCCTCCGA CCCCGCCGAC GTCGCCCGCC ACGACCGCAT GGTCGCCCTC
GTCGAGAAGA TGCTCGACCT GAACAGGCGC CTCGCGGCGG CGAAGGCCCC GCATGAGAAG
GAGGTGCTCG CCGGGATGAT TGACGCGACC GACCGGGAGA TCGATAGGCT GGTGTACGAG
TTGTACGGGC TGACTGAGGA GGAGATCGCG GTTGTGGAGG GGGTTGTGTA G
 
Protein sequence
MPAPPEIVDL VKRFDQYYAK YTSQSYNEFQ VRKEFIDPFF EALGWDVNNR SGLDERYKDV 
IHEDTVRVEK STKAPDYSFR IGGMRKFFVE AKKPAVNLKE NPEPAYQVRR YAWSAKLPVS
LLTDFEEFII YDCTAKPSPK DKASVKRIGY YRYTDYVEKW DEIAAIFSKQ GILTGGYDDY
VDNLKQKKGG KGTAGIDDAF LAEIEGWRDI LAKNIALRNK DLSVRELNAA VQKTIDRIIF
LRICEDRGIE EYGQLKRIAA GKDVYEQLKL LFRYADDRYN SGLFHFSGEA GRGEEPDNLT
LSLAIDDKVL KQIISHLYYP DSPYEFSVFP ADILGQVYEQ FLGKVIRLTA GHQAKVEEKP
EVKKAGGVFY TPTYIVEYIV KQTVGNLVEG KDPKAVAGLH VLDPACGSGS FLLGAYQYLL
DWHLAWYMDN LVPLLASGKK MTDPAVLELL PARPAPVKNG RGKKKRGDEY ILPVYQVTDT
DWRLTTEEKK RILLNNIYGV DIDPQAVEVT KLSLLLKVLE GEKSERIGKQ LTITEERVLP
SLHENIKCGN SLVGPDIYND VQMTLDDEDA IFRINAFDWN QAFPSILQAG GFDAVIGNPP
YVRQEILGRE FKDYAKKHYA VYHGVADLYT YFIEKGVSLL RPGGQFAYIV ANKWMRANYG
KPLRQWLKEQ RIEEIMDFGD LPVFESATTY PCILRIGAGS PCEWFDAVQV ENLDYPDLTD
YVKAHAYPVN LTYLDEEGWS LADEKTQALL LKIRAAGVPL GTYVDGKIYR GILTGLNEAF
VIDEATRGRL IAEDPRSAEV IKPFLAGRDV KRYQPLELSK YLIFTRRGIN IDSYPAIGRY
LKQYKEKLIP RPRDWPSDKP WLGRKAGSYQ WYEVQDSIDY YLEFEKPKII VPAIVQRGSY
TYDERSIYSN DKTSIIPCTD IYLLGILNSK VADFVVHLIS STKQGGYYEY KPMYISQIPI
HPIDPSDPAD VARHDRMVAL VEKMLDLNRR LAAAKAPHEK EVLAGMIDAT DREIDRLVYE
LYGLTEEEIA VVEGVV