Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Memar_1094 |
Symbol | |
ID | 4847985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanoculleus marisnigri JR1 |
Kingdom | Archaea |
Replicon accession | NC_009051 |
Strand | - |
Start bp | 1085337 |
End bp | 1088174 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640115783 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001047009 |
Protein GI | 126179044 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.5514 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTAA CCGAATTACA GACGCGGAAG AGTAAAATCG ACGTCTACCT TGCCGAGCAG GGCTGGGACG TTATGAACCG CGCTTCTGTC ATTCCCGAAG TCGATACCAA GCAATCCGAC TTTCTCGCAC GCTCCTATAA GACCGTCTCT GAGACACTGA AGAATGATCT GGAGAGCAAG TACGTCGACT ACCTCCTGCT CGATAGCCTG GGCGCGCCGC TCGCCATTAT CGAAGCCAAA CGCACCTCGA AAGATCCGCT CATCGGACAG AAGCAGGCGG AGCAGTACGC CGATGACATC AAGCGGCAGA CCGGGAGAGA CGTCTTCATC TTCCTCTCCA ACGGCTATGA GATCTGGTTC TGGGATCGGG AGCGCTACCC ACTCCGGCTG CTGAAGGGCT TCTACGCGCA GAAAGACCTG GAGCGGCTCC GTTTCCAGAT TCAGAAGATC GATCCCACCC GATCGATCGA GATCAATACC CGCATTGTCG ACCGCTCAAA GAGCATCGAG AACGTCAAGC GCGTGCTGGA GCACATCCGT AAAGGTCACC GCAAAGGGCT GATCGTCATG GCGACCGGGA CCGGCAAGAC CCGCGTGGCG ATGGCAATCA TCGATGCCCT CCTCCAGGAG AACCGGGCGC AGAAAGTGCT CTTCCTCGCC GACCGAAAGG CGCTCCGCGA TCAGGCGTGG AATAAAGGCT TCCTGGAGTT CTTCCCCCAT GAGGCCAAGG ACAAGATCCT GCACGGGATC TACAATAAGG AAAAACGGCT GTACGTCTCT ACCATCCAGA CATTTCAGGA GATATACACC CAGAAGGACA GGCACGGGCA GAACCTCATC TCCCCCGGAG AGTTCGACCT GATCTTCTCC GACGAGGCGC ACCGGAGCAT CTACAACAAG TGGCGGGATG TCTTCACCTA CCTGGACGCC ATGCAGATCG GGCTGACGGC AACACCTGCC GAACTGGTCG ACCGGGACAC CTTCCGGTTC TTCCACTGTA ACGACAATAT GCCGACCGCG CTTTACTCGT ATGACGAGGC GGTGAAGGAC GGCGTCCTGG TCGACTTCCG CAAAAGCATC ATCGGAGCGC AGACACACTT CCAGATCGAA GGTCTCCACC CTTCCGACCT GACGGAGAGC GAGCGCAACC GGCTGATCGA GCAGGGGATC GACCCGTACG AGATCAACTT TGAAGGGACA GAGCTGGAGA AGAAGGTCGC CGTCAAAGGT ACGTCCGAAG CGATCGTGCG GGAGTTCATG GAGGGATGCC AGATGGATCA GGCCGGGACA CTTCCGGCAA AGTCGATCTT CTTCGCCATC TCAAAGAAGC ACGCTCGAAG GCTGCATGAG GCGTTCGACG ACCTGTATCC CGAATATAAA GGACGGCTGG CGCGGATCAT CGTCTCGGAC GACCCCCGCG CCGAGGCCCT CATCCACGAC TTCGAACACG AGTCCTTCCC CCGCGTGGCG ATCTCGGTGG ATATGCTCGA TACCGGCATC GATGTGCCGG AGGTCTGCAA CCTGGTCTTC GCCAAACCCG TCTTCTCAAA GATCAAGTTC TGGCAGATGC TGGGGCGCGG GACGCGGTCG GATGGCGCGT GCAAACACCG GGAATGGCTG CCGGACGGGC ATAAAGAGTA CTTCAAGGTC TTCGACTTCT GGAATAACTT CGAGTACTGG AATATGAATC CCGAAGGGGT GAAGAACGAG CCGACCGAAG CCATCACGAG CCGGATCTTC TTCCTTCGGC TGAAGCAACT GGAGCGGCTC CTGGAGCAGG GCGACGACGA GCGGGCGGCG ATCGTCCGGC AGAGGCTCGA AGAGGATATC CGCTCGCTTC CGATGGACTC GGTAACCGTC CGGGAGCACG AGCGGGAGAT CGCCAAAGCC CTCTCCCCGA AGCTCTGGGA TAATGTCGGG CTCGACCCGC TGGAGAACCT GAAGACCACG ATCATGCCCC TCATGCGCTT CCAGACCGGC GTGAACCTCA AAGAGGCATC CTTCACCGTG AAAGCAGAGC GGCTGGGGCT GGCGGTGCTG GAGGGCAACG AGAAGGAGAT CGAGCGATTG GCCCCGGAGA TCGGGGAGAT GGTCGATCAC CTCCCCCGCA CGCTCAATAT TGTGAAGGAA AAAGAGGAGC GGCTGGACGA GGTACTGACT CATGCCTTCT GGAAGGGTCT GTCATTCGAG GATGCGATCG GGCTGGTGGA GGAGATCGCC CCGCTCATGA AGTACATGTC GAAGGAGGCG CATGAGCCGA TCGTCATCGA CATGGGGGAT ATCATCGAGC AGCGCACGCT CTGGACGCTC AAGGAGGATG CTCCGGAGTA CGTGGTGGCA TTCAGAGAGA AGGTCGAGAA GCGGGTGACG GAGCTGGCCG ACCATAACCC GGTTATCCAG AAGATTCTCC GGGATGACCC GATAACCGAA GCCGACCTTC ACGATCTCGA AGAGGCGCTG GCGGAGGCGG GAGTGAACGT CACCGAAGAG ATGTTGCAGA TCTCGCCGCG TCACCCGTAT GGCTCGCTGG TCGAGTTTAT CCGCTCGCTC TTCGGTCTCT ATGAAGCGCC TGATCCTAAG GAGAAGATAG CAGAGGCGTT TCAGACCTAC ATGATCGAGA GCAATAAGCA CTACACCGCC GATCAACTTC ACTTCATCCG GACGATTCAG ACCGTTTTTA TGCGTAAGAG GCGTATAGAA ATGGACGATC TATTCCTCGC TCCATTCACG AATTTTGGGA GTACTGCACC GATGCCGATG TTTGACGAGG GCGATCTGAA GGCGTTTATC GGCATCTGTC AGGGGCTGGA GCGGGAATTG TTTGCGGCGG GGGCGTAA
|
Protein sequence | MDLTELQTRK SKIDVYLAEQ GWDVMNRASV IPEVDTKQSD FLARSYKTVS ETLKNDLESK YVDYLLLDSL GAPLAIIEAK RTSKDPLIGQ KQAEQYADDI KRQTGRDVFI FLSNGYEIWF WDRERYPLRL LKGFYAQKDL ERLRFQIQKI DPTRSIEINT RIVDRSKSIE NVKRVLEHIR KGHRKGLIVM ATGTGKTRVA MAIIDALLQE NRAQKVLFLA DRKALRDQAW NKGFLEFFPH EAKDKILHGI YNKEKRLYVS TIQTFQEIYT QKDRHGQNLI SPGEFDLIFS DEAHRSIYNK WRDVFTYLDA MQIGLTATPA ELVDRDTFRF FHCNDNMPTA LYSYDEAVKD GVLVDFRKSI IGAQTHFQIE GLHPSDLTES ERNRLIEQGI DPYEINFEGT ELEKKVAVKG TSEAIVREFM EGCQMDQAGT LPAKSIFFAI SKKHARRLHE AFDDLYPEYK GRLARIIVSD DPRAEALIHD FEHESFPRVA ISVDMLDTGI DVPEVCNLVF AKPVFSKIKF WQMLGRGTRS DGACKHREWL PDGHKEYFKV FDFWNNFEYW NMNPEGVKNE PTEAITSRIF FLRLKQLERL LEQGDDERAA IVRQRLEEDI RSLPMDSVTV REHEREIAKA LSPKLWDNVG LDPLENLKTT IMPLMRFQTG VNLKEASFTV KAERLGLAVL EGNEKEIERL APEIGEMVDH LPRTLNIVKE KEERLDEVLT HAFWKGLSFE DAIGLVEEIA PLMKYMSKEA HEPIVIDMGD IIEQRTLWTL KEDAPEYVVA FREKVEKRVT ELADHNPVIQ KILRDDPITE ADLHDLEEAL AEAGVNVTEE MLQISPRHPY GSLVEFIRSL FGLYEAPDPK EKIAEAFQTY MIESNKHYTA DQLHFIRTIQ TVFMRKRRIE MDDLFLAPFT NFGSTAPMPM FDEGDLKAFI GICQGLEREL FAAGA
|
| |