Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1092 |
Symbol | |
ID | 7271009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1128539 |
End bp | 1131436 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643569728 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002466161 |
Protein GI | 219851729 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.20335 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.270177 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACTCG GCGGCGAACG GGGGTCGGTG CAGAATCCGT TCATCGACTA TGCAGAATCC AAAAAATGGG AATATGTCCC GAAGGACCGG GCCACAGCAA TACGGGGCGG CACCACCGGC ATCTTGTTCA AAGAGATCTT CATCGAACAG ATCTGCCGGC TCAACGACTC CTTCATGACC CGTGAGCTCG CCACCGAACT CATAAAACGG ATCGGGCGCA TCCCTCCCAC CATCGAAGGA AACCTCGTGG CATGGGAATA TCTCAAAGGA ATAAAAACAA TCTTTGTCCC TGCAGAGAAA CGGGAACGCA ATGTCCAGTT CATCGACACC AAGGATATCG AGAACAACAC CTTCCATGTC ACAGACGAGT TTTCGTTCAC CAACGGTTCT AAGACGATCC ACGAGGATGT CGTCTTCCTC GTCAACGGGA TCCCGGTCTT CTTTGTCGAG GCAAAGGCAG CCCACAAGAA GGAGGGGATC GCGGAAGCCC TCGACCAGAT CCGACGTTAC CACCGCGAAT GCCCCGAACT GCTCGCCATC CTCCAGATCT ACGCCCTCAC CCACATCATC CGGTATTATT ACAGTGCCAC CTGGAATACC TCAAAGAAGA CACTCTTCAA CTGGAAGGAT GAAGCCGGCG GGAATTTCGA GACACTGGTC AAGACCTTCT GTGACCGGAA ACGGTTCCTC ACCCTGCTCA GCGACGGGAT TCTTTTCACC AGACAGGATG AGGAGTTAAA GAAAGTCATT CTCCGCCAGC ACCAGATGCG GGCCGTCGAC AAGCTGCTCG GACGGGCACA GGATGCCTGG AAGAAACGCG GCCTTGTCTG GCACACGCAG GGATCGGGCA AGACCTACAC GATGATCGTA GCGGCACAGA AGATACTCGG TGAACCGGTC TTTGGGAACC CGACCGTGAT CATGCTCGTG GACCGGAACG AGCTCGAAAC CCAGCTCTTC GGTAACCTCA CCTCTGCCGG CATTGGAAAT GTCGAAGTCG CAGGGAGCAA AAAGGATCTG CGAGAACTCC TCGCAGCTGA TCACCGGGGC CTCATCGTCT CAATGATCCA TAAGTTTGAG GGGATGCCGG AAGAGATCAA TACCCGGGAC ACGATCTTCA TCCTCGTGGA CGAAGCCCAT CGCACCACCA CCGGCACACT CGGTAACTAC CTCATGGGTG CACTCCCGAA TGCCACCTAC ATCGGGTTCA CCGGCACCCC CATCGACAGG ACCGCATACG GACAGGGGAC CTTCATCACG TTCGGGCGAG ACGACCCACC ATACGGTTAC CTCGACAAGT ACAGTATCGC AGAGTCCATC AGTGACGGGA CCACCGTCCC GCTCCACTAC ACCCTCGCTC CCAACGATCT GCTCGTCGAC CGTCAGACCC TCGAACAGGA ATTCCTTGAC CTGGCCGAGA CGGAAGGTAT CAGCGATGTC ACGGAGCTCA ACAAAGTCCT TGAAAAAGCG GTCAACCTCC GAAATATGAT GAAGAGTCCG GAACGGGTGC CGAAAGTGGC GCAGTTCGTG GCAGATCACT TCCGGAACAA TGTGGAGCCC ATGGGATACA AGGCGTTTTT TGTCGCTGTC GACCGGGAGG CCTGTGCACT CTACAAAAAG GAACTCGACA AACATCTCCC CCTGCAGTAC TCGGAAGTCA TCTACAGTCC AAATCCCAAG GATGATGACA ATCTCCGAGC GTATTATCAT ACTGATGAGG ATGAAAAACG CATCCGTAAG GCGTTCCGGA GCCCGGAAAA AGACCCGAAG ATCCTCATTG TTACGGAAAA ACTCCTCACA GGGTTTGATG CCCCCGTTCT CTACTGTATG TATCTCGACA AACCGATGCG GGATCATGTC CTCCTGCAGG CAATTGCCCG CGTGAACAGG CCGTTTGAAG ATGAAGAAGG ACGCAAAAAA CCTTCTGGGT TCGTACTCGA TATTGTCGGG ATCTTCAATA ACCTCAAAAA AGCGCTCGCC TTTGATTCCA GTGATATCGA AGGTATCATC GATGATCTGG ACGTTCTTAA AAAACAGTTC ACCCGGCTGA TGGGACAGGA AGCACAACAG TATCTCGGGC TTGCCAGAGG AAAGAAACGC GACAAGGCAG TCGAAGCCGT GCTTGAATAT TTCCTGAACG AAGAAATCCG CCAAACATTC TACGCATTCT TCCACGAACT CTCGGATATC TACGAAATCC TCTCCCCCGA TGCGTTCCTG AGACCGTACC TCGACGATGT GGACAAACTA GCGAAAATGT ACCGCATGGT CCGGGAAAAT TTTGATCCCG GTATCTCAGT TGACCGGGAG TTCTCCCGCA AGGTGGCTCG ACTCGTACAG GACCATACCG TCAGCAGCGA GATCGGGAAC CCAGGCGGAA TCTATGAGAT TAACGACAAG GTTATCTGGT ATATCAATGA GCAACCGGAT TCAGACATCG AAAAGGTCTT CAATCTAACC AAAGGGATTT CTCATCTCGT ACAGAAACAA GCAGAAGAAT CTCCTTACCT CATCTCCATT GGTGAAAAGG CGGATGCGGT GATCCAACTC TACAAGGACC GCCAGAATAC TACGCAGGAA ACCCTCGCTG AACTCAAGAC GATCATTGAA GAGATCAACG CAGCTCGTCT CGAACAGGAG AAGCGTAATA TCCCCATGGC AGAATTCTCC ATCTTCTGGC TTCTCAACAA AGCTGGTGTT AGCGATCCGG AAACGAAAGC CCATGAAATG AAGAATATTC TGAACCACTA TCCCCACTGG AGAATCAGCG AACAACAGGC TCGCGATGTG AAACAGGAAT TGTATTCGAT AATTCTGCAT TCCGAAACCC GTGACATCAA AGAGATCAAA AAGATCATTG ACCAGATCAT GAAAGTTCTC AATCGGGTAG TTACATGA
|
Protein sequence | MTLGGERGSV QNPFIDYAES KKWEYVPKDR ATAIRGGTTG ILFKEIFIEQ ICRLNDSFMT RELATELIKR IGRIPPTIEG NLVAWEYLKG IKTIFVPAEK RERNVQFIDT KDIENNTFHV TDEFSFTNGS KTIHEDVVFL VNGIPVFFVE AKAAHKKEGI AEALDQIRRY HRECPELLAI LQIYALTHII RYYYSATWNT SKKTLFNWKD EAGGNFETLV KTFCDRKRFL TLLSDGILFT RQDEELKKVI LRQHQMRAVD KLLGRAQDAW KKRGLVWHTQ GSGKTYTMIV AAQKILGEPV FGNPTVIMLV DRNELETQLF GNLTSAGIGN VEVAGSKKDL RELLAADHRG LIVSMIHKFE GMPEEINTRD TIFILVDEAH RTTTGTLGNY LMGALPNATY IGFTGTPIDR TAYGQGTFIT FGRDDPPYGY LDKYSIAESI SDGTTVPLHY TLAPNDLLVD RQTLEQEFLD LAETEGISDV TELNKVLEKA VNLRNMMKSP ERVPKVAQFV ADHFRNNVEP MGYKAFFVAV DREACALYKK ELDKHLPLQY SEVIYSPNPK DDDNLRAYYH TDEDEKRIRK AFRSPEKDPK ILIVTEKLLT GFDAPVLYCM YLDKPMRDHV LLQAIARVNR PFEDEEGRKK PSGFVLDIVG IFNNLKKALA FDSSDIEGII DDLDVLKKQF TRLMGQEAQQ YLGLARGKKR DKAVEAVLEY FLNEEIRQTF YAFFHELSDI YEILSPDAFL RPYLDDVDKL AKMYRMVREN FDPGISVDRE FSRKVARLVQ DHTVSSEIGN PGGIYEINDK VIWYINEQPD SDIEKVFNLT KGISHLVQKQ AEESPYLISI GEKADAVIQL YKDRQNTTQE TLAELKTIIE EINAARLEQE KRNIPMAEFS IFWLLNKAGV SDPETKAHEM KNILNHYPHW RISEQQARDV KQELYSIILH SETRDIKEIK KIIDQIMKVL NRVVT
|
| |