Gene Mpal_1092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1092 
Symbol 
ID7271009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1128539 
End bp1131436 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content52% 
IMG OID643569728 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002466161 
Protein GI219851729 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.20335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.270177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCG GCGGCGAACG GGGGTCGGTG CAGAATCCGT TCATCGACTA TGCAGAATCC 
AAAAAATGGG AATATGTCCC GAAGGACCGG GCCACAGCAA TACGGGGCGG CACCACCGGC
ATCTTGTTCA AAGAGATCTT CATCGAACAG ATCTGCCGGC TCAACGACTC CTTCATGACC
CGTGAGCTCG CCACCGAACT CATAAAACGG ATCGGGCGCA TCCCTCCCAC CATCGAAGGA
AACCTCGTGG CATGGGAATA TCTCAAAGGA ATAAAAACAA TCTTTGTCCC TGCAGAGAAA
CGGGAACGCA ATGTCCAGTT CATCGACACC AAGGATATCG AGAACAACAC CTTCCATGTC
ACAGACGAGT TTTCGTTCAC CAACGGTTCT AAGACGATCC ACGAGGATGT CGTCTTCCTC
GTCAACGGGA TCCCGGTCTT CTTTGTCGAG GCAAAGGCAG CCCACAAGAA GGAGGGGATC
GCGGAAGCCC TCGACCAGAT CCGACGTTAC CACCGCGAAT GCCCCGAACT GCTCGCCATC
CTCCAGATCT ACGCCCTCAC CCACATCATC CGGTATTATT ACAGTGCCAC CTGGAATACC
TCAAAGAAGA CACTCTTCAA CTGGAAGGAT GAAGCCGGCG GGAATTTCGA GACACTGGTC
AAGACCTTCT GTGACCGGAA ACGGTTCCTC ACCCTGCTCA GCGACGGGAT TCTTTTCACC
AGACAGGATG AGGAGTTAAA GAAAGTCATT CTCCGCCAGC ACCAGATGCG GGCCGTCGAC
AAGCTGCTCG GACGGGCACA GGATGCCTGG AAGAAACGCG GCCTTGTCTG GCACACGCAG
GGATCGGGCA AGACCTACAC GATGATCGTA GCGGCACAGA AGATACTCGG TGAACCGGTC
TTTGGGAACC CGACCGTGAT CATGCTCGTG GACCGGAACG AGCTCGAAAC CCAGCTCTTC
GGTAACCTCA CCTCTGCCGG CATTGGAAAT GTCGAAGTCG CAGGGAGCAA AAAGGATCTG
CGAGAACTCC TCGCAGCTGA TCACCGGGGC CTCATCGTCT CAATGATCCA TAAGTTTGAG
GGGATGCCGG AAGAGATCAA TACCCGGGAC ACGATCTTCA TCCTCGTGGA CGAAGCCCAT
CGCACCACCA CCGGCACACT CGGTAACTAC CTCATGGGTG CACTCCCGAA TGCCACCTAC
ATCGGGTTCA CCGGCACCCC CATCGACAGG ACCGCATACG GACAGGGGAC CTTCATCACG
TTCGGGCGAG ACGACCCACC ATACGGTTAC CTCGACAAGT ACAGTATCGC AGAGTCCATC
AGTGACGGGA CCACCGTCCC GCTCCACTAC ACCCTCGCTC CCAACGATCT GCTCGTCGAC
CGTCAGACCC TCGAACAGGA ATTCCTTGAC CTGGCCGAGA CGGAAGGTAT CAGCGATGTC
ACGGAGCTCA ACAAAGTCCT TGAAAAAGCG GTCAACCTCC GAAATATGAT GAAGAGTCCG
GAACGGGTGC CGAAAGTGGC GCAGTTCGTG GCAGATCACT TCCGGAACAA TGTGGAGCCC
ATGGGATACA AGGCGTTTTT TGTCGCTGTC GACCGGGAGG CCTGTGCACT CTACAAAAAG
GAACTCGACA AACATCTCCC CCTGCAGTAC TCGGAAGTCA TCTACAGTCC AAATCCCAAG
GATGATGACA ATCTCCGAGC GTATTATCAT ACTGATGAGG ATGAAAAACG CATCCGTAAG
GCGTTCCGGA GCCCGGAAAA AGACCCGAAG ATCCTCATTG TTACGGAAAA ACTCCTCACA
GGGTTTGATG CCCCCGTTCT CTACTGTATG TATCTCGACA AACCGATGCG GGATCATGTC
CTCCTGCAGG CAATTGCCCG CGTGAACAGG CCGTTTGAAG ATGAAGAAGG ACGCAAAAAA
CCTTCTGGGT TCGTACTCGA TATTGTCGGG ATCTTCAATA ACCTCAAAAA AGCGCTCGCC
TTTGATTCCA GTGATATCGA AGGTATCATC GATGATCTGG ACGTTCTTAA AAAACAGTTC
ACCCGGCTGA TGGGACAGGA AGCACAACAG TATCTCGGGC TTGCCAGAGG AAAGAAACGC
GACAAGGCAG TCGAAGCCGT GCTTGAATAT TTCCTGAACG AAGAAATCCG CCAAACATTC
TACGCATTCT TCCACGAACT CTCGGATATC TACGAAATCC TCTCCCCCGA TGCGTTCCTG
AGACCGTACC TCGACGATGT GGACAAACTA GCGAAAATGT ACCGCATGGT CCGGGAAAAT
TTTGATCCCG GTATCTCAGT TGACCGGGAG TTCTCCCGCA AGGTGGCTCG ACTCGTACAG
GACCATACCG TCAGCAGCGA GATCGGGAAC CCAGGCGGAA TCTATGAGAT TAACGACAAG
GTTATCTGGT ATATCAATGA GCAACCGGAT TCAGACATCG AAAAGGTCTT CAATCTAACC
AAAGGGATTT CTCATCTCGT ACAGAAACAA GCAGAAGAAT CTCCTTACCT CATCTCCATT
GGTGAAAAGG CGGATGCGGT GATCCAACTC TACAAGGACC GCCAGAATAC TACGCAGGAA
ACCCTCGCTG AACTCAAGAC GATCATTGAA GAGATCAACG CAGCTCGTCT CGAACAGGAG
AAGCGTAATA TCCCCATGGC AGAATTCTCC ATCTTCTGGC TTCTCAACAA AGCTGGTGTT
AGCGATCCGG AAACGAAAGC CCATGAAATG AAGAATATTC TGAACCACTA TCCCCACTGG
AGAATCAGCG AACAACAGGC TCGCGATGTG AAACAGGAAT TGTATTCGAT AATTCTGCAT
TCCGAAACCC GTGACATCAA AGAGATCAAA AAGATCATTG ACCAGATCAT GAAAGTTCTC
AATCGGGTAG TTACATGA
 
Protein sequence
MTLGGERGSV QNPFIDYAES KKWEYVPKDR ATAIRGGTTG ILFKEIFIEQ ICRLNDSFMT 
RELATELIKR IGRIPPTIEG NLVAWEYLKG IKTIFVPAEK RERNVQFIDT KDIENNTFHV
TDEFSFTNGS KTIHEDVVFL VNGIPVFFVE AKAAHKKEGI AEALDQIRRY HRECPELLAI
LQIYALTHII RYYYSATWNT SKKTLFNWKD EAGGNFETLV KTFCDRKRFL TLLSDGILFT
RQDEELKKVI LRQHQMRAVD KLLGRAQDAW KKRGLVWHTQ GSGKTYTMIV AAQKILGEPV
FGNPTVIMLV DRNELETQLF GNLTSAGIGN VEVAGSKKDL RELLAADHRG LIVSMIHKFE
GMPEEINTRD TIFILVDEAH RTTTGTLGNY LMGALPNATY IGFTGTPIDR TAYGQGTFIT
FGRDDPPYGY LDKYSIAESI SDGTTVPLHY TLAPNDLLVD RQTLEQEFLD LAETEGISDV
TELNKVLEKA VNLRNMMKSP ERVPKVAQFV ADHFRNNVEP MGYKAFFVAV DREACALYKK
ELDKHLPLQY SEVIYSPNPK DDDNLRAYYH TDEDEKRIRK AFRSPEKDPK ILIVTEKLLT
GFDAPVLYCM YLDKPMRDHV LLQAIARVNR PFEDEEGRKK PSGFVLDIVG IFNNLKKALA
FDSSDIEGII DDLDVLKKQF TRLMGQEAQQ YLGLARGKKR DKAVEAVLEY FLNEEIRQTF
YAFFHELSDI YEILSPDAFL RPYLDDVDKL AKMYRMVREN FDPGISVDRE FSRKVARLVQ
DHTVSSEIGN PGGIYEINDK VIWYINEQPD SDIEKVFNLT KGISHLVQKQ AEESPYLISI
GEKADAVIQL YKDRQNTTQE TLAELKTIIE EINAARLEQE KRNIPMAEFS IFWLLNKAGV
SDPETKAHEM KNILNHYPHW RISEQQARDV KQELYSIILH SETRDIKEIK KIIDQIMKVL
NRVVT