Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1030 |
Symbol | |
ID | 5412245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1010105 |
End bp | 1012786 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640868256 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_001404191 |
Protein GI | 154150573 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0182402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.455384 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAG AGCCTGAGGT AGTGACGCGG AAAAAAAGGA TCGATCCCCA GCTGAGAGCT GCGGGATGGA CCATTGCACC CTATCAGGCA GGCATGGATC TCTCCCGGTA TTCCCGGTAC GCCCTTGAGG AATTTCCAAC CACTAATGGA CCGGCCGATT ATGCACTCTG CCTTGACGGG AAGATCGTTG CCGTAATCGA GGCAAAGAAA CTCACGCTCG GCCCGCAGAA TGTTCTCACC CAGGCCGAGC GGTACGCTCA GGGGATCTCC GGCAGCCCGT TCAATTATTC CGGGTTCCGC GTCCCTTTTA TCTATTCAAC CAATGGCGAG ATCATCTGGT ACCACGATCT CCGCAACTCT TTGAACCGGT CCAGCACGGT TTCGCATTTT CATACCCCTG ACGCACTTGC TGAACGGCTG AAGGATAAAT TCGAGTCTTC ATGCCAGCAC CTTTTTGAAT GGGAAAATGT GCACCCGATG ATCCGCCCCT ACCAGGCCGA AGCGAACGCG GCAATCGAGC AGGCCATCCG GGACCGGAAG CGCCAGATGC TTGTGGCGAT GGCGACCGGG ACTGGCAAGA CCTACACGAT GGTAAACGAG ACGTTCCGGC TGATGGAATC CGGTGTGGCC AAGAGGATCC TGTTTCTTGT GGATCGCCGG GCTCTTGCTG CTCAGGCAGT CAAGGCGTTT GCGTCCTTTG AAGCACGGCC GGGGCTGAAG TTCGACAAAT CGTACGAGGT CTACAGCCAG AGGTTTTTCC GGGAGGATTT TGAGGAGGAA GAGAAGTTCG ATCCCAAAGT ACTTCCCTCG AATTATCTTC TTGAGCCAAA ACCCGGTCTT GCATTTGTTT ATGTCTGCAC GATCCAGCGG ATGACGATCA ATCTCTTCGG GCGTAATGCG GTTTTTGGTT CCAGCGATGA ACCGATCGAT GAGGATGCCG AGCAGATGGA TATCCCGATC CATGCCTTTG ATCTCATTAT TGCTGACGAG TGCCACCGGG GATATACCGC TGCGGAACAA TCGGTGTGGC GTAAAACGCT CGATCATTTC GATGCGATCA AGATCGGCCT GACGGCAACA CCGGCGGCTC ATACAATGGC ATATTTCCGG GAGATTGTGT ACAAGTATGA TTATGCCCGG GCTGTGCGGG AAGGATTCCT TGTCGATTAC GATGCAGTTG CGCTTGATTC GAACGTGCGG ATGAACGGGA TCTTCCTGCA GGCCGGCGAG CAGGTCGGGG TGATCGATGC ATCATCGGGT GCCCAATCGT TCGACAACAT GGAGGACGAG CGGCAGTTTG ATACAACTGA AGTCGAGCGT TCGATTACCT CGCCGGATTC CAACCGGAAG ATCCTTTCCG AGATCCGGAA GTACGCAGAA GAGCACGAGC AGCGGTTCGG AAGGTTCCCG AAAATCCTGA TCTTTGCCGT GAATGACCTG TCGCATACCT CTCATGCAGA CCAGCTGGTG GATATTGCCC GGGACGTGTT CGGGAAAGGT GATTCGTTTG TCCAGAAGAT CACAGGCAAG GTTGACCGGC CTCTCCAGCA TATCCGCGAG TTCAGGAACC GGCCAATGCC TGCCGTTGTG GTGACGGTGG ACATGCTCTC AACCGGTGTC GATATCCCGG ACCTGGAGTT CATTGTTTTC CTGCGGCCGG TGAAATCCCG TATCCTCTTC GAGCAGATGC TTGGCCGGGG AACACGCAGG GGCGAGCGAT GCCCGGATAA ATCCCACTTT GTTGTCTTCG ATTGTTTTGG CGGGACGCTG CTTGATTATT TCCGGCAGGC AACCGGGATC ACCGCTGAGC CTCCCGAGAA GGAGACACGA TCGATCGTGC AGGTCATCAA GGATGTCTGG GACAACCGGG ACCGGGACTA TAATATCCGG GTACTGGTCC GGAGACTCCA GCGGATCGAC AAAGAGATGA GCGGACATGC CCGCGACCTG TTCGCTGCAT ATGTTCCGCT GGGCGACCTG AAACGCTATG CAAGCGATCT TACCCATGCC CTCGGGCAGG ATTTCACCGG AACGATGACC CTTCTCCGGA ACCCGGCATG CCAGGACCTG CTGCTTCACT ATCCCCGGCC CGAGCGCTCG CTGCTGGTTG CGTACGAGAA TGTGGATACA GTCAGTTCCC GGTATTTGAT CCGGGACTCA GCGGGACACG AATACAAGCC GGAGGATTAT CTCACAGCGT TCTCAACTTT TGTTAAGGAA AACCCGGAAC ATATCGAGGC AATCCGGATC CTGCTGGACC GGCCAAAAGA CTGGGGAACG GACGCACTCT CCGAGTTAAA GCAGAAACTT GCGGCCACCC GGTACCGGTT CACGGTCGAA AACCTCCAGA TGGCACACAA GGTGCGGTAC AACAAGGCGC TCGTGGATAT CATCTCGATG GTAAAACACG CCGCCCGCGA GGAGGAGCCG CTCTGTACTG CAGAGCAGCG GATCCACCGG GTCTTCGACA AGATGTCCCT TGCCACCTCA TTGACTCCCG AGCAGCAGCA GTGGCTCGAT AAAATCCGTG AACATTTAAT TGCGAACCTC TCAATCAGTA AGGACGACTT TGATAGCATC CCGATCTTTG CGAATGTCGG CGGGTGGGGC AAAGCGAACC GGGTCTTTGA CGGGCAGCTG CCCGACCTGA TCCGGCAGTG GAATGAGGCT ATTGCAGCAT GA
|
Protein sequence | MSEEPEVVTR KKRIDPQLRA AGWTIAPYQA GMDLSRYSRY ALEEFPTTNG PADYALCLDG KIVAVIEAKK LTLGPQNVLT QAERYAQGIS GSPFNYSGFR VPFIYSTNGE IIWYHDLRNS LNRSSTVSHF HTPDALAERL KDKFESSCQH LFEWENVHPM IRPYQAEANA AIEQAIRDRK RQMLVAMATG TGKTYTMVNE TFRLMESGVA KRILFLVDRR ALAAQAVKAF ASFEARPGLK FDKSYEVYSQ RFFREDFEEE EKFDPKVLPS NYLLEPKPGL AFVYVCTIQR MTINLFGRNA VFGSSDEPID EDAEQMDIPI HAFDLIIADE CHRGYTAAEQ SVWRKTLDHF DAIKIGLTAT PAAHTMAYFR EIVYKYDYAR AVREGFLVDY DAVALDSNVR MNGIFLQAGE QVGVIDASSG AQSFDNMEDE RQFDTTEVER SITSPDSNRK ILSEIRKYAE EHEQRFGRFP KILIFAVNDL SHTSHADQLV DIARDVFGKG DSFVQKITGK VDRPLQHIRE FRNRPMPAVV VTVDMLSTGV DIPDLEFIVF LRPVKSRILF EQMLGRGTRR GERCPDKSHF VVFDCFGGTL LDYFRQATGI TAEPPEKETR SIVQVIKDVW DNRDRDYNIR VLVRRLQRID KEMSGHARDL FAAYVPLGDL KRYASDLTHA LGQDFTGTMT LLRNPACQDL LLHYPRPERS LLVAYENVDT VSSRYLIRDS AGHEYKPEDY LTAFSTFVKE NPEHIEAIRI LLDRPKDWGT DALSELKQKL AATRYRFTVE NLQMAHKVRY NKALVDIISM VKHAAREEEP LCTAEQRIHR VFDKMSLATS LTPEQQQWLD KIREHLIANL SISKDDFDSI PIFANVGGWG KANRVFDGQL PDLIRQWNEA IAA
|
| |