Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1032 |
Symbol | |
ID | 5412247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1014213 |
End bp | 1015586 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640868258 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_001404193 |
Protein GI | 154150575 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.41908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.322454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCTGA CAGTTCCCGT TGCTGAAATT ATCAATAGTT CAACAAATCC TCTACTTGCG ATTGATCCCT CTTGGGAGCG CGTGCCTCTT GGAAAAATCG CAAAAGTACT GAATGGTTTT GCATTTAAAT CAGAATTGTT TAACGATAAA AAAGGTACGC CTCTTATCCG CATTCGGGAT ATCGGAAATA ACAAAACAGA GTGTTATTAT GACGGTGTAT TCGATGAAGC ATATGTCATA CATCCGGGGG ATTTGCTCGT AGGGATGGAT GGGGATTTCA ATTGTTCTAC ATGGCGAGGT CCAAAAGCCT TACTCAATCA ACGGGTTTGT AAAATTGAAG TTAATATTGA ACAATACAAC AGAAAATTTT TAGAATATGT TTTACCGGGA TATCTGAAAG CTATCAATGA AAATACCTCT TCGCAAACAG TGAAACATCT ATCGTCACGA TCAATCTCTG AAATACTTCT TCCAAATCCT CCACTAACCG AACAGCAGCG CATCGTCGCC CGTGTCGAAG CCCTCCTGTC GCACGTCAAC GCCGCCCGCG AACGGCTGAG CCGGGTGCCG TTGATCATGA AAAAGTTCCG GCAGGCAGTG CTCGCGGCGG CGTGTAGTGG AGGGTTGACG GAGGGGTGGA GAAAGGAGAA TCCGGATATT GAAGAAGCAA ATAAATTAGT CAAACGTCTA GAATCTATAA GAAAGCAATT TAAAATCCGC GAAATTTCTT CAATAGATAA TTTAGAATTA TCTGACCTGC CAGATTCTTG GACTTGGATT CGTTTAGCTA ATATTGCTAT CGTAATGGAT CCTGATCATA AAATGCCAAA AAGTTCAGAC GGTGGAATAA TCTTTATTTC TCCAAAAGAC TTCAAAGAAA ATTATCAAAT TGATATGACA AAAACAAAAC GGATATCCGA TGAAGAGTTT TTAAGATTAT CTAAAAAATT CGTCCCTAGA CCGTTGGATA TTTTATATTC AAGAATTGGC GCAGATTTGG GGAAAGCAAG AAAAGCACCC CAAGATATCA AATTTCATAT ATCATATAGT TTGGCGGTAA TCCGACAACT GGGTGAAATG GAAAATTCTG ATTATTTGTT TTGGTTATTA AATTCAATGT TTATTAGGAA TCAGGCATTC GAGAATGTGC GAAGCATCGG CGTTCCTGAT TTGGGATTAA GGGATATTGA TAATTTTATA ATCCCCCTCC CACCCCTTGC CGAGCAGTAC GAGATCGTCC GGCGTGTCGG TTTACTGTTT GAGCGTGCGG ATGCCATTGA TCGCGAGGTT GAAGCGGCGA CCCGGCGGTG CGAGCGGTTG ACGCAGGCGG TACTGGGGAA GGCGTTCAGA GGAGAATTAA CGAGGAATTT ATGA
|
Protein sequence | MSLTVPVAEI INSSTNPLLA IDPSWERVPL GKIAKVLNGF AFKSELFNDK KGTPLIRIRD IGNNKTECYY DGVFDEAYVI HPGDLLVGMD GDFNCSTWRG PKALLNQRVC KIEVNIEQYN RKFLEYVLPG YLKAINENTS SQTVKHLSSR SISEILLPNP PLTEQQRIVA RVEALLSHVN AARERLSRVP LIMKKFRQAV LAAACSGGLT EGWRKENPDI EEANKLVKRL ESIRKQFKIR EISSIDNLEL SDLPDSWTWI RLANIAIVMD PDHKMPKSSD GGIIFISPKD FKENYQIDMT KTKRISDEEF LRLSKKFVPR PLDILYSRIG ADLGKARKAP QDIKFHISYS LAVIRQLGEM ENSDYLFWLL NSMFIRNQAF ENVRSIGVPD LGLRDIDNFI IPLPPLAEQY EIVRRVGLLF ERADAIDREV EAATRRCERL TQAVLGKAFR GELTRNL
|
| |