Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_1084 |
Symbol | |
ID | 5411689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 1073031 |
End bp | 1075841 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640868310 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001404245 |
Protein GI | 154150627 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.293507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA TCATCATCAA AGGTGCACGC CAGCACAACC TCAAAAATAT CAATGTCGAG ATCCCGCGTG ACAAGCTTGT CGTGATAACC GGGGTTTCCG GCTCCGGCAA ATCGACGCTT GCATTCGATA CGCTCTATGC CGAAGGTCAG CGGCGGTATG TCGAATCCCT CTCGTCCTAT GCCCGGCAGT TCCTCGGGAT GATGCAGAAG CCCGATGTGG ATTCCATCGA AGGGCTCTCA CCTGCCATCT CCATCGAGCA GAAGACCACG TCCAAGAACC CCCGCTCGAC CGTGGGGACG ACGACCGAGA TCTACGATTA CCTCCGTCTG CTCTTTGCCC GGATCGGGAC GCCGTACTGC CCCGAGCACA ATATCCCGAT TGCCGCCCAG AGCCCGGACC GGATCGCCGA CCAGATCGCT GCTGAGCACC CGGGCCAGGT CACGGTCCTT GCCCCGATTG TCCGGCAGAA GAAGGGGACG TACCAGCAGC TCCTCAAAGA CCTGAATAAG GAAGGCTACG CCCGGGTCCG GCTGAACGGA AAAATAATCC GCACCGATGA AGAGATTACG CTCGACCGGT ACAAGAAGCA CGACATCGAG GTCGTGATCG ACCGGCTCGA AACCACCGAC CGGGCCCGGC TCGCTGAAGC GGTCGAGAAC ACGCTCAAAA AATCCGGTGG GCTCGTGCTC GTAGCGGACG AAGAGGGAAA GGAATCTACG TACTCCTCGC TTCTCGCCTG CCCGGTCTGC GGCCTTGCCT TTGAGGAACT CCAGCCGCGG ATGTTCTCGT TTAACAGCCC CTTTGGCGCC TGCGACGAAT GCCACGGGCT TGGCGTCAAG ATGGAGTTTG ACGCTGACCT CATTATCCCG GACAAGAACC GGTGCATAGC CGATGGTGCA GTTGCCCCGT ACCGGAACCC GATGGACGGT TTCCGGGGCC AGTACCTGGC AACGGTTGCA AAACATTTCG GTTTCTCGGT ACTTACACCC ATCAAAGATC TGACCGAAGA GCAGTACAAT GCCCTGATGT TCGGCTCGAC CGAAAAGATG CACTTCTCGA TGAGCATGAA AAACGGCGAC GCCCAGTGGT CACACAACGG TGAGTGGGAA GGGCTCCTCC CGCAGACCGC CCGGCTCTAT TCGCAGACCC AGTCCGAGTG GCGGAAGCGG GAACTTGAAG GCTACATGCG GGTCTTTCCC TGCCCGGCCT GCAAGGGAAA AAGGCTCAAG GACAAGGTGC TCGCGGTCCG GATTGATGGC AAATCGATCA TTGATGTGAC CGATCTCTCG GTCTCCGGCT GCATCGCGTA CTTCTCCGGC CTCCGGCTCA CCGAGAAGGA GGAAGGCATT GCCCGGCAGA TCATCAAGGA GATCCGGTCC CGGTTGCTCT TTTTGGAAAA AGTCGGGCTC GGATACCTCA CGCTCTCGCG GAATGCCGGG ACGCTCTCGG GCGGCGAAGC GCAGCGGATC CGGCTTGCCA CCCAGATCGG CTCGAACCTG ATGGGCGTGC TCTACGTGCT CGACGAACCG TCCATCGGGC TTCACCAGCG GGACAACCGG AAACTTATCG AGACGCTACG GACGCTCCGC GATATTGGGA ATACGCTGAT CGTGGTGGAG CACGACGAGG ACATGATTCG CTCGGCCGAT CACGTGATCG ATATCGGGCC CGGCGCCGGG CTCCACGGCG GTGAGGTGGT GGCAGAAGGC ACACCGCAGC AGATCGAGAA GAACAAAAAG TCCCTCACCG GCCTGTATCT TGCCGGGAAA AAGAAGATCG ATGTGCCGGA GAAACGCCGG AAAGCTGCGA AGTACATCAC GGTCAAAGGC TGTAAGGAGA ACAACCTGAA AAACATCGAT GTAAAGATCC CGATCGGTCT TTTTTCGGTG GTGACCGGCG TCTCCGGGTC GGGCAAGTCA ACGCTTGTCT ACGATACGCT CTACAAGGGC ATGATGCAGA AACTGTACGG CTCGCGGGAG CAGGCCGGGG CGCATAAGGA GATCGTGTTC GATTCCGAGA TCGACAAGGT GATCGTGATC GACCAGAGCC CGATCGGCCG GACACCACGC TCGAACCCGG CAACGTATAC CAAACTCTTC GATGAGATCC GCACCATTTT TGCTGATACA AAAGAAGCAA AGATGCGGGG CTACAAGGCG GGCCGTTTCT CGTTTAACCT CAAAGGCGGG CGCTGCGAGG CTTGCGAGGG CGACGGCCTC ATCAAGATCG AGATGAACTT CCTGCCAGAC GTGTATATCG AGTGCGAGGA GTGCAAGGGG AAGCGCTACA ACCGCGAGAC GCTAGAAGTG AAGTACAAGG GCAGGTCGAT CTCCGATGTG CTGGACATGA GCGTGGAGGA AGCCCTTGCC CTCTTCGAGA ACATTCCCTC GATCCAGAGC AAGCTCGAGA TGCTCACCCG GGTCGGCCTC GGGTACGTGA AGCTCGGCCA GAGTGCGACT ACACTCTCTG GCGGAGAAGC ACAGCGGATC AAGCTCACCC GGGAGCTCGC AAAGAGGGCG ACCGGGAAGA CCCTTTACCT GCTCGACGAG CCGACCACCG GGCTCCACTT CGACGACACA AAGAAACTGA TCAAGGTGCT TGACGATCTT GTGGAGAAGG GCAACACGGT CGTGGTGATC GAGCACAACC TGGACGTGAT CAAGTCGGCC GATTACCTCA TCGATATCGG CCCCGAGGGC GGGGATGCCG GCGGCGAGAT CGTGGCGACC GGGACACCGG AGAAGGTGGC TCTCGTCCAG AAGAGTTATA CGGGGCAGTT TTTGAAGGGG ATGATTGGGG GGAGGGTGTA G
|
Protein sequence | MKNIIIKGAR QHNLKNINVE IPRDKLVVIT GVSGSGKSTL AFDTLYAEGQ RRYVESLSSY ARQFLGMMQK PDVDSIEGLS PAISIEQKTT SKNPRSTVGT TTEIYDYLRL LFARIGTPYC PEHNIPIAAQ SPDRIADQIA AEHPGQVTVL APIVRQKKGT YQQLLKDLNK EGYARVRLNG KIIRTDEEIT LDRYKKHDIE VVIDRLETTD RARLAEAVEN TLKKSGGLVL VADEEGKEST YSSLLACPVC GLAFEELQPR MFSFNSPFGA CDECHGLGVK MEFDADLIIP DKNRCIADGA VAPYRNPMDG FRGQYLATVA KHFGFSVLTP IKDLTEEQYN ALMFGSTEKM HFSMSMKNGD AQWSHNGEWE GLLPQTARLY SQTQSEWRKR ELEGYMRVFP CPACKGKRLK DKVLAVRIDG KSIIDVTDLS VSGCIAYFSG LRLTEKEEGI ARQIIKEIRS RLLFLEKVGL GYLTLSRNAG TLSGGEAQRI RLATQIGSNL MGVLYVLDEP SIGLHQRDNR KLIETLRTLR DIGNTLIVVE HDEDMIRSAD HVIDIGPGAG LHGGEVVAEG TPQQIEKNKK SLTGLYLAGK KKIDVPEKRR KAAKYITVKG CKENNLKNID VKIPIGLFSV VTGVSGSGKS TLVYDTLYKG MMQKLYGSRE QAGAHKEIVF DSEIDKVIVI DQSPIGRTPR SNPATYTKLF DEIRTIFADT KEAKMRGYKA GRFSFNLKGG RCEACEGDGL IKIEMNFLPD VYIECEECKG KRYNRETLEV KYKGRSISDV LDMSVEEALA LFENIPSIQS KLEMLTRVGL GYVKLGQSAT TLSGGEAQRI KLTRELAKRA TGKTLYLLDE PTTGLHFDDT KKLIKVLDDL VEKGNTVVVI EHNLDVIKSA DYLIDIGPEG GDAGGEIVAT GTPEKVALVQ KSYTGQFLKG MIGGRV
|
| |