Gene Mboo_1084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1084 
Symbol 
ID5411689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp1073031 
End bp1075841 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content59% 
IMG OID640868310 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001404245 
Protein GI154150627 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.293507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA TCATCATCAA AGGTGCACGC CAGCACAACC TCAAAAATAT CAATGTCGAG 
ATCCCGCGTG ACAAGCTTGT CGTGATAACC GGGGTTTCCG GCTCCGGCAA ATCGACGCTT
GCATTCGATA CGCTCTATGC CGAAGGTCAG CGGCGGTATG TCGAATCCCT CTCGTCCTAT
GCCCGGCAGT TCCTCGGGAT GATGCAGAAG CCCGATGTGG ATTCCATCGA AGGGCTCTCA
CCTGCCATCT CCATCGAGCA GAAGACCACG TCCAAGAACC CCCGCTCGAC CGTGGGGACG
ACGACCGAGA TCTACGATTA CCTCCGTCTG CTCTTTGCCC GGATCGGGAC GCCGTACTGC
CCCGAGCACA ATATCCCGAT TGCCGCCCAG AGCCCGGACC GGATCGCCGA CCAGATCGCT
GCTGAGCACC CGGGCCAGGT CACGGTCCTT GCCCCGATTG TCCGGCAGAA GAAGGGGACG
TACCAGCAGC TCCTCAAAGA CCTGAATAAG GAAGGCTACG CCCGGGTCCG GCTGAACGGA
AAAATAATCC GCACCGATGA AGAGATTACG CTCGACCGGT ACAAGAAGCA CGACATCGAG
GTCGTGATCG ACCGGCTCGA AACCACCGAC CGGGCCCGGC TCGCTGAAGC GGTCGAGAAC
ACGCTCAAAA AATCCGGTGG GCTCGTGCTC GTAGCGGACG AAGAGGGAAA GGAATCTACG
TACTCCTCGC TTCTCGCCTG CCCGGTCTGC GGCCTTGCCT TTGAGGAACT CCAGCCGCGG
ATGTTCTCGT TTAACAGCCC CTTTGGCGCC TGCGACGAAT GCCACGGGCT TGGCGTCAAG
ATGGAGTTTG ACGCTGACCT CATTATCCCG GACAAGAACC GGTGCATAGC CGATGGTGCA
GTTGCCCCGT ACCGGAACCC GATGGACGGT TTCCGGGGCC AGTACCTGGC AACGGTTGCA
AAACATTTCG GTTTCTCGGT ACTTACACCC ATCAAAGATC TGACCGAAGA GCAGTACAAT
GCCCTGATGT TCGGCTCGAC CGAAAAGATG CACTTCTCGA TGAGCATGAA AAACGGCGAC
GCCCAGTGGT CACACAACGG TGAGTGGGAA GGGCTCCTCC CGCAGACCGC CCGGCTCTAT
TCGCAGACCC AGTCCGAGTG GCGGAAGCGG GAACTTGAAG GCTACATGCG GGTCTTTCCC
TGCCCGGCCT GCAAGGGAAA AAGGCTCAAG GACAAGGTGC TCGCGGTCCG GATTGATGGC
AAATCGATCA TTGATGTGAC CGATCTCTCG GTCTCCGGCT GCATCGCGTA CTTCTCCGGC
CTCCGGCTCA CCGAGAAGGA GGAAGGCATT GCCCGGCAGA TCATCAAGGA GATCCGGTCC
CGGTTGCTCT TTTTGGAAAA AGTCGGGCTC GGATACCTCA CGCTCTCGCG GAATGCCGGG
ACGCTCTCGG GCGGCGAAGC GCAGCGGATC CGGCTTGCCA CCCAGATCGG CTCGAACCTG
ATGGGCGTGC TCTACGTGCT CGACGAACCG TCCATCGGGC TTCACCAGCG GGACAACCGG
AAACTTATCG AGACGCTACG GACGCTCCGC GATATTGGGA ATACGCTGAT CGTGGTGGAG
CACGACGAGG ACATGATTCG CTCGGCCGAT CACGTGATCG ATATCGGGCC CGGCGCCGGG
CTCCACGGCG GTGAGGTGGT GGCAGAAGGC ACACCGCAGC AGATCGAGAA GAACAAAAAG
TCCCTCACCG GCCTGTATCT TGCCGGGAAA AAGAAGATCG ATGTGCCGGA GAAACGCCGG
AAAGCTGCGA AGTACATCAC GGTCAAAGGC TGTAAGGAGA ACAACCTGAA AAACATCGAT
GTAAAGATCC CGATCGGTCT TTTTTCGGTG GTGACCGGCG TCTCCGGGTC GGGCAAGTCA
ACGCTTGTCT ACGATACGCT CTACAAGGGC ATGATGCAGA AACTGTACGG CTCGCGGGAG
CAGGCCGGGG CGCATAAGGA GATCGTGTTC GATTCCGAGA TCGACAAGGT GATCGTGATC
GACCAGAGCC CGATCGGCCG GACACCACGC TCGAACCCGG CAACGTATAC CAAACTCTTC
GATGAGATCC GCACCATTTT TGCTGATACA AAAGAAGCAA AGATGCGGGG CTACAAGGCG
GGCCGTTTCT CGTTTAACCT CAAAGGCGGG CGCTGCGAGG CTTGCGAGGG CGACGGCCTC
ATCAAGATCG AGATGAACTT CCTGCCAGAC GTGTATATCG AGTGCGAGGA GTGCAAGGGG
AAGCGCTACA ACCGCGAGAC GCTAGAAGTG AAGTACAAGG GCAGGTCGAT CTCCGATGTG
CTGGACATGA GCGTGGAGGA AGCCCTTGCC CTCTTCGAGA ACATTCCCTC GATCCAGAGC
AAGCTCGAGA TGCTCACCCG GGTCGGCCTC GGGTACGTGA AGCTCGGCCA GAGTGCGACT
ACACTCTCTG GCGGAGAAGC ACAGCGGATC AAGCTCACCC GGGAGCTCGC AAAGAGGGCG
ACCGGGAAGA CCCTTTACCT GCTCGACGAG CCGACCACCG GGCTCCACTT CGACGACACA
AAGAAACTGA TCAAGGTGCT TGACGATCTT GTGGAGAAGG GCAACACGGT CGTGGTGATC
GAGCACAACC TGGACGTGAT CAAGTCGGCC GATTACCTCA TCGATATCGG CCCCGAGGGC
GGGGATGCCG GCGGCGAGAT CGTGGCGACC GGGACACCGG AGAAGGTGGC TCTCGTCCAG
AAGAGTTATA CGGGGCAGTT TTTGAAGGGG ATGATTGGGG GGAGGGTGTA G
 
Protein sequence
MKNIIIKGAR QHNLKNINVE IPRDKLVVIT GVSGSGKSTL AFDTLYAEGQ RRYVESLSSY 
ARQFLGMMQK PDVDSIEGLS PAISIEQKTT SKNPRSTVGT TTEIYDYLRL LFARIGTPYC
PEHNIPIAAQ SPDRIADQIA AEHPGQVTVL APIVRQKKGT YQQLLKDLNK EGYARVRLNG
KIIRTDEEIT LDRYKKHDIE VVIDRLETTD RARLAEAVEN TLKKSGGLVL VADEEGKEST
YSSLLACPVC GLAFEELQPR MFSFNSPFGA CDECHGLGVK MEFDADLIIP DKNRCIADGA
VAPYRNPMDG FRGQYLATVA KHFGFSVLTP IKDLTEEQYN ALMFGSTEKM HFSMSMKNGD
AQWSHNGEWE GLLPQTARLY SQTQSEWRKR ELEGYMRVFP CPACKGKRLK DKVLAVRIDG
KSIIDVTDLS VSGCIAYFSG LRLTEKEEGI ARQIIKEIRS RLLFLEKVGL GYLTLSRNAG
TLSGGEAQRI RLATQIGSNL MGVLYVLDEP SIGLHQRDNR KLIETLRTLR DIGNTLIVVE
HDEDMIRSAD HVIDIGPGAG LHGGEVVAEG TPQQIEKNKK SLTGLYLAGK KKIDVPEKRR
KAAKYITVKG CKENNLKNID VKIPIGLFSV VTGVSGSGKS TLVYDTLYKG MMQKLYGSRE
QAGAHKEIVF DSEIDKVIVI DQSPIGRTPR SNPATYTKLF DEIRTIFADT KEAKMRGYKA
GRFSFNLKGG RCEACEGDGL IKIEMNFLPD VYIECEECKG KRYNRETLEV KYKGRSISDV
LDMSVEEALA LFENIPSIQS KLEMLTRVGL GYVKLGQSAT TLSGGEAQRI KLTRELAKRA
TGKTLYLLDE PTTGLHFDDT KKLIKVLDDL VEKGNTVVVI EHNLDVIKSA DYLIDIGPEG
GDAGGEIVAT GTPEKVALVQ KSYTGQFLKG MIGGRV