Gene Mboo_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_0859 
Symbol 
ID5411478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp835930 
End bp838758 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content58% 
IMG OID640868085 
ProductDNA topoisomerase 
Protein accessionYP_001404020 
Protein GI154150402 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.293507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCTGA TCGTTGCTGA AAAGAACATC TCGGCCCACC GGATTGCCCA GATCCTGGCC 
GGAGGTACCA GGGTTATCGA GAAAAAGGAT GCCGGGGTAT CCACCTACAG TTTTGGGGAT
ACTATCACGG TGGGGCTGCG CGGCCACGTG GTGGAAGTGG ATTTCGAGCC CGGCTATGAG
AACTGGCGCA GCGAAAAGTA CACACCCCGC AGCCTGATCG ATGCAAAAAC CATCAAGGTG
CCTACCGAAA AAAGGATCGT TTCCCTTGTC CAGAAACTTG CCCGCCATGC AGACCGGGTT
ACCATTGCCA CTGATTTTGA TACCGAGGGG GAACTGATCG GGAAAGAGGC CTATGAATTG
GTGCGTGCGG TCAATAAGAA CGTAAAGATC GACCGGGCCC GGTTCTCCGC GATCACAGCG
CAGGAGATTG GTTCTGCCTT TGCAAACACA ACTGACCTTG ATTTTGCTCT TGCCGCTGCC
GGCGAAGCCC GCCAGTCCAT CGATCTCATG TGGGGCGCAT CGCTCACCCG GTTTATATCG
CTTGCCGCAA AGAGGGGAGG CAACAATATC TTATCCGTGG GCCGGGTCCA GAGCCCCACG
CTCTCCATGA TCGTTGACCG GGAAAAAGAG ATCGAGAAGT TTGTCCCGGA GAAGTACTGG
CAGCTGGGCC TTCTTACGGA AAAGGCGGGC GAGCAGATCG AGGCCCGGCA CACAAACGGC
CGCTTCAAGG ACCAGAAGGC AGCAGAACTT GCCCGTGACC GGACAAAAGA ACCGCTCGTG
GTAACCGAGG TCAGGGAAGG TACAAAACAG GACCGGTCCC CGTCCCCGTT CGACACCACC
ACCTACATTG TTGCAGCAGC CCGGCTGGGT TTTTCTGCTG CAAATGCGAT GCGGATTGCC
GAAGAACTGT ACATGAACGG GTACATCTCG TACCCCAGAA CGGACAATAC CGTATACCCG
GCATCCCTTG ACCTCAATGG CGTGCTTGCC ACGATAAGGA ACTCGCCCTT TACAAAAGAC
GTTGACTGGA TCATTTCCCA CCGGCGGGCC GAGCCTACAC GAGGGAAGAA ATCCTCTACC
GATCACCCGC CGATCTATCC CACCGGTGTT GCCACCCGGG AAGGGATCGG GGATGACGCA
TTCCGGATCT ACGAACTGGT GCTGCGCCGG TTTTTGGCAA CGCTTGCCCC GGATGCACTC
TGGAAGACCC TCAAGGTGAA CTTTGAGGCC GGTGGCGAAA CCTACACCGT GACCGGAGGG
CTCCTGCTCG AACCGGGCTG GCATGCGGTG TACCCGTTCT CCGAAGCAAA GGAGACGATC
CTCCCGGCAT TTGTTACCGG AGAGAAACTA CCGATACGAA AAGTCAACCT TGAGGAAAAG
GAGACCCTTC CCCCTGCCCG GTACACGCAG AGCAAGCTCA TCCAGAGGAT GGAAGAGCTT
GGCCTTGGCA CCAAGAGTAC GCGCCATGAG GTGATAGCAC GGCTTGTCTC CCGCAAATAT
ATCGAAGGTA ATCCCCTTCG TCCGACCCTT GTGGGCCGGG TAGTGACCGA GGCGCTGGAG
CAGAACGCCG ATACGATCAC CAAACCCGAG ATGACCCAGA CAATCGAATC GCACATGCAG
CAGATAAAGG AGAGCAAGCG GACCCGCGAC GATGTGATTG CGGAATCGCG CAGCATGCTG
CATAAGGCAT TTGACCAGCT CGAGGCAAAC GAGCAGGTGA TAGGAAACGA TATCCGGGAC
CGCACCGCAG AGGAACTCAA CCTGGGCCGC TGCCCGGCCT GCGGCGGGAC GCTTGCGATC
AAGCACATGC GGGGATCGAC CCAGTTCATT GGCTGTTCGC GGTACCCTGA GTGTACATTC
AATATCGGTC TCCCGGTCAC CCAGTGGGGC TGGGCAGTCC GCACCGACGA TATCTGCGAC
AAGCACCACC TCCACTTCGT GCGCCTTGTC AGGAAGGGCG CCCGTCCCTG GGATATCGGC
TGCCCGCTCT GCCACCATAT CAGCTCCAAT GCAGAGTCCC TCACCGAGAT CCCCTCCATG
GATGAGGCCC TTCTCGAAAA GGCGCGATCA AAGCATATCT ACACAGTCGC GGAGATCGCA
CGGAGCGAGC CGGATGCCCT TGCAAAGTCC CTTGACCTGC CCTTAGAGAA AGCAGGGAAA
CTAAAAAGTG AAGCCGGGAT TGTCCTTGAA AAGCTGCGGC GGCGCTCCGA GTGCCGCAAG
TTTTTGCGCA ACCACCTGAT CCCGCGCAAG GGGCGCAGCT ATGCAAAGAT CATGGAAGCC
CTCGGTCAGT CCGGGGTAAC AGATCTCGCA AGCCTTGCCC ATGCAACCCC GGCAACACTC
CAGCAGGCCG GTATTGGCGA AACCGAGGCC CGTGACCTTC TTACTGAGGC CCGGCTCACC
CATAACAGCC AGCTCTTAAA AGGGACCGGT ATCCCGGCCG TGAGCCTGAA AAAGTACCTG
GAAGCCGGCG TTTCCGGTCC TGAAGATTTT GTCTCCACCG GGGCAGAGAA ACTTGCGACC
TTAACCGGGA TGAGCACCGG CACGATAAAC CGGCACCTGG CCCTTGTCTG CGAGTACCTG
CACCGCCCCG TGCCGTTAGC CGTTCCCAAG GCAAAACTTG CGAAAGGGAG AAAGGAACTG
CTCTCGGTTA AGGGCCTTGC CGCAACCACG GCAGACAAGC TTATCGGGGC AGGGGTCATA
AGTGGGGATC TCCTGCTTGC AGCGGATACC AAAAAACTTG CAGCCTCAAC CGGGATTGAT
GAGGAAAAGC TCCGGGGTTA CCAGGCACTG ATGAAGAAAA AAAAGGAAAA CGCGATCATA
CGGATCTGA
 
Protein sequence
MHLIVAEKNI SAHRIAQILA GGTRVIEKKD AGVSTYSFGD TITVGLRGHV VEVDFEPGYE 
NWRSEKYTPR SLIDAKTIKV PTEKRIVSLV QKLARHADRV TIATDFDTEG ELIGKEAYEL
VRAVNKNVKI DRARFSAITA QEIGSAFANT TDLDFALAAA GEARQSIDLM WGASLTRFIS
LAAKRGGNNI LSVGRVQSPT LSMIVDREKE IEKFVPEKYW QLGLLTEKAG EQIEARHTNG
RFKDQKAAEL ARDRTKEPLV VTEVREGTKQ DRSPSPFDTT TYIVAAARLG FSAANAMRIA
EELYMNGYIS YPRTDNTVYP ASLDLNGVLA TIRNSPFTKD VDWIISHRRA EPTRGKKSST
DHPPIYPTGV ATREGIGDDA FRIYELVLRR FLATLAPDAL WKTLKVNFEA GGETYTVTGG
LLLEPGWHAV YPFSEAKETI LPAFVTGEKL PIRKVNLEEK ETLPPARYTQ SKLIQRMEEL
GLGTKSTRHE VIARLVSRKY IEGNPLRPTL VGRVVTEALE QNADTITKPE MTQTIESHMQ
QIKESKRTRD DVIAESRSML HKAFDQLEAN EQVIGNDIRD RTAEELNLGR CPACGGTLAI
KHMRGSTQFI GCSRYPECTF NIGLPVTQWG WAVRTDDICD KHHLHFVRLV RKGARPWDIG
CPLCHHISSN AESLTEIPSM DEALLEKARS KHIYTVAEIA RSEPDALAKS LDLPLEKAGK
LKSEAGIVLE KLRRRSECRK FLRNHLIPRK GRSYAKIMEA LGQSGVTDLA SLAHATPATL
QQAGIGETEA RDLLTEARLT HNSQLLKGTG IPAVSLKKYL EAGVSGPEDF VSTGAEKLAT
LTGMSTGTIN RHLALVCEYL HRPVPLAVPK AKLAKGRKEL LSVKGLAATT ADKLIGAGVI
SGDLLLAADT KKLAASTGID EEKLRGYQAL MKKKKENAII RI