Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mboo_0859 |
Symbol | |
ID | 5411478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Methanoregula boonei 6A8 |
Kingdom | Archaea |
Replicon accession | NC_009712 |
Strand | + |
Start bp | 835930 |
End bp | 838758 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640868085 |
Product | DNA topoisomerase |
Protein accession | YP_001404020 |
Protein GI | 154150402 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.293507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACCTGA TCGTTGCTGA AAAGAACATC TCGGCCCACC GGATTGCCCA GATCCTGGCC GGAGGTACCA GGGTTATCGA GAAAAAGGAT GCCGGGGTAT CCACCTACAG TTTTGGGGAT ACTATCACGG TGGGGCTGCG CGGCCACGTG GTGGAAGTGG ATTTCGAGCC CGGCTATGAG AACTGGCGCA GCGAAAAGTA CACACCCCGC AGCCTGATCG ATGCAAAAAC CATCAAGGTG CCTACCGAAA AAAGGATCGT TTCCCTTGTC CAGAAACTTG CCCGCCATGC AGACCGGGTT ACCATTGCCA CTGATTTTGA TACCGAGGGG GAACTGATCG GGAAAGAGGC CTATGAATTG GTGCGTGCGG TCAATAAGAA CGTAAAGATC GACCGGGCCC GGTTCTCCGC GATCACAGCG CAGGAGATTG GTTCTGCCTT TGCAAACACA ACTGACCTTG ATTTTGCTCT TGCCGCTGCC GGCGAAGCCC GCCAGTCCAT CGATCTCATG TGGGGCGCAT CGCTCACCCG GTTTATATCG CTTGCCGCAA AGAGGGGAGG CAACAATATC TTATCCGTGG GCCGGGTCCA GAGCCCCACG CTCTCCATGA TCGTTGACCG GGAAAAAGAG ATCGAGAAGT TTGTCCCGGA GAAGTACTGG CAGCTGGGCC TTCTTACGGA AAAGGCGGGC GAGCAGATCG AGGCCCGGCA CACAAACGGC CGCTTCAAGG ACCAGAAGGC AGCAGAACTT GCCCGTGACC GGACAAAAGA ACCGCTCGTG GTAACCGAGG TCAGGGAAGG TACAAAACAG GACCGGTCCC CGTCCCCGTT CGACACCACC ACCTACATTG TTGCAGCAGC CCGGCTGGGT TTTTCTGCTG CAAATGCGAT GCGGATTGCC GAAGAACTGT ACATGAACGG GTACATCTCG TACCCCAGAA CGGACAATAC CGTATACCCG GCATCCCTTG ACCTCAATGG CGTGCTTGCC ACGATAAGGA ACTCGCCCTT TACAAAAGAC GTTGACTGGA TCATTTCCCA CCGGCGGGCC GAGCCTACAC GAGGGAAGAA ATCCTCTACC GATCACCCGC CGATCTATCC CACCGGTGTT GCCACCCGGG AAGGGATCGG GGATGACGCA TTCCGGATCT ACGAACTGGT GCTGCGCCGG TTTTTGGCAA CGCTTGCCCC GGATGCACTC TGGAAGACCC TCAAGGTGAA CTTTGAGGCC GGTGGCGAAA CCTACACCGT GACCGGAGGG CTCCTGCTCG AACCGGGCTG GCATGCGGTG TACCCGTTCT CCGAAGCAAA GGAGACGATC CTCCCGGCAT TTGTTACCGG AGAGAAACTA CCGATACGAA AAGTCAACCT TGAGGAAAAG GAGACCCTTC CCCCTGCCCG GTACACGCAG AGCAAGCTCA TCCAGAGGAT GGAAGAGCTT GGCCTTGGCA CCAAGAGTAC GCGCCATGAG GTGATAGCAC GGCTTGTCTC CCGCAAATAT ATCGAAGGTA ATCCCCTTCG TCCGACCCTT GTGGGCCGGG TAGTGACCGA GGCGCTGGAG CAGAACGCCG ATACGATCAC CAAACCCGAG ATGACCCAGA CAATCGAATC GCACATGCAG CAGATAAAGG AGAGCAAGCG GACCCGCGAC GATGTGATTG CGGAATCGCG CAGCATGCTG CATAAGGCAT TTGACCAGCT CGAGGCAAAC GAGCAGGTGA TAGGAAACGA TATCCGGGAC CGCACCGCAG AGGAACTCAA CCTGGGCCGC TGCCCGGCCT GCGGCGGGAC GCTTGCGATC AAGCACATGC GGGGATCGAC CCAGTTCATT GGCTGTTCGC GGTACCCTGA GTGTACATTC AATATCGGTC TCCCGGTCAC CCAGTGGGGC TGGGCAGTCC GCACCGACGA TATCTGCGAC AAGCACCACC TCCACTTCGT GCGCCTTGTC AGGAAGGGCG CCCGTCCCTG GGATATCGGC TGCCCGCTCT GCCACCATAT CAGCTCCAAT GCAGAGTCCC TCACCGAGAT CCCCTCCATG GATGAGGCCC TTCTCGAAAA GGCGCGATCA AAGCATATCT ACACAGTCGC GGAGATCGCA CGGAGCGAGC CGGATGCCCT TGCAAAGTCC CTTGACCTGC CCTTAGAGAA AGCAGGGAAA CTAAAAAGTG AAGCCGGGAT TGTCCTTGAA AAGCTGCGGC GGCGCTCCGA GTGCCGCAAG TTTTTGCGCA ACCACCTGAT CCCGCGCAAG GGGCGCAGCT ATGCAAAGAT CATGGAAGCC CTCGGTCAGT CCGGGGTAAC AGATCTCGCA AGCCTTGCCC ATGCAACCCC GGCAACACTC CAGCAGGCCG GTATTGGCGA AACCGAGGCC CGTGACCTTC TTACTGAGGC CCGGCTCACC CATAACAGCC AGCTCTTAAA AGGGACCGGT ATCCCGGCCG TGAGCCTGAA AAAGTACCTG GAAGCCGGCG TTTCCGGTCC TGAAGATTTT GTCTCCACCG GGGCAGAGAA ACTTGCGACC TTAACCGGGA TGAGCACCGG CACGATAAAC CGGCACCTGG CCCTTGTCTG CGAGTACCTG CACCGCCCCG TGCCGTTAGC CGTTCCCAAG GCAAAACTTG CGAAAGGGAG AAAGGAACTG CTCTCGGTTA AGGGCCTTGC CGCAACCACG GCAGACAAGC TTATCGGGGC AGGGGTCATA AGTGGGGATC TCCTGCTTGC AGCGGATACC AAAAAACTTG CAGCCTCAAC CGGGATTGAT GAGGAAAAGC TCCGGGGTTA CCAGGCACTG ATGAAGAAAA AAAAGGAAAA CGCGATCATA CGGATCTGA
|
Protein sequence | MHLIVAEKNI SAHRIAQILA GGTRVIEKKD AGVSTYSFGD TITVGLRGHV VEVDFEPGYE NWRSEKYTPR SLIDAKTIKV PTEKRIVSLV QKLARHADRV TIATDFDTEG ELIGKEAYEL VRAVNKNVKI DRARFSAITA QEIGSAFANT TDLDFALAAA GEARQSIDLM WGASLTRFIS LAAKRGGNNI LSVGRVQSPT LSMIVDREKE IEKFVPEKYW QLGLLTEKAG EQIEARHTNG RFKDQKAAEL ARDRTKEPLV VTEVREGTKQ DRSPSPFDTT TYIVAAARLG FSAANAMRIA EELYMNGYIS YPRTDNTVYP ASLDLNGVLA TIRNSPFTKD VDWIISHRRA EPTRGKKSST DHPPIYPTGV ATREGIGDDA FRIYELVLRR FLATLAPDAL WKTLKVNFEA GGETYTVTGG LLLEPGWHAV YPFSEAKETI LPAFVTGEKL PIRKVNLEEK ETLPPARYTQ SKLIQRMEEL GLGTKSTRHE VIARLVSRKY IEGNPLRPTL VGRVVTEALE QNADTITKPE MTQTIESHMQ QIKESKRTRD DVIAESRSML HKAFDQLEAN EQVIGNDIRD RTAEELNLGR CPACGGTLAI KHMRGSTQFI GCSRYPECTF NIGLPVTQWG WAVRTDDICD KHHLHFVRLV RKGARPWDIG CPLCHHISSN AESLTEIPSM DEALLEKARS KHIYTVAEIA RSEPDALAKS LDLPLEKAGK LKSEAGIVLE KLRRRSECRK FLRNHLIPRK GRSYAKIMEA LGQSGVTDLA SLAHATPATL QQAGIGETEA RDLLTEARLT HNSQLLKGTG IPAVSLKKYL EAGVSGPEDF VSTGAEKLAT LTGMSTGTIN RHLALVCEYL HRPVPLAVPK AKLAKGRKEL LSVKGLAATT ADKLIGAGVI SGDLLLAADT KKLAASTGID EEKLRGYQAL MKKKKENAII RI
|
| |