Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_07540 |
Symbol | |
ID | 7314741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 811024 |
End bp | 813099 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643611185 |
Product | DNA topoisomerase I |
Protein accession | YP_002508506 |
Protein GI | 220931598 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA [COG0551] Zn-finger domain associated with topoisomerase type I |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.00641456 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAATA AAAGTAATAA TACACTGGTA ATTGTTGAAT CTCCTGCTAA AGCTAAAACA ATATCAAAGT TTCTGGGAAA AGGATATAAA GTGGAAGCAA CAATGGGCCA TGTTATTGAT CTACCCAAGA GCAAACTGGG AATAAATATA GATAAAGGTT TTGAACCCAG GTATATAACA ATCAGGGGTA AAGGAAAAGT ATTAAAAAAA TTAAGAAAGG AAGTAAAAAA GAGTAAAGAT GTTCTCCTGG CAACTGACCC TGATAGAGAA GGGGAAGCCA TATCATGGCA TCTGACCCGG GCTTTGAAAA TAGATGAAGA TAAACCAAGG ATTGAATTTA ATGAAATAAC TAAAAGTGCT ATTAAAAATG CCCTTAAAAA CCGCAGGCCT ATAGATAAGA ATCTGGTTAA TTCCCAGCAG GCCAGACGCC TGCTTGACCG TCTTGTCGGT TACAAACTCA GTCCACTATT ATGGAAAAAG GTGAGACGGG GTTTAAGTGC CGGGCGTGTT CAGACAGTTG CTGTCAAATT GTTATGTAAT CGGGAAAAAG AGATAGAGTC TTTCGAACCT GAAGAATACT GGACTATTTC TGCTTCATTT AATAAAAAAG ATAAAGATTT TATAGCTGAT CTGTACCGAA TTTCAGGTAA AAAATTCAAA ATTAATAATG AGAAAGAAGC AAAACAAATT TTAGAAGATT TAAATAAAAG TAAATTTGTT GTAAGTGACA TAAAAGAAAA AACAAGGAAA CGTAACCCCA ATCCACCCTT TACAACAAGT ACCCTTCAAC AGAGGGCTTC TTCGATTCTG GGCTTTTCTG CTAAAAAAAC AATGTATCTG GCCCAGCAGT TATATGAAGG AATAGATATG GGTAGCGAAG GGACCACTGG TCTTATAAGT TATATACGTA CTGACAGTAC AAGAATCTCC AGAGAAGCCC AGAAACAGGC TTTAGACTAC ATTAAAGAAG CTTTTGGGGA CAAATATATT CCCGATAAAG TGAAGGTATA TAAGGCAAAA GAAGGTTCCC AGGATGCTCA TGAGGCTATT CGTCCTACTT CAGTTGACCG TACTCCAGGT AAGGTAAAAA AATATTTAAA CAAAGACCAG TACAGGTTAT ATAAGTTGAT CTGGGAAAGG TTTGTTGCCA GTCAAATGAG TCCAGCTCAG TATAAACAGG TGAAAGTATT GATAAAAGCA GGAGATAAAT ATATTTTCAG GGCAAAGGGA TCCAGAATTA TATTCCCGGG GTTTTTACGG GTTAACACAA GCAGTCAGAA AAAAGACATC ATATTACCAC CTGTTAAAAA GTCTGAAAGA CTTGATGTTA AGGAAATCAA GCCAGAACAG CACTTTACCC AGCCACCACC ACGTTATACG GAAGCTACCC TGGTTAAAAC ATTAGAAGAA GAGGGAATAG GCCGCCCCAG TACTTATGCT CCGATTATTT CAACAATAAT CTCCCGGGGC TATGTGGAGC GTCAGGGTAA GCAGTTAAAA CCGACTGAAC TGGGTTTTAT AGTTACAGAT CTGTTATCAA AATATTTTCC TGATGTAACT GACATAGAAT TTACCGCACA TATGGAAGAA AGGCTTGATA AGATAGAAGA TGGCAAAGAT GAATGGCGTA ATGTGCTTGA AGATTTTTAT TCCAATTTTT CCAGGAGACT TAAAGAGGCC AGTGAGGAAA TGGAAGAAGT TAAACTTGAA GATGAGGTAA CTGATGAAGT ATGTGAAAAG TGTGGCAGGA ATATGGTAAT AAAATATGGT CGTTATGGCA AATTTCTGGC CTGTTCTGGT TATCCTGAAT GTAAAAATAC CAAGCCTTAT GTTATTAAAA CTGGAGTTAA ATGTCCTCAG TGTAAAGAGG GAGAGCTTGT TCAAAGAAAA AGCCGTAAAG GGCGTACTTT TTACGGATGT AGTTCTTACC CTGATTGCAA ATTTGTTGTC TGGAATAAAC CAGTTAAAGA AAAATGCCCT GAATGTGGTG GCCTTATGGT AGAGAAGAAC TCAAAAAAGC AAGGCCGGTA TTATCTCTGT ATTAACAAAG AGTGTGGTTA TAAGAAAGAA GTATAA
|
Protein sequence | MGNKSNNTLV IVESPAKAKT ISKFLGKGYK VEATMGHVID LPKSKLGINI DKGFEPRYIT IRGKGKVLKK LRKEVKKSKD VLLATDPDRE GEAISWHLTR ALKIDEDKPR IEFNEITKSA IKNALKNRRP IDKNLVNSQQ ARRLLDRLVG YKLSPLLWKK VRRGLSAGRV QTVAVKLLCN REKEIESFEP EEYWTISASF NKKDKDFIAD LYRISGKKFK INNEKEAKQI LEDLNKSKFV VSDIKEKTRK RNPNPPFTTS TLQQRASSIL GFSAKKTMYL AQQLYEGIDM GSEGTTGLIS YIRTDSTRIS REAQKQALDY IKEAFGDKYI PDKVKVYKAK EGSQDAHEAI RPTSVDRTPG KVKKYLNKDQ YRLYKLIWER FVASQMSPAQ YKQVKVLIKA GDKYIFRAKG SRIIFPGFLR VNTSSQKKDI ILPPVKKSER LDVKEIKPEQ HFTQPPPRYT EATLVKTLEE EGIGRPSTYA PIISTIISRG YVERQGKQLK PTELGFIVTD LLSKYFPDVT DIEFTAHMEE RLDKIEDGKD EWRNVLEDFY SNFSRRLKEA SEEMEEVKLE DEVTDEVCEK CGRNMVIKYG RYGKFLACSG YPECKNTKPY VIKTGVKCPQ CKEGELVQRK SRKGRTFYGC SSYPDCKFVV WNKPVKEKCP ECGGLMVEKN SKKQGRYYLC INKECGYKKE V
|
| |