Gene Hore_07540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_07540 
Symbol 
ID7314741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp811024 
End bp813099 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content37% 
IMG OID643611185 
ProductDNA topoisomerase I 
Protein accessionYP_002508506 
Protein GI220931598 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial
[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00641456 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAATA AAAGTAATAA TACACTGGTA ATTGTTGAAT CTCCTGCTAA AGCTAAAACA 
ATATCAAAGT TTCTGGGAAA AGGATATAAA GTGGAAGCAA CAATGGGCCA TGTTATTGAT
CTACCCAAGA GCAAACTGGG AATAAATATA GATAAAGGTT TTGAACCCAG GTATATAACA
ATCAGGGGTA AAGGAAAAGT ATTAAAAAAA TTAAGAAAGG AAGTAAAAAA GAGTAAAGAT
GTTCTCCTGG CAACTGACCC TGATAGAGAA GGGGAAGCCA TATCATGGCA TCTGACCCGG
GCTTTGAAAA TAGATGAAGA TAAACCAAGG ATTGAATTTA ATGAAATAAC TAAAAGTGCT
ATTAAAAATG CCCTTAAAAA CCGCAGGCCT ATAGATAAGA ATCTGGTTAA TTCCCAGCAG
GCCAGACGCC TGCTTGACCG TCTTGTCGGT TACAAACTCA GTCCACTATT ATGGAAAAAG
GTGAGACGGG GTTTAAGTGC CGGGCGTGTT CAGACAGTTG CTGTCAAATT GTTATGTAAT
CGGGAAAAAG AGATAGAGTC TTTCGAACCT GAAGAATACT GGACTATTTC TGCTTCATTT
AATAAAAAAG ATAAAGATTT TATAGCTGAT CTGTACCGAA TTTCAGGTAA AAAATTCAAA
ATTAATAATG AGAAAGAAGC AAAACAAATT TTAGAAGATT TAAATAAAAG TAAATTTGTT
GTAAGTGACA TAAAAGAAAA AACAAGGAAA CGTAACCCCA ATCCACCCTT TACAACAAGT
ACCCTTCAAC AGAGGGCTTC TTCGATTCTG GGCTTTTCTG CTAAAAAAAC AATGTATCTG
GCCCAGCAGT TATATGAAGG AATAGATATG GGTAGCGAAG GGACCACTGG TCTTATAAGT
TATATACGTA CTGACAGTAC AAGAATCTCC AGAGAAGCCC AGAAACAGGC TTTAGACTAC
ATTAAAGAAG CTTTTGGGGA CAAATATATT CCCGATAAAG TGAAGGTATA TAAGGCAAAA
GAAGGTTCCC AGGATGCTCA TGAGGCTATT CGTCCTACTT CAGTTGACCG TACTCCAGGT
AAGGTAAAAA AATATTTAAA CAAAGACCAG TACAGGTTAT ATAAGTTGAT CTGGGAAAGG
TTTGTTGCCA GTCAAATGAG TCCAGCTCAG TATAAACAGG TGAAAGTATT GATAAAAGCA
GGAGATAAAT ATATTTTCAG GGCAAAGGGA TCCAGAATTA TATTCCCGGG GTTTTTACGG
GTTAACACAA GCAGTCAGAA AAAAGACATC ATATTACCAC CTGTTAAAAA GTCTGAAAGA
CTTGATGTTA AGGAAATCAA GCCAGAACAG CACTTTACCC AGCCACCACC ACGTTATACG
GAAGCTACCC TGGTTAAAAC ATTAGAAGAA GAGGGAATAG GCCGCCCCAG TACTTATGCT
CCGATTATTT CAACAATAAT CTCCCGGGGC TATGTGGAGC GTCAGGGTAA GCAGTTAAAA
CCGACTGAAC TGGGTTTTAT AGTTACAGAT CTGTTATCAA AATATTTTCC TGATGTAACT
GACATAGAAT TTACCGCACA TATGGAAGAA AGGCTTGATA AGATAGAAGA TGGCAAAGAT
GAATGGCGTA ATGTGCTTGA AGATTTTTAT TCCAATTTTT CCAGGAGACT TAAAGAGGCC
AGTGAGGAAA TGGAAGAAGT TAAACTTGAA GATGAGGTAA CTGATGAAGT ATGTGAAAAG
TGTGGCAGGA ATATGGTAAT AAAATATGGT CGTTATGGCA AATTTCTGGC CTGTTCTGGT
TATCCTGAAT GTAAAAATAC CAAGCCTTAT GTTATTAAAA CTGGAGTTAA ATGTCCTCAG
TGTAAAGAGG GAGAGCTTGT TCAAAGAAAA AGCCGTAAAG GGCGTACTTT TTACGGATGT
AGTTCTTACC CTGATTGCAA ATTTGTTGTC TGGAATAAAC CAGTTAAAGA AAAATGCCCT
GAATGTGGTG GCCTTATGGT AGAGAAGAAC TCAAAAAAGC AAGGCCGGTA TTATCTCTGT
ATTAACAAAG AGTGTGGTTA TAAGAAAGAA GTATAA
 
Protein sequence
MGNKSNNTLV IVESPAKAKT ISKFLGKGYK VEATMGHVID LPKSKLGINI DKGFEPRYIT 
IRGKGKVLKK LRKEVKKSKD VLLATDPDRE GEAISWHLTR ALKIDEDKPR IEFNEITKSA
IKNALKNRRP IDKNLVNSQQ ARRLLDRLVG YKLSPLLWKK VRRGLSAGRV QTVAVKLLCN
REKEIESFEP EEYWTISASF NKKDKDFIAD LYRISGKKFK INNEKEAKQI LEDLNKSKFV
VSDIKEKTRK RNPNPPFTTS TLQQRASSIL GFSAKKTMYL AQQLYEGIDM GSEGTTGLIS
YIRTDSTRIS REAQKQALDY IKEAFGDKYI PDKVKVYKAK EGSQDAHEAI RPTSVDRTPG
KVKKYLNKDQ YRLYKLIWER FVASQMSPAQ YKQVKVLIKA GDKYIFRAKG SRIIFPGFLR
VNTSSQKKDI ILPPVKKSER LDVKEIKPEQ HFTQPPPRYT EATLVKTLEE EGIGRPSTYA
PIISTIISRG YVERQGKQLK PTELGFIVTD LLSKYFPDVT DIEFTAHMEE RLDKIEDGKD
EWRNVLEDFY SNFSRRLKEA SEEMEEVKLE DEVTDEVCEK CGRNMVIKYG RYGKFLACSG
YPECKNTKPY VIKTGVKCPQ CKEGELVQRK SRKGRTFYGC SSYPDCKFVV WNKPVKEKCP
ECGGLMVEKN SKKQGRYYLC INKECGYKKE V