Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1310 |
Symbol | topB |
ID | 4240821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1499439 |
End bp | 1501337 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638104883 |
Product | DNA topoisomerase III |
Protein accession | YP_719522 |
Protein GI | 113461453 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00570993 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTAT TTATAGCGGA AAAACCCAGC CTTGCTCGAG CTATTGCTGA CGTATTGCCT AAACCGCATC AACGTGGACA AGGCTTTGTC AAATGTGGCG AACAAGATTA TGTAACTTGG TGTGTAGGAC ATTTATTGGA ACAAGCTGCT CCTGATGTTT ACAATCCAAT GTTCAAACAA TGGCGTTTAG AACATTTACC TATTATTCCG CAAAAATGGC TATTACTTCC CAGACAAGAA GTAAAAAGTC AGCTGGACAT TGTACTTAAA TTAATTCATC AAGCAGACAT TATAATTAAT GCTGGGGATC CGGATCGAGA AGGGCAATTA TTAGTTGATG AAGTTTTCAG TTATGCCAAG TTGCCTGCTA CAAAATTAGC CGAAATTCAA CGTTGTCTAA TTAACGATCT CAATCCAAGT GCGGTAGAAA AAGCAATAAA AAAATTACAG CCTAACAAAA ATTTTATCCC CCTTGCTACA TCAGCATTAG CACGTTCAAG AGCAGATTGG TTATATGGCA TTAATATGAC CCGAGCCTAC ACTATTCGTG GTCGTCAATC AGGCTATAAC GGTGTTCTAT CCGTTGGCCG AGTGCAGACA CCTGTCTTGG GATTAATTGT ACGTCGTGAT TTAGAAATTG AGCATTTTCA ACCGAAAGAT TTTTTTGAGG TTCAAGCCTT TATTGCAACA AAAGAAAAAA TGCCCTCAAC ATTTACCGCA CTTTGGCAAC CGAGTAAAGC CTGTGAAGAT TATCAAGATG AAGAAGGACG GGTGTTATCC AACGCTTTAG CTAAAAATGT AGTAAAACGC ATTACCGCAC AGCCAGCAAC GGTGACAGAA TATATTGATA AAAGAGAAAA AGAAACCGCC CCTTTGCCTT ATTCACTTTC TGCATTACAA ATAGATGCAG CGAAACGTTA TGCCATGTCT GCACAAGAGG TTTTAGATGT TTGTCAAAAA TTATACGAAA CCCATAAATT AATCACTTAC CCACGTTCCG ATTGTCGCTA TTTGCCTGAA GAACATTTTT CCGCACGACA TACTATTTTA CGTGCAATTT CCACACACAG TATTTTATAT AAAGAAATTC CCGATATAGT TAATACTGAG CTGAAAAATC GTTGTTGGAA CGATAAAAAA GTAGAAGCAC ATCATGCCAT TATCCCAACG GCAAAAAATA AAACTGTGTC TTTAAGCCAA AATGAACAAC AAATCTATGA TTTAATTGCT CGCCAATACT TAATGCAATT TTGTCAAGAG GCTGAATATC GAAAAAGTAA AATTACATTA GATATTGCTG GGGGAACTTT TATTGCTCAA GCGAGAAATT TACAAATTGC GGGCTGGAAA CAACTTTGGG GAAAAGAGGA TGAAGATGAA CAACAAGAAC CTTTGCTTCC TATTGTGAAA AAAGGTGATG AATTATTTTG TGAAAAAGGT GATGTAATTA GCAAAAAAAC ACAACCGCCA AAACCTTTTA CCGATGCAAC CTTACTTTCG GCAATGACGG GAATCGCACG TTTTGTACAA AATAAAGAAT TGAAAAAAAT CTTACGTGAA ACAGATGGTC TAGGTACGGA AGCAACAAGA GCAGGCATTA TTGAATTATT ATTTAAGCGA GGATTCATCT ATAAAAAAGG GCGGAATATT CACAGTACAG AAACAGGCAG AACACTCATT CAAGCCTTAC CAGAAATCGC AACTCAGCCG GATATGACCG CACATTGGGA AGCTCAACTG GACAGCATTA GCCGCAGGCA AGGATCCTAT CAACAATTTA TGCAAACGCT CAGTGAACTT TTACCTGAAT TATTAGGTTA CTTTAATTTT TCCGCACTGC GGAAACTAGG ACTTCAAACG AAAGAGAATA AAAAAGCAGA TTTGAAAAAA AATGAGTAA
|
Protein sequence | MRLFIAEKPS LARAIADVLP KPHQRGQGFV KCGEQDYVTW CVGHLLEQAA PDVYNPMFKQ WRLEHLPIIP QKWLLLPRQE VKSQLDIVLK LIHQADIIIN AGDPDREGQL LVDEVFSYAK LPATKLAEIQ RCLINDLNPS AVEKAIKKLQ PNKNFIPLAT SALARSRADW LYGINMTRAY TIRGRQSGYN GVLSVGRVQT PVLGLIVRRD LEIEHFQPKD FFEVQAFIAT KEKMPSTFTA LWQPSKACED YQDEEGRVLS NALAKNVVKR ITAQPATVTE YIDKREKETA PLPYSLSALQ IDAAKRYAMS AQEVLDVCQK LYETHKLITY PRSDCRYLPE EHFSARHTIL RAISTHSILY KEIPDIVNTE LKNRCWNDKK VEAHHAIIPT AKNKTVSLSQ NEQQIYDLIA RQYLMQFCQE AEYRKSKITL DIAGGTFIAQ ARNLQIAGWK QLWGKEDEDE QQEPLLPIVK KGDELFCEKG DVISKKTQPP KPFTDATLLS AMTGIARFVQ NKELKKILRE TDGLGTEATR AGIIELLFKR GFIYKKGRNI HSTETGRTLI QALPEIATQP DMTAHWEAQL DSISRRQGSY QQFMQTLSEL LPELLGYFNF SALRKLGLQT KENKKADLKK NE
|
| |