Gene HS_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1310 
SymboltopB 
ID4240821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1499439 
End bp1501337 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content38% 
IMG OID638104883 
ProductDNA topoisomerase III 
Protein accessionYP_719522 
Protein GI113461453 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00570993 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTAT TTATAGCGGA AAAACCCAGC CTTGCTCGAG CTATTGCTGA CGTATTGCCT 
AAACCGCATC AACGTGGACA AGGCTTTGTC AAATGTGGCG AACAAGATTA TGTAACTTGG
TGTGTAGGAC ATTTATTGGA ACAAGCTGCT CCTGATGTTT ACAATCCAAT GTTCAAACAA
TGGCGTTTAG AACATTTACC TATTATTCCG CAAAAATGGC TATTACTTCC CAGACAAGAA
GTAAAAAGTC AGCTGGACAT TGTACTTAAA TTAATTCATC AAGCAGACAT TATAATTAAT
GCTGGGGATC CGGATCGAGA AGGGCAATTA TTAGTTGATG AAGTTTTCAG TTATGCCAAG
TTGCCTGCTA CAAAATTAGC CGAAATTCAA CGTTGTCTAA TTAACGATCT CAATCCAAGT
GCGGTAGAAA AAGCAATAAA AAAATTACAG CCTAACAAAA ATTTTATCCC CCTTGCTACA
TCAGCATTAG CACGTTCAAG AGCAGATTGG TTATATGGCA TTAATATGAC CCGAGCCTAC
ACTATTCGTG GTCGTCAATC AGGCTATAAC GGTGTTCTAT CCGTTGGCCG AGTGCAGACA
CCTGTCTTGG GATTAATTGT ACGTCGTGAT TTAGAAATTG AGCATTTTCA ACCGAAAGAT
TTTTTTGAGG TTCAAGCCTT TATTGCAACA AAAGAAAAAA TGCCCTCAAC ATTTACCGCA
CTTTGGCAAC CGAGTAAAGC CTGTGAAGAT TATCAAGATG AAGAAGGACG GGTGTTATCC
AACGCTTTAG CTAAAAATGT AGTAAAACGC ATTACCGCAC AGCCAGCAAC GGTGACAGAA
TATATTGATA AAAGAGAAAA AGAAACCGCC CCTTTGCCTT ATTCACTTTC TGCATTACAA
ATAGATGCAG CGAAACGTTA TGCCATGTCT GCACAAGAGG TTTTAGATGT TTGTCAAAAA
TTATACGAAA CCCATAAATT AATCACTTAC CCACGTTCCG ATTGTCGCTA TTTGCCTGAA
GAACATTTTT CCGCACGACA TACTATTTTA CGTGCAATTT CCACACACAG TATTTTATAT
AAAGAAATTC CCGATATAGT TAATACTGAG CTGAAAAATC GTTGTTGGAA CGATAAAAAA
GTAGAAGCAC ATCATGCCAT TATCCCAACG GCAAAAAATA AAACTGTGTC TTTAAGCCAA
AATGAACAAC AAATCTATGA TTTAATTGCT CGCCAATACT TAATGCAATT TTGTCAAGAG
GCTGAATATC GAAAAAGTAA AATTACATTA GATATTGCTG GGGGAACTTT TATTGCTCAA
GCGAGAAATT TACAAATTGC GGGCTGGAAA CAACTTTGGG GAAAAGAGGA TGAAGATGAA
CAACAAGAAC CTTTGCTTCC TATTGTGAAA AAAGGTGATG AATTATTTTG TGAAAAAGGT
GATGTAATTA GCAAAAAAAC ACAACCGCCA AAACCTTTTA CCGATGCAAC CTTACTTTCG
GCAATGACGG GAATCGCACG TTTTGTACAA AATAAAGAAT TGAAAAAAAT CTTACGTGAA
ACAGATGGTC TAGGTACGGA AGCAACAAGA GCAGGCATTA TTGAATTATT ATTTAAGCGA
GGATTCATCT ATAAAAAAGG GCGGAATATT CACAGTACAG AAACAGGCAG AACACTCATT
CAAGCCTTAC CAGAAATCGC AACTCAGCCG GATATGACCG CACATTGGGA AGCTCAACTG
GACAGCATTA GCCGCAGGCA AGGATCCTAT CAACAATTTA TGCAAACGCT CAGTGAACTT
TTACCTGAAT TATTAGGTTA CTTTAATTTT TCCGCACTGC GGAAACTAGG ACTTCAAACG
AAAGAGAATA AAAAAGCAGA TTTGAAAAAA AATGAGTAA
 
Protein sequence
MRLFIAEKPS LARAIADVLP KPHQRGQGFV KCGEQDYVTW CVGHLLEQAA PDVYNPMFKQ 
WRLEHLPIIP QKWLLLPRQE VKSQLDIVLK LIHQADIIIN AGDPDREGQL LVDEVFSYAK
LPATKLAEIQ RCLINDLNPS AVEKAIKKLQ PNKNFIPLAT SALARSRADW LYGINMTRAY
TIRGRQSGYN GVLSVGRVQT PVLGLIVRRD LEIEHFQPKD FFEVQAFIAT KEKMPSTFTA
LWQPSKACED YQDEEGRVLS NALAKNVVKR ITAQPATVTE YIDKREKETA PLPYSLSALQ
IDAAKRYAMS AQEVLDVCQK LYETHKLITY PRSDCRYLPE EHFSARHTIL RAISTHSILY
KEIPDIVNTE LKNRCWNDKK VEAHHAIIPT AKNKTVSLSQ NEQQIYDLIA RQYLMQFCQE
AEYRKSKITL DIAGGTFIAQ ARNLQIAGWK QLWGKEDEDE QQEPLLPIVK KGDELFCEKG
DVISKKTQPP KPFTDATLLS AMTGIARFVQ NKELKKILRE TDGLGTEATR AGIIELLFKR
GFIYKKGRNI HSTETGRTLI QALPEIATQP DMTAHWEAQL DSISRRQGSY QQFMQTLSEL
LPELLGYFNF SALRKLGLQT KENKKADLKK NE