Gene HS_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1737 
SymboltopA 
ID4241271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1966695 
End bp1968725 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content43% 
IMG OID638105330 
ProductDNA topoisomerase III 
Protein accessionYP_719942 
Protein GI113461873 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCT TTCTATGCGA AAAACCCAGT CAGGGACGGG ATATTGCAAA AGTGCTGGGG 
GCAACACAAA AAGGGGAGGG TTATTTATCA ACCGCTGATG GCACAATTGT TGTTACTTGG
GCGAGAGGGC ATTTAGTTGA GCATTTCTCG CCGGAACAAT ACGATCCCGC ACTTAAGGCA
TGGCGATTAG ATACTCTCCC GATTATCCCT TCCCAATGGC AAGTTTCCCC TAAACCTGAT
GCAAAAAAAG AATATAAAAC CGTGATGACA TTGCTGAAAA AAGCACGCAC GGTAGTGATT
TCAACTGATA TTGATCGAGA AGGGGAAACT ATTGCCCGTG AATTATTAGA CCTTGCCGGT
TATCGTGGAC ATATTCAGCG GCTGTGGATC ACCGCACTTG ATGAAATCAG TGTGCGTAAA
GCACTCAGCC GGCTTAAAAC CAATGAAGAA ACCTTGCCTC TTTATTATGC CGGGCTTGCT
CGTAGTCGGG CAGATTGGTT AATCGGAATG AATTTTAGTC GAATGTTTAC TCTTTTGGCA
CAACAACAAG GTTATCAAGG TCCGCCACTT TCGGTAGGGC GAGTGCAAAG CCCCACATTG
GCACTGGTGG TGAATCGAGA TAAGGAAATT GCTCATTTTG TTCCAACCTA TCATTATGCA
TTGATGGTAC AAGTCTGTGG AACCAATGCT CAGTCTTTTG AGGCAAGCCT TTTGGTTCCT
GAACAATATT GTGATGAAGA GGGGCGTTGT TTAGAGATAA AGGTGCTTCA GCAAGCGGAA
CAACAGATTC GTCAGGCGGG ACAAGTGTGT GTGCAGAATG TAGAAACGAA GCGAGAAAAA
GAAAGTCCGC CGTTACTTTT TGCATTAAGT GATTTGCAGG CGGAATGTAA TCGCCTCTAT
GATTTAGGTA CACAACAAGT GCTTGATATT GCACAATCCC TCTATGAAAA ACACAAAGCA
ACTACCTATC CGCGTACGGA TTGCGGTTAT CTTCCTGAAG CACAATTACT TGAAGTGCCT
CAGGTCATTA ATAGTCTGAT GCAGTCAGAT GCCAAGTTAC AGCCTTTGCG AGTACAGCTT
AATTTGAGCC AAAAATCCCG TGCGTGGAAT GATAAAAAAA TTACCGCTCA CCATGGGATT
ATTCCCACAA CACAGCCGGT TGATATTGCT AAAATGAACG AGAATGAGTT TAAAGTTTAT
GACTTAATTC GTCGGCGTTA TCTCGCACAA TTTCTACCGC ACTTTGAAGT GGATAAAACC
CGAGTTAGGT TATCTTGTGG AGAGCATCAA TGCATTGCAA AAGGTCAAGT GATTGTGAGT
GCAGGTTGGA AAGCCTTGTT TCGAGCGTCT AAAGAAGAGC AGGGAGAGAA ACAAGGTTTG
CCAATACTGA ATAGTGGGCA AATGTTGAAG GTGATGAATA CGGAAATAAA ATCACTTAAA
ACCACACCGC CTGCACACTA TACAGAAGGT ACACTGCTCA CGGCAATGGT GAATGTCGCC
CGTTTTGTTA CAGATGAACG GCTAAAGAAA CAATTACGTG AAACGGAAGG ACTGGGGACA
GAGGCTACTC GGGCAAGTAT TATGAAAACC CTTTATGACC GCGGTTATAT CAAGAAAAAA
GGGAAATCCA TTGTGGCGAC AGACGCAGGC GTGATGTTGA TTGATAATTT ACCGACGGTA
TTAAAAGATC CGGGGTTAAC GGCATTATGG GAACAAGCAT TAAATCAAAT TGCCGAGAAC
CAAATGAGCT TGCAGGAATT TATGCAAAAG CAAGAACAGT TTGTTCTCCA TCTTATTCAA
ACCTGTGGAC ATCAAGGTAT CAAAATGGGC AATATAGATA TTAAAAAATG CCCACAATGC
GGTAAGCCAC TTCGAAAAAT GAAAGGGAAA AATGGAGATT TCCTAGGTTG TACGGGCTAT
CCGAAATGCA AATATATTGA ATCCGGTGCG AAAAGAAAAA CATCTGCCGG AAAAGTTACC
CCTATCAACC TTTCTCAACA ATTTGCAAAT TTGCGTCAAG CCGTCAAATA A
 
Protein sequence
MKLFLCEKPS QGRDIAKVLG ATQKGEGYLS TADGTIVVTW ARGHLVEHFS PEQYDPALKA 
WRLDTLPIIP SQWQVSPKPD AKKEYKTVMT LLKKARTVVI STDIDREGET IARELLDLAG
YRGHIQRLWI TALDEISVRK ALSRLKTNEE TLPLYYAGLA RSRADWLIGM NFSRMFTLLA
QQQGYQGPPL SVGRVQSPTL ALVVNRDKEI AHFVPTYHYA LMVQVCGTNA QSFEASLLVP
EQYCDEEGRC LEIKVLQQAE QQIRQAGQVC VQNVETKREK ESPPLLFALS DLQAECNRLY
DLGTQQVLDI AQSLYEKHKA TTYPRTDCGY LPEAQLLEVP QVINSLMQSD AKLQPLRVQL
NLSQKSRAWN DKKITAHHGI IPTTQPVDIA KMNENEFKVY DLIRRRYLAQ FLPHFEVDKT
RVRLSCGEHQ CIAKGQVIVS AGWKALFRAS KEEQGEKQGL PILNSGQMLK VMNTEIKSLK
TTPPAHYTEG TLLTAMVNVA RFVTDERLKK QLRETEGLGT EATRASIMKT LYDRGYIKKK
GKSIVATDAG VMLIDNLPTV LKDPGLTALW EQALNQIAEN QMSLQEFMQK QEQFVLHLIQ
TCGHQGIKMG NIDIKKCPQC GKPLRKMKGK NGDFLGCTGY PKCKYIESGA KRKTSAGKVT
PINLSQQFAN LRQAVK