Gene HS_0599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0599 
SymbolparE 
ID4240083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp640056 
End bp641948 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content39% 
IMG OID638104149 
ProductDNA topoisomerase IV subunit B 
Protein accessionYP_718811 
Protein GI113460744 
COG category[L] Replication, recombination and repair 
COG ID[COG0187] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), B subunit 
TIGRFAM ID[TIGR01055] DNA topoisomerase IV, B subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATT ATTCAGCACA AGAAATAACC GTTTTAAAAG ATCTTGAACC CGTTCAACTT 
CGCCCCGGTA TGTACACGGA TACGTCTCGC CCTAATCATT TAGGGCAAGA AGTTATTGAT
AATAGTGTTG ATGAAGCACT TGCAGGGCAC GCAACCAAAA TTGAAGTTAT TTTGTATAAA
GACCAATCTC TAGAGGTGAT TGATAATGGG CGTGGAATGC CTGTCGATAT TCACCCGACA
GAAAAAGTAT CCGGTGTTGA AGTTATTTTA ACTAAACTTC ACGCTGGCGG AAAATTTTCC
AATAAGCACT ATGAATTTGC AGGTGGTTTG CATGGTGTGG GTATTTCTGT AGTAAATGCC
TTATCTGAGC GTGTAGATAT TCAGGTAAAA CGTAACGGTG AAATCTATAA AATTGCCTTC
GAAAACGGCG TTAAGGTAGA AGAATTAGAG ATTATCGGCA CTTGCGGAAA GCGTACAACC
GGGACCTCCG TTCACTTCAA ACCTAATCCG AAATATTTTG ATAGTGCTAA TTTTTCAGTC
AGTCGCTTAC GTCATTTATT GCGTGCTAAA GCGGTTTTAT GTTCAGGATT GGAAATTAAA
TTTATCGATA AAATCAATAA CACTGAGGAT ACTTGGTTAT ACCAAGACGG TTTATCTGAT
TATTTAATGG AAAATGTTAA TGGTTATAAC CTCTTGCCAC AAACCCCTTT TGTCGGTGAT
TTTACGACGA ATAAAGAAGC GGCAAGTTGG GCATTATTAT GGTTACCCGA AGGAGGCGAA
TTACTTAACG AAAGCTATGT CAACCTTATT CCAACCATTC AGGGAGGTAC TCATGTCAAT
GGCTTACGTC AAGGTTTGCT TGATGCTATG CGTGAATTTT GTGAGTTTCG TAACCTATTA
CCCAAAAATG TCAAATTAAC CGCTGACGAT ATTTGGGATC GCTGTGCTTA TGTGTTATCA
GTAAAAATGC ACGATGCACA ATTTGCAGGT CAAACTAAAG AACGACTTTC CTCTCGTCAA
AGTGCGGTTT TTGTTGGCGG CGTTGTAAAA GATAGTTTCA GTTTATGGCT AAATCAAAAT
ATTCAAGATG CCGAAGAACT TGCAAAAATG GTGATAAGTT CAGCGGAAAG ACGTTTACGT
GCAGCAAAAA AAGTTGTGCG TAAAAAATTA GTCAGTGGTC CTGCTTTACC CGGAAAATTA
GCGGATTGTA GTGAACAAGA TTTAAGTAAA ACGGAGCTTT TTTTAGTAGA AGGAGATTCT
GCCGGTGGCT CAGCCAAACA AGCTCGCAAC CGTGAACATC AAGCAATTCT CCCATTACGG
GGAAAAATTT TAAACACATG GGAAGTTTCA CCGGATCAAG TGTTACGCTC ACAAGAAGTT
CATGATATTG CTATTGCACT GGGAATTGAT CCTGACAACG AAGATTTATC TCAATTACGC
TATGGTAAAG TTTGTATATT AGCCGATGCC GACTCTGACG GATTACATAT TGCCACATTA
CTTTGTGCTT TATTTTTACG TCATTTTCCA AAATTAGTTC AACAGGGCCA TGTTTATGTC
GCAATGCCTC CACTTTATCG TATAGATTTA GGTAATGAAG TTTTTTATGC TCTTGATGAA
AATGAAAAAG AGAGCATTTT AGCACGCTTA AAAAATAAGA GAGGCAAACT CAATGTTCAG
CGTTTCAAAG GGCTAGGTGA AATGAATCCT AATCAGCTAC GTGAAACTAC AATGGATCCA
AACACTCGCC GCTTGGTGCA ATTAACCTAC AAACCCCACG AAGAAGATGT ATCCACGTTA
GAATTGATGG ATATGTTATT AGCTAAAAAA CGTTCCGAAG ATCGGAAAAT TTGGTTGCAA
AATAACGGTG ATCAAGTGGA TGTAAATGCT TAA
 
Protein sequence
MTNYSAQEIT VLKDLEPVQL RPGMYTDTSR PNHLGQEVID NSVDEALAGH ATKIEVILYK 
DQSLEVIDNG RGMPVDIHPT EKVSGVEVIL TKLHAGGKFS NKHYEFAGGL HGVGISVVNA
LSERVDIQVK RNGEIYKIAF ENGVKVEELE IIGTCGKRTT GTSVHFKPNP KYFDSANFSV
SRLRHLLRAK AVLCSGLEIK FIDKINNTED TWLYQDGLSD YLMENVNGYN LLPQTPFVGD
FTTNKEAASW ALLWLPEGGE LLNESYVNLI PTIQGGTHVN GLRQGLLDAM REFCEFRNLL
PKNVKLTADD IWDRCAYVLS VKMHDAQFAG QTKERLSSRQ SAVFVGGVVK DSFSLWLNQN
IQDAEELAKM VISSAERRLR AAKKVVRKKL VSGPALPGKL ADCSEQDLSK TELFLVEGDS
AGGSAKQARN REHQAILPLR GKILNTWEVS PDQVLRSQEV HDIAIALGID PDNEDLSQLR
YGKVCILADA DSDGLHIATL LCALFLRHFP KLVQQGHVYV AMPPLYRIDL GNEVFYALDE
NEKESILARL KNKRGKLNVQ RFKGLGEMNP NQLRETTMDP NTRRLVQLTY KPHEEDVSTL
ELMDMLLAKK RSEDRKIWLQ NNGDQVDVNA