Gene HS_0559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0559 
SymbolhsdR 
ID4240042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp593223 
End bp596423 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content40% 
IMG OID638104108 
Producttype I site-specific deoxyribonuclease, restriction subunit 
Protein accessionYP_718770 
Protein GI113460704 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAGA AAATACCCGA ATTAAAATTT GAAAAAACCT TAATTGAATA TCTTTGTACA 
GGAACAATTA CGGCAACGGA TAATATGATT GGCGAGCCGC CTGCGGATTA TGTAGTGCGT
ACCAAATTGT GGCAATATGA GCCAAGCATT AAAACGACGG AAGATTTATG GAAAAATTTT
AAAAGAATTT TAGAAAAGCT CAATCAAAAA ACCTTAGATC GTCCTTTAAG TGCGGTGGAA
TTTGCACAAG TTAAACGTAC TATTTCTAAT TTATCCACGC CTTATGCCGC AGGGCAGTTT
CTTTATGGCA CAAACGGTGT GTCACAAATT GAAATTGATT TAGATGATGA TCGTCATGTG
TTTTTAACCG TATTTGACCA AAAACAAATC GGTGCGGGCG ATACGGTGTA CCAAATTGTG
AATCAAATTA AACGACCGGC GATTATTGCA GGAAAAGAAA ATCGACAATT TGATACTACG
TTGTTGATTA ATGGCTTGCC CATTATTCAA ATTGAAGAAA AGAAAGAAAC GCATGATATT
AACGAAGCCT TAAATCAAAT GCACCAATAT ATCGCGGAAA ATCAATACAG CGATATTTTT
TCTACGTTAC AAATTCTGAT TGCGATTACG CCAAACAATG TGAAATATAT GGCGAATACC
ACAGCAGATC GCTTTAATAA AGATTTTGCC TTCAACTGGC AACGTCAATC GGACAATAAA
CCGGTTTATC ACTGGAAAGC CTTTGCCGAT TCCATGTTGA GTATTCCCAT GGCACATCAA
ATGGCGACTA ACTATATGAT TTTAGATGGT ACGCCGAATA AGCAAACCCT AAAAGTAATG
CGACCTTATC AAGTTTATGC TACGCGTTGT GTGATGGATA AACTCAAACG GGCGGATTTC
GATCAAGCGT TGAATAAACT CGGTTATATT TGGCATACCA CAGGTTCAGG CAAAACTATT
ACGAGCTTTA AAACCGCTTG GCTGGCAAGT CGTTCGCCGA AAGTGGATAA AGTGGTTTTT
TTAGTGGATC GTATTGCACT GACTAACCAA ACTACCGAAA ACTATCGAGC TTATGATCCT
GATGCCACAG AAGATAGTCT AGGTAGTGTA CTGGATACCA ACAACACCAC AGATTTGAAC
CGCAAATTAC ACAGCAAAGG CAGTCAAATT ATTATTACCT CTGTGCAAAA ACTGGATACC
TTGGTTAAAC GCACTTCTTT CAAGGCACCC GAAAAAAACA TTGTGTTTAT TGTGGATGAA
GCTCATCGTT CTACAGGCAG TGATAATTTT GCTGCGATTC AAAAAAAATT TAAACATTCT
GCGTGGATTG GTTACACCGG CACGCCAATG TTTGATGACA CTACCAAAGG CTTACGAACT
GAAGATATTT TCGGTGAACT CTTGCACGCC TATACCATTC GAGAAGCAAT TGCTGATCGT
AATGTGTTGG GTTTTAAAGT GGATTTTGAA ACCACGATTA GCGAAAAAAG TATGAAAGAA
GCACATCTAC CGAAATTCTA TCGCCTTAAA TATCCTCATT GGACGGAAGA ACAAATTGCC
TTGAAGATTG CCAATCTCAC GCAACAAGAT ATGGACGACA GCATCGAACC GAGTTTTTAT
GACGAAAATG AAGAACACGT CAAATTAGTG GTGGAAGATA TTTTCAAACA TTGGAAAAAT
CGCTCCAATT GGGATAACAA ACGTAATCGT GGACGATATA ACGCCTTGCT GACCACGCAC
GTGGGTGGCG GCAAAGCCAG TACACCAATG GCAATGATGT ATTTCCGTGA ATTCCAACGC
ATCAATAAAG TGCGGTCAGA AAAGGGGGAG TTTTTGCTAA AAGTTGCTGT GACCTTTAGC
CAAAATACCA GCAATAACGA CACCATGCTT GAAACCAACC AAGGATTATT CGAGGCAATT
CAACATTACA ACAGCGAATT TGGCACTAAC TTCACTATGT CTGATGTATC GGCTTATACT
CAAGATGTCA CCAGCCGTTT AAATCGCACC GCTGTGGACA AAAAATATTT AGACCTCGTG
ATTGTCGTAG ATCAGTTACT AACAGGATTT GACGCCCCTG AATTAAATAC GCTTTATGTT
GACCGCACTT TAAAAGGTGC AGGGCTTATT CAAGCCTATT CCAGAACCAA TCGTGTAGCG
GATATGCGTG AAAAACCTTG GGGGCGGATT ATCAATTACC GCTGGCCTGC CCATAATGAA
AAGTTAATGA ATGAGGCGTT GTCCGTTTAC GCCAACAAAG ATTCCGCCAA ATTATCCGAT
GAAGAGCGAA AAGTGAATAA TACCAAGGAT AAGATTACTG CACCAAAATA CGAAGAATTG
CTCGATGACA CCAAACAGAT TGTGGAAAAT CTGCGAGAAA TAACGGATAA CTTTAGCCAA
TTGCCACCGT CTGAAGCGAA AAAATGGGAA ATGTTGGAAC AATTACGCAC CTATACCGCC
AATATTGCCA AAATCAAGCA ATATTCTGCG GAAGATGAAA ATGGTGAATT TATCGGTTTT
GATTATGAAA ATCCGGATAA ATTGATTCAA GAACTCGGCA TGACCAGTGA AGAAGAAAGT
ATGCTCACCA ACACATTGTC CAACGAACTC AAACTGCATT TAGCAAAAAG CCGAAATGTG
CCAGTGATAC AAATTGAGCT ACAAATGACC CACGTTAAAG ACGTGGAAAT TAATTACGAT
TACTTAACTG AATTAATTTC CAGCTTGTTG GCACAAGTAG AGGCAGGTGA AACGGAAAAA
GCCAAAGAAA CGGAAAAAGA ATTGGAAAAA TTTGCTAACG GCTTGGAAGA TCGCAGTTTT
GCACAACGCA TTAACCGTGC GGCAAAAGCG ATTATGAAAG GTGTGTTCAA ATTAAAAGAT
GGGTTTAAAA AAGACGGCAT TTCTATTGTG GAACAAGCGA ATAACGCCAC CCAAGATCAA
CAATTTTTGG ATTTTCGTCT CCAATGGGGA TTAACGGAGG TGATTACCAA CGCCCAACTT
CGACAGCTTT TTGCCAAACA TCAATTTCAA CAAAAAGATT TAGATGACAA TAATCAGCTT
ACGGATTTAA TCAAACAAGC AGGTCTCACT TATCCCGAAC TTGCACAAAA TGATGAAGTA
AAAAAACTGA CAAAAATGAA ATACCGCAAC CGGTTGCGTC ATACCATTTA CGAACTTGCA
GATGAGTTTG TGGGGGAATA G
 
Protein sequence
MTKKIPELKF EKTLIEYLCT GTITATDNMI GEPPADYVVR TKLWQYEPSI KTTEDLWKNF 
KRILEKLNQK TLDRPLSAVE FAQVKRTISN LSTPYAAGQF LYGTNGVSQI EIDLDDDRHV
FLTVFDQKQI GAGDTVYQIV NQIKRPAIIA GKENRQFDTT LLINGLPIIQ IEEKKETHDI
NEALNQMHQY IAENQYSDIF STLQILIAIT PNNVKYMANT TADRFNKDFA FNWQRQSDNK
PVYHWKAFAD SMLSIPMAHQ MATNYMILDG TPNKQTLKVM RPYQVYATRC VMDKLKRADF
DQALNKLGYI WHTTGSGKTI TSFKTAWLAS RSPKVDKVVF LVDRIALTNQ TTENYRAYDP
DATEDSLGSV LDTNNTTDLN RKLHSKGSQI IITSVQKLDT LVKRTSFKAP EKNIVFIVDE
AHRSTGSDNF AAIQKKFKHS AWIGYTGTPM FDDTTKGLRT EDIFGELLHA YTIREAIADR
NVLGFKVDFE TTISEKSMKE AHLPKFYRLK YPHWTEEQIA LKIANLTQQD MDDSIEPSFY
DENEEHVKLV VEDIFKHWKN RSNWDNKRNR GRYNALLTTH VGGGKASTPM AMMYFREFQR
INKVRSEKGE FLLKVAVTFS QNTSNNDTML ETNQGLFEAI QHYNSEFGTN FTMSDVSAYT
QDVTSRLNRT AVDKKYLDLV IVVDQLLTGF DAPELNTLYV DRTLKGAGLI QAYSRTNRVA
DMREKPWGRI INYRWPAHNE KLMNEALSVY ANKDSAKLSD EERKVNNTKD KITAPKYEEL
LDDTKQIVEN LREITDNFSQ LPPSEAKKWE MLEQLRTYTA NIAKIKQYSA EDENGEFIGF
DYENPDKLIQ ELGMTSEEES MLTNTLSNEL KLHLAKSRNV PVIQIELQMT HVKDVEINYD
YLTELISSLL AQVEAGETEK AKETEKELEK FANGLEDRSF AQRINRAAKA IMKGVFKLKD
GFKKDGISIV EQANNATQDQ QFLDFRLQWG LTEVITNAQL RQLFAKHQFQ QKDLDDNNQL
TDLIKQAGLT YPELAQNDEV KKLTKMKYRN RLRHTIYELA DEFVGE