Gene HS_1516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1516 
SymbolhepA 
ID4241036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1710458 
End bp1713358 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content39% 
IMG OID638105097 
ProductATP-dependent helicase HepA 
Protein accessionYP_719726 
Protein GI113461657 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATTTG CAATTGGTCA GCGTTGGATA AGTGAAAGTG AAAATAGTCT TGGTTTGGGT 
ATTATTACGG GACAAGATAA TCGTACCGTT ACCATTTCTT TTCCCGCCTC GGATGAAACA
CGTATTTATG CTTTAGCAAG TGCACCTTTA ACTCGAGTGT TGTTTCAAAA AGGTGATGAA
ATTACACATC AATTAGGTTG GAAAGCAAGA GTTGTTGATG TAATGATGCG TAATGAGTTG
GCTTTTTATC TTGTGCAGCG ACTTGATAAT AACGAAGAGA TTGTTGTACA AGAAATGGAA
CTAGCTCACC AAATTTCTTT TAGTAAACCG CAGGATCGCT TATTTAGTAC ACAAATTGAC
CGCAATGAGC ATTTTGTATT GCGTTACAAG GCACTAAAAC ACCAACAAGA ACAATTTCAG
TCCTCTTTAA GAGGATTAAG AGGAAATCGA GCTGGTCTGA TTCCTCATCA ATTACATATT
GCACAAGAGG TAGGGCGACG TATCGCACCA AGGGTATTAT TAGCCGATGA AGTCGGTTTG
GGGAAAACGA TTGAAGCGGG TATGATTTTA CAGCAACAAT TGCTTGCTGA AAAAGTTCAG
CGAGTGCTAA TTTTGGTGCC TGAAACTTTA CAGCATCAGT GGCTTGTTGA AATGCTACGT
CGTTTTAACT TGCATTTCTC TTTATTTGAT GAAGAACGGT GTGCTGATTT TGATAATCCG
GAAGATCATG TTGAAGCAAA TCCTTTTGTT GCTGAAAACC TAATTATTTG TGCTTTAGAT
TGGCTTGTTC AACAACCGAA ACGTGCAAAA CAAGCCTTGG CGGGTGAGTT TGATTTATTG
ATTGTTGATG AGGCTCATCA TCTCACTTGG TCGGAAGATT CGCCAAGCAT AGCTTATGAA
TTGGTTATGC AATTAAGTGC GGTGATTCCT GCCGTATTAT TATTAACGGC GACACCGGAG
CAATTGGGGC AACAAAGTCA TTTTGCCCGT TTGCATTTAT TGGATCCTAA TCGTTTTTAT
AGTTATCGGG CTTTTGAAAA AGAACAGCAA CAATATCAAC CTGTTGCTAA AGCAGTACAA
AGTCTGTTAT CAGAGCATAT TTTAACAGTT GAAGAACAAA ATCATTTAGC AGAATTACTT
AGCGAGCAAG ATATCGAGCC AATGCTCAAA GTGATTAATT CTCAAGCTGA TACGGAACAA
AAACAGGTTG CTCGAGGAGA ATTAATGAGT AATCTAATTG ATCGTCATGG AACCGGTCGC
TTGTTATTTC GCAATACAAG ACAAGGTGTG CAAGGGTTTC CTCATCGTAT TTATCATCAA
ATCAAATTAG ATTTACCAAC ACAATATCAA AATGCGATTA ATGTGTTGAA TATGTTGGGG
GAAATAAAAG ATCCAGAGTT ATTTTATCCT GAGCAAATTT TCCAAAAAAT GAATGCGGAT
GCAACTTGGT GGCGTTTTGA TCCTAGAGTG GATTGGTTAA TTAATTTAGT TAAGAGTTTA
CGTGAAGAAA AAATCTTAGT GATTTGTCAA GATGCGATAA CAGCCATACA ATTGGAGCAG
GCATTGCGAG AGAAGGAAGG AATTCGTAGT GCGGTATTTC ATGAGAATAT GTCGATTATT
GAACGAGATC GTGCCTCTGC ATATTTTGCA CAGCAAGAAG AGGGTGCACA GGTATTACTA
AGTTCTTCAA TCGGTTCGGA AGGTCGAAAT TTTCAGTTTG CTTGTCATTT AGTGTTATTC
CATTTGCCGA ATAATCCCGA TTTATTGGAA CAATGTATTG GGCGGTTGGA TCGTATAGGT
CAACGTCGAG ATATTCAAAT TTATGTCCCT TGTTTTGCTG ATACACCGCA AATTCGCTTG
GCTCAATGGT ACCATGAAGG TTTGAACGCA TTTGAGGAAA CCTGTCCAAT GGGGGCTATT
TTGCATGAAA AGTGCGGTGC AAAATTGACA GAATTTTTAA CTTCAGACAC AACGGACGAT
TTTCAGGCTT TTATTCAACA GACTCATCAG CAACAATTAC AGCTAAAATC CGAATTGGAG
CAAGGTCGAG ATCGTTTATT GGAGCTGAAC TCAAATGGCG GTGAGCAGGC TCAACAATTA
GCGAATGACA TTGCGATACA AGACGGTTCA ACAGAACTCA TTGATTTCAC ATTGAACTTA
TTCGATATTA TTGGTGTTGA GCAAGAAGAT CTTGGCGAAA AATCTATTGT CATTAGCCCG
ACAGGAACGA TGCTTGTCCC TGATTTTCCG GGTTTAAAAG AAGAAGGGGT TACTGTTACT
TTTGATCGTG ATTTAGCGTT GGCTCGTGAA GATCTTGAGT TTCTTACTTG GGATCACCCA
ATTGTGCGTA ATGGGATTGA TTTAATCGTC AGTGGTGATA TAGGTAAAAG TGCGGTGGCT
TTATTGGTAA ATAAACAATT ACCGACCGGT ACATTGCTAC TTGAATTAGT TTATATAATT
GAGAGCCAAT CACCACGTGG ATTACAGTTG ACCCGCTTCT TACCGCCAAC ACCGCTACGT
TTACTTCTTG ATATTAAAGG GAATGATTTA AGTCATCAAA TTTCTTTTCA GGGCTTGCAA
AAACAACTGA AACCAATGGG TAAAAATATG GCAACAAAAG TCATTAAAAT AATGCGTCCA
GCGATTGAGC AATTAATTAA GCAAAGTGCA AAAAATGTAG TTGAGCCAGC TAAAATGATA
ATTGAACAAG CTAAACAACT GGCGGATCAA TCATTGAGTG CGGAAATAAA TCGATTATAT
GCGTTGCAGG CGGTCAATAA AAATATTCGT CCGGAGGAAA TTGAACAGTT AGAAAGTCAG
CGTACGTTAT CTCTTGAATT GCTTAATCAA GCAAATTGGC GTTTAGACAG TTTACGGGTA
ATTGTGAGTA ATAAGGAATA A
 
Protein sequence
MLFAIGQRWI SESENSLGLG IITGQDNRTV TISFPASDET RIYALASAPL TRVLFQKGDE 
ITHQLGWKAR VVDVMMRNEL AFYLVQRLDN NEEIVVQEME LAHQISFSKP QDRLFSTQID
RNEHFVLRYK ALKHQQEQFQ SSLRGLRGNR AGLIPHQLHI AQEVGRRIAP RVLLADEVGL
GKTIEAGMIL QQQLLAEKVQ RVLILVPETL QHQWLVEMLR RFNLHFSLFD EERCADFDNP
EDHVEANPFV AENLIICALD WLVQQPKRAK QALAGEFDLL IVDEAHHLTW SEDSPSIAYE
LVMQLSAVIP AVLLLTATPE QLGQQSHFAR LHLLDPNRFY SYRAFEKEQQ QYQPVAKAVQ
SLLSEHILTV EEQNHLAELL SEQDIEPMLK VINSQADTEQ KQVARGELMS NLIDRHGTGR
LLFRNTRQGV QGFPHRIYHQ IKLDLPTQYQ NAINVLNMLG EIKDPELFYP EQIFQKMNAD
ATWWRFDPRV DWLINLVKSL REEKILVICQ DAITAIQLEQ ALREKEGIRS AVFHENMSII
ERDRASAYFA QQEEGAQVLL SSSIGSEGRN FQFACHLVLF HLPNNPDLLE QCIGRLDRIG
QRRDIQIYVP CFADTPQIRL AQWYHEGLNA FEETCPMGAI LHEKCGAKLT EFLTSDTTDD
FQAFIQQTHQ QQLQLKSELE QGRDRLLELN SNGGEQAQQL ANDIAIQDGS TELIDFTLNL
FDIIGVEQED LGEKSIVISP TGTMLVPDFP GLKEEGVTVT FDRDLALARE DLEFLTWDHP
IVRNGIDLIV SGDIGKSAVA LLVNKQLPTG TLLLELVYII ESQSPRGLQL TRFLPPTPLR
LLLDIKGNDL SHQISFQGLQ KQLKPMGKNM ATKVIKIMRP AIEQLIKQSA KNVVEPAKMI
IEQAKQLADQ SLSAEINRLY ALQAVNKNIR PEEIEQLESQ RTLSLELLNQ ANWRLDSLRV
IVSNKE