Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1516 |
Symbol | hepA |
ID | 4241036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1710458 |
End bp | 1713358 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638105097 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_719726 |
Protein GI | 113461657 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATTTG CAATTGGTCA GCGTTGGATA AGTGAAAGTG AAAATAGTCT TGGTTTGGGT ATTATTACGG GACAAGATAA TCGTACCGTT ACCATTTCTT TTCCCGCCTC GGATGAAACA CGTATTTATG CTTTAGCAAG TGCACCTTTA ACTCGAGTGT TGTTTCAAAA AGGTGATGAA ATTACACATC AATTAGGTTG GAAAGCAAGA GTTGTTGATG TAATGATGCG TAATGAGTTG GCTTTTTATC TTGTGCAGCG ACTTGATAAT AACGAAGAGA TTGTTGTACA AGAAATGGAA CTAGCTCACC AAATTTCTTT TAGTAAACCG CAGGATCGCT TATTTAGTAC ACAAATTGAC CGCAATGAGC ATTTTGTATT GCGTTACAAG GCACTAAAAC ACCAACAAGA ACAATTTCAG TCCTCTTTAA GAGGATTAAG AGGAAATCGA GCTGGTCTGA TTCCTCATCA ATTACATATT GCACAAGAGG TAGGGCGACG TATCGCACCA AGGGTATTAT TAGCCGATGA AGTCGGTTTG GGGAAAACGA TTGAAGCGGG TATGATTTTA CAGCAACAAT TGCTTGCTGA AAAAGTTCAG CGAGTGCTAA TTTTGGTGCC TGAAACTTTA CAGCATCAGT GGCTTGTTGA AATGCTACGT CGTTTTAACT TGCATTTCTC TTTATTTGAT GAAGAACGGT GTGCTGATTT TGATAATCCG GAAGATCATG TTGAAGCAAA TCCTTTTGTT GCTGAAAACC TAATTATTTG TGCTTTAGAT TGGCTTGTTC AACAACCGAA ACGTGCAAAA CAAGCCTTGG CGGGTGAGTT TGATTTATTG ATTGTTGATG AGGCTCATCA TCTCACTTGG TCGGAAGATT CGCCAAGCAT AGCTTATGAA TTGGTTATGC AATTAAGTGC GGTGATTCCT GCCGTATTAT TATTAACGGC GACACCGGAG CAATTGGGGC AACAAAGTCA TTTTGCCCGT TTGCATTTAT TGGATCCTAA TCGTTTTTAT AGTTATCGGG CTTTTGAAAA AGAACAGCAA CAATATCAAC CTGTTGCTAA AGCAGTACAA AGTCTGTTAT CAGAGCATAT TTTAACAGTT GAAGAACAAA ATCATTTAGC AGAATTACTT AGCGAGCAAG ATATCGAGCC AATGCTCAAA GTGATTAATT CTCAAGCTGA TACGGAACAA AAACAGGTTG CTCGAGGAGA ATTAATGAGT AATCTAATTG ATCGTCATGG AACCGGTCGC TTGTTATTTC GCAATACAAG ACAAGGTGTG CAAGGGTTTC CTCATCGTAT TTATCATCAA ATCAAATTAG ATTTACCAAC ACAATATCAA AATGCGATTA ATGTGTTGAA TATGTTGGGG GAAATAAAAG ATCCAGAGTT ATTTTATCCT GAGCAAATTT TCCAAAAAAT GAATGCGGAT GCAACTTGGT GGCGTTTTGA TCCTAGAGTG GATTGGTTAA TTAATTTAGT TAAGAGTTTA CGTGAAGAAA AAATCTTAGT GATTTGTCAA GATGCGATAA CAGCCATACA ATTGGAGCAG GCATTGCGAG AGAAGGAAGG AATTCGTAGT GCGGTATTTC ATGAGAATAT GTCGATTATT GAACGAGATC GTGCCTCTGC ATATTTTGCA CAGCAAGAAG AGGGTGCACA GGTATTACTA AGTTCTTCAA TCGGTTCGGA AGGTCGAAAT TTTCAGTTTG CTTGTCATTT AGTGTTATTC CATTTGCCGA ATAATCCCGA TTTATTGGAA CAATGTATTG GGCGGTTGGA TCGTATAGGT CAACGTCGAG ATATTCAAAT TTATGTCCCT TGTTTTGCTG ATACACCGCA AATTCGCTTG GCTCAATGGT ACCATGAAGG TTTGAACGCA TTTGAGGAAA CCTGTCCAAT GGGGGCTATT TTGCATGAAA AGTGCGGTGC AAAATTGACA GAATTTTTAA CTTCAGACAC AACGGACGAT TTTCAGGCTT TTATTCAACA GACTCATCAG CAACAATTAC AGCTAAAATC CGAATTGGAG CAAGGTCGAG ATCGTTTATT GGAGCTGAAC TCAAATGGCG GTGAGCAGGC TCAACAATTA GCGAATGACA TTGCGATACA AGACGGTTCA ACAGAACTCA TTGATTTCAC ATTGAACTTA TTCGATATTA TTGGTGTTGA GCAAGAAGAT CTTGGCGAAA AATCTATTGT CATTAGCCCG ACAGGAACGA TGCTTGTCCC TGATTTTCCG GGTTTAAAAG AAGAAGGGGT TACTGTTACT TTTGATCGTG ATTTAGCGTT GGCTCGTGAA GATCTTGAGT TTCTTACTTG GGATCACCCA ATTGTGCGTA ATGGGATTGA TTTAATCGTC AGTGGTGATA TAGGTAAAAG TGCGGTGGCT TTATTGGTAA ATAAACAATT ACCGACCGGT ACATTGCTAC TTGAATTAGT TTATATAATT GAGAGCCAAT CACCACGTGG ATTACAGTTG ACCCGCTTCT TACCGCCAAC ACCGCTACGT TTACTTCTTG ATATTAAAGG GAATGATTTA AGTCATCAAA TTTCTTTTCA GGGCTTGCAA AAACAACTGA AACCAATGGG TAAAAATATG GCAACAAAAG TCATTAAAAT AATGCGTCCA GCGATTGAGC AATTAATTAA GCAAAGTGCA AAAAATGTAG TTGAGCCAGC TAAAATGATA ATTGAACAAG CTAAACAACT GGCGGATCAA TCATTGAGTG CGGAAATAAA TCGATTATAT GCGTTGCAGG CGGTCAATAA AAATATTCGT CCGGAGGAAA TTGAACAGTT AGAAAGTCAG CGTACGTTAT CTCTTGAATT GCTTAATCAA GCAAATTGGC GTTTAGACAG TTTACGGGTA ATTGTGAGTA ATAAGGAATA A
|
Protein sequence | MLFAIGQRWI SESENSLGLG IITGQDNRTV TISFPASDET RIYALASAPL TRVLFQKGDE ITHQLGWKAR VVDVMMRNEL AFYLVQRLDN NEEIVVQEME LAHQISFSKP QDRLFSTQID RNEHFVLRYK ALKHQQEQFQ SSLRGLRGNR AGLIPHQLHI AQEVGRRIAP RVLLADEVGL GKTIEAGMIL QQQLLAEKVQ RVLILVPETL QHQWLVEMLR RFNLHFSLFD EERCADFDNP EDHVEANPFV AENLIICALD WLVQQPKRAK QALAGEFDLL IVDEAHHLTW SEDSPSIAYE LVMQLSAVIP AVLLLTATPE QLGQQSHFAR LHLLDPNRFY SYRAFEKEQQ QYQPVAKAVQ SLLSEHILTV EEQNHLAELL SEQDIEPMLK VINSQADTEQ KQVARGELMS NLIDRHGTGR LLFRNTRQGV QGFPHRIYHQ IKLDLPTQYQ NAINVLNMLG EIKDPELFYP EQIFQKMNAD ATWWRFDPRV DWLINLVKSL REEKILVICQ DAITAIQLEQ ALREKEGIRS AVFHENMSII ERDRASAYFA QQEEGAQVLL SSSIGSEGRN FQFACHLVLF HLPNNPDLLE QCIGRLDRIG QRRDIQIYVP CFADTPQIRL AQWYHEGLNA FEETCPMGAI LHEKCGAKLT EFLTSDTTDD FQAFIQQTHQ QQLQLKSELE QGRDRLLELN SNGGEQAQQL ANDIAIQDGS TELIDFTLNL FDIIGVEQED LGEKSIVISP TGTMLVPDFP GLKEEGVTVT FDRDLALARE DLEFLTWDHP IVRNGIDLIV SGDIGKSAVA LLVNKQLPTG TLLLELVYII ESQSPRGLQL TRFLPPTPLR LLLDIKGNDL SHQISFQGLQ KQLKPMGKNM ATKVIKIMRP AIEQLIKQSA KNVVEPAKMI IEQAKQLADQ SLSAEINRLY ALQAVNKNIR PEEIEQLESQ RTLSLELLNQ ANWRLDSLRV IVSNKE
|
| |