Gene HS_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0858 
Symbol 
ID4240350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp933476 
End bp935287 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content35% 
IMG OID638104413 
ProductATP-dependent Lon protease 
Protein accessionYP_719068 
Protein GI113461001 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000838018 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAATTG AGCAAAAAAT AAAAAAAAGA GCCGACGAAT GCTTGTCAGA ACAGTTAATT 
TCTTGGCAAA ATTTATTACC CATGTTACAA CTTGATAATG ATTTTTTAGA CGCTTCATTA
TCTGAGATTG ATTTTTTTAC TTTGCAACCA AGAGCTAAAA GTGCGGTTGA TCTTTTCCTA
AAAAATCCTC AACGATCATT GCTTGTTTTA AAAGCAGATG ATCAAGCTGA ATATGCAACT
TTACTACAAG AATATATAGA ACAAAAAATT CCTGTTTCTG AAACAATTAG TGGTGTTCAG
TATATTATTG AACAGGGAGA CAGTTTTTCT TTTCCACAAA TCAGTCTAGC ACAGGCAGCA
TCAATGGAAG ACAACTTTGC AGGAAAACAA AAAGTGGCGA GTGCTTTATT TTTCGATCAA
AGTCAGTTAT TCGGATCAGT TGTAATTCAC CCCAGTTCTC ATGATATTCA ACTTAATTCA
GGATTAGTAC ATCAAGTAAA TAATGGTGTA TTGATTTTGA GTGCCGGTGC ATTGTTGGAG
CAATTTGAGA TATGGCATCG TTTAAAACAC TTATTACTAA CAGGTATTTT CAGTTGGTAT
TCTCTTAATC CTTTGAAAAC ATTACCTTGC ACAATTCCTA ATTATCCCCT AAATCTAAAA
GTTATCATTT TAGGTAATCG TTCAGAGTTG GCAACCCTAG GTGAATTGGA GGAGGAACTG
TATCATTTCG CAGATTATGC AGAAATTGAA AGTTATTTTT CGCTTAATGA TGCACAATCT
CATCAAAAGT GGGCAAGTTA TGTTCATACG TTGGCAAAAA AACAAGGAAT TGAGCTGAGT
ATTGAAGGAA TAAATGCACT TTATCAACTT TTTGTGAGAG AAAGTGAAGA TCGTTACTTA
ATTAGTATCT CACCATTAAA ATTAAAAGGG ATCTTATTTG AAACACAAAT TTTAAGTCAA
AGAAAACATT TAAGTGCGGT AGATTTTCAA TTATTTTTTC AACAAAAAGA ACAACAGTAC
TGTTTCTTAC GTGAGCAGGC GTATAAAGGC ATTTTACAGG AGCAAATTTT TATTGCTACA
GATGGGGAAA TAATAGGACA AATTAACGGA TTATCAGTCA TTGAATATCC CGGTACACCG
GTTTCTTTTG GCGAGCCTTC AAGAATAAGT TGCATTGTTC AATTTGGTGA TGGTGAAGTA
ATTGATATTG AAAGAAAAAG TGATCTGGCA GGGAATATCC ATGGCAAAAG CATTATGATT
GCAGAAACTT GTCTTGCCGG CATTTTAGAT CTTCCTTCTC AATTGCCTTT TTCAGCCTCA
ATTGCATTTG AACAATCTTA TGGTGATATT GATGGTGATA GTGCGTCTTT GGCTGTTTTC
TGCTCGTTAT TAAGTGCGTT GGCTGATTTA CCGTTACCAC AAAATATTGC CGTAACAGGT
AGTATTGATC AATTTGGTTT AGTGCATGCT GTTGGTGGTG TTAACGATAA AATTGAAGGC
TTTTTTGAAG TTTGTCAGCG TCGTAGATTA ACGGGAAAGC AAGGTGTAAT TATTCCAAGT
GCGGTATTAA ATCAACTCAG TTTATCCAGT AAAGTAATCG AAGCTGTTCA ACAAGAAAAA
TTCTTTATTT GGGCAGTTGA CGATATTTTT CAGACCACTG AAATCCTATT TAAGCGATAT
TTAGTAAGCG AACAGGATGC TGGATTGGAA AAAAATCTTC CCCTCGTAGA TGTGATTCGA
CAACGATTAG AAGAGAGATC TGAACAACAG CATAAAGGTC GTTTTTGGAA CTTTTTCTTT
AATCGCCATT AA
 
Protein sequence
MTIEQKIKKR ADECLSEQLI SWQNLLPMLQ LDNDFLDASL SEIDFFTLQP RAKSAVDLFL 
KNPQRSLLVL KADDQAEYAT LLQEYIEQKI PVSETISGVQ YIIEQGDSFS FPQISLAQAA
SMEDNFAGKQ KVASALFFDQ SQLFGSVVIH PSSHDIQLNS GLVHQVNNGV LILSAGALLE
QFEIWHRLKH LLLTGIFSWY SLNPLKTLPC TIPNYPLNLK VIILGNRSEL ATLGELEEEL
YHFADYAEIE SYFSLNDAQS HQKWASYVHT LAKKQGIELS IEGINALYQL FVRESEDRYL
ISISPLKLKG ILFETQILSQ RKHLSAVDFQ LFFQQKEQQY CFLREQAYKG ILQEQIFIAT
DGEIIGQING LSVIEYPGTP VSFGEPSRIS CIVQFGDGEV IDIERKSDLA GNIHGKSIMI
AETCLAGILD LPSQLPFSAS IAFEQSYGDI DGDSASLAVF CSLLSALADL PLPQNIAVTG
SIDQFGLVHA VGGVNDKIEG FFEVCQRRRL TGKQGVIIPS AVLNQLSLSS KVIEAVQQEK
FFIWAVDDIF QTTEILFKRY LVSEQDAGLE KNLPLVDVIR QRLEERSEQQ HKGRFWNFFF
NRH