Gene HS_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_0489 
Symbollon 
ID4239971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp529557 
End bp531968 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content37% 
IMG OID638104037 
ProductLon-A peptidase 
Protein accessionYP_718700 
Protein GI113460634 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAA AACGAACTAA GCTAGAGCAT CTTCCGGTTC TACCATTGCG TGATGTGGTA 
GTTTTTCCTT ATATGGTAAT GCCATTGTTT GTTGGTCGTC CCAAGTCTAT TCGTAGTTTA
GAGGAGGCGA TGGAAAATAA TAAGCAATTA TTATTGGTTT CACAAAGAAA ACCTGACATT
GAAGAACCTA AGATCGCTGA TCTTTATAAG ATTGGTACAT TAGTCAATAT TATTCAATTG
TTAAAATTGC CGGATGGTAC TGTAAAAGTT CTTGTTGAAG GACAACAAAG AACTAAACTT
ATTGATTTAC AAGATAATGG GGAATTCTTT TTAGCGTCTC ACGAATTAAT TGAAACACAA
TGGAGTGATG AGAAAGAATT AAGTGTATTG AAGAAAATTA CTTTATCCGA ATTTGAAAAA
TATGCGAATT TAAATAAAAA AATTCCCGCA GATATTATTT CTGCATTGCG ACGTATTAAT
GATATAGAGA GATTAAGTGA TACGGTTGCA GCTCATCTTC CGGTATCTAT TAATGAAAAG
CAAAATATCC TAGAAATAGG AGATTTGTCG GCACGGTTTG AATATTTATT AGGATTAATG
GTAAGCGAAG CCGATATATT GCAAGTTGAA CAACGTGTGC GTGGCAGAGT TAAAAAACAG
ATAGAAAAAA ATCAACGTGA TTACTATCTG AATGAACAAA TTAAGGCATT GCAAAAAGAG
CTGAATGATG ATGAAAACAC AGTTGATGAA GTTGAGCAAT TACGCAAGAA AATAGAAGAG
GCTAAGATGC CGATAGAGGC TCGTGAAAAA GTGTTTGCCG AATTGCAAAA ATTAAAGATG
ATGTCGCCAA TGTCTTCCGA AGCAACGGTT TTGCGTAGTT ATATTGACTG GATGGTTCAA
GTTCCTTGGC ATAAGCGAAC TAAAGTTAAA AAAGATCTTG CCAAAGCACA GGAAACCTTA
GATGCGGATC ACTATGGCTT AGAACGTGTT AAAGAGCGTA TATTGGAGTA TTTAGCGGTA
CAAAGTCGCT TAAATCAATT AAAAGGCCCT ATTTTATGTT TAGTTGGCCC TCCGGGTGTG
GGAAAAACAT CGCTAGGGCA TTCTATTGCC AACGCAACGG GGCGTAAATA TGTACGCATG
GCATTAGGCG GTGTGCGAGA TGAAGCAGAG ATTCGTGGAC ATCGTAAAAC GTATATAGGT
TCTTTACCCG GCAAATTAAT TCAAAAAATG GCAAAAGTGG GGGTGAAAAA TCCGCTGTTT
TTACTTGATG AAATTGACAA AATGGCAATG GATTATCGAG GTGATCCGGC ATCTGCATTA
TTGGAAGTGC TTGATCCTGA ACAAAATTCA CATTTTAATG ATCATTATCT TGAAGTCGAT
TATGATTTAT CTGATGTAAT GTTTGTTGCT ACGTCCAACT CAATGAATAT TCCAGCACCT
TTACTGGATC GTATGGAAGT CATTCGTCTC TCCGGTTATA CGGAAGATGA AAAACTCAAT
ATTGCGACAC GTCACTTATT GAATAAACAA ATTGAGCGTA ACGGATTGAA GACTGATGAG
TTGGTTATCA ATGAAGAGGC TATTTTAGAT ATTATTCGCT ATTATACTCG AGAAGCCGGT
GTTCGTTCTT TAGAGAGAGA GATTTCTAAA ATTTGCCGCA AAGCAGTGAA AAATCTGCTA
TTAGATAAAA GTTTGAAATC TATTCAAGTG AATTCTAACA ATTTGCAAGA GTATCTTGGG
GTTAGACGCT TTGAATTTGG TCGAGCAGAT ACACAAAACC GCATTGGTGA AGTGACAGGA
TTAGCTTGGA CCGAAGTTGG CGGTGATTTA TTAACAATAG AAACGGCATC TGTAATCGGT
AAAGGTAAAT TGATTTATAC CGGTTCTTTG GGCGATGTGA TGAAAGAAAG TATTCAAGCT
GCGATGACTG TTGTAAGAAC TCGAGCTGAA AAGTTAGGTA TTGCTAATGA CTTTCATGAA
AAACGTGATA TTCACATTCA TGTACCGGAC GGTGCGACTC CGAAAGATGG ACCAAGTGCG
GGTATTGCTA TGTGTACAGC GTTGGTTTCT TGTTTAACCG GTAATCCGGT AAAATCTGAA
GTGGCAATGA CGGGGGAAAT TAGTTTACGT GGCAAAGTAT TACCGATTGG TGGGTTGAAG
GAAAAATTAT TAGCAGCTCA TCGAGGTGGT ATTAAAACGG TGATTATACC CAAAGAAAAT
GTAAAAGATT TGGAAGAAAT CCCTGAAAAT GTGAAAAATA ATTTAACTAT TCATGCGGTT
GACACTATTG ATGAAGTCTT AACAATTGCA TTAGAAAATC CACCGGAAGG AGTTGATTTT
GTGAAGCTTT CTCCAATTCA TAAAATTAAA TCTTCTCGTA AGCGTTCTTC TCGAACAAAA
AGTTTGAATT AA
 
Protein sequence
MNAKRTKLEH LPVLPLRDVV VFPYMVMPLF VGRPKSIRSL EEAMENNKQL LLVSQRKPDI 
EEPKIADLYK IGTLVNIIQL LKLPDGTVKV LVEGQQRTKL IDLQDNGEFF LASHELIETQ
WSDEKELSVL KKITLSEFEK YANLNKKIPA DIISALRRIN DIERLSDTVA AHLPVSINEK
QNILEIGDLS ARFEYLLGLM VSEADILQVE QRVRGRVKKQ IEKNQRDYYL NEQIKALQKE
LNDDENTVDE VEQLRKKIEE AKMPIEAREK VFAELQKLKM MSPMSSEATV LRSYIDWMVQ
VPWHKRTKVK KDLAKAQETL DADHYGLERV KERILEYLAV QSRLNQLKGP ILCLVGPPGV
GKTSLGHSIA NATGRKYVRM ALGGVRDEAE IRGHRKTYIG SLPGKLIQKM AKVGVKNPLF
LLDEIDKMAM DYRGDPASAL LEVLDPEQNS HFNDHYLEVD YDLSDVMFVA TSNSMNIPAP
LLDRMEVIRL SGYTEDEKLN IATRHLLNKQ IERNGLKTDE LVINEEAILD IIRYYTREAG
VRSLEREISK ICRKAVKNLL LDKSLKSIQV NSNNLQEYLG VRRFEFGRAD TQNRIGEVTG
LAWTEVGGDL LTIETASVIG KGKLIYTGSL GDVMKESIQA AMTVVRTRAE KLGIANDFHE
KRDIHIHVPD GATPKDGPSA GIAMCTALVS CLTGNPVKSE VAMTGEISLR GKVLPIGGLK
EKLLAAHRGG IKTVIIPKEN VKDLEEIPEN VKNNLTIHAV DTIDEVLTIA LENPPEGVDF
VKLSPIHKIK SSRKRSSRTK SLN