Gene HS_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1049 
SymboldegS 
ID4240547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1156362 
End bp1157408 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content35% 
IMG OID638104610 
ProductDegS serine peptidase 
Protein accessionYP_719261 
Protein GI113461192 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0146114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA AATTAATTCA ATCAATAATC ACTGGGTTAG CTGCCGCAGC ATTGGTGTTA 
CTTATATTGC CTGTTTTCAA GGGAAATGGG TATTTGACAA ATATTTTTTT TATACAAAAA
GATATTCTTT CTTATAAAGA TGCTGTACGT ATTGCTTCAC CGGCGGTAGT TAATGTTTAT
AATCAAGCTT TTGTATTTAC AACCAATAAT TACCAACCAC AAATTAATAA TTTGGGTTCC
GGTGTTATTA TGTCAAAAGA TGGTTATATA TTGACTAATG AGCATGTTGT TCAAAATGCG
GATCAAATTG TGGTGGCATT ACAAAATGGA CGTATTTTTG AAGCGAATTT AGTGGGGTCG
GATCGCTTGA CAGATTTAGC GGTTTTAAAA ATTCATGCAG ATAATCTGGC AACCATTCCA
CAAAATCCAA AACGTCAAGC TCATGTCGGT GATGTCGTTT TATCTATTGG GAATCCCTAT
AACCTAGGTC AAAGTGTGTC GCAAGGTATT ATTAGTGCGT TGGGTCGTAA TGCTGTTGGC
GATTTCATTG GACGACAAAA TTTTATTCAA ACGGATGCCC CTCTTAATCG TGGTAATTCC
GGTGGGGCAC TCATTAATTC TGCTGGTGAA TTGATTGGTA TAAGTACGTT AAGTATTGGT
AAGAATGCCA ACGAAATTGC GGAAGGATTA AATTTTGCCA TTCCTATTGA ATTAGCTAAT
GATGTTATGC AAAAAATCAT TCGTGATGGT CGAGTTATTC GTGGTTACTT AGGGGTGCAA
AGTGATATTC TATTTAGCAA TGGAAAGGGT TTAAGAGATA AAGGAATTTT AATTACATCA
ATATTACAAG GTAGCCCTGC ACATAAAGCT GGTATTCAGC CTGGTGATGT GATTGTTAGT
TTTGATGGTA TTGATGCTGT TTCTCCTGCT CAAATGATGG AAGCGATTAG TAATACTAAA
CCTAATACCA CAATAAATAT GGTCATACAG CGTTTAGATA AAACCTTGAC TTTACCTGTT
GTGATTGAAG AATATAAAGC GAATTAA
 
Protein sequence
MIKKLIQSII TGLAAAALVL LILPVFKGNG YLTNIFFIQK DILSYKDAVR IASPAVVNVY 
NQAFVFTTNN YQPQINNLGS GVIMSKDGYI LTNEHVVQNA DQIVVALQNG RIFEANLVGS
DRLTDLAVLK IHADNLATIP QNPKRQAHVG DVVLSIGNPY NLGQSVSQGI ISALGRNAVG
DFIGRQNFIQ TDAPLNRGNS GGALINSAGE LIGISTLSIG KNANEIAEGL NFAIPIELAN
DVMQKIIRDG RVIRGYLGVQ SDILFSNGKG LRDKGILITS ILQGSPAHKA GIQPGDVIVS
FDGIDAVSPA QMMEAISNTK PNTTINMVIQ RLDKTLTLPV VIEEYKAN