Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1049 |
Symbol | degS |
ID | 4240547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1156362 |
End bp | 1157408 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638104610 |
Product | DegS serine peptidase |
Protein accession | YP_719261 |
Protein GI | 113461192 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0146114 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA AATTAATTCA ATCAATAATC ACTGGGTTAG CTGCCGCAGC ATTGGTGTTA CTTATATTGC CTGTTTTCAA GGGAAATGGG TATTTGACAA ATATTTTTTT TATACAAAAA GATATTCTTT CTTATAAAGA TGCTGTACGT ATTGCTTCAC CGGCGGTAGT TAATGTTTAT AATCAAGCTT TTGTATTTAC AACCAATAAT TACCAACCAC AAATTAATAA TTTGGGTTCC GGTGTTATTA TGTCAAAAGA TGGTTATATA TTGACTAATG AGCATGTTGT TCAAAATGCG GATCAAATTG TGGTGGCATT ACAAAATGGA CGTATTTTTG AAGCGAATTT AGTGGGGTCG GATCGCTTGA CAGATTTAGC GGTTTTAAAA ATTCATGCAG ATAATCTGGC AACCATTCCA CAAAATCCAA AACGTCAAGC TCATGTCGGT GATGTCGTTT TATCTATTGG GAATCCCTAT AACCTAGGTC AAAGTGTGTC GCAAGGTATT ATTAGTGCGT TGGGTCGTAA TGCTGTTGGC GATTTCATTG GACGACAAAA TTTTATTCAA ACGGATGCCC CTCTTAATCG TGGTAATTCC GGTGGGGCAC TCATTAATTC TGCTGGTGAA TTGATTGGTA TAAGTACGTT AAGTATTGGT AAGAATGCCA ACGAAATTGC GGAAGGATTA AATTTTGCCA TTCCTATTGA ATTAGCTAAT GATGTTATGC AAAAAATCAT TCGTGATGGT CGAGTTATTC GTGGTTACTT AGGGGTGCAA AGTGATATTC TATTTAGCAA TGGAAAGGGT TTAAGAGATA AAGGAATTTT AATTACATCA ATATTACAAG GTAGCCCTGC ACATAAAGCT GGTATTCAGC CTGGTGATGT GATTGTTAGT TTTGATGGTA TTGATGCTGT TTCTCCTGCT CAAATGATGG AAGCGATTAG TAATACTAAA CCTAATACCA CAATAAATAT GGTCATACAG CGTTTAGATA AAACCTTGAC TTTACCTGTT GTGATTGAAG AATATAAAGC GAATTAA
|
Protein sequence | MIKKLIQSII TGLAAAALVL LILPVFKGNG YLTNIFFIQK DILSYKDAVR IASPAVVNVY NQAFVFTTNN YQPQINNLGS GVIMSKDGYI LTNEHVVQNA DQIVVALQNG RIFEANLVGS DRLTDLAVLK IHADNLATIP QNPKRQAHVG DVVLSIGNPY NLGQSVSQGI ISALGRNAVG DFIGRQNFIQ TDAPLNRGNS GGALINSAGE LIGISTLSIG KNANEIAEGL NFAIPIELAN DVMQKIIRDG RVIRGYLGVQ SDILFSNGKG LRDKGILITS ILQGSPAHKA GIQPGDVIVS FDGIDAVSPA QMMEAISNTK PNTTINMVIQ RLDKTLTLPV VIEEYKAN
|
| |