Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0908 |
Symbol | sppA |
ID | 4240400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | - |
Start bp | 1000290 |
End bp | 1002155 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638104463 |
Product | protease IV family protein |
Protein accession | YP_719118 |
Protein GI | 113461051 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.378235 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGTGA TGTTGACCTT TCTCAAATTG TGCTGGCAAA TGCTCAATTT TGCTCGTCGT GTTGTAATGA ATATAGTATT TCTTTTTTTC GTTTTATTGA TTGTAGCAGT GTTTTCGGTT AATTCGAGTA TGTCGAATAA AGTAGATTTA ACTCATTTTA AAGGGGCGTT GCTATTAAAT TTGGACGGCT ATTTGGCAGA TAATCGTAGC GATGATACGG CATGGCAAGA GCTTCTTTTG GAATTAGGTA ATCAACATGT ACCGAGAAAA ATCTCAACGT TTGATGTTGT TAATGCCATT CAAGCGGCTA AAAAAGATCC AAAAATTACC GCACTTGTTC TTGATCTCAA TTATTTTGAT GGGAAAGATA TTCCTGCGTT AACTTACATT GGGAAAGCTA TTCAAGCCTT TAAAGCCAGC AAGAAACCAG TGATTGCTTA TGCGGATAAC TATACGCAAA GTCAATATTT ATTGGCAAGT TATGCGGATG TTATTTTGTT AAATCCGCAA GGTGAAGTGG CGATAGAAGG TATGGTTGCA GAGAATTTGT ATTTTAAATC GCTATTTAAT AAATTGGAAA TTACACCGCA TATTTTTCGT GTAGGTACTT ATAAATCTGC GGTAGAACCG TTTATGCTCG ATAAAATGTC AGAGAAAAGT CGTGAAAATA CAAGCCGTTG GCTTAACCAA TTATGGAAAA GTTATCAGCA AATTGTGGCT GAAAATCGTG ATATTCCATT AGCACAGGTG TTACCGGATA GTAAAACATA CCTTAGCGAA TTAAAAGCAC TGAACGGAAA TCAAACTGAA TATGCGAAAA AGCGTGGTTT AATAACAGAA TTAGCGGTAA CGCAAGAACG AGAAAAAATC ATTAAACGTT GGATAGTTAA TTCTGATGAT AAGTTGGATT TTGTTGAATT TGAGGATTAT TTGGCAATAT TGAAAAATCG ATTTGCACAA CCTACACAAC CTGCTATTGC CGTGGTAAAT GTTGAAGGGG CTATTGTCGA TGGTGAAAGT GATGAACAAA ATGTGGGCGG AGATAGCATA GCACAATTAT TACGTGAGGC GAATGATGAG CCAAATATAA AAGCGGTTGT ATTGCGAGTA AATAGTCCGG GTGGAAGTGC GTTCGCTTCA GAAATTATTC GCCAAGAAGT AGATAATTTA CAAAAATCAG GAAAGCCAGT CGTTGTTTCG ATGGGGGCAA TTGCGGCTTC CGGCGGTTAT TGGATTTCCT CAACGGCAGA TTATATTGTT GCTGACCCTA ATACAATTAC AGGGTCTATC GGTATTTTTG CCATGTTTCC AACGTTTGAA AAATCTATGC AAAAAATTGG TGTGAATGCC GATGGTGTTG CGACAACTGA TGTGGTGATG AAATCACATT TCAGCCCGTT ATCTAAAATC AGCAGTGAGA TTATTCAATT GGAAATAGAG CATGGTTATG ATCAATTTTT GGATGTTGTT AGCCGTGGGC GTAATTTATC AAAAACACAA GTTGATAAAA TTGCTCAAGG TCAAGTTTGG TCAGGTTTTG ATGCCTATAC GTATAAACTG GTAGATCAAC TGGGCAGTTT TGATGATGCA GTCGAAAAAG CTAGAGAATT AGTGATACAA AAATCTTCAG AAGAAATAAA AGATTTTTCT GTTGTTTGGC TAACGGAAAA AGAACCGTCA TTATTGGGGG AGTTGATGAA AAATGCTAAA CAACATTCAG AACATAATCT TCGTCAGTAT ATTGCCCATT TATTCGGCTT TTCTACATCA ATGCAGAAAA TGACTGAGCA ATTAGGTATT TTAAATAAAT TTAATGATCC CAAAGGGCAA TACTTATATT GTTTGAATTG TGGTGAATTG AAGTAA
|
Protein sequence | MKVMLTFLKL CWQMLNFARR VVMNIVFLFF VLLIVAVFSV NSSMSNKVDL THFKGALLLN LDGYLADNRS DDTAWQELLL ELGNQHVPRK ISTFDVVNAI QAAKKDPKIT ALVLDLNYFD GKDIPALTYI GKAIQAFKAS KKPVIAYADN YTQSQYLLAS YADVILLNPQ GEVAIEGMVA ENLYFKSLFN KLEITPHIFR VGTYKSAVEP FMLDKMSEKS RENTSRWLNQ LWKSYQQIVA ENRDIPLAQV LPDSKTYLSE LKALNGNQTE YAKKRGLITE LAVTQEREKI IKRWIVNSDD KLDFVEFEDY LAILKNRFAQ PTQPAIAVVN VEGAIVDGES DEQNVGGDSI AQLLREANDE PNIKAVVLRV NSPGGSAFAS EIIRQEVDNL QKSGKPVVVS MGAIAASGGY WISSTADYIV ADPNTITGSI GIFAMFPTFE KSMQKIGVNA DGVATTDVVM KSHFSPLSKI SSEIIQLEIE HGYDQFLDVV SRGRNLSKTQ VDKIAQGQVW SGFDAYTYKL VDQLGSFDDA VEKARELVIQ KSSEEIKDFS VVWLTEKEPS LLGELMKNAK QHSEHNLRQY IAHLFGFSTS MQKMTEQLGI LNKFNDPKGQ YLYCLNCGEL K
|
| |