Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3647 |
Symbol | degS |
ID | 6489349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3531290 |
End bp | 3532360 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642743765 |
Product | serine endoprotease |
Protein accession | YP_002047377 |
Protein GI | 194447948 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 87 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTGA AGCTCTTACG TTCGGTCGCA ATAGGTTTAA TTGTCGGCGC TATTCTGTTG GCCGTCATGC CTTCTTTGCG CAAAATTAAT CCTATCGCCG TCCCGCAATT CGACAGTACC GATGAGACGC CAGCCAGTTA TAATTTTGCG GTTCGCCGCG CCGCGCCTGC CGTCGTCAAC GTCTATAACC GCAGTATGAA CAGTACCGCG CATAATCAAC TGGAGATCCG CACGCTGGGT TCCGGCGTGA TCATGGATCA ACGCGGTTAT ATTATTACCA ACAAGCACGT GATTAACGAT GCCGATCAGA TTATCGTCGC GCTACAGGAT GGCCGCGTCT TTGAAGCGCT ACTGGTTGGC TCCGATTCGC TTACCGATCT GGCGGTGCTG AAGATCAACG CCACTGGCGG GCTGCCTACC ATCCCGATTA ATACAAAGCG TACACCGCAT ATTGGCGACG TCGTACTGGC TATCGGCAAC CCATATAATC TGGGACAGAC CATTACCCAG GGGATCATCA GCGCAACGGG TCGTATCGGC CTGAACCCGA CGGGGCGACA GAATTTTCTC CAGACCGACG CCTCGATTAA CCACGGTAAT TCCGGCGGCG CGCTGGTCAA CTCGTTAGGC GAACTGATGG GGATCAACAC CCTCTCTTTT GATAAGAGTA ACGATGGTGA AACGCCGGAA GGCCTTGGTT TTGCGATTCC CTTCCAGCTA GCCACGAAAA TTATGGATAA GCTTATCCGC GACGGTCGTG TGATTCGCGG CTATATCGGT ATTGGCGGAC GAGAAATCGC GCCGCTGCAC GCGCAGCAGG GTAGCGGCAT GGACCCGATT CAGGGCATTG TCGTTAATGA AGTGACGCCA AACGGCCCCG CCGCGCTTGC CGGTATTCAG GTTAATGATT TGATTATTTC GGTCAATAAT AAACCCGCCG TGTCCGCGCT GGAGACAATG GATCAGGTGG CGGAAATCCG CCCGGGCTCC GTCATTCCGG TCGTGGTAAT GCGGGATGAT AAGCAACTCA CGTTCCAGGT GACGGTGCAG GAATACCCGG CGTCGAACTA A
|
Protein sequence | MFVKLLRSVA IGLIVGAILL AVMPSLRKIN PIAVPQFDST DETPASYNFA VRRAAPAVVN VYNRSMNSTA HNQLEIRTLG SGVIMDQRGY IITNKHVIND ADQIIVALQD GRVFEALLVG SDSLTDLAVL KINATGGLPT IPINTKRTPH IGDVVLAIGN PYNLGQTITQ GIISATGRIG LNPTGRQNFL QTDASINHGN SGGALVNSLG ELMGINTLSF DKSNDGETPE GLGFAIPFQL ATKIMDKLIR DGRVIRGYIG IGGREIAPLH AQQGSGMDPI QGIVVNEVTP NGPAALAGIQ VNDLIISVNN KPAVSALETM DQVAEIRPGS VIPVVVMRDD KQLTFQVTVQ EYPASN
|
| |