Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3709 |
Symbol | degS |
ID | 6873248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3557710 |
End bp | 3558780 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642786684 |
Product | serine endoprotease |
Protein accession | YP_002217318 |
Protein GI | 198242513 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.650789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGTGA AGCTCTTACG TTCGGTCGCA ATAGGTTTAA TTGTCGGCGC TATTCTGTTG GCCGTCATGC CTTCTTTGCG CAAAATTAAT CCTATCGCCG TCCCGCAATT CGACAGTACC GATGAGACGC CAGCCAGTTA TAATTTTGCG GTTCGCCGCG CCGCGCCTGC CGTCGTCAAC GTCTATAACC GCAGTATGAA CAGTACCGCG CATAATCAAC TGGAGATCCG CACGCTAGGT TCCGGCGTGA TCATGGATCA ACGCGGTTAT ATTATTACCA ACAAGCACGT GATTAACGAT GCCGATCAGA TTATCGTCGC GCTACAGGAT GGCCGCGTCT TTGAAGCGCT ACTGGTTGGC TCCGATTCGC TTACCGATCT GGCGGTGCTG AAGATCAACG CCACTGGCGG GCTGCCTACC ATCCCGATTA ATACAAAGCG TACACCGCAT ATTGGCGACG TCGTACTGGC TATCGGCAAC CCATATAATC TGGGACAGAC CATCACCCAG GGGATCATCA GCGCAACGGG TCGTATCGGC CTGAACCCGA CGGGGCGACA GAATTTTCTC CAGACCGACG CCTCGATTAA CCACGGTAAT TCCGGCGGCG CGCTGGTCAA CTCGTTAGGC GAACTGATGG GGATCAACAC CCTCTCTTTT GATAAGAGTA ACGATGGCGA AACGCCGGAA GGCCTTGGTT TTGCGATTCC CTTCCAGCTA GCCACGAAAA TTATGGATAA GCTTATCCGC GACGGTCGCG TGATTCGCGG CTATATCGGT ATTGGCGGAC GAGAAATCGC GCCGCTGCAC GCGCAGCAGG GTAGCGGCAT GGACCCGATT CAGGGCATTG TCGTTAATGA AGTGACGCCA AACGGCCCCG CCGCGCTTGC CGGTATTCAG GTTAATGATT TGATTATTTC GGTCAATAAT AAACCCGCTG TGTCCGCGCT GGAGACGATG GATCAGGTGG CGGAAATCCG CCCGGGCTCC GTCATTCCGG TCGTGGTAAT GCGGGATGAT AAGCAGCTCA CGTTCCAGGT GACGGTGCAG GAATACCCGG CGTCGAACTA A
|
Protein sequence | MFVKLLRSVA IGLIVGAILL AVMPSLRKIN PIAVPQFDST DETPASYNFA VRRAAPAVVN VYNRSMNSTA HNQLEIRTLG SGVIMDQRGY IITNKHVIND ADQIIVALQD GRVFEALLVG SDSLTDLAVL KINATGGLPT IPINTKRTPH IGDVVLAIGN PYNLGQTITQ GIISATGRIG LNPTGRQNFL QTDASINHGN SGGALVNSLG ELMGINTLSF DKSNDGETPE GLGFAIPFQL ATKIMDKLIR DGRVIRGYIG IGGREIAPLH AQQGSGMDPI QGIVVNEVTP NGPAALAGIQ VNDLIISVNN KPAVSALETM DQVAEIRPGS VIPVVVMRDD KQLTFQVTVQ EYPASN
|
| |