Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0229 |
Symbol | degP |
ID | 6871230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 242145 |
End bp | 243572 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642783475 |
Product | serine endoprotease |
Protein accession | YP_002214169 |
Protein GI | 198246202 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCACATTAGC AATGAGTGCA CTGGCTCTGA GTTTAGGTTT GGCATTGTCG CCTCTGTCTG CCACGGCGGC TGAAACGTCC TCTTCAGCAA TGACTGCCCA GCAGATGCCA AGCCTGGCAC CGATGCTCGA AAAAGTGATG CCATCGGTGG TCAGTATTAA TGTTGAAGGT AGCACCACGG TGAATACGCC GCGTATGCCG CGTAATTTCC AGCAGTTCTT TGGCGATGAC TCCCCGTTCT GCCAGGACGG TTCTCCGTTC CAGAATTCTC CGTTCTGCCA GGGCGGCGGT AACGGCGGCA ACGGCGGTCA GCAACAGAAA TTCATGGCGC TGGGCTCCGG CGTAATTATT GACGCCGCGA AGGGCTACGT CGTCACCAAC AACCACGTGG TTGATAACGC CAGCGTGATT AAAGTACAGC TTAGCGATGG ACGTAAATTC GATGCTAAAG TGGTAGGCAA AGATCCGCGT TCTGATATCG CGCTGATTCA AATTCAGAAT CCGAAGAACC TGACGGCGAT TAAGCTGGCG GACTCCGACG CGCTGCGCGT GGGGGATTAT ACCGTCGCTA TTGGTAACCC GTTTGGTCTG GGCGAAACGG TGACGTCAGG TATCGTTTCG GCGCTGGGGC GTAGCGGCCT GAACGTAGAA AATTACGAGA ATTTTATTCA GACCGACGCC GCGATTAACC GCGGTAACTC CGGCGGCGCG CTGGTGAACC TGAACGGTGA GCTGATCGGT ATTAACACCG CGATTCTGGC GCCGGACGGC GGCAACATCG GTATCGGCTT CGCTATCCCC AGTAACATGG TGAAAAACCT GACGTCGCAG ATGGTGGAAT ACGGCCAGGT GAAACGCGGC GAACTGGGGA TCATGGGGAC TGAGCTGAAT TCCGAATTGG CGAAAGCGAT GAAAGTCGAC GCCCAGCGTG GCGCGTTCGT CAGCCAGGTG ATGCCGAATT CGTCCGCGGC GAAAGCGGGT ATCAAAGCCG GGGATGTCAT TACCTCGCTG AACGGTAAAC CGATCAGCAG CTTTGCGGCG CTGCGCGCTC AGGTCGGCAC TATGCCGGTC GGCAGCAAAA TCAGCCTCGG TCTGCTGCGT GAAGGTAAAG CGATTACGGT TAATCTGGAA CTGCAGCAGA GCAGCCAGAG TCAGGTTGAT TCCAGCACCA TCTTCAGCGG GATTGAAGGC GCTGAAATGA GCAATAAAGG CCAGGATAAA GGCGTTGTGG TGAGCAGCGT GAAAGCGAAC TCACCCGCCG CGCAAATTGG CCTCAAAAAA GGCGATGTGA TTATCGGCGC TAACCAGCAG CCGGTGAAAA ATATCGCCGA GCTGCGTAAG ATTCTCGACA GCAAGCCGTC GGTACTGGCG CTGAATATTC AGCGTGGTGA TAGTTCTATT TATTTGCTGA TGCAGTAA
|
Protein sequence | MKKTTLAMSA LALSLGLALS PLSATAAETS SSAMTAQQMP SLAPMLEKVM PSVVSINVEG STTVNTPRMP RNFQQFFGDD SPFCQDGSPF QNSPFCQGGG NGGNGGQQQK FMALGSGVII DAAKGYVVTN NHVVDNASVI KVQLSDGRKF DAKVVGKDPR SDIALIQIQN PKNLTAIKLA DSDALRVGDY TVAIGNPFGL GETVTSGIVS ALGRSGLNVE NYENFIQTDA AINRGNSGGA LVNLNGELIG INTAILAPDG GNIGIGFAIP SNMVKNLTSQ MVEYGQVKRG ELGIMGTELN SELAKAMKVD AQRGAFVSQV MPNSSAAKAG IKAGDVITSL NGKPISSFAA LRAQVGTMPV GSKISLGLLR EGKAITVNLE LQQSSQSQVD SSTIFSGIEG AEMSNKGQDK GVVVSSVKAN SPAAQIGLKK GDVIIGANQQ PVKNIAELRK ILDSKPSVLA LNIQRGDSSI YLLMQ
|
| |