Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0246 |
Symbol | degP |
ID | 6489978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 252228 |
End bp | 253655 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642740525 |
Product | serine endoprotease |
Protein accession | YP_002044199 |
Protein GI | 194450352 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.561258 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA CCACATTAGC AATGAGTGCA CTGGCTCTGA GTTTAGGTTT GGCATTGTCG CCTCTGTCTG CCACGGCGGC TGAAACGTCT TCTTCAGCAA TGACTGCCCA GCAGATGCCA AGCCTGGCAC CGATGCTCGA AAAAGTGATG CCATCGGTGG TCAGTATTAA TGTAGAAGGT AGCACCACGG TGAATACGCC GCGTATGCCG CGTAATTTCC AGCAGTTCTT TGGCGATGAC TCCCCGTTCT GCCAGGACGG TTCTCCGTTC CAGAATTCTC CCTTCTGCCA GGGCGGCGGT AACGGCGGCA ACGGCGGTCA ACAACAGAAA TTCATGGCGC TGGGCTCCGG CGTAATTATT GACGCCGCGA AGGGCTACGT CGTCACCAAC AACCACGTGG TTGATAACGC CAGCGTGATT AAAGTACAGC TTAGCGATGG GCGTAAATTC GATGCTAAAG TGGTGGGCAA AGATCCGCGT TCTGATATCG CGCTGATTCA AATTCAGAAT CCGAAGAACC TGACGGCGAT TAAGCTGGCG GACTCCGACG CGCTGCGCGT GGGGGATTAT ACCGTCGCTA TTGGTAACCC GTTTGGTCTG GGCGAAACGG TGACGTCAGG TATCGTTTCG GCGCTGGGGC GTAGCGGCCT GAACGTAGAA AATTACGAGA ACTTTATTCA GACCGACGCC GCGATTAACC GCGGTAACTC CGGCGGCGCG CTGGTGAACC TGAACGGTGA GCTGATCGGT ATTAACACCG CGATTCTGGC GCCGGACGGC GGCAACATCG GTATCGGCTT CGCTATCCCC AGTAACATGG TGAAAAACCT GACGTCGCAG ATGGTGGAAT ACGGCCAGGT GAAACGCGGC GAACTGGGGA TCATGGGGAC TGAGCTGAAT TCCGAATTGG CGAAAGCGAT GAAAGTCGAC GCCCAGCGAG GCGCGTTCGT CAGCCAGGTG ATGCCGAATT CGTCCGCGGC GAAAGCGGGT ATCAAAGCCG GGGATGTCAT TACCTCGCTG AACGGTAAAC CGATCAGCAG CTTTGCGGCG CTGCGCGCTC AGGTCGGCAC TATGCCGGTC GGCAGCAAAA TCAGCCTCGG TCTGCTGCGT GAAGGTAAAG CGATTACGGT GAATCTGGAA CTGCAGCAGA GCAGCCAGAG TCAGGTTGAT TCCAGCACCA TCTTCAGCGG GATTGAAGGC GCTGAAATGA GCAATAAAGG CCAGGATAAA GGCGTTGTGG TGAGCAGCGT GAAAGCGAAC TCACCCGCCG CGCAAATTGG CCTCAAAAAA GGCGATGTGA TTATCGGCGC TAACCAGCAG CCGGTGAAAA ATATCGCCGA GCTGCGTAAG ATTCTCGACA GCAAGCCGTC GGTTCTGGCG CTGAATATTC AGCGTGGTGA TAGTTCTATT TATTTGCTGA TGCAGTAA
|
Protein sequence | MKKTTLAMSA LALSLGLALS PLSATAAETS SSAMTAQQMP SLAPMLEKVM PSVVSINVEG STTVNTPRMP RNFQQFFGDD SPFCQDGSPF QNSPFCQGGG NGGNGGQQQK FMALGSGVII DAAKGYVVTN NHVVDNASVI KVQLSDGRKF DAKVVGKDPR SDIALIQIQN PKNLTAIKLA DSDALRVGDY TVAIGNPFGL GETVTSGIVS ALGRSGLNVE NYENFIQTDA AINRGNSGGA LVNLNGELIG INTAILAPDG GNIGIGFAIP SNMVKNLTSQ MVEYGQVKRG ELGIMGTELN SELAKAMKVD AQRGAFVSQV MPNSSAAKAG IKAGDVITSL NGKPISSFAA LRAQVGTMPV GSKISLGLLR EGKAITVNLE LQQSSQSQVD SSTIFSGIEG AEMSNKGQDK GVVVSSVKAN SPAAQIGLKK GDVIIGANQQ PVKNIAELRK ILDSKPSVLA LNIQRGDSSI YLLMQ
|
| |