Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2051 |
Symbol | sppA |
ID | 6873195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1982861 |
End bp | 1984717 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642785165 |
Product | protease 4 |
Protein accession | YP_002215831 |
Protein GI | 198245686 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.278981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACCC TGTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG AGTGTTGAAC TTCGTCCGTG AAATGGTACT CAACCTGTTC TTTATTTTTC TGGTCCTGGT GGGCGTTGGG ATCTGGATGC AGATCGGTAA CGGCAACAAC AGTGAGCAAA CGGCGCGCGG CGCTCTGCTG CTGGATATTT CCGGCGTCAT TGTGGATAAA CCCTCCACCA ATCACCGTCT GGGCGCGCTG GGTCGCCAGT TATTTGGCGC CAGCTCCGAC CGTCTGCAGG AAAACTCCCT GTTTGACATC GTCAACGCTA TTCGTCAGGC GAAAGATGAC CGTAACATCA CGGGTATCGT TCTGGATCTG AAAAACTTCA CCGGCGCCGA TCAGCCGTCA ATGCGCTATA TCGGTAAAGC GCTGCGCGAA TTCCGCGACA GCGGCAAACC GGTTTTCGCC GTGGGTGAAA ACTACAGCCA GGGTCAGTAT TACCTCGCCA GTTTCGCCAA TAAAATTTGG CTTTCCCCAC AGGGTCAGGT AGATCTTCAC GGCTTCGCTA CGAATGGCCT GTACTACAAA ACGCTGCTGG ATAAGCTGAA AGTCTCTACC CACGTTTTCC GGGTCGGCAC CTATAAATCC GCCGTCGAGC CGTTTATCCG CGACGATATG TCGCCCGCCG CCCGCGAGGC CGACAGCCGC TGGATAGGCG AACTGTGGCA GAACTACCTG CATACCGTTT CCGCCAATCG CCAGATTTCG CCGCAACAAC TCTTCCCCGG CGCGCAGGCT ATTATCGACG GGTTAACTAG CGTGGGCGGC GACACCGCCA AATATGCGCT CGACCATAAA CTGGTGGACG CCCTCGCCTC CAGCGCAGAT GTTGAAAAAG CGCTGACGAA GCAGTTTGGC TGGAGCAAAA CCGAAAATAA CTATCGCGCG ATCAGTTATT ACGATTATTC GCTGAAAACG CCTGCGGATA CCGGCGGTAC TATTGCGGTT ATTTTCGCCA ATGGCGCGAT TATGGATGGC GAAGAAACAC CAGGGAATGT CGGCGGCGAC ACTACGGCAT CGCAGATCCG CGACGCACGC CTTGATCCTA AAGTGAAAGC GATTGTGCTG CGCATCAATA GCCCAGGCGG TAGCGTCAAC GCCTCCGAAG TTATCCGCGC CGAACTGGCG GCGGCAAGAG CGGCTGGCAA ACCGGTGGTG GTCTCAATGG GCGGTATGGC GGCCTCCGGC GGTTACTGGA TCTCTACGCC GGCAAACTAT ATCGTGGCCA GCCCCAGCAC GCTGACGGGT TCAATTGGCA TCTTCGGCGT CATCAATACG GTAGAAAACA GCCTGTCGTC GATTGGCGTA CACAGCGACG GCGTTTCCAC CTCGCCGCTG GCGGATATTT CGATGACCAA AGCGCTGTCA CCGGAAGTGC AGCAGATGAT GCAACTCAGT ATTGAGTACG GCTACAAACG CTTTATCACG CTGGTGGCAG ACGCGCGTAA GCGTACGCCG GAGCAGATTG ATAAAATCGC GCAAGGCCAT GTCTGGACCG GAGAAGACGC GAAAGCCAAT GGTCTGGTGG ACAGTCTCGG CGACTTTGAC GACGCCGTCG CCAAAGCGGC GGAGCTGGCG AAACTGAAAC AGTGGCATCT TGATTACTAT CAGGACGAAC CGACGGTCCT TGATATGGTC ATGGACAGTA TGACCGGATC AGTACGCGCC ATGCTGCCGG AGGCCATTCA GGCGATGCTC CCGGCGCCGC TCGTTTCCGC CGCCAATACG GTGAAGGCCG AGGGGGATAA ACTGGCGGCA TTTAACGATC CGCAAAACCG TTATGCGTTC TGTTTGACTT GCGCGAACGT TCGCTAA
|
Protein sequence | MRTLWRFIAG FFKWTWRVLN FVREMVLNLF FIFLVLVGVG IWMQIGNGNN SEQTARGALL LDISGVIVDK PSTNHRLGAL GRQLFGASSD RLQENSLFDI VNAIRQAKDD RNITGIVLDL KNFTGADQPS MRYIGKALRE FRDSGKPVFA VGENYSQGQY YLASFANKIW LSPQGQVDLH GFATNGLYYK TLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL HTVSANRQIS PQQLFPGAQA IIDGLTSVGG DTAKYALDHK LVDALASSAD VEKALTKQFG WSKTENNYRA ISYYDYSLKT PADTGGTIAV IFANGAIMDG EETPGNVGGD TTASQIRDAR LDPKVKAIVL RINSPGGSVN ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY IVASPSTLTG SIGIFGVINT VENSLSSIGV HSDGVSTSPL ADISMTKALS PEVQQMMQLS IEYGYKRFIT LVADARKRTP EQIDKIAQGH VWTGEDAKAN GLVDSLGDFD DAVAKAAELA KLKQWHLDYY QDEPTVLDMV MDSMTGSVRA MLPEAIQAML PAPLVSAANT VKAEGDKLAA FNDPQNRYAF CLTCANVR
|
| |