Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1880 |
Symbol | sppA |
ID | 6792596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 1838514 |
End bp | 1840370 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642776110 |
Product | protease 4 |
Protein accession | YP_002146744 |
Protein GI | 197250444 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000366121 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACCC TGTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG AGTGTTGAAC TTCGTCCGTG AAATGGTACT CAACCTGTTC TTTATTTTTC TGGTCCTGGT GGGCGTTGGG ATCTGGATGC AGATCGGTAA CGGCAGCAAC AGTGAGCAAA CGGCGCGCGG CGCTTTGCTG CTGGATATTT CCGGCGTCAT TGTGGATAAA CCCTCCACCA ATCACCGTCT GGGCGCGCTG GGACGCCAGT TATTTGGCGC CAGCTCCGAC CGTCTGCAGG AAAACTCCCT GTTTGACATC GTCAACGCTA TTCGTCAGGC GAAAGATGAC CGTAACATCA CTGGTATCGT TCTGGATCTG AAAAACTTCA CCGGTGCCGA TCAGCCGTCA ATGCGCTATA TCGGTAAAGC GCTGCGCGAA TTCCGCGACA GCGGCAAACC GGTTTTCGCC GTGGGTGAAA ACTACAGCCA GGGTCAGTAT TACCTCGCCA GTTTCGCCAA TAAAATTTGG CTTTCCCCAC AGGGTCAGGT AGACCTTCAC GGCTTCGCTA CGAATGGCCT GTACTACAAA ACGCTGCTGG ATAAACTGAA AGTCTCTACC CACGTTTTCC GGGTCGGCAC CTATAAATCC GCCGTCGAGC CGTTTATCCG CGACGATATG TCGCCCGCCG CCCGCGAGGC CGACAGCCGC TGGATAGGCG AACTGTGGCA GAACTACCTG CATACCGTTT CCGCCAATCG CCAGATTTCG CCGCAACAAC TCTTCCCCGG CGCGCAGGCT ATTATCGACG GGTTAACCAG CGTGGGCGGC GACACCGCCA AATATGCGCT CGACCATAAA CTGGTGGACG CCCTCGCCTC CAGCGCAGAT GTTGAAAAAG CGCTGACGAA GCAGTTTGGC TGGAGCAAAA CCGAAAATAA CTATCGCGCG ATCAGCTATT ACGATTATTC GCTGAAAACG CCTGCGGATA CCGGCGGTAC TATTGCGGTT ATTTTCGCCA ATGGCGCGAT TATGGATGGC GAAGAAACGC CAGGGAATGT CGGCGGCGAC ACTACGGCAT CGCAGATCCG CGACGCACGC CTTGATCCTA AAGTGAAAGC GATTGTGCTG CGCGTCAATA GCCCAGGCGG TAGCGTCAAC GCCTCCGAAG TTATCCGCGC CGAACTGGCG GCGGCAAGAG CGGCTGGCAA ACCGGTGGTG GTCTCAATGG GCGGTATGGC GGCCTCCGGC GGTTACTGGA TCTCTACGCC GGCAAACTAT ATCGTGGCCA GCCCCAGCAC GCTGACGGGT TCAATTGGCA TCTTCGGCGT CATCAATACG GTGGAAAACA GCCTGTCGTC GATTGGCGTA CACAGCGACG GCGTTTCCAC CTCGCCGCTG GCGGATATTT CGATGACCAA AGCGCTGTCA CCGGAAGTGC AGCAGATGAT GCAACTCAGT ATTGAGTACG GCTACAAACG CTTTATCACG CTGGTGGCAG ACGCGCGTAA GCGTACGCCG GAGCAGATTG ATAAAATCGC ACAAGGCCAT GTCTGGACCG GAGAAGACGC GAAAGCCAAT GGTCTGGTGG ACAGTCTTGG CGACTTTGAC GACGCCGTCG CCAAAGCGGC GGAGCTGGCG AAACTGAAAC AGTGGCATCT TGATTACTAT CAGGACGAAC CGACGGTCCT TGATATGGTC ATGGACAGTA TGACCGGATC AGTACGCGCC ATGCTGCCGG AGGCCATTCA GGCGATGCTC CCGGCGCCGC TCGTTTCCGC CGCCAACACG GTGAAGGCCG AGGGGGATAA ACTGGCGGCA TTTAACGATC CGCAAAACCG TTATGCGTTC TGTTTGACTT GCGCGAACGT TCGCTAA
|
Protein sequence | MRTLWRFIAG FFKWTWRVLN FVREMVLNLF FIFLVLVGVG IWMQIGNGSN SEQTARGALL LDISGVIVDK PSTNHRLGAL GRQLFGASSD RLQENSLFDI VNAIRQAKDD RNITGIVLDL KNFTGADQPS MRYIGKALRE FRDSGKPVFA VGENYSQGQY YLASFANKIW LSPQGQVDLH GFATNGLYYK TLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL HTVSANRQIS PQQLFPGAQA IIDGLTSVGG DTAKYALDHK LVDALASSAD VEKALTKQFG WSKTENNYRA ISYYDYSLKT PADTGGTIAV IFANGAIMDG EETPGNVGGD TTASQIRDAR LDPKVKAIVL RVNSPGGSVN ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY IVASPSTLTG SIGIFGVINT VENSLSSIGV HSDGVSTSPL ADISMTKALS PEVQQMMQLS IEYGYKRFIT LVADARKRTP EQIDKIAQGH VWTGEDAKAN GLVDSLGDFD DAVAKAAELA KLKQWHLDYY QDEPTVLDMV MDSMTGSVRA MLPEAIQAML PAPLVSAANT VKAEGDKLAA FNDPQNRYAF CLTCANVR
|
| |