Gene SeD_A2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2051 
SymbolsppA 
ID6873195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1982861 
End bp1984717 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content56% 
IMG OID642785165 
Productprotease 4 
Protein accessionYP_002215831 
Protein GI198245686 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.278981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACCC TGTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG AGTGTTGAAC 
TTCGTCCGTG AAATGGTACT CAACCTGTTC TTTATTTTTC TGGTCCTGGT GGGCGTTGGG
ATCTGGATGC AGATCGGTAA CGGCAACAAC AGTGAGCAAA CGGCGCGCGG CGCTCTGCTG
CTGGATATTT CCGGCGTCAT TGTGGATAAA CCCTCCACCA ATCACCGTCT GGGCGCGCTG
GGTCGCCAGT TATTTGGCGC CAGCTCCGAC CGTCTGCAGG AAAACTCCCT GTTTGACATC
GTCAACGCTA TTCGTCAGGC GAAAGATGAC CGTAACATCA CGGGTATCGT TCTGGATCTG
AAAAACTTCA CCGGCGCCGA TCAGCCGTCA ATGCGCTATA TCGGTAAAGC GCTGCGCGAA
TTCCGCGACA GCGGCAAACC GGTTTTCGCC GTGGGTGAAA ACTACAGCCA GGGTCAGTAT
TACCTCGCCA GTTTCGCCAA TAAAATTTGG CTTTCCCCAC AGGGTCAGGT AGATCTTCAC
GGCTTCGCTA CGAATGGCCT GTACTACAAA ACGCTGCTGG ATAAGCTGAA AGTCTCTACC
CACGTTTTCC GGGTCGGCAC CTATAAATCC GCCGTCGAGC CGTTTATCCG CGACGATATG
TCGCCCGCCG CCCGCGAGGC CGACAGCCGC TGGATAGGCG AACTGTGGCA GAACTACCTG
CATACCGTTT CCGCCAATCG CCAGATTTCG CCGCAACAAC TCTTCCCCGG CGCGCAGGCT
ATTATCGACG GGTTAACTAG CGTGGGCGGC GACACCGCCA AATATGCGCT CGACCATAAA
CTGGTGGACG CCCTCGCCTC CAGCGCAGAT GTTGAAAAAG CGCTGACGAA GCAGTTTGGC
TGGAGCAAAA CCGAAAATAA CTATCGCGCG ATCAGTTATT ACGATTATTC GCTGAAAACG
CCTGCGGATA CCGGCGGTAC TATTGCGGTT ATTTTCGCCA ATGGCGCGAT TATGGATGGC
GAAGAAACAC CAGGGAATGT CGGCGGCGAC ACTACGGCAT CGCAGATCCG CGACGCACGC
CTTGATCCTA AAGTGAAAGC GATTGTGCTG CGCATCAATA GCCCAGGCGG TAGCGTCAAC
GCCTCCGAAG TTATCCGCGC CGAACTGGCG GCGGCAAGAG CGGCTGGCAA ACCGGTGGTG
GTCTCAATGG GCGGTATGGC GGCCTCCGGC GGTTACTGGA TCTCTACGCC GGCAAACTAT
ATCGTGGCCA GCCCCAGCAC GCTGACGGGT TCAATTGGCA TCTTCGGCGT CATCAATACG
GTAGAAAACA GCCTGTCGTC GATTGGCGTA CACAGCGACG GCGTTTCCAC CTCGCCGCTG
GCGGATATTT CGATGACCAA AGCGCTGTCA CCGGAAGTGC AGCAGATGAT GCAACTCAGT
ATTGAGTACG GCTACAAACG CTTTATCACG CTGGTGGCAG ACGCGCGTAA GCGTACGCCG
GAGCAGATTG ATAAAATCGC GCAAGGCCAT GTCTGGACCG GAGAAGACGC GAAAGCCAAT
GGTCTGGTGG ACAGTCTCGG CGACTTTGAC GACGCCGTCG CCAAAGCGGC GGAGCTGGCG
AAACTGAAAC AGTGGCATCT TGATTACTAT CAGGACGAAC CGACGGTCCT TGATATGGTC
ATGGACAGTA TGACCGGATC AGTACGCGCC ATGCTGCCGG AGGCCATTCA GGCGATGCTC
CCGGCGCCGC TCGTTTCCGC CGCCAATACG GTGAAGGCCG AGGGGGATAA ACTGGCGGCA
TTTAACGATC CGCAAAACCG TTATGCGTTC TGTTTGACTT GCGCGAACGT TCGCTAA
 
Protein sequence
MRTLWRFIAG FFKWTWRVLN FVREMVLNLF FIFLVLVGVG IWMQIGNGNN SEQTARGALL 
LDISGVIVDK PSTNHRLGAL GRQLFGASSD RLQENSLFDI VNAIRQAKDD RNITGIVLDL
KNFTGADQPS MRYIGKALRE FRDSGKPVFA VGENYSQGQY YLASFANKIW LSPQGQVDLH
GFATNGLYYK TLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
HTVSANRQIS PQQLFPGAQA IIDGLTSVGG DTAKYALDHK LVDALASSAD VEKALTKQFG
WSKTENNYRA ISYYDYSLKT PADTGGTIAV IFANGAIMDG EETPGNVGGD TTASQIRDAR
LDPKVKAIVL RINSPGGSVN ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY
IVASPSTLTG SIGIFGVINT VENSLSSIGV HSDGVSTSPL ADISMTKALS PEVQQMMQLS
IEYGYKRFIT LVADARKRTP EQIDKIAQGH VWTGEDAKAN GLVDSLGDFD DAVAKAAELA
KLKQWHLDYY QDEPTVLDMV MDSMTGSVRA MLPEAIQAML PAPLVSAANT VKAEGDKLAA
FNDPQNRYAF CLTCANVR