Gene SeSA_A3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A3540 
Symbol 
ID6517354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp3413647 
End bp3415002 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content54% 
IMG OID642748526 
Productserine endoprotease 
Protein accessionYP_002116296 
Protein GI194735465 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTTTCG 
GCGCCGTTTC CAGCCCTTGC ATCGATACCA GGCCAGGCGA CGCTGCCAAG CCTTGCCCCT
ATGCTGGAGA AAGTGCTGCC TGCTGTCGTC AGCGTAAAAG TCGAGGGAAC CGCCGCCCAG
AGCCAAAAAG TGCCGGAGGA GTTTAAAAAA TTCTTTGGCG AGGATCTGCC AGACCAGCCG
TCCCAGCCGT TTGAAGGACT CGGTTCGGGG GTGATTATCG ATGCCGCGAA AGGCTATGTA
TTAACCAATA ATCATGTGAT TAATCAGGCA CAAAAGATCA GCATTCAACT GAATGACGGA
CGCGAATTCG ACGCGAAGCT GATCGGCGGC GACGACCAGA GCGATATCGC TCTGTTACAA
ATTCAGAATC CCAGCAAGTT AACGCAAATT GCCATCGCCG ATTCCGACAA ACTCCGCGTC
GGCGATTTCG CCGTGGCGGT CGGTAATCCG TTTGGTCTTG GACAAACCGC CACCTCCGGG
ATTATTTCAG CGCTGGGACG CAGCGGGCTT AATCTGGAAG GGCTTGAGAA CTTTATTCAA
ACCGATGCCT CTATTAACCG CGGCAACTCC GGCGGCGCGC TGCTTAACCT GAACGGCGAG
CTGATCGGGA TTAATACCGC GATCCTCGCG CCAGGTGGCG GGAGTATCGG CATTGGCTTT
GCTATTCCTT CCAATATGGC GCAGACGCTG GCGCAGCAGT TGATTCAGTT CGGCGAAATC
AAACGCGGAT TGCTGGGAAT TAAAGGCACT GAAATGACCG CTGATATCGC CAAGGCATTC
AAACTGAACG TTCAGCGTGG CGCTTTTGTC AGCGAGGTTT TACCCAATTC AGGTTCGGCG
AAGGCCGGGG TGAAATCCGG AGACGTGATT ATCAGTCTTA ACGGTAAGCC GCTGAATAGC
TTTGCCGAAC TGCGTTCACG TATCGCCACC ACCGAACCGG GCACGAAAGT GAAGCTGGGC
CTGCTGCGCG ATGGTAAGCC GCTGGAGGTG GAAGTCACGC TGGATTCCAA TACCTCTTCT
TCCGCCAGCG CCGAAATGAT CGCCCCGGCG TTGCAAGGCG CGACGTTGAG CGACGGCCAG
CTGAAAGACG GGACGAAAGG CGTTAAGGTT GATAGCGTCG AAAAAAGCAG TCCTGCCGCG
CAGGCCGGTT TGCAAAAAGA TGATGTTATC ATCGGCGTTA ACCGCGATCG TATCAGTTCT
ATCGCCGAAA TGCGCAAAGT GATGGCGGCA AAACCGTCCA TCATTGCTCT TCAGGTAGTA
CGCGGCAACG AGAACATTTA TCTATTGCTA CGCTAA
 
Protein sequence
MKKHTQLLSA LALSVGLTLS APFPALASIP GQATLPSLAP MLEKVLPAVV SVKVEGTAAQ 
SQKVPEEFKK FFGEDLPDQP SQPFEGLGSG VIIDAAKGYV LTNNHVINQA QKISIQLNDG
REFDAKLIGG DDQSDIALLQ IQNPSKLTQI AIADSDKLRV GDFAVAVGNP FGLGQTATSG
IISALGRSGL NLEGLENFIQ TDASINRGNS GGALLNLNGE LIGINTAILA PGGGSIGIGF
AIPSNMAQTL AQQLIQFGEI KRGLLGIKGT EMTADIAKAF KLNVQRGAFV SEVLPNSGSA
KAGVKSGDVI ISLNGKPLNS FAELRSRIAT TEPGTKVKLG LLRDGKPLEV EVTLDSNTSS
SASAEMIAPA LQGATLSDGQ LKDGTKGVKV DSVEKSSPAA QAGLQKDDVI IGVNRDRISS
IAEMRKVMAA KPSIIALQVV RGNENIYLLL R