Gene SNSL254_A0230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0230 
SymboldegP 
ID6485532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp246642 
End bp248078 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content55% 
IMG OID642735667 
Productserine endoprotease 
Protein accessionYP_002039449 
Protein GI194443684 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAACACA TGAAAAAAAC CACATTAGCA ATGAGTGCAC TGGCTCTGAG TTTAGGTTTG 
GCATTGTCGC CTCTGTCTGC CACGGCGGCT GAAACGTCCT CTTCAGCAAT GACTGCCCAG
CAGATGCCAA GCCTGGCACC GATGCTCGAA AAAGTGATGC CATCGGTGGT CAGTATTAAT
GTTGAAGGTA GCACCACGGT GAATACGCCG CGTATGCCGC GTAATTTCCA GCAGTTCTTT
GGCGATGACT CCCCGTTCTG CCAGGACGGT TCTCCGTTCC AGAATTCTCC GTTCTGCCAG
GGCGGCGGTA ACGGCGGCAA CGGCGGTCAA CAACAGAAAT TCATGGCGCT GGGCTCCGGC
GTAATTATTG ACGCCGCGAA GGGCTACGTC GTCACCAACA ACCACGTGGT TGATAACGCC
AGCGTGATTA AAGTACAGCT TAGCGATGGG CGTAAATTCG ATGCTAAAGT GGTGGGCAAA
GATCCGCGTT CTGATATCGC GCTGATTCAA ATTCAGAATC CGAAGAACCT GACGGCGATT
AAGCTGGCGG ACTCCGACGC GCTGCGCGTG GGGGATTATA CCGTCGCTAT TGGTAACCCG
TTTGGTCTGG GCGAAACGGT GACGTCAGGT ATCGTTTCGG CGCTGGGGCG TAGCGGCCTG
AACGTAGAAA ATTACGAGAA CTTTATTCAG ACCGACGCCG CGATTAACCG CGGTAACTCC
GGCGGCGCGC TGGTGAACCT GAACGGTGAG CTGATCGGTA TTAACACCGC GATTCTGGCG
CCGGACGGCG GCAACATCGG TATCGGCTTC GCTATCCCCA GTAACATGGT GAAAAACCTG
ACGTCGCAGA TGGTGGAATA CGGCCAGGTG AAACGCGGCG AACTGGGGAT CATGGGGACT
GAGCTGAATT CCGAATTGGC GAAAGCGATG AAAGTCGACG CCCAGCGAGG CGCGTTCGTC
AGCCAGGTGA TGCCGAATTC GTCCGCGGCG AAAGCGGGTA TCAAAGCCGG GGATGTCATT
ACCTCGCTGA ACGGTAAACC GATCAGCAGC TTTGCGGCGC TGCGCGCTCA GGTCGGCACT
ATGCCGGTCG GCAGCAAAAT CAGCCTCGGT CTGCTGCGTG AAGGTAAAGC GATTACGGTG
AATCTGGAAC TGCAGCAGAG CAGCCAGAGT CAGGTTGATT CCAGCACCAT CTTCAGCGGG
ATTGAAGGCG CTGAAATGAG CAATAAAGGC CAGGATAAAG GCGTTGTGGT GAGCAGCGTG
AAAGCGAACT CACCCGCCGC GCAAATTGGC CTCAAAAAAG GCGATGTGAT TATCGGCGCT
AACCAGCAGC CGGTGAAAAA TATCGCCGAG CTGCGTAAGA TTCTCGACAG CAAGCCGTCG
GTTCTGGCGC TGAATATTCA GCGTGGTGAT AGTTCTATTT ATTTGCTGAT GCAGTAA
 
Protein sequence
MKHMKKTTLA MSALALSLGL ALSPLSATAA ETSSSAMTAQ QMPSLAPMLE KVMPSVVSIN 
VEGSTTVNTP RMPRNFQQFF GDDSPFCQDG SPFQNSPFCQ GGGNGGNGGQ QQKFMALGSG
VIIDAAKGYV VTNNHVVDNA SVIKVQLSDG RKFDAKVVGK DPRSDIALIQ IQNPKNLTAI
KLADSDALRV GDYTVAIGNP FGLGETVTSG IVSALGRSGL NVENYENFIQ TDAAINRGNS
GGALVNLNGE LIGINTAILA PDGGNIGIGF AIPSNMVKNL TSQMVEYGQV KRGELGIMGT
ELNSELAKAM KVDAQRGAFV SQVMPNSSAA KAGIKAGDVI TSLNGKPISS FAALRAQVGT
MPVGSKISLG LLREGKAITV NLELQQSSQS QVDSSTIFSG IEGAEMSNKG QDKGVVVSSV
KANSPAAQIG LKKGDVIIGA NQQPVKNIAE LRKILDSKPS VLALNIQRGD SSIYLLMQ