Gene SNSL254_A3611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3611 
Symbol 
ID6482231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3497474 
End bp3498841 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID642738886 
Productserine endoprotease 
Protein accessionYP_002042603 
Protein GI194445588 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.40714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTTTCG 
GCGCCGTTTC CAGCCCTTGC ATCGATACCA GGCCAGGTGC CAGGCCAGGC GACGCTGCCA
AGCCTTGCCC CTATGCTGGA GAAAGTGCTG CCTGCTGTCG TCAGCGTAAA AGTCGAGGGA
ACCGCCGCCC AGAGCCAAAA AGTGCCGGAG GAGTTTAAAA AATTCTTTGG CGAGGATCTG
CCAGACCAGC CGTCCCAGCC GTTTGAAGGA CTCGGTTCGG GGGTGATTAT CGATGCCGCG
AAAGGCTATG TATTAACCAA TAATCATGTG ATTAATCAGG CACAGAAGAT CAGTATTCAA
CTGAATGACG GACGCGAATT CGACGCGAAG CTGATCGGCG GCGACGACCA GAGCGATATC
GCTCTGTTAC AAATTCAGAA TCCCAGCAAG TTAACGCAAA TTGCCATCGC CGATTCCGAC
AAACTCCGCG TCGGCGATTT CGCCGTGGCG GTCGGTAATC CGTTTGGTCT TGGACAAACC
GCCACCTCCG GGATTATTTC AGCGCTGGGA CGCAGCGGGC TTAATCTGGA AGGGCTTGAG
AACTTTATTC AAACCGATGC CTCTATTAAC CGCGGCAACT CCGGCGGCGC GCTGCTTAAC
CTGAACGGCG AGCTGATCGG GATTAATACC GCAATCCTCG CGCCAGGGGG CGGGAGCATC
GGCATTGGCT TTGCTATTCC TTCCAATATG GCGCAGACGC TGGCGCAGCA GTTGATTCAG
TTCGGCGAAA TCAAACGCGG ATTGCTGGGA ATTAAAGGCA CTGAAATGAC CGCTGATATC
GCTAAGGCAT TCAAACTGAA CGTTCAGCGT GGCGCTTTTG TCAGCGAGGT TTTACCCAAT
TCAGGTTCGG CGAAGGCCGG GGTGAAATCC GGAGACGTGA TTATCAGTCT TAACGGTAAG
CCGCTGAATA GCTTTGCCGA ACTGCGTTCA CGTATCGCCA CCACCGAACC GGGCACGAAA
GTGAAGCTGG GCCTGCTGCG CGATGGTAAG CCGCTGGAGG TGGAAGTCAC GCTGGATTCC
AATACCTCTT CTTCCGCCAG TGCCGAAATG ATCGCCCCGG CGTTGCAAGG CGCGACGTTG
AGCGACGGCC AACTGAAAGA CGGGACGAAA GGCGTTAAGG TTGATAGCGT CGAAAAAAGC
AGTCCTGCCG CGCAGGCCGG TTTGCAAAAA GATGATGTTA TCATCGGCGT TAACCGCGAT
CGTATCAGTT CTATCGCCGA AATGCGTAAA GTGATGGCGG CAAAACCGTC CATCATTGCT
CTTCAGGTAG TACGCGGCAA CGAGAACATT TATCTATTGC TGCGCTAA
 
Protein sequence
MKKHTQLLSA LALSVGLTLS APFPALASIP GQVPGQATLP SLAPMLEKVL PAVVSVKVEG 
TAAQSQKVPE EFKKFFGEDL PDQPSQPFEG LGSGVIIDAA KGYVLTNNHV INQAQKISIQ
LNDGREFDAK LIGGDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSI
GIGFAIPSNM AQTLAQQLIQ FGEIKRGLLG IKGTEMTADI AKAFKLNVQR GAFVSEVLPN
SGSAKAGVKS GDVIISLNGK PLNSFAELRS RIATTEPGTK VKLGLLRDGK PLEVEVTLDS
NTSSSASAEM IAPALQGATL SDGQLKDGTK GVKVDSVEKS SPAAQAGLQK DDVIIGVNRD
RISSIAEMRK VMAAKPSIIA LQVVRGNENI YLLLR