Gene SeD_A3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3708 
Symbol 
ID6872852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3556250 
End bp3557617 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content54% 
IMG OID642786683 
Productserine endoprotease 
Protein accessionYP_002217317 
Protein GI198243897 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0987273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ACACCCAGCT GTTAAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTTTCG 
GCGCCGTTTC CAGCCCTTGC ATCGATACCA AGCCAGGTGC CAGGCCAGGC GACGCTGCCA
AGCCTTGCCC CTATGCTGGA GAAAGTGCTG CCTGCTGTCG TCAGCGTAAA AGTCGAGGGA
ACCGCCGCCC AGAGCCAAAA AGTGCCGGAG GAGTTTAAAA AATTCTTTGG CGAGGATCTG
CCAGACCAGC CGTCCCAGCC GTTTGAAGGA CTCGGTTCGG GGGTGATTAT CGATGCCGCG
AAAGGCTATG TATTAACCAA TAATCATGTG ATTAATCAGG CACAGAAGAT CAGTATTCAA
CTGAATGACG GACGCGAATT CGACGCGAAG CTGATCGGCG GCGACGACCA GAGCGATATC
GCTCTGTTAC AAATTCAGAA TCCCAGCAAG TTAACGCAAA TTGCCATCGC CGATTCCGAC
AAACTCCGCG TCGGCGATTT CGCCGTGGCG GTCGGTAATC CGTTTGGTCT AGGACAAACC
GCCACCTCCG GGATTATTTC AGCGCTGGGA CGCAGCGGGC TTAATCTGGA AGGGCTTGAG
AACTTTATTC AAACCGATGC CTCTATTAAC CGCGGCAACT CCGGCGGCGC GCTGCTTAAC
CTGAACGGCG AGCTGATCGG GATTAATACC GCGATCCTCG CGCCAGGGGG CGGGAGCATC
GGCATTGGCT TTGCTATTCC TTCCAATATG GCGCAGACGC TGGCGCAGCA GTTGATTCAG
TTCGGCGAAA TCAAACGCGG ATTGCTGGGA ATTAAAGGCA CTGAAATGAC CGCTGATATT
GCTAAGGCAT TCAAACTGAA CGTTCAGCGT GGCGCTTTTG TCAGCGAGGT TTTACCCAAT
TCAGGTTCGG CGAAGGCCGG GGTGAAATCC GGAGACGTGA TTATCAGTCT TAACGGTAAG
CCGCTGAATA GCTTTGCCGA ACTGCGTTCG CGTATCGCCA CCACCGAACC GGGCACGAAA
GTGAAGCTGG GCCTGCTGCG CGATGGTAAG CCGCTGGAGG TGGAAGTCAC GCTGGATTCC
AATACCTCTT CTTCCGCCAG CGCCGAAATG ATCGCCCCGG CGTTGCAAGG CGCGACGTTG
AGCGACGGCC AGCTGAAAGA CGGGACGAAA GGCGTTAAGG TTGATAGCGT CGAAAAAAGC
AGTCCTGCCG CGCAGGCCGG TTTGCAAAAA GATGATGTTA TCATCGGCGT TAACCGCGAT
CGCATCAGTT CTATCGCCGA AATGCGCAAA GTGATGGCGG CAAAACCGTC CATCATTGCT
CTTCAGGTAG TACGCGGCAA CGAGAACATT TATCTATTGC TGCGCTAA
 
Protein sequence
MKKHTQLLSA LALSVGLTLS APFPALASIP SQVPGQATLP SLAPMLEKVL PAVVSVKVEG 
TAAQSQKVPE EFKKFFGEDL PDQPSQPFEG LGSGVIIDAA KGYVLTNNHV INQAQKISIQ
LNDGREFDAK LIGGDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSI
GIGFAIPSNM AQTLAQQLIQ FGEIKRGLLG IKGTEMTADI AKAFKLNVQR GAFVSEVLPN
SGSAKAGVKS GDVIISLNGK PLNSFAELRS RIATTEPGTK VKLGLLRDGK PLEVEVTLDS
NTSSSASAEM IAPALQGATL SDGQLKDGTK GVKVDSVEKS SPAAQAGLQK DDVIIGVNRD
RISSIAEMRK VMAAKPSIIA LQVVRGNENI YLLLR