Gene YpsIP31758_0455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0455 
SymboldegS 
ID5384836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp529156 
End bp530244 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content48% 
IMG OID640863424 
Productserine endoprotease 
Protein accessionYP_001399448 
Protein GI153949989 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02038] periplasmic serine pepetdase DegS 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.000102083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCTTA AGCTATTGCG TTCTATTATT TTGGGGCTAA TTGTTGCCGG TATTCTGCTG 
GTTGCCCTAC CCATGCTCCG CAGCCCAGGT TATTTATTCT CTGGAAAAAG CAATAACGTA
AATGAAGAGG TTCCTACCAG TTATAACCAA GCAGTACGTC GTGCCGCACC GGCGGTGGTC
AATGTCTATA ACCGGAGCCT GAGCGCTACA CAGCAGGGGT TAGCCATCCG TACGCTGGGC
TCGGGTGTGA TCATGAGCGA TAAGGGCTAT ATCCTTACCA ATAAACACGT TATCAATGAT
GCAGAACAGA TCATTGTCGC CATGCAAAAT GGCCGTATCT CAGAAGCTTT ATTGGTCGGT
TCAGATAATC TGACAGATTT AGCCGTCCTA AAGATTGACG CAACAAACCT GCCGGTGATC
CCCATTAATA TTAACCGCAC ACCACATATT GGTGACGTCG TGTTGGCAAT TGGTAACCCT
TATAACCTTG GGCAGACAGT AACGCAGGGG ATTATCAGTG CAACCGGGCG TATTGGTTTA
AGCTCTTCCG GGCGGCAAAA TTTCCTGCAA ACAGATGCAT CAATTAATCA GGGTAATTCC
GGCGGTGCGC TGGTCAACAC CCTTGGCGAG CTAATGGGGA TCAACACGCT CTCATTTGAT
AAAAGCAATA ATGGCGAAAC ACCGGAAGGC ATCGGCTTTG CGATCCCAAC AGCACTGGCA
ACGAAAGTGA TGGAAAAACT GATCCGTGAT GGGCGGGTGA TCCGTGGTTA TATCGGTATT
ACCGGCGAGG AGTACCCACC GTTTAATGCT AACGATAATG GCTCAGATCG GGTACACGGT
ATTAAGGTCA AAAAAGTTTC ACCAGACGGC CCAGCGGCCC AGGCAGGAAT ACACGTTGGC
GATATCATTC TTAACGTGAA TAATAAACCG GCAACCTCCG TGATCGAAAC TATGGATCAG
GTCGCAGAAG TCCGCCCTGG TACGACCATT CCTGTTTTAC TATTACGTAA TGGTCAGCAG
ATAGCGGTTC AAATCACCAT CACTGAACTC GATCAGAATG AGATGCTGAC CACCCAAGCA
GCAGATTAA
 
Protein sequence
MFLKLLRSII LGLIVAGILL VALPMLRSPG YLFSGKSNNV NEEVPTSYNQ AVRRAAPAVV 
NVYNRSLSAT QQGLAIRTLG SGVIMSDKGY ILTNKHVIND AEQIIVAMQN GRISEALLVG
SDNLTDLAVL KIDATNLPVI PININRTPHI GDVVLAIGNP YNLGQTVTQG IISATGRIGL
SSSGRQNFLQ TDASINQGNS GGALVNTLGE LMGINTLSFD KSNNGETPEG IGFAIPTALA
TKVMEKLIRD GRVIRGYIGI TGEEYPPFNA NDNGSDRVHG IKVKKVSPDG PAAQAGIHVG
DIILNVNNKP ATSVIETMDQ VAEVRPGTTI PVLLLRNGQQ IAVQITITEL DQNEMLTTQA
AD