Gene YpsIP31758_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3323 
SymboldegP 
ID5388171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3734727 
End bp3736172 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content50% 
IMG OID640866338 
Productserine endoprotease 
Protein accessionYP_001402280 
Protein GI153950722 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.00403874 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CAACTTTAGT ATTAAGTGCA TTGGCATTGA GCATTGGTTT CGCCATGGGC 
CCGGTTTCTT CCGTCGTTGC GGCAGAGACG GCAGCATCGA GTAGCCAGCA GCTCCCTAGC
CTGGCGCCAA TGCTAGAGAA AGTAATGCCT TCAGTGGTCA GTATCAACGT TGAAGGTAGT
GCGCCTGTAA GCAGTGCTGG TGCACGCGGT ATGCCACCAC AATTCCAGCA GTTTTTTGGT
GATAACTCGC CATTCTGTCA GGACGGTTCA CCGTTCCAAG GCTCGCCAAT GTGTCAAGGG
GATCTGGGCG GACTAGGGCA GGGAATGCCA AGTAAGCGGG AATTCCGTTC GCTTGGTTCA
GGTGTCATTA TTGATGCGGG CAAGGGGTAT GTCGTTACCA ATAACCACGT GGTCGATAAT
GCGAACAAGA TCAGCGTAAA ACTGAGCGAT GGCCGCAGTT TTGATGCCAA GGTGATCGGT
AAAGATCCAC GTACCGATAT CGCACTGTTA CAACTGAAAG ACGCTAAAAA TCTGACTGCG
ATTAAGATTG CCAATTCGGA TCAACTGCGT GTCGGTGATT ATACCGTCGC TATCGGGAAC
CCGTATGGCT TGGGTGAAAC CGTGACATCC GGTATTGTCT CTGCTTTAGG GCGCAGTGGT
TTGAATGTAG AAAACTATGA AAACTTTATC CAGACTGATG CGGCGATTAA CCGCGGTAAT
TCCGGCGGCG CATTAATCAA CCTGAACGGT GAGTTGATTG GTATTAACAC CGCTATTCTG
GCACCGGATG GCGGTAACAT TGGTATTGGC TTTGCTATCC CAAGCAACAT GGTGAAGAAC
CTGACATCAC AGATGGTTGA GTTTGGTCAG GTAAAACGCG GTGAACTGGG CATTATGGGG
ACCGAGCTAA ACTCTGAACT GGCAAAAGCC ATGAAGGTTG ATGCGCAGAA AGGTGCCTTT
ATCAGCCAGG TCGTGCCTAA ATCTGCTGCG GCAAAAGCGG GTATCAAAGC GGGCGATATC
ATTGTCAGTA TGAATGGGAA AGCCATCAAT AGTTTTGCAG GGTTCCGCGC CGAGATCGGC
ACGTTACCTG TTGGCAGCAA AATGACCTTG GGTCTGCTGC GTGATGGCAA ACCGATCAAT
GTGAATGTCG TCCTGGAGCA GAGCAGCCAC AGTCAGGTGG AATCCGGTAA TCTCTACACC
GGTATTGAGG GGGCTGAACT GAGTAACAGC GACGTTAGCG GCAAGAAAGG GGTGAAAGTT
GATAGCGTAA AACCAGGCAC TGCTGCGGCG CGTATCGGCC TGAAAAAAGG TGATATCATC
ATGGGGATTA ACCAGCAACC AGTCCAGAAC CTAGGTGAGC TGCGGAAAAT CCTCGATGCT
AAACCACCGG TATTGGCGTT GAATATTCAA CGTGGTGATA CTTCACTCTA TTTATTGATG
CAGTAA
 
Protein sequence
MKKTTLVLSA LALSIGFAMG PVSSVVAAET AASSSQQLPS LAPMLEKVMP SVVSINVEGS 
APVSSAGARG MPPQFQQFFG DNSPFCQDGS PFQGSPMCQG DLGGLGQGMP SKREFRSLGS
GVIIDAGKGY VVTNNHVVDN ANKISVKLSD GRSFDAKVIG KDPRTDIALL QLKDAKNLTA
IKIANSDQLR VGDYTVAIGN PYGLGETVTS GIVSALGRSG LNVENYENFI QTDAAINRGN
SGGALINLNG ELIGINTAIL APDGGNIGIG FAIPSNMVKN LTSQMVEFGQ VKRGELGIMG
TELNSELAKA MKVDAQKGAF ISQVVPKSAA AKAGIKAGDI IVSMNGKAIN SFAGFRAEIG
TLPVGSKMTL GLLRDGKPIN VNVVLEQSSH SQVESGNLYT GIEGAELSNS DVSGKKGVKV
DSVKPGTAAA RIGLKKGDII MGINQQPVQN LGELRKILDA KPPVLALNIQ RGDTSLYLLM
Q