Gene SbBS512_E3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3542 
SymboldegQ 
ID6271575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3293932 
End bp3295299 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content52% 
IMG OID641727413 
Productserine endoprotease 
Protein accessionYP_001881859 
Protein GI187731297 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0199857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AAACCCAGCT GTTGAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTCTCG 
GCGTCATTTC AGGCCGTCGC GTCGATTCCA GGCCAGGTTG CCGATCAGGC CCCTCTCCCC
AGTCTGGCTC CAATGCTGGA AAAAGTGCTT CCGGCAGTGG TGAGCGTACG GGTGGAAGGA
ACGGCCAGTC AGGGACAGAA AATCCCGGAA GAATTCAAAA AGTTTTTTGG TGATGATTTA
CCGGATCAAC CTGCACAACC CTTCGAAGGT TTAGGTTCCG GTGTCATCAT CAACGCCAGT
AAAGGCTATG TGCTGACCAA CAACCATGTG ATTAATCAGG CACAGAAAAT CAGTATTCAG
CTCAATGATG GGCGCGAGTT TGATGCCAAA CTGATTGGTA GCGATGACCA GAGCGATATC
GCCCTGTTAC AAATTCAAAA CCCGAGCAAA TTAACGCAAA TCGCTATTGC CGACTCCGAT
AAATTGCGCG TCGGTGATTT TGCCGTAGCG GTCGGTAACC CATTTGGCCT TGGGCAAACC
GCCACCTCTG GCATTGTTTC CGCATTAGGC CGCAGCGGGT TGAATCTTGA AGGTCTGGAA
AACTTTATCC AGACAGATGC TTCCATTAAC CGCGGTAACT CCGGCGGTGC ACTATTAAAC
CTTAACGGTG AGTTAATTGG CATCAACACT GCAATCCTTG CGCCTGGCGG CGGGAGCGTC
GGGATTGGAT TTGCCATCCC CAGTAATATG GCGCGAACAC TGGCGCAGCA GCTTATCGAC
TTTGGTGAAA TCAAACGCGG TTTGTTAGGC ATCAAAGGCA CCGAGATGAG TGCCGATATC
GCCAAAGCCT TCAACCTTGA CGTGCAGCGT GGCGCGTTTG TCAGCGAAGT GTTGCCAGGT
TCTGGCTCGG CAAAAGCGGG CGTCAAAGCG GGCGATATTA TTACCAGCCT CAACGGCAAA
CCGCTGAATA GCTTTGCTGA GTTGCGCTCT CGTATCGCGA CTACAGAGCC GGGCACGAAA
GTGAAGCTTG GCCTGCTGCG TAACGGCAAA CCACTGGAAG TAGAAGTGAC GCTTGATACC
AGCACCTCTT CGTCGGCCAG CGCTGAAATG ATCACGCCAG CGCTGGAAGG TGCAACGTTG
AGCGATGGTC AGCTAAAAGA TGGCGGCAAA GGTATTAAGA TCGATGAAGT TGTCAAAGGA
AGCCCAGCTG CTCAGGCAGG CTTGCAAAAA GACGATGTGA TCATTGGCGT CAACCGCGAT
CGGGTGAACT CGATTGCTGA AATGCGTAAA GTGCTGGCGG CAAAACCGGC CATCATCGCC
CTGCAAATTG TACGCGGCAA TGAAAGCATC TATCTGCTGA TGCGTTAA
 
Protein sequence
MKKQTQLLSA LALSVGLTLS ASFQAVASIP GQVADQAPLP SLAPMLEKVL PAVVSVRVEG 
TASQGQKIPE EFKKFFGDDL PDQPAQPFEG LGSGVIINAS KGYVLTNNHV INQAQKISIQ
LNDGREFDAK LIGSDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIVSALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSV
GIGFAIPSNM ARTLAQQLID FGEIKRGLLG IKGTEMSADI AKAFNLDVQR GAFVSEVLPG
SGSAKAGVKA GDIITSLNGK PLNSFAELRS RIATTEPGTK VKLGLLRNGK PLEVEVTLDT
STSSSASAEM ITPALEGATL SDGQLKDGGK GIKIDEVVKG SPAAQAGLQK DDVIIGVNRD
RVNSIAEMRK VLAAKPAIIA LQIVRGNESI YLLMR