Gene SbBS512_E2361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2361 
Symbol 
ID6270636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2148808 
End bp2150568 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID641726365 
Productpeptidase, S16 (lon protease) family 
Protein accessionYP_001880847 
Protein GI187732130 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000251656 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACCATTA CGAAACTTGC ATGGCGTGAC CTGGTTCCTG ATACCGATAG CTATCAGGAA 
ATATTTGCTC AGCCACATTT GATTGACGAA AACGATCCTT TATTCAGTGA TACTCAACCG
CGGCTGCAAT TTGCGCTGGA GCAGTTGCTG CATACGCGAG CATCCTCCTC TTTTATGCTG
GCTAAGGCCC CGGAAGAGTC TGAGTATCTG AATCTTATTG CCGATGCCGC GCGTACGCTA
CAAAGCGATG CAGGCCAACT GGTGGGCGAT CACTATGAGG TTTCCGGCCA CTCCATCCGC
TTACGTCACG CAGTGAGTGC AGATGATAAT TTTGCGACTT TAACGCAAGT TGTCGCTGCC
GACTGGGTAG AAGCGGAGCA ACTCTTTGGC TGCCTGCGCC AGTTTAATGG CGACATTACC
CTGCAGCCTG GTCTGGTGCA TCAGGCAAAT GGCGGTATTC TCATCATCTC TTTGCGTACA
CTGCTGGCGC AACCTCTGCT GTGGATGCGG CTGAAAAATA TCGTTAACCG CGAGCGTTTT
GACTGGGTTG CGTTTGATGA GTCGCGCCCT CTCCCCGTCT CTGTGCCTTC GATGCCATTG
AAGCTGAAAG TCATTCTGGT AGGCGAACGC GAATCATTGG CTGATTTCCA GGAGATGGAG
CCAGAGCTTT CAGAGCAGGC TATTTATAGC GAATTTGAAG ATACTCTGCA GATTGTCGAT
GCGGAGTCAG TAAGCCAGTG GTGTCGCTGG GTGACATTTA CCGCCAGACA TAATCACTTA
CCTGCCCCGG GAGCGGATGC CTGGCCGGTA CTTATCCGCG AAGCAGCACG CTACACCGGT
GAACAAGAAA CACTTCCGCT TAGCCCGCAG TGGATCCTCC GCCAGTGTAA AGAGGTCGCC
TCCCTGTGTG ATGGCGACAC CTTCTCCGGC GAGCAGCTAA ACTTAATGCT GCAGCAGCGT
GAATGGCGCG AAGGTTTCCT CGCTGAACGT ATGCAGGATG AGATCCTTCA GGAGCAAATC
CTGATTGAAA CCGAAGGCGA ACGCATCGGG CAAATTAACG CCCTTTCGGT CATTGAATTT
CCGGGTCATC CACGCGCTTT TGGCGAACCT TCTCGCATTA GCTGCGTTGT GCATATTGGC
GATGGTGAAT TCACCGACAT CGAACGCAAA GCGGAACTTG GCGGCAATAT CCATGCGAAA
GGGATGATGA TCATGCAAGC GTTCCTGATG TCGGAACTAC AGCTTGAGCA ACAGATCCCC
TTCTCAGCAT CGCTGACATT TGAGCAGTCA TACAGTGAAG TTGATGGAGA TAGTGCCTCG
ATGGCTGAAC TCTGCGCCCT GATAAGCGCC CTCGCCGATG TGCCGGTGAA TCAGAGTATC
GCTATCACAG GTTCAGTCGA TCAGTTCGGT CGCGCCCAGC CGGTCGGTGG TTTAAATGAG
AAAATCGAAG GCTTCTTTGC TATTTGCCAG CAACGTGAGT TAACCGGGAA ACAAGGTGTC
ATTATCCCCA CAGCTAACGT TCGCCATTTA AGTCTTCACA GTGAACTGGT GAAAGCGGTA
GAAGAAGGCA AATTCACCAT CTGGGCAGTA GACGATGTGA CTGACGCACT GCCGTTATTA
TTAAATCTGG TGTGGGATGG CGAAGGCCAA ACGACGCTGA TGCAAACCAT CCAGGAACGT
ATCGCGCAAG CATCGCAACA GGAAGGACGT CACCGTTTTC CATGGCCATT ACGTTGGCTG
AACTGGTTTA TTCCGAACTG A
 
Protein sequence
MTITKLAWRD LVPDTDSYQE IFAQPHLIDE NDPLFSDTQP RLQFALEQLL HTRASSSFML 
AKAPEESEYL NLIADAARTL QSDAGQLVGD HYEVSGHSIR LRHAVSADDN FATLTQVVAA
DWVEAEQLFG CLRQFNGDIT LQPGLVHQAN GGILIISLRT LLAQPLLWMR LKNIVNRERF
DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELSEQAIYS EFEDTLQIVD
AESVSQWCRW VTFTARHNHL PAPGADAWPV LIREAARYTG EQETLPLSPQ WILRQCKEVA
SLCDGDTFSG EQLNLMLQQR EWREGFLAER MQDEILQEQI LIETEGERIG QINALSVIEF
PGHPRAFGEP SRISCVVHIG DGEFTDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP
FSASLTFEQS YSEVDGDSAS MAELCALISA LADVPVNQSI AITGSVDQFG RAQPVGGLNE
KIEGFFAICQ QRELTGKQGV IIPTANVRHL SLHSELVKAV EEGKFTIWAV DDVTDALPLL
LNLVWDGEGQ TTLMQTIQER IAQASQQEGR HRFPWPLRWL NWFIPN