Gene EcSMS35_2165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2165 
Symbol 
ID6145397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2169295 
End bp2171055 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID641617041 
ProductS16 family peptidase 
Protein accessionYP_001744215 
Protein GI170679870 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000228937 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.352298 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCATTA CGAAACTTGC ATGGCGTGAC CTGGTTCCTG ATACCGATAG CTATCAGGAA 
ATATTTGCTC AGCCACATTT GATTGACGAA AACGATCCTT TATTCAGTGA TACTCAACCG
CGACTGCAAT TTGCGCTGGA GCAGTTGCTG CATACGCGAG CATCCTCCTC TTTTATGCTG
GCGAAGGCCC CGGAAGAGTC TGAGTATCTG AATCTTATTG CCGATGCCGC GCGTGCGCTA
CAAAGCGATG CAGGCCAACT GGTGGGCTGT CACTATGAGG TTTCCGGGCA CACCATCCGC
TTACGTAACG CAGTGAGTGC AGATGATAAT TTTGCGACTT TAACGCAAGT TGTCGCTGCC
GACTGGGTAG AAGCGGAACA ACTCTTTGGC TGCCTGCGCC AGTTTAATGG CGACATTACC
CTGCAGCCTG GTCTGGTGCA TCAGGCAAAT GGCGGTATTC TCATCATCTC TTTGCGTACA
CTGCTGGCGC AACCTCTGCT GTGGATGCGG CTGAAAAATA TCGTTAACCG CGAGCGTTTT
GACTGGGTTG CGTTTGATGA GTCGCGCCCT CTCCCCGTCT CTGTGCCTTC GATGCCATTG
AAGCTGAAAG TCATTCTGGT AGGCGAACGT GAATCATTGG CTGATTTCCA GGAGATGGAA
CCAGAGCTTT CAGAGCAGGC TATTTATAGC GAATTTGAAG ATACTCTGCA GATTGTCGAT
GCGGAGTCAG TAAGCCAGTG GTGTCGCTGG GTAACATTGA CCGCAAGACA TAATCACTTA
CCTGCACCGG GAGCGGATGC CTGGCCAGTA CTTATCCGCG AAGCAGCCCG CTACACCGGT
GAACAAGAAA CACTTCCGCT TAGCCCGCAG TGGATCCTCC GCCAGTGTAA AGAGGTCGCC
TCCCTGTGCG ATGGCGACAC CTTCTCCGGC GAGCAGCTAA ACTTAATGCT GCAGCAGCGT
GAATGGCGTG AAGGTTTCCT CGCTGAACGC ATGCAGGATG AGATCCTTCA GGAGCAAATC
CTGATTGAAA CCGAAGGCGA ACGCATCGGG CAAATTAACG CCCTTTCGGT CATTGAATTT
CCGGGTCATC CACGCGCTTT TGGCGAACCT TCTCGCATTA GCTGCGTTGT GCATATTGGC
GATGGTGAAT TCACCGACAT CGAACGCAAA GCGGAGCTTG GCGGCAATAT CCATGCGAAA
GGGATGATGA TCATGCAAGC GTTCCTGATG TCGGAACTAC AGCTTGAGCA ACAGATCCCC
TTCTCAGCAT CGCTGACATT TGAGCAGTCA TACAGTGAAG TGGATGGCGA TAGTGCCTCG
ATGGCTGAAC TCTGCGCCCT GATCAGCGCC CTCGCCGATG TGCCGGTGAA TCAGAGTATC
GCTATCACAG GTTCAGTCGA TCAGTTCGGT CGCGCCCAGC CAGTCGGTGG TTTAAATGAG
AAAATCGAAG GCTTCTTTGC TATTTGCCAG CAACGTGAGT TAACGGGGAA ACAAGGTGTC
ATTATCCCCA CTGCTAACGT TCGCCATTTA AGTCTTCACA GTGAACTGGT GAAAGCGGTA
GAAGAAGGCA AATTCACCAT CTGGGCAGTA GACGATGTGA CTGACGCACT GCCGTTATTA
TTAAATCTGG TGTGGGATGG CGAAGGCCAA ACGACGCTGA TGCAAACCAT CCAGGAACGT
ATCGCACAAG CATCGCAACA GGAAGGACGT CACCGTTTTC CATGGCCATT ACGTTGGCTG
AACTGGTTTA TTCCGAACTG A
 
Protein sequence
MTITKLAWRD LVPDTDSYQE IFAQPHLIDE NDPLFSDTQP RLQFALEQLL HTRASSSFML 
AKAPEESEYL NLIADAARAL QSDAGQLVGC HYEVSGHTIR LRNAVSADDN FATLTQVVAA
DWVEAEQLFG CLRQFNGDIT LQPGLVHQAN GGILIISLRT LLAQPLLWMR LKNIVNRERF
DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELSEQAIYS EFEDTLQIVD
AESVSQWCRW VTLTARHNHL PAPGADAWPV LIREAARYTG EQETLPLSPQ WILRQCKEVA
SLCDGDTFSG EQLNLMLQQR EWREGFLAER MQDEILQEQI LIETEGERIG QINALSVIEF
PGHPRAFGEP SRISCVVHIG DGEFTDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP
FSASLTFEQS YSEVDGDSAS MAELCALISA LADVPVNQSI AITGSVDQFG RAQPVGGLNE
KIEGFFAICQ QRELTGKQGV IIPTANVRHL SLHSELVKAV EEGKFTIWAV DDVTDALPLL
LNLVWDGEGQ TTLMQTIQER IAQASQQEGR HRFPWPLRWL NWFIPN