Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2165 |
Symbol | |
ID | 6145397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2169295 |
End bp | 2171055 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617041 |
Product | S16 family peptidase |
Protein accession | YP_001744215 |
Protein GI | 170679870 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000228937 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.352298 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCATTA CGAAACTTGC ATGGCGTGAC CTGGTTCCTG ATACCGATAG CTATCAGGAA ATATTTGCTC AGCCACATTT GATTGACGAA AACGATCCTT TATTCAGTGA TACTCAACCG CGACTGCAAT TTGCGCTGGA GCAGTTGCTG CATACGCGAG CATCCTCCTC TTTTATGCTG GCGAAGGCCC CGGAAGAGTC TGAGTATCTG AATCTTATTG CCGATGCCGC GCGTGCGCTA CAAAGCGATG CAGGCCAACT GGTGGGCTGT CACTATGAGG TTTCCGGGCA CACCATCCGC TTACGTAACG CAGTGAGTGC AGATGATAAT TTTGCGACTT TAACGCAAGT TGTCGCTGCC GACTGGGTAG AAGCGGAACA ACTCTTTGGC TGCCTGCGCC AGTTTAATGG CGACATTACC CTGCAGCCTG GTCTGGTGCA TCAGGCAAAT GGCGGTATTC TCATCATCTC TTTGCGTACA CTGCTGGCGC AACCTCTGCT GTGGATGCGG CTGAAAAATA TCGTTAACCG CGAGCGTTTT GACTGGGTTG CGTTTGATGA GTCGCGCCCT CTCCCCGTCT CTGTGCCTTC GATGCCATTG AAGCTGAAAG TCATTCTGGT AGGCGAACGT GAATCATTGG CTGATTTCCA GGAGATGGAA CCAGAGCTTT CAGAGCAGGC TATTTATAGC GAATTTGAAG ATACTCTGCA GATTGTCGAT GCGGAGTCAG TAAGCCAGTG GTGTCGCTGG GTAACATTGA CCGCAAGACA TAATCACTTA CCTGCACCGG GAGCGGATGC CTGGCCAGTA CTTATCCGCG AAGCAGCCCG CTACACCGGT GAACAAGAAA CACTTCCGCT TAGCCCGCAG TGGATCCTCC GCCAGTGTAA AGAGGTCGCC TCCCTGTGCG ATGGCGACAC CTTCTCCGGC GAGCAGCTAA ACTTAATGCT GCAGCAGCGT GAATGGCGTG AAGGTTTCCT CGCTGAACGC ATGCAGGATG AGATCCTTCA GGAGCAAATC CTGATTGAAA CCGAAGGCGA ACGCATCGGG CAAATTAACG CCCTTTCGGT CATTGAATTT CCGGGTCATC CACGCGCTTT TGGCGAACCT TCTCGCATTA GCTGCGTTGT GCATATTGGC GATGGTGAAT TCACCGACAT CGAACGCAAA GCGGAGCTTG GCGGCAATAT CCATGCGAAA GGGATGATGA TCATGCAAGC GTTCCTGATG TCGGAACTAC AGCTTGAGCA ACAGATCCCC TTCTCAGCAT CGCTGACATT TGAGCAGTCA TACAGTGAAG TGGATGGCGA TAGTGCCTCG ATGGCTGAAC TCTGCGCCCT GATCAGCGCC CTCGCCGATG TGCCGGTGAA TCAGAGTATC GCTATCACAG GTTCAGTCGA TCAGTTCGGT CGCGCCCAGC CAGTCGGTGG TTTAAATGAG AAAATCGAAG GCTTCTTTGC TATTTGCCAG CAACGTGAGT TAACGGGGAA ACAAGGTGTC ATTATCCCCA CTGCTAACGT TCGCCATTTA AGTCTTCACA GTGAACTGGT GAAAGCGGTA GAAGAAGGCA AATTCACCAT CTGGGCAGTA GACGATGTGA CTGACGCACT GCCGTTATTA TTAAATCTGG TGTGGGATGG CGAAGGCCAA ACGACGCTGA TGCAAACCAT CCAGGAACGT ATCGCACAAG CATCGCAACA GGAAGGACGT CACCGTTTTC CATGGCCATT ACGTTGGCTG AACTGGTTTA TTCCGAACTG A
|
Protein sequence | MTITKLAWRD LVPDTDSYQE IFAQPHLIDE NDPLFSDTQP RLQFALEQLL HTRASSSFML AKAPEESEYL NLIADAARAL QSDAGQLVGC HYEVSGHTIR LRNAVSADDN FATLTQVVAA DWVEAEQLFG CLRQFNGDIT LQPGLVHQAN GGILIISLRT LLAQPLLWMR LKNIVNRERF DWVAFDESRP LPVSVPSMPL KLKVILVGER ESLADFQEME PELSEQAIYS EFEDTLQIVD AESVSQWCRW VTLTARHNHL PAPGADAWPV LIREAARYTG EQETLPLSPQ WILRQCKEVA SLCDGDTFSG EQLNLMLQQR EWREGFLAER MQDEILQEQI LIETEGERIG QINALSVIEF PGHPRAFGEP SRISCVVHIG DGEFTDIERK AELGGNIHAK GMMIMQAFLM SELQLEQQIP FSASLTFEQS YSEVDGDSAS MAELCALISA LADVPVNQSI AITGSVDQFG RAQPVGGLNE KIEGFFAICQ QRELTGKQGV IIPTANVRHL SLHSELVKAV EEGKFTIWAV DDVTDALPLL LNLVWDGEGQ TTLMQTIQER IAQASQQEGR HRFPWPLRWL NWFIPN
|
| |