Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1904 |
Symbol | sohB |
ID | 6970466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1797053 |
End bp | 1798102 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643385837 |
Product | putative periplasmic protease |
Protein accession | YP_002270326 |
Protein GI | 209397450 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000000180069 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCACAAG CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGCG AGGTGTCAAC TGACAGTAAA CCCCGCGTCT GGGTGCTGGA CTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA CGTGAAGAGA TAACGGCGGT ACTTGCAGCA TTCAAACCGC AGGATCAGGT TGTGCTCCGT CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT CTGCGTGATA AAAACATTCC ATTAACTGTT ACGGTAGACA AAGTTGCTGC CAGCGGTGGT TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT ATCGAACTGC ACACCGCCGG GCAGTATAAG CGTACGCTGA CCTTGCTGGG TGAAAATACC GAAGAAGGGC GGGAGAAATT CCGCGAAGAG CTGAACGAAA CGCATCAGTT ATTTAAAGAT TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG TACGGACAAC AGGCGGTAGA GAAGGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT ATTCTTAGCC TGATGGAAGG CCGAGAAGTG GTCAATGTAC GCTATATGCA GCGTAAACGA CTCATTGACC GATTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTATT GCTACGCTGG TGGCAGCGTG GGCAAAAGCC ATTGATGTAA
|
Protein sequence | MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVSTDSK PRVWVLDFKG SMDAHEVNSL REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGENT EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV ILSLMEGREV VNVRYMQRKR LIDRFTGSAA ESADRLLLRW WQRGQKPLM
|
| |