Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1471 |
Symbol | sohB |
ID | 5587058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1461061 |
End bp | 1462110 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640925163 |
Product | putative periplasmic protease |
Protein accession | YP_001462568 |
Protein GI | 157157433 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.251936 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCACAAG CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGCG AGGTGGCAAC TGACAGTAAA CCTCGCGTCT GGGTGCTGGA TTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA CGTGAAGAGA TAACGGCTGT ACTCGCAGCA TTCAAACCGC AGGATCAGGT TGTGCTACGT CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT CTGCGTGATA AAAACATTCC TTTAACTGTT ACGGTAGACA AAGTCGCTGC CAGCGGCGGT TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT ATCGAACTGC ACACTGCCGG GCAGTATAAG CGTACTCTGA CGTTGCTGGG TGACAATACC GAAGAAGGGC GGGAGAAATT CCGCGAAGAG TTGAACGAAA CGCATCAGTT GTTTAAAGAT TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG TACGGACAAC AGGCGGTAGA GAAAGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT ATTCTTAGCC TGATGGAAGG CCGTGAAGTG GTTAATGTAC GCTATATGCA GCGTAAACGA CTCATTGACC GATTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTGTT GCTACGCTGG TGGCAGCGGG GTCAAAAGCC ATTGATGTAA
|
Protein sequence | MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVATDSK PRVWVLDFKG SMDAHEVNSL REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGDNT EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV ILSLMEGREV VNVRYMQRKR LIDRFTGSAA ESADRLLLRW WQRGQKPLM
|
| |