Gene ECH74115_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1904 
SymbolsohB 
ID6970466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1797053 
End bp1798102 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content50% 
IMG OID643385837 
Productputative periplasmic protease 
Protein accessionYP_002270326 
Protein GI209397450 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000000180069 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG 
ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC
GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG
GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCACAAG
CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGCG AGGTGTCAAC TGACAGTAAA
CCCCGCGTCT GGGTGCTGGA CTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA
CGTGAAGAGA TAACGGCGGT ACTTGCAGCA TTCAAACCGC AGGATCAGGT TGTGCTCCGT
CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT
CTGCGTGATA AAAACATTCC ATTAACTGTT ACGGTAGACA AAGTTGCTGC CAGCGGTGGT
TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC
ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT
ATCGAACTGC ACACCGCCGG GCAGTATAAG CGTACGCTGA CCTTGCTGGG TGAAAATACC
GAAGAAGGGC GGGAGAAATT CCGCGAAGAG CTGAACGAAA CGCATCAGTT ATTTAAAGAT
TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG
TACGGACAAC AGGCGGTAGA GAAGGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT
ATTCTTAGCC TGATGGAAGG CCGAGAAGTG GTCAATGTAC GCTATATGCA GCGTAAACGA
CTCATTGACC GATTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTATT GCTACGCTGG
TGGCAGCGTG GGCAAAAGCC ATTGATGTAA
 
Protein sequence
MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA 
ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVSTDSK PRVWVLDFKG SMDAHEVNSL
REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG
YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGENT
EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV
ILSLMEGREV VNVRYMQRKR LIDRFTGSAA ESADRLLLRW WQRGQKPLM