Gene EcHS_A1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1381 
SymbolsohB 
ID5594982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1376389 
End bp1377438 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content50% 
IMG OID640920536 
Productputative periplasmic protease 
Protein accessionYP_001458095 
Protein GI157160777 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG 
ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC
GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG
GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCATAAG
CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGAG AGGTGGCAAC TGACAGTAAA
CCCCGCGTCT GGGTGCTGGA TTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA
CGTGAAGAGA TAACGGCTGT ACTCGCAGCA TTCAAACCGC AGGATCAGGT TGTACTACGT
CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT
CTGCGTGATA AAAACATTCC TTTAACTGTT ACGGTAGACA AAGTCGCTGC CAGCGGCGGT
TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC
ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT
ATCGAACTTC ACACCGCCGG GCAGTATAAG CGTACGCTGA CCTTGCTGGG TGAAAATACC
GAAGAAGGGC GGGAGAAATT CCGCGAAGAG CTGAACGAAA CGCATCAGTT ATTTAAAGAT
TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG
TACGGACAAC AGGCGGTAGA GAAAGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT
ATTCTTAGCC TGATGGAAGG CCGTGAAGTG GTCAATGTAC GCTATATGCA GCGTAAACGA
CTCATTGACC GACTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTGTT GTTACGCTGG
TGGCAGCGGG GGCAAAAGCC ATTGATGTAA
 
Protein sequence
MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA 
ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVATDSK PRVWVLDFKG SMDAHEVNSL
REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG
YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGENT
EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV
ILSLMEGREV VNVRYMQRKR LIDRLTGSAA ESADRLLLRW WQRGQKPLM