Gene EcE24377A_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1471 
SymbolsohB 
ID5587058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1461061 
End bp1462110 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content50% 
IMG OID640925163 
Productputative periplasmic protease 
Protein accessionYP_001462568 
Protein GI157157433 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.251936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG 
ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC
GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG
GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCACAAG
CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGCG AGGTGGCAAC TGACAGTAAA
CCTCGCGTCT GGGTGCTGGA TTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA
CGTGAAGAGA TAACGGCTGT ACTCGCAGCA TTCAAACCGC AGGATCAGGT TGTGCTACGT
CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT
CTGCGTGATA AAAACATTCC TTTAACTGTT ACGGTAGACA AAGTCGCTGC CAGCGGCGGT
TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC
ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT
ATCGAACTGC ACACTGCCGG GCAGTATAAG CGTACTCTGA CGTTGCTGGG TGACAATACC
GAAGAAGGGC GGGAGAAATT CCGCGAAGAG TTGAACGAAA CGCATCAGTT GTTTAAAGAT
TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG
TACGGACAAC AGGCGGTAGA GAAAGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT
ATTCTTAGCC TGATGGAAGG CCGTGAAGTG GTTAATGTAC GCTATATGCA GCGTAAACGA
CTCATTGACC GATTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTGTT GCTACGCTGG
TGGCAGCGGG GTCAAAAGCC ATTGATGTAA
 
Protein sequence
MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA 
ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVATDSK PRVWVLDFKG SMDAHEVNSL
REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG
YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGDNT
EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV
ILSLMEGREV VNVRYMQRKR LIDRFTGSAA ESADRLLLRW WQRGQKPLM