Gene ECH74115_2486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2486 
SymbolsppA 
ID6966725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2354643 
End bp2356499 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID643386355 
Productprotease 4 
Protein accessionYP_002270837 
Protein GI209400845 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.944254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.176457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT 
TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTTC TCGTGCTGGT TGGTGTGGGG
ATCTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG TGCACTGCTG
CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA
AGCCGCCAAC TACTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC
GTCAATACCA TTCGCCAGGC GAAGGATGAC CGCAATATCA CCGGTATCGT GATGGATCTG
AAAAACTTCG CAGGCGGCGA CCAACCGTCT ATGCAGTACA TCGGCAAAGC CCTGAAAGAG
TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT
TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCACCGC AAGGCGTGGT GGATTTGCAC
GGTTTTGCTA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC
CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGACATG
TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG
AATACTGTTG CCGCTAACCG GCAGATCCCT GCTCAGCAGG TATTCCCTGG CGCGCAAGGG
TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG
CTGGTCGATG CACTGGCATC GAGTGCGGAA ATCGAAAAAG CACTGACCAA AGAGTTCGGC
TGGAGTAAGG CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG
CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTCGCTA ATGGCGCAAT TATGGATGGC
GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC
CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC
GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCCTGTGGTT
GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC
ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATTGGTA TCTTCGGCGT GATCACCACC
GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG
GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCAGATGAT GCAGTTAAGC
ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG
GAGCAGATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGACGC AAAAGCTAAC
GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTCG CCAAAGCAGC AGAGCTGGCA
AAAGCGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG
ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA
CCTGCACCGC TGGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGCGACAA GCTGGCGGCG
TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
 
Protein sequence
MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL 
LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL
KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH
GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG
WSKADKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR
LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY
IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS
IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA
KAKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA
FNDPQNRYAF CLTCANVR