Gene ECH74115_2563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2563 
Symbolprc 
ID6971820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2420892 
End bp2422934 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content50% 
IMG OID643386430 
Productcarboxy-terminal protease 
Protein accessionYP_002270912 
Protein GI209399862 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000036056 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTTA GGCTTACCGC GTTAGCTGGC CTGCTTGCAA TAGCAGGCCA GACCTTCGCT 
GTAGAAGATA TCACGCGTGC TGATCAAATT CCGGTATTAA AGGAAGAGAC GCAGCATGCG
ACGGTGAGTG AGCGCGTAAC GTCGCGCTTC ACCCGTTCTC ATTATCGCCA GTTCGACCTC
GATCAGGCAT TTTCGGCCAA AATCTTTGAC CGCTACCTGA ATCTGCTCGA TTACAGCCAC
AACGTGCTGC TGGCAAGCGA TGTTGAACAG TTCGCGAAAA AGAAAACCGA GTTAGGCGAT
GAACTGCGTT CAGGCAAACT CGACGTTTTC TACGATCTCT ACAATCTGGC GCAAAAGCGT
CGTTTTGAAC GTTACCAATA CGCTTTGTCG GTACTGGAAA AGCCGATGGA TTTCACCGGC
AACGACACTT ATAACCTTGA CCGCAGCAAA GCGCCCTGGC CGAAAAACGA GGCTGAGTTG
AACGCGCTGT GGGACAGTAA AGTCAAATTC GACGAGTTAA GCCTGAAGCT GGCAGGAAAA
ACGGATAAAG AAATTCGTGA AACCCTGACT CGCCGCTACA AATTTGCCAT TCGTCGTCTG
GCGCAAACCA ACAGCGAAGA TGTTTTCTCG CTGGCAATGA CGGCGTTTGC GCGTGAAATC
GACCCGCATA CCAACTATCT TTCCCCGCGT AATACCGAAC AGTTCAACAC TGAAATGAGT
TTGTCGCTGG AAGGTATTGG CGCAGTGCTG CAAATGGATG ATGACTACAC CGTTATCAAT
TCGATGGTGG CAGGTGGTCC GGCAGCGAAG AGTAAAGCTA TCAGCGTTGG TGACAAAATT
GTCGGTGTTG GTCAAACAGG CAAGCCGATG GTTGACGTGA TTGGCTGGCG TCTTGATGAT
GTGGTTGCCT TAATTAAAGG GCCGAAGGGC AGTAAAGTTC GTCTGGAAAT TTTACCTGCT
GGTAAAGGGA CCAAGACCCG CACTGTAACA TTGACCCGTG AACGTATTCG TCTCGAAGAC
CGCGCGGTTA AAATGTCGGT GAAGACCGTC GGTAAAGAGA AAGTCGGCGT GCTGGATATT
CCTGGCTTCT ATGTGGGTTT GACAGACGAT GTCAAAGTGC AACTGCAGAA ACTGGAAAAA
CAGAATGTCA GCAGTGTGAT CATCGATCTG CGTAGCAATG GTGGTGGGGC GCTGACCGAA
GCGGTATCGC TCTCAGGTCT GTTTATTCCT GCGGGTCCTA TTGTTCAGGT CCGTGATAAC
AACGGTAAAG TTCGTGAAGA CAGCGATACC GACGGACAGG TTTTCTATAA AGGCCCGCTG
GTGGTACTGG TTGACCGCTT CAGTGCTTCG GCTTCAGAAA TCTTTGCCGC GGCAATGCAG
GATTACGGTC GTGCACTGGT TGTGGGTGAA CCGACGTTTG GTAAAGGCAC CGTTCAGCAA
TACCGTTCAT TGAACCGTAT TTACGATCAG ATGTTACGTC CTGAATGGCC AGCGCTGGGT
TCTGTGCAGT ACACGATCCA GAAATTCTAT CGCGTTAACG GCGGCAGTAC GCAACGTAAA
GGCGTAACGC CAGACATCAT CATGCCGACG GGTAATGAAG AAACGGAAAC GGGTGAGAAA
TTCGAAGATA ACGCGCTGCC GTGGGATAGC ATTGATGCCG CGACTTATGT GAAATCAGGC
GATTTAACGG CCTTTGAACC GGAGCTACTG AAGGAACATA ATGCGCGTAT CGCGAAAGAT
CCTGAGTTCC AGAACATCAT GAAGGATATC GCGCGCTTCA ACGCTATGAA GGACAAGCGC
AATATCGTTT CTCTGAATTA CGCTGTGCGT GAGAAAGAGA ATAATGAAGA TGATGCGACG
CGTTTGGCGC GTTTGAACGA ACGCTTTAAA CGCGAAGGTA AACCGGAGTT GAAGAAACTG
GATGATCTAC CGAAAGATTA CCAGGAGCCG GATCCTTATC TGGATGAGAC GGTGAATATC
GCACTCGATC TGGCGAAGCT TGAAAAAGCC AGACCCGCGG AACAACCCGC ACCCGTCAAG
TAA
 
Protein sequence
MFFRLTALAG LLAIAGQTFA VEDITRADQI PVLKEETQHA TVSERVTSRF TRSHYRQFDL 
DQAFSAKIFD RYLNLLDYSH NVLLASDVEQ FAKKKTELGD ELRSGKLDVF YDLYNLAQKR
RFERYQYALS VLEKPMDFTG NDTYNLDRSK APWPKNEAEL NALWDSKVKF DELSLKLAGK
TDKEIRETLT RRYKFAIRRL AQTNSEDVFS LAMTAFAREI DPHTNYLSPR NTEQFNTEMS
LSLEGIGAVL QMDDDYTVIN SMVAGGPAAK SKAISVGDKI VGVGQTGKPM VDVIGWRLDD
VVALIKGPKG SKVRLEILPA GKGTKTRTVT LTRERIRLED RAVKMSVKTV GKEKVGVLDI
PGFYVGLTDD VKVQLQKLEK QNVSSVIIDL RSNGGGALTE AVSLSGLFIP AGPIVQVRDN
NGKVREDSDT DGQVFYKGPL VVLVDRFSAS ASEIFAAAMQ DYGRALVVGE PTFGKGTVQQ
YRSLNRIYDQ MLRPEWPALG SVQYTIQKFY RVNGGSTQRK GVTPDIIMPT GNEETETGEK
FEDNALPWDS IDAATYVKSG DLTAFEPELL KEHNARIAKD PEFQNIMKDI ARFNAMKDKR
NIVSLNYAVR EKENNEDDAT RLARLNERFK REGKPELKKL DDLPKDYQEP DPYLDETVNI
ALDLAKLEKA RPAEQPAPVK