Gene EcSMS35_1357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1357 
Symbolprc 
ID6147108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1345283 
End bp1347325 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content50% 
IMG OID641616235 
Productcarboxy-terminal protease 
Protein accessionYP_001743415 
Protein GI170681936 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000834708 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.174046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTTA GGCTTACCGC GTTAGCTGGC CTGCTTGCAA TAGCAGGCCA GACCTTCGCT 
GTAGAAGATA TCACGCGTGC TGATCAAATT CCGGTATTAA AGGAAGAGAC GCAGCATGCG
ACGGTAAGTG AGCGCGTAAC GTCGCGCTTC ACCCGTTCTC ATTATCGCCA GTTCGACCTC
GATCAGGCAT TTTCGGCCAA AATCTTTGAC CGCTACCTGA ATCTGCTCGA TTACAGCCAC
AACGTGCTGT TGGCAAGCGA TGTTGAACAG TTCGCGAAAA AGAAAACCGA GTTAGGCGAT
GAACTGCGTT CAGGCAAACT CGACGTTTTC TACGATCTCT ACAATCTGGC GCAAAAGCGT
CGTTTTGAAC GTTACCAGTA CGCTTTGTCG GTACTGGAAA AGCCGATGGA TTTCACCGGC
AACGACACTT ATAACCTTGA CCGCAGCAAA GCGCCCTGGC CGAAAAACGA GGCTGAGTTG
AACGCGCTGT GGGACAGTAA AGTCAAATTC GACGAGTTAA GCCTGAAGCT GGCAGGAAAA
ACGGATAAAG AAATTCGTGA AACCCTGACT CGCCGCTACA AATTTGCCAT TCGTCGTCTG
GCGCAAACCA ACAGCGAAGA TGTTTTCTCG CTGGCAATGA CGGCGTTTGC GCGTGAAATC
GACCCGCATA CCAACTATCT TTCCCCGCGA AATACCGAAC AGTTCAACAC TGAAATGAGT
TTGTCGCTGG AAGGTATTGG CGCAGTGCTG CAAATGGATG ATGACTACAC CGTTATCAAT
TCGATGGTGG CAGGTGGCCC GGCAGCGAAG AGTAAAGCTA TCAGCGTTGG TGACAAAATT
GTCGGTGTTG GTCAAACAGG CAAGCCGATG GTTGACGTGA TTGGTTGGCG TCTTGATGAT
GTGGTTGCCT TAATTAAAGG GCCGAAGGGC AGTAAAGTTC GTCTGGAAAT TTTACCTGCT
GGTAAAGGGA CCAAGACCCG TACTGTAACG TTGACCCGTG AACGTATTCG TCTCGAAGAC
CGCGCGGTTA AAATGTCGGT GAAGACCGTC GGTAAAGAGA AAGTCGGCGT GCTGGATATT
CCGGGCTTCT ATGTGGGTTT GACAGATGAT GTCAAAGTAC AGCTGCAGAA ACTGGAAAAA
CAGAATGTCA GCAGCGTCAT CATCGACCTG CGTAGCAATG GTGGTGGGGC GCTGACCGAA
GCGGTATCGC TCTCCGGTCT GTTTATTCCA TCCGGCCCGA TTGTTCAGGT CCGTGATAAC
AACGGTAAAG TTCGTGAAGA CAGCGATACC GACGGACAGG TCTTCTATAA AGGCCCGCTG
GTGGTACTGG TTGACCGTTT CAGTGCTTCG GCTTCAGAAA TCTTTGCCGC GGCAATGCAG
GATTACGGTC GTGCGCTGGT TGTGGGTGAA CCGACGTTTG GTAAAGGCAC CGTTCAGCAA
TACCGTTCAT TGAACCGTAT CTACGATCAG ATGTTACGTC CTGAATGGCC AGCGCTGGGA
TCTGTGCAGT ACACGATCCA GAAATTCTAT CGCGTTAACG GCGGCAGTAC GCAACGTAAA
GGCGTAACGC CAGACATCAT CATGCCGACG GGTAATGAAG AAACGGAAAC GGGTGAGAAA
TTCGAAGATA ACGCGCTGCC GTGGGATAGC ATTGATGCCG CGACTTATGT GAAATCAGGA
GATTTAACGG CCTTTGAACC GGAGCTGCTG AAGGAACATA ATGCGCGTAT CGCGAAAGAT
CCTGAGTTCC AGAACATCAT GAAGGATATC GCACGCTTCA ACGCTATGAA GGACAAGCGC
AATATCGTTT CTCTGAATTA CGCTGTGCGT GAGAAAGAGA ATAATGAAGA TGATGCGACG
CGTCTGGCGC GTTTGAACGA ACGCTTTAAA CGCGAAGGTA AACCGGAGTT GAAGAAACTG
GATGATCTAC CGAAAGATTA CCAGGAGCCA GATCCTTATC TGGATGAGAC GGTGAATATC
GCACTCGATC TGGCGAAGCT TGAAAAAGCC AGACCCGCGG AACAACCCGC TCCCGTCAAG
TAA
 
Protein sequence
MFFRLTALAG LLAIAGQTFA VEDITRADQI PVLKEETQHA TVSERVTSRF TRSHYRQFDL 
DQAFSAKIFD RYLNLLDYSH NVLLASDVEQ FAKKKTELGD ELRSGKLDVF YDLYNLAQKR
RFERYQYALS VLEKPMDFTG NDTYNLDRSK APWPKNEAEL NALWDSKVKF DELSLKLAGK
TDKEIRETLT RRYKFAIRRL AQTNSEDVFS LAMTAFAREI DPHTNYLSPR NTEQFNTEMS
LSLEGIGAVL QMDDDYTVIN SMVAGGPAAK SKAISVGDKI VGVGQTGKPM VDVIGWRLDD
VVALIKGPKG SKVRLEILPA GKGTKTRTVT LTRERIRLED RAVKMSVKTV GKEKVGVLDI
PGFYVGLTDD VKVQLQKLEK QNVSSVIIDL RSNGGGALTE AVSLSGLFIP SGPIVQVRDN
NGKVREDSDT DGQVFYKGPL VVLVDRFSAS ASEIFAAAMQ DYGRALVVGE PTFGKGTVQQ
YRSLNRIYDQ MLRPEWPALG SVQYTIQKFY RVNGGSTQRK GVTPDIIMPT GNEETETGEK
FEDNALPWDS IDAATYVKSG DLTAFEPELL KEHNARIAKD PEFQNIMKDI ARFNAMKDKR
NIVSLNYAVR EKENNEDDAT RLARLNERFK REGKPELKKL DDLPKDYQEP DPYLDETVNI
ALDLAKLEKA RPAEQPAPVK