Gene EcSMS35_1425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1425 
SymbolsppA 
ID6146703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1409620 
End bp1411488 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content52% 
IMG OID641616303 
Productprotease 4 
Protein accessionYP_001743483 
Protein GI170683467 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.107145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGAGAAT ACATGCGAAC CCTTTGGCGA TTTATTGCCG GATTTTTTAA ATGGACGTGG 
CGTCTGCTGA ATTTCGTCCG TGAAATGGTA CTTAACCTGT TCTTTATTTT TCTCGTGCTG
GTTGGTGTGG GGATCTGGAT GCAGGTCAGT GGTGGTGATT CGAAAGAAAC GGCCAGTCGT
GGCGCACTGC TGCTGGACAT TTCTGGTGTG ATCGTCGATA AACCCGACAG TTCTCAGCGG
TTTAGTAAAT TAAGCCGCCA GCTGCTTGGT GCCAGTTCCG ATCGTCTGCA GGAAAACTCA
CTGTTTGATA TCGTCAACAC CATTCGCCAG GCGAAGGACG ACCGCAATAT CACCGGTATC
GTGATGGATC TGAAAAACTT CGCAGGCGGC GACCAGCCGT CTATGCAGTA CATCGGCAAA
GCCCTGAAAG AGTTTCGTGA CAGCGGGAAA CCGGTTTATG CCGTTGGCGA GAACTACAGC
CAGGGGCAAT ATTATCTCGC CAGTTTCGCC AATAAAATTT GGCTGTCACC GCAAGGCGTG
GTGGATCTGC ACGGTTTTGC CACCAACGGT CTGTACTACA AATCGTTACT GGATAAGCTG
AAAGTTTCCA CCCATGTGTT CCGCGTGGGT ACGTATAAAT CTGCCGTTGA ACCGTTTATT
CGTGATGATA TGTCACCGGC AGCCCGCGAA GCTGACAGCC GCTGGATTGG TGAGCTGTGG
CAAAACTATC TGAATACTGT TGCCGCTAAC CGGCAGATCC CTGCTCAGCA GGTATTCCCT
GGCGCGCAAG GGTTGCTTGA GGGTTTAACC AAAACCGGTG GCGATACCGC GAAATATGCG
CTGGAAAACA AGCTGGTCGA TGCACTGGCA TCGAGTGCCG AAATCGAAAA AGCTCTGACC
AAAGAGTTCG GCTGGAGTAA GACTGATAAA AATTATCGCG CCATCAGTTA TTACGATTAC
GCATTGAAAA CGCCGGCAGA TACCGGTGAC AGCATCGGAG TCGTCTTCGC TAATGGCGCA
ATTATGGATG GCGAGGAAAC TCAGGGGAAT GTTGGCGGTG ATACCACTGC GGCACAAATC
CGCGACGCTC GCCTTGACCC GAAAGTGAAA GCGATTGTTC TGCGTGTTAA TAGTCCCGGC
GGCAGCGTTA CTGCGTCTGA AGTGATTCGC GCTGAACTGG CAGCAGCCCG GGCAGCGGGT
AAGCCTGTGG TTGTGTCGAT GGGCGGTATG GCGGCATCTG GTGGTTACTG GATTTCCACA
CCAGCTAATT ACATTGTGGC TAACCCCAGC ACCCTGACCG GTTCTATCGG TATCTTCGGC
GTGATCACCA CCGTAGAAAA TAGTCTGGAT TCGATTGGTG TTCATACTGA TGGTGTCTCA
ACTTCACCGC TGGCGGATGT TTCTATCACC AGGGCACTGC CGCCGGAAGC ACAGCAGATG
ATGCAGTTAA GCATTGAGAA TGGCTATAAA CGCTTTATCA CGCTGGTTGC TGATGCGCGT
CATTCAACGC CAGAGCAGAT TGATAAAATC GCCCAGGGCC ACGTCTGGAC CGGCCAGGAT
GCAAAAGCTA ACGGGCTGGT CGATAGTCTC GGGGATTTCG ATGATGCGGT CGCCAAAGCA
GCAGAGCTGG CAAAAGTGAA ACAGTGGCAT CTGGAATACT ACGTTGATGA ACCGACCTTC
TTCGACAAAG TGATGGACAA CATGTCTGGT TCTGTCCGGG CAATGTTGCC AGATGCGTTC
CAGGCCATGT TACCTGCACC GCTGGCCTCG GTAGCCTCTA CCGTTAAAAG TGAAAACGAC
AAGCTGGCCG CGTTTAACGA CCCACAAAAC CGTTATGCGT TTTGCCTGAC CTGCGCCAAC
GTGCGTTAA
 
Protein sequence
MGEYMRTLWR FIAGFFKWTW RLLNFVREMV LNLFFIFLVL VGVGIWMQVS GGDSKETASR 
GALLLDISGV IVDKPDSSQR FSKLSRQLLG ASSDRLQENS LFDIVNTIRQ AKDDRNITGI
VMDLKNFAGG DQPSMQYIGK ALKEFRDSGK PVYAVGENYS QGQYYLASFA NKIWLSPQGV
VDLHGFATNG LYYKSLLDKL KVSTHVFRVG TYKSAVEPFI RDDMSPAARE ADSRWIGELW
QNYLNTVAAN RQIPAQQVFP GAQGLLEGLT KTGGDTAKYA LENKLVDALA SSAEIEKALT
KEFGWSKTDK NYRAISYYDY ALKTPADTGD SIGVVFANGA IMDGEETQGN VGGDTTAAQI
RDARLDPKVK AIVLRVNSPG GSVTASEVIR AELAAARAAG KPVVVSMGGM AASGGYWIST
PANYIVANPS TLTGSIGIFG VITTVENSLD SIGVHTDGVS TSPLADVSIT RALPPEAQQM
MQLSIENGYK RFITLVADAR HSTPEQIDKI AQGHVWTGQD AKANGLVDSL GDFDDAVAKA
AELAKVKQWH LEYYVDEPTF FDKVMDNMSG SVRAMLPDAF QAMLPAPLAS VASTVKSEND
KLAAFNDPQN RYAFCLTCAN VR