Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1425 |
Symbol | sppA |
ID | 6146703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1409620 |
End bp | 1411488 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616303 |
Product | protease 4 |
Protein accession | YP_001743483 |
Protein GI | 170683467 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.107145 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGGAGAAT ACATGCGAAC CCTTTGGCGA TTTATTGCCG GATTTTTTAA ATGGACGTGG CGTCTGCTGA ATTTCGTCCG TGAAATGGTA CTTAACCTGT TCTTTATTTT TCTCGTGCTG GTTGGTGTGG GGATCTGGAT GCAGGTCAGT GGTGGTGATT CGAAAGAAAC GGCCAGTCGT GGCGCACTGC TGCTGGACAT TTCTGGTGTG ATCGTCGATA AACCCGACAG TTCTCAGCGG TTTAGTAAAT TAAGCCGCCA GCTGCTTGGT GCCAGTTCCG ATCGTCTGCA GGAAAACTCA CTGTTTGATA TCGTCAACAC CATTCGCCAG GCGAAGGACG ACCGCAATAT CACCGGTATC GTGATGGATC TGAAAAACTT CGCAGGCGGC GACCAGCCGT CTATGCAGTA CATCGGCAAA GCCCTGAAAG AGTTTCGTGA CAGCGGGAAA CCGGTTTATG CCGTTGGCGA GAACTACAGC CAGGGGCAAT ATTATCTCGC CAGTTTCGCC AATAAAATTT GGCTGTCACC GCAAGGCGTG GTGGATCTGC ACGGTTTTGC CACCAACGGT CTGTACTACA AATCGTTACT GGATAAGCTG AAAGTTTCCA CCCATGTGTT CCGCGTGGGT ACGTATAAAT CTGCCGTTGA ACCGTTTATT CGTGATGATA TGTCACCGGC AGCCCGCGAA GCTGACAGCC GCTGGATTGG TGAGCTGTGG CAAAACTATC TGAATACTGT TGCCGCTAAC CGGCAGATCC CTGCTCAGCA GGTATTCCCT GGCGCGCAAG GGTTGCTTGA GGGTTTAACC AAAACCGGTG GCGATACCGC GAAATATGCG CTGGAAAACA AGCTGGTCGA TGCACTGGCA TCGAGTGCCG AAATCGAAAA AGCTCTGACC AAAGAGTTCG GCTGGAGTAA GACTGATAAA AATTATCGCG CCATCAGTTA TTACGATTAC GCATTGAAAA CGCCGGCAGA TACCGGTGAC AGCATCGGAG TCGTCTTCGC TAATGGCGCA ATTATGGATG GCGAGGAAAC TCAGGGGAAT GTTGGCGGTG ATACCACTGC GGCACAAATC CGCGACGCTC GCCTTGACCC GAAAGTGAAA GCGATTGTTC TGCGTGTTAA TAGTCCCGGC GGCAGCGTTA CTGCGTCTGA AGTGATTCGC GCTGAACTGG CAGCAGCCCG GGCAGCGGGT AAGCCTGTGG TTGTGTCGAT GGGCGGTATG GCGGCATCTG GTGGTTACTG GATTTCCACA CCAGCTAATT ACATTGTGGC TAACCCCAGC ACCCTGACCG GTTCTATCGG TATCTTCGGC GTGATCACCA CCGTAGAAAA TAGTCTGGAT TCGATTGGTG TTCATACTGA TGGTGTCTCA ACTTCACCGC TGGCGGATGT TTCTATCACC AGGGCACTGC CGCCGGAAGC ACAGCAGATG ATGCAGTTAA GCATTGAGAA TGGCTATAAA CGCTTTATCA CGCTGGTTGC TGATGCGCGT CATTCAACGC CAGAGCAGAT TGATAAAATC GCCCAGGGCC ACGTCTGGAC CGGCCAGGAT GCAAAAGCTA ACGGGCTGGT CGATAGTCTC GGGGATTTCG ATGATGCGGT CGCCAAAGCA GCAGAGCTGG CAAAAGTGAA ACAGTGGCAT CTGGAATACT ACGTTGATGA ACCGACCTTC TTCGACAAAG TGATGGACAA CATGTCTGGT TCTGTCCGGG CAATGTTGCC AGATGCGTTC CAGGCCATGT TACCTGCACC GCTGGCCTCG GTAGCCTCTA CCGTTAAAAG TGAAAACGAC AAGCTGGCCG CGTTTAACGA CCCACAAAAC CGTTATGCGT TTTGCCTGAC CTGCGCCAAC GTGCGTTAA
|
Protein sequence | MGEYMRTLWR FIAGFFKWTW RLLNFVREMV LNLFFIFLVL VGVGIWMQVS GGDSKETASR GALLLDISGV IVDKPDSSQR FSKLSRQLLG ASSDRLQENS LFDIVNTIRQ AKDDRNITGI VMDLKNFAGG DQPSMQYIGK ALKEFRDSGK PVYAVGENYS QGQYYLASFA NKIWLSPQGV VDLHGFATNG LYYKSLLDKL KVSTHVFRVG TYKSAVEPFI RDDMSPAARE ADSRWIGELW QNYLNTVAAN RQIPAQQVFP GAQGLLEGLT KTGGDTAKYA LENKLVDALA SSAEIEKALT KEFGWSKTDK NYRAISYYDY ALKTPADTGD SIGVVFANGA IMDGEETQGN VGGDTTAAQI RDARLDPKVK AIVLRVNSPG GSVTASEVIR AELAAARAAG KPVVVSMGGM AASGGYWIST PANYIVANPS TLTGSIGIFG VITTVENSLD SIGVHTDGVS TSPLADVSIT RALPPEAQQM MQLSIENGYK RFITLVADAR HSTPEQIDKI AQGHVWTGQD AKANGLVDSL GDFDDAVAKA AELAKVKQWH LEYYVDEPTF FDKVMDNMSG SVRAMLPDAF QAMLPAPLAS VASTVKSEND KLAAFNDPQN RYAFCLTCAN VR
|
| |