Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2486 |
Symbol | sppA |
ID | 6966725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2354643 |
End bp | 2356499 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643386355 |
Product | protease 4 |
Protein accession | YP_002270837 |
Protein GI | 209400845 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.944254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.176457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTTC TCGTGCTGGT TGGTGTGGGG ATCTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG TGCACTGCTG CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA AGCCGCCAAC TACTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC GTCAATACCA TTCGCCAGGC GAAGGATGAC CGCAATATCA CCGGTATCGT GATGGATCTG AAAAACTTCG CAGGCGGCGA CCAACCGTCT ATGCAGTACA TCGGCAAAGC CCTGAAAGAG TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCACCGC AAGGCGTGGT GGATTTGCAC GGTTTTGCTA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGACATG TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG AATACTGTTG CCGCTAACCG GCAGATCCCT GCTCAGCAGG TATTCCCTGG CGCGCAAGGG TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG CTGGTCGATG CACTGGCATC GAGTGCGGAA ATCGAAAAAG CACTGACCAA AGAGTTCGGC TGGAGTAAGG CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTCGCTA ATGGCGCAAT TATGGATGGC GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCCTGTGGTT GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATTGGTA TCTTCGGCGT GATCACCACC GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCAGATGAT GCAGTTAAGC ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG GAGCAGATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGACGC AAAAGCTAAC GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTCG CCAAAGCAGC AGAGCTGGCA AAAGCGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA CCTGCACCGC TGGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGCGACAA GCTGGCGGCG TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
|
Protein sequence | MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG WSKADKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA KAKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA FNDPQNRYAF CLTCANVR
|
| |