Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1990 |
Symbol | sppA |
ID | 5587437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1974459 |
End bp | 1976315 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640925662 |
Product | protease 4 |
Protein accession | YP_001463065 |
Protein GI | 157158760 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0152485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTCC TCGTACTGGT TGGTGTGGGG ATCTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG CGCACTGCTG CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA AGCCGCCAGC TGCTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC GTCAACACCA TTCGCCAGGC GAAGGACGAC CGTAATATCA CCGGTATCGT GATGGATCTG AAAAACTTCG CAGGCGGCGA CCAGCCGTCT ATGCAGTACA TCGGCAAAGC CCTGAAAGAG TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCACCGC AAGGCGTGGT TGATCTGCAC GGTTTTGCTA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGATATG TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG AATACTGTTG CCGCTAACCG GCAGATCCCT GCTCAGCAGG TATTCCCTGG CGCGCAAGGG TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG CTGGTCGATG CACTGGCATC GAGTGCCGAA ATCGAAAAAG CTCTGACCAA AGAGTTCGGC TGGAGTAAGA CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTCGCTA ATGGCGCAAT TATGGATGGC GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCTTGTGGTT GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATCGGTA TCTTCGGCGT GATCACCACC GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCAGATGAT GCAATTAAGC ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG GAGCAAATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGATGC AAAAGCTAAC GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTTG CCAAAGCAGC AGAGCTGGCA AAAGTGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA CCTGCACCGC TTGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGCGACAA GCTGGCCGCG TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
|
Protein sequence | MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKLVV VSMGGMAASG GYWISTPANY IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA FNDPQNRYAF CLTCANVR
|
| |