Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2225 |
Symbol | |
ID | 5587059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 2187783 |
End bp | 2189093 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640925893 |
Product | S49 family peptidase |
Protein accession | YP_001463293 |
Protein GI | 157157447 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0230467 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTGGA ACAATTTTCC GCACCTCGCC GCTAGGGCGT TCAATCAACC GCTTTTGCTG GAGCCCGCCT ACGCGCGGGT ATTTTTTTCT GCGCTGAGCG ACCGGTTCGG TACCGGGCGA CTGATTGATA CAGCGTCAGG AGAGGTAATG AACAGCGACG AAATGAACGC GCTCGCGATG GGGTGGGACA GCAGCGAGCG AACACGCCAG AAATCGTATC GCGTGGAGCG TGGTATAGCC GTTCTGCCGG TTACCGGGAC GCTGGTTCAT AAATTGGGTT ATATCAATCC GGTCAGCGGG ATGAGTGGTT ACGACGGAAT CGCAAAACGC CTGCAGCAGG CGATTTCTGA TCCCGATGTT AAGGGGATCC TGCTGGATAT TGATTCCCCT GGCGGTGAGG TCGCCGGCGC GTTTGATACC GCTGATTTAA TCGCCCGGGC GCGAGAGCAA AAACCGGTGT GGGCGCTGGC CAGCGATACG GCCTGCAGCG CCGCATATTT GCTGGCGTCA GCGTGTTCGC GCCGGCTGAT AACGCAGACC GGCACGGTTG GTTCAATCGG TGTCCTGATG GCTCACCGCT GCGTCGAAAA GGCGCTGGAG ATTGCCGGCG TTGACGTGAC GCTGATTTAC GCCGGCGCGC ACAAAGTCGA CGGGAACCCG TATTCCCAGC TGCCCGACGA CGTTCGCGAC GAATTCCAGC TGAGTATTAA CAGCACACGC GAGCAGTTCG CGCAAAAAGT CTCGGATTAT ACCGGGCTGA AAAAATCCAG GGTGCTGGCC ACAGAGGCCG CAGTATTTAT CGGCGCGGAC GCGATTAAAT CTGGTCTCGC TGATCAACTC GTTAATTACG CGGACGCTAT CGCAGTGATG GCCGACGCAC TGAAACCAAA AACGGAGCGA TTTATGCCAG GTACAACAGA AACCACGGCG GAGACCACGA CCACAGAACA AACCGCGGCT ACGACTACGG TCGCGCCGGT TGAGTCCAAC GCGGAGCAGA TTCGCGCGGA CGCCGCATCG AGCGAACTGG CACGCGTGAT GGCCATCATC AACTGTCCCG AAGCTGTTGG GCGCGAGGCG CAGGCAAAAG CGCTCGCTGC CGTCCCCGGG ATGACGGTCG GGCAGGCGCA GGCTGTCCTC GCGGCAGCAC CGCAAACAGC GCAGGCGCGG ACAGAAACGG CGCTCGATAC ACTCATGAGC ACTGAATCAC CGGAAACTAT TCAGGATGCC GGCAGCACCA CGGCAACAGG AACAACCGCA AACGTCTCGA TGCTGGTCGC GGCAGGGCGT TCAATTTTAG GGGATGAATA A
|
Protein sequence | MPWNNFPHLA ARAFNQPLLL EPAYARVFFS ALSDRFGTGR LIDTASGEVM NSDEMNALAM GWDSSERTRQ KSYRVERGIA VLPVTGTLVH KLGYINPVSG MSGYDGIAKR LQQAISDPDV KGILLDIDSP GGEVAGAFDT ADLIARAREQ KPVWALASDT ACSAAYLLAS ACSRRLITQT GTVGSIGVLM AHRCVEKALE IAGVDVTLIY AGAHKVDGNP YSQLPDDVRD EFQLSINSTR EQFAQKVSDY TGLKKSRVLA TEAAVFIGAD AIKSGLADQL VNYADAIAVM ADALKPKTER FMPGTTETTA ETTTTEQTAA TTTVAPVESN AEQIRADAAS SELARVMAII NCPEAVGREA QAKALAAVPG MTVGQAQAVL AAAPQTAQAR TETALDTLMS TESPETIQDA GSTTATGTTA NVSMLVAAGR SILGDE
|
| |