Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2356 |
Symbol | |
ID | 6064794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2596246 |
End bp | 2597295 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641601759 |
Product | putative periplasmic protease |
Protein accession | YP_001725318 |
Protein GI | 170020364 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.203735 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000000387336 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAATTGT TGTCTGAATA TGGTTTGTTT TTGGCGAAAA TCGTTACCGT TGTGCTAGCG ATTGCGGCGA TTGCCGCCAT TATTGTCAAT GTTGCTCAAC GTAATAAACG CCAGCGTGGC GAGTTACGGG TCAACAATCT CAGCGAACAG TATAAGGAGA TGAAAGAAGA ACTGGCCGCG GCGCTGATGG ACTCACATCA GCAAAAACAG TGGCACAAAG CGCAGAAGAA AAAGCATAAG CAAGAAGCGA AAGCAGCAAA AGCGAAAGCC AAACTGGGAG AGGTGGCAAC TGACAGTAAA CCCCGCGTCT GGGTGCTGGA TTTTAAAGGC AGCATGGACG CCCATGAAGT GAACTCGCTA CGTGAAGAGA TAACGGCTGT ACTCGCAGCA TTCAAACCGC AGGATCAGGT TGTACTACGT CTGGAAAGCC CTGGTGGCAT GGTGCATGGT TACGGGTTGG CGGCTTCGCA GCTGCAGCGT CTGCGTGATA AAAACATTCC TTTAACTGTT ACGGTAGACA AAGTCGCTGC CAGCGGCGGT TACATGATGG CCTGTGTGGC GGACAAAATT GTTTCCGCAC CGTTTGCTAT TGTGGGTTCC ATTGGGGTGG TGGCGCAAAT GCCCAACTTT AACCGCTTCC TGAAAAGCAA AGATATTGAT ATCGAACTTC ACACCGCCGG GCAGTATAAG CGTACGCTGA CCTTGCTGGG TGAAAATACC GAAGAAGGGC GGGAGAAATT CCGCGAAGAG CTGAACGAAA CGCATCAGTT ATTTAAAGAT TTTGTGAAGC GTATGCGTCC GTCTCTGGAT ATTGAACAGG TGGCAACGGG TGAACACTGG TACGGACAAC AGGCGGTAGA GAAAGGCCTG GTTGATGAAA TCAACACCAG TGATGAAGTT ATTCTTAGCC TGATGGAAGG CCGTGAAGTG GTCAATGTAC GCTATATGCA GCGTAAACGA CTCATTGACC GACTCACCGG CAGCGCGGCA GAGAGCGCCG ATCGATTGTT GTTACGCTGG TGGCAGCGGG GGCAAAAGCC ATTGATGTAA
|
Protein sequence | MELLSEYGLF LAKIVTVVLA IAAIAAIIVN VAQRNKRQRG ELRVNNLSEQ YKEMKEELAA ALMDSHQQKQ WHKAQKKKHK QEAKAAKAKA KLGEVATDSK PRVWVLDFKG SMDAHEVNSL REEITAVLAA FKPQDQVVLR LESPGGMVHG YGLAASQLQR LRDKNIPLTV TVDKVAASGG YMMACVADKI VSAPFAIVGS IGVVAQMPNF NRFLKSKDID IELHTAGQYK RTLTLLGENT EEGREKFREE LNETHQLFKD FVKRMRPSLD IEQVATGEHW YGQQAVEKGL VDEINTSDEV ILSLMEGREV VNVRYMQRKR LIDRLTGSAA ESADRLLLRW WQRGQKPLM
|
| |