Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2094 |
Symbol | |
ID | 6067300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2290665 |
End bp | 2291984 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641601502 |
Product | peptidase S49 |
Protein accession | YP_001725061 |
Protein GI | 170020107 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00430491 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGCAG AGCTGCGTAA TCTCCCGCAT ATTGCCAGCA TGGCCTTTAA TGAGCCGCTG ATGCTTGAAC CCGCCTATGC GCGGGTTTTC TTTTGTGCGC TTGCAGGCCA GCTTGGGATC AGCCGCCTGA CGGATGCGGT GTCCGGCGAC AGCCTGACTG CCCAGGAGGC ACTCGCGACG CTGGCATTAT CCGGTGATGA TGACGGACCA CGACAGGCCC GCAGTTATCA GGTCATGAAC GGCATCGCCG TGCTGCCGGT GTCCGGCACG CTGGTCAGCC GGACGCGGGC GCTGCAGCCG TACTCGGGGA TGACCGGTTA CAACGGCATT ATCGCCCGTC TGCAACAGGC TGCCAGCGAT CCGATGGTGG ACGGCATTCT GCTCGATATG GACACGCCCG GCGGGATGGT GGCGGGGGCA TTTGACTGCG CTGACATCAT CGCCCGTGTG CGTGACATAA AACCGGTATG GGCGCTTGCC AACGACATGA ACTGCAGTGC AGGTCAGTTG CTTGCCAGTG CCGCCTCCCG GCGTCTGGTC ACGCAGACCG CCCGGACAGG CTCCATCGGC GTCATGATGG CTCACAGTAA TTACGGTGCT GCGCTGGAGA AACAGGGTGT GGAAATCACG CTGATTTACA GCGGCAGCCA TAAGGTGGAT GGCAACCCCT ACAGCCATCT TCCGGATGAC GTCCGGGAGA CACTGCAGTC CCGGATGGAT GCAACCCGCC AGATGTTTGC GCAGAAGGTG TCGGCATATA CCGGCCTGTC CGTGCAGGCT GTGCTGGATA CCGAGGCTGC AGTGTACAGC GGTCAGGAGG TCATTGATGC CGGACTGGCT GATGAACTTG TTAACAGCAC CGATGCGATC ACCGTCATGC GTGATGCACT GGATGCACGT AAATCCCGTC TCTCAGGAGG GCGAATGACC AAAGAGACTC AATCAACAAC TGTTTCAGCC ACTGCTTCGC AGGCTGACGT TACTGACGTG GTGCCAGCGA CGGAGGGCGA AAACGCCAGC GCGGCGCAGC CGGACGTGAA CGCGCAGATC ACCGCAGCGG TTGCGGCAGA AAACAGCCGC ATTATGGGGA TCCTCAACTG TGAGGAGGCT CACGGACGCG AAGAACAGGC ACGCGTGCTG GCCGAAACCC CCGGTATGAC CGTGGAAACG GCCCGCCGCA TTCTGGCCGC AGCACCACAG AGTGCACAGG CGCGCAGTGA CACTGCGCTG GATCGTCTGA TGCAGGGGGC ACCGGCACCG CTGGCTTCAG GTAACCCGGC ATCTGATGCC GTTAACGATT TGCTGAACAC ACCAGTGTAA
|
Protein sequence | MTAELRNLPH IASMAFNEPL MLEPAYARVF FCALAGQLGI SRLTDAVSGD SLTAQEALAT LALSGDDDGP RQARSYQVMN GIAVLPVSGT LVSRTRALQP YSGMTGYNGI IARLQQAASD PMVDGILLDM DTPGGMVAGA FDCADIIARV RDIKPVWALA NDMNCSAGQL LASAASRRLV TQTARTGSIG VMMAHSNYGA ALEKQGVEIT LIYSGSHKVD GNPYSHLPDD VRETLQSRMD ATRQMFAQKV SAYTGLSVQA VLDTEAAVYS GQEVIDAGLA DELVNSTDAI TVMRDALDAR KSRLSGGRMT KETQSTTVSA TASQADVTDV VPATEGENAS AAQPDVNAQI TAAVAAENSR IMGILNCEEA HGREEQARVL AETPGMTVET ARRILAAAPQ SAQARSDTAL DRLMQGAPAP LASGNPASDA VNDLLNTPV
|
| |