Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1623 |
Symbol | |
ID | 6969781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1570981 |
End bp | 1572300 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643385583 |
Product | minor capsid protein C |
Protein accession | YP_002270077 |
Protein GI | 209398131 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0216288 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGCAG AGCTGCGTAA TCTCCCGCAT ATTGCCAGCA TGGCTTTTAA TGAGCCGCTG ATGCTTGAAC CCGCCTATGC GCGGGTTTTC TTTTGTGCGC TTGCAGGCCA GCTTGGGATC AGCCGCCTGA CGGATGCAGT ATCCGGCGAC AGCCTGACTG CCGGAGAGGC ACCCGCGGCG CTGGCGTTAT CCGGTGATGA TGACGGACCA CGACAGGCCC GGAGTTATCA GGTCATGAAC GGCATCGCCG TGCTGCCGGT GTCCGGTACG CTGGTCAGCC GGACGCGGGC GCTGCAGCCG TATTCGGGAA TGACCGGTTA CAACGGCATT ATCGCCCGTC TGCAACAGGC TGCCAGCGAT CCGATGGTGG ACGGCATTCT GCTCGATATG GACACACCGG GCGGGATGGT GGCGGGAGCA TTTGACTGTG CTGACATCAT CGCCCGTGTG CGAGACATAA AACCGGTATG GGCGCTGGCC AACGACATGA ACTGCAGTGC AGGTCAGCTG CTTGCCAGCG CCGCCTCCCG GCGTCTGGTC ACGCAGACCG CCCGGACAGG CTCCATCGGC GTCATGATGG CTCACAGTAA TTACGGTGCT GCCCTGGAGA AACAGGGCGT GGAAATCACG CTGATTTACA GCGGCAGCCA TAAGGTGGAT GGCAACCCCT ACAGCCATCT ACCGGGTGAT GTCCGGGAAA CACTGCAGTC CCGGATGGAT GCAACCCGCC GGATGTTTGC GCAGAAGGTG TCGGCATATA CCGGCCTGTC CGTGCAGGCT GTGCTGGATA CCGAGGCTGC AGTGTACAGC GGTCAGGAGG CCATTGATGC CGGACTGGCT GATGAACTTG TCAACAGCAC CGATGCGATC ACCGTTATGC GTGATGCACT GGATGCACGT AAATCCCGTC TCTCAGGAGG GCGAATGACC AAAGAGACTC AATCAACAAC TGTTTCAGCC ACTGCTTCGC AGGCTGACGT TACTGGCGTG GTGCCAGCGA CGGAGGGCGA AAACGCCAGC GCGGCGCAGC CGGACGTGAA CGCGCAGATC ACCGCTGCGG TTGCGGCAGA AAACAGCCGC ATTATGGGGA TCCTCAACTG TGAGGAGGCT CACGGACGCG AAGAACAGGC CCGCGTGCTG GCAGAAACCC CCGGTATGAC CGTGGAAACG GCCCGCCGCA TTCTGGCCGC AGCACCACAG AGTGCACAGG CGCGCAGTGA TACTGCGCTG GATCGTCTGA TGCAGGGGGC ACCGGCACCG CTGGCTGCAG GTAACCCGGC ATCTGATGCC GTTAACGATT TGCTGAACAC ACCAGTGTAA
|
Protein sequence | MTAELRNLPH IASMAFNEPL MLEPAYARVF FCALAGQLGI SRLTDAVSGD SLTAGEAPAA LALSGDDDGP RQARSYQVMN GIAVLPVSGT LVSRTRALQP YSGMTGYNGI IARLQQAASD PMVDGILLDM DTPGGMVAGA FDCADIIARV RDIKPVWALA NDMNCSAGQL LASAASRRLV TQTARTGSIG VMMAHSNYGA ALEKQGVEIT LIYSGSHKVD GNPYSHLPGD VRETLQSRMD ATRRMFAQKV SAYTGLSVQA VLDTEAAVYS GQEAIDAGLA DELVNSTDAI TVMRDALDAR KSRLSGGRMT KETQSTTVSA TASQADVTGV VPATEGENAS AAQPDVNAQI TAAVAAENSR IMGILNCEEA HGREEQARVL AETPGMTVET ARRILAAAPQ SAQARSDTAL DRLMQGAPAP LAAGNPASDA VNDLLNTPV
|
| |