Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3021 |
Symbol | |
ID | 6972250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2805937 |
End bp | 2807298 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643386856 |
Product | peptidase, U32 family |
Protein accession | YP_002271324 |
Protein GI | 209396381 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0605791 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAC CGGAACTCCT TTCCCCGGCG GGAACGCTGA AAAATATGCG TTACGCTTTC GCTTATGGCG CAGATGCTGT TTATGCGGGC CAGCCGCGTT ACTCCCTGCG TGTGCGCAAC AACGAATTCA ACCACGAAAA CCTTCAGCTC GGCATCAATG AAGCCCACGC GCTGGGGAAA AAGTTTTATG TCGTGGTCAA CATTGCACCG CACAACGCCA AGCTGAAAAC CTTTATCCGT GACCTGAAAC CGGTGGTGGA AATGGGGCCG GATGCGCTGA TTATGTCCGA TCCAGGGCTG ATTATGCTGG TGCGTGAGCA CTTCCCTGAA ATGCCGATCC ATCTCTCGGT ACAGGCTAAC GCCGTGAACT GGGCGACGGT GAAATTCTGG CAGCAAATGG GACTGACCCG CGTGATCCTC TCTCGCGAGC TGTCGCTGGA AGAGATTGAA GAGATCCGCA ATCAGGTGCC GGATATGGAG ATCGAGATCT TCGTTCACGG CGCACTGTGC ATGGCCTACT CCGGTCGCTG CCTGCTCTCT GGCTATATCA ACAAGCGCGA TCCGAACCAG GGTACCTGCA CCAACGCCTG CCGCTGGGAG TACAACGTCC AGGAAGGGAA AGAAGATGAC GTTGGCAACA TCGTACACAA GTACGAGCCG ATTCCGGTGC AAAATGTTGA GCCGACGCTG GGTATCGGCG CGCCAACCGA CAAAGTGTTT ATGATCGAAG AAGCCCAGCG TCCGGGCGAG TATATGACCG CGTTTGAAGA TGAGCACGGC ACTTACATCA TGAACTCGAA AGATCTGCGC GCCATCGCCC ATGTAGAACG CCTGACCAAA ATGGGCGTGC ATTCGCTGAA AATCGAAGGC CGTACTAAAT CTTTCTACTA TTGCGCACGC ACCGCACAGG TTTATCGTAA AGCTATCGAT GATGCCGCTG CGGGCAAACC GTTCGATACC AGCCTGCTGG AAACTCTGGA AGGTCTGGCG CATCGTGGCT ATACCGAAGG TTTCCTGCGT CGTCATACTC ACGACGATTA TCAGAACTAC GAATACGGTT ATTCAGTTTC TGACCGCCAG CAGTTTGTTG GTGAGTTTAC CGGTGAGCGC AAGGGGGACC TCGCGGCGGT AGCGGTGAAA AATAAATTCT CCGTTGGCGA CAGCCTTGAG CTGATGACGC CGCAAGGCAA CATTAACTTT ACCCTTGAGC ACATGGAAAA CGCCAAAGGC GAAGCTATGC CGGTAGCACC AGGCGATGGT TATACTGTGT GGCTCCCGGT GCCGCAGGAT CTTGAGCTAA ATTACGCGCT GCTGATGCGT AATTTCTCCG GGGAAACTAC GCGTAACCCC CACGGTAAGT GA
|
Protein sequence | MFKPELLSPA GTLKNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVREHFPE MPIHLSVQAN AVNWATVKFW QQMGLTRVIL SRELSLEEIE EIRNQVPDME IEIFVHGALC MAYSGRCLLS GYINKRDPNQ GTCTNACRWE YNVQEGKEDD VGNIVHKYEP IPVQNVEPTL GIGAPTDKVF MIEEAQRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTK MGVHSLKIEG RTKSFYYCAR TAQVYRKAID DAAAGKPFDT SLLETLEGLA HRGYTEGFLR RHTHDDYQNY EYGYSVSDRQ QFVGEFTGER KGDLAAVAVK NKFSVGDSLE LMTPQGNINF TLEHMENAKG EAMPVAPGDG YTVWLPVPQD LELNYALLMR NFSGETTRNP HGK
|
| |