Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3716 |
Symbol | |
ID | 6966624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3436354 |
End bp | 3437817 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387510 |
Product | peptidase, M48 family |
Protein accession | YP_002271963 |
Protein GI | 209399555 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.997857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAGGC AGTTGAAAAA AAACCTGGTT GCAACCCTCA TTGCTGCTAT GACCATTGGT CAGGTAGCCC CGGCATTTGC CGACAGCGCA GACACCTTGC CGGATATGGG AACCTCCGCA GGAAGCACGC TTTCCATTGG TCAGGAAATG CAGATGGGCG ACTATTATGT CCGCCAGCTA CGCGGCAGCG CGCCGTTAAT TAATGACCCG CTGTTAACGC AATATATTAA TTCGCTGGGG ATGCGTCTGG TTTCGCATGC CAATTCGGTT AAGACACCGT TTCATTTCTT TCTGATCAAC AACGACGAAA TTAACGCCTT TGCTTTCTTT GGCGGCAACG TGGTGCTGCA CTCTGCCCTG TTCCGTTATT CCGATAACGA AAGTCAACTG GCTTCAGTTA TGGCGCACGA AATCTCCCAC GTCACCCAAC GTCACCTGGC GCGAGCGATG GAAGATCAGC AGCGCAACGC GCCGCTGACC TGGGTCGGCG CGTTAGGTTC TATTTTACTG GCGATGGCCA GTCCGCAGGC GGGGATGGCG GCGCTGACCG GTACACTGGC GGGAACGCGT CAGGGGATGA TCAGTTTCAC CCAGCAAAAT GAACAGGAAG CGGACCGCAT TGGTATTCAG GTGCTGCAAC GCTCGGGATT CGATCCGCAG GCGATGCCAA CCTTCCTCGA AAAATTACTC GATCAGGCGC GTTACTCCTC GCGCCCGCCG GAAATTTTAC TGACTCACCC GTTGCCGGAA AGTCGTCTGG CAGATGCCCG CAACCGTGCT AATCAGATGC GCCCGATGGT GGTGCAGTCG TCGGAAGATT TCTATCTGGC GAAAGCGCGC ACACTGGGGA TGTATAATTC CGGACGTAAC CAGCTCACCA GTGATTTGCT GGATGAATGG GCGAAAGGAA ACGTTCGTCA GCAACGAGCG GCGCAATATG GTCGTGCTTT ACAGGCGATG GAAGCCAATA AATACGACGA GGCGCGAAAA ACGCTGCAAC CGTTACTGGC GGCAGAACCT GGCAACGCAT GGTATCTCGA TCTGGCTACT GATATCGATC TTGGGCAAAA CAAAGCCAAT GAGGCGATCA ATCGTCTGAA AAATGCCCGC GATTTGCGCA CCAATCCTGT GTTGCAGCTC AACCTGGCGA ACGCTTATCT ACAAGGCGGT CAACCACAAG AAGCGGCCAA TATTCTTAAT CGCTACACCT TTAATAATAA AGATGACAGC AACGGCTGGG ATTTGCTGGC ACAGGCGGAA GCCGCGCTAA ATAACCGCGA TCAGGAGCTG GCTGCGCGAG CAGAAGGTTA TGCGCTCGCC GGACGACTCG ATCAGGCCAT TTCGCTGTTG AGTAGCGCCA GTTCGCAGGT GAAATTAGGC AGCCTGCAAC AAGCGCGTTA CGATGCGCGC ATCGACCAGT TGCGCCAGCT GCAGGAACGC TTTAAGCCTT ATACCAAGAT GTAA
|
Protein sequence | MFRQLKKNLV ATLIAAMTIG QVAPAFADSA DTLPDMGTSA GSTLSIGQEM QMGDYYVRQL RGSAPLINDP LLTQYINSLG MRLVSHANSV KTPFHFFLIN NDEINAFAFF GGNVVLHSAL FRYSDNESQL ASVMAHEISH VTQRHLARAM EDQQRNAPLT WVGALGSILL AMASPQAGMA ALTGTLAGTR QGMISFTQQN EQEADRIGIQ VLQRSGFDPQ AMPTFLEKLL DQARYSSRPP EILLTHPLPE SRLADARNRA NQMRPMVVQS SEDFYLAKAR TLGMYNSGRN QLTSDLLDEW AKGNVRQQRA AQYGRALQAM EANKYDEARK TLQPLLAAEP GNAWYLDLAT DIDLGQNKAN EAINRLKNAR DLRTNPVLQL NLANAYLQGG QPQEAANILN RYTFNNKDDS NGWDLLAQAE AALNNRDQEL AARAEGYALA GRLDQAISLL SSASSQVKLG SLQQARYDAR IDQLRQLQER FKPYTKM
|
| |