Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4891 |
Symbol | |
ID | 6971749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4525659 |
End bp | 4527155 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643388579 |
Product | peptidase, M16 (pitrilysin) family |
Protein accession | YP_002273007 |
Protein GI | 209399996 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.271502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGCA CAAAAATTCG ACTTTTAGCG GGCGGTTTGC TGATGATGGC CACTGCTGGC TATGTGCAGG CAGATGCGCT CCAGCCTGAT CCAGCATGGC AACAGGGGAC GCTTTCCAAC GGTTTACAGT GGCAAGTGCT GACCACCCCC CAGCGTCCCA GCGATCGTGT TGAAATTCGC CTGCTGGTTA ATACCGGTTC GCTCGCCGAA AGTACACAAC AGAGCGGTTA CAGTCACGCC ATCCCTCGTA TTGCGCTAAC GCAAAGCGGT GGCCTTGACG CAGCACAGGC GCGTTCATTG TGGCAGCAGG GGATCGACCC TAAACGCCCG ATGCCGCCGG TAATTGTCTC TTATGACACC ACGCTGTTTA ATCTGAGTTT GCCCAATAAC CGTAACGATT TGCTGAAAGA GGCGCTCTCT TATCTGGCAA ATGCCACTGG CAAACTGACT ATCACGCCAG AAACCATCAA CCACGCGCTG CAAAGTCAGG ACATGGTGGC AACCTGGCCT GCCGATACTA AAGAGGGCTG GTGGCGCTAT CGTCTGAAAG GGTCAACCTT GTTAGGTCAC GATCCTGCCG ATCCGCTGAA ACAACCCGTT GAAGCGGAAA AAATTAAAGA TTTCTATCAG AAATGGTACA CCCCGGATGC AATGACGCTG CTGGTGGTGG GAAACGTGGA TGCGCGCTCG GTCGTCGACC AAATCAACAA AACGTTTGGC GAACTGAAAG GCAAACGTGA AACACCGGCT CCGGTGCCGA CGCTTTCTCC GCTGCGTGCG GAAGCGGTGA GTATTATGAC TGACGCGGTG CGTCAGGACC GGTTATCTAT CATGTGGGAT ACGCCGTGGC AGCCGATTCG TGAATCAGCC GCACTGCTGC GCTACTGGCG TGCGGACCTG GCCCGTGAGG CGCTGTTCTG GCATGTTCAG CAAGCGTTAA GCGCCAGTAA CAGCAAAGAC ATCGGTCTTG GATTTGACTG CCGTGTGCTG TATCTGCGTG CGCAGTGTGC CATCAACATC GAATCACCAA ACGACAAGCT GAACAGCAAC CTTAATCTGG TGGCGCGTGA ACTGGCGAAG GTTCGCGATA AAGGTCTGCC GGAAGAAGAG TTCAATGCGT TAGTGGCGCA AAAGAAACTG GAGCTGCAGA AACTGTTTGC CGCCTATGCA CGGGCTGATA CCGATATTCT GATGGGCCAG CGGATGCGTT CGTTGCAAAA TCAGGTCGTC GATATCGCGC CGGAGCAGTA TCAGAAACTG CGGCAGGATT TCCTTAACAG CCTGACTGTG GAGATGTTAA ATCAGGATCT GCGTCAGCAG TTGTCGAATG ATATGGCGTT AATACTGCTG CAGCCGAAAG GCGAGCCGGA ATTTAACATG AAAGCGTTGC AGGCGGCCTG GGATCAAATC ATGGCCCCAT CTACCGCCGC TGCGACCACC TCTGTCGCCA CGGATGACGT ACATCCTGAA GTGACGGATA TTCCACCTGC ACAGTAA
|
Protein sequence | MQGTKIRLLA GGLLMMATAG YVQADALQPD PAWQQGTLSN GLQWQVLTTP QRPSDRVEIR LLVNTGSLAE STQQSGYSHA IPRIALTQSG GLDAAQARSL WQQGIDPKRP MPPVIVSYDT TLFNLSLPNN RNDLLKEALS YLANATGKLT ITPETINHAL QSQDMVATWP ADTKEGWWRY RLKGSTLLGH DPADPLKQPV EAEKIKDFYQ KWYTPDAMTL LVVGNVDARS VVDQINKTFG ELKGKRETPA PVPTLSPLRA EAVSIMTDAV RQDRLSIMWD TPWQPIRESA ALLRYWRADL AREALFWHVQ QALSASNSKD IGLGFDCRVL YLRAQCAINI ESPNDKLNSN LNLVARELAK VRDKGLPEEE FNALVAQKKL ELQKLFAAYA RADTDILMGQ RMRSLQNQVV DIAPEQYQKL RQDFLNSLTV EMLNQDLRQQ LSNDMALILL QPKGEPEFNM KALQAAWDQI MAPSTAAATT SVATDDVHPE VTDIPPAQ
|
| |