Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4086 |
Symbol | ptrA |
ID | 6969226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3782793 |
End bp | 3785681 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643387844 |
Product | protease III |
Protein accession | YP_002272284 |
Protein GI | 209397695 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.851704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA GCACCTGGTT CAAAGCATTA TTGTTGTTAG TTGCCCTTTG GGCACCCTTA AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT CCGCAGGCAG TTAAATCGCT ATCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC GAGGCGTACC AGGGGCTGGC ACATTACCTT GAACATATGA GTCTGATGGG GTCGAAAAAG TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGTAG TCACAATGCC AGCACTGCGC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCCGGT GCGGTAGACC GCCTGGCCGA TGCTATTGCA GAACCCTTGC TCGACAAGAA ATACGCCGAA CGTGAGCGTA ATGCAGTGAA CGCTGAATTA ACCATGGCGC GTACACGTGA CGGGATGCGC ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT GGTAACCTCG AAACTTTAAG CGACAAACCT GGTAATCCGG TGCAGCAGGC GCTGAAAGAT TTCCACGAGA AGTACTATTC CGCCAATTTG ATGAAGGCGG TTATTTACAG TAATAAACCG CTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACTTTTGGTC GCGTGCCGAA CAAAGAGAGC AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT TACGTCCCAG CGCTGCCGCG TAAAGTTCTG CGCGTTGAGT TTCGCATCGA TAACAACTCA GCGAAGTTCC GCAGTAAAAC GGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA GGTACACTGT CTGACTGGCT GCAAAAGCAG GGGTTAGTTG AGGGCATTAG CGCCAACTCC GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA GGTCTGGCGA ATCGCGATCA GGTTGTGGCG GCTATTTTTA GCTACCTCAA TTTGTTACGT GAAAAAGGGA TCGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC CGTTATCCGT CAATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT CGCGTTCCTG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGATGCTAAA GCAGTAAAAG AACGTCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC CCGAAAGAGC CGCACAATAA AACGGCTTAT TTTGTCGATG CGCCGTATCA GGTCGATAAA ATCAGCGAAC AAACTTTCGC TGACTGGCAG CAAAAAGCTG CCAATATTGC GCTCTCCTTA CCGGAGCTTA ACCCCTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA GAAGAAATAT GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGCG TGGTGTATGC GCCAAGCCGT TATTTTTCCA GCGAACCCAA AGCTGATGTC AGCCTGATTT TGCGTAATCC GAAAGCCATG GACAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA CGCTAACAAC GGCCTTATGG TTAATGCTAA TGGTTACACC CAGCGTCTGC CGCAGCTGTT CCAGGCATTG CTCGAGGGAT ACTTTAGCTA TACCGCTACG GAAGATCAGC TTGAGCAGGC GAAGTCCTGG TATAACCAGA TGATGGATTC CGCAGAAAAG GGTAAAGCGT TTGAGCAGGC GATTATGCCC GCGCAGATGC TCTCGCAAGT GCCGTACTTC TCGCGAGATG AACGGCGTAA AATTTTGCCC TCCATTACGT TGAAAGAGGT GCTGGCTTAT CGTGACGCCT TAAAATCAGG GGCTCGACCA GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCTGGC ACGCGATGTG CAAAAACAGT TGGGCGCTGA TGGTTCAGAG TGGTGTCGAA ACAAAGATGT AGTGGTCGAT AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG GTATTTGTAC CGACTGGCTA CGATGAATAC ACAAGCTCAG CCTATAGCTC TCTGTTGGGG CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAATT GGGCTATGCC GTGTTTGCGT TTCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT TTTGCAAAGC AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC AACCGCAGAG GCAAAATTGC GAGCGATGAA GCCAGATGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT TGAAGTTAAG TAAAGATTTC GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGCATGGCT ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTGCA TCCTGAAGGC TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG AATGAGTGA
|
Protein sequence | MPRSTWFKAL LLLVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK ISEQTFADWQ QKAANIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR YFSSEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP AQMLSQVPYF SRDERRKILP SITLKEVLAY RDALKSGARP EFMVIGNMTE AQATTLARDV QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE AKLRAMKPDE FAQIQQAVIT QMLQAPQTLG EEALKLSKDF DRGNMRFDSR DKIVAQIKLL TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK NE
|
| |