Gene ECH74115_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4086 
SymbolptrA 
ID6969226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3782793 
End bp3785681 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content51% 
IMG OID643387844 
Productprotease III 
Protein accessionYP_002272284 
Protein GI209397695 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.851704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA GCACCTGGTT CAAAGCATTA TTGTTGTTAG TTGCCCTTTG GGCACCCTTA 
AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT
AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT
CCGCAGGCAG TTAAATCGCT ATCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC
GAGGCGTACC AGGGGCTGGC ACATTACCTT GAACATATGA GTCTGATGGG GTCGAAAAAG
TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGTAG TCACAATGCC
AGCACTGCGC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCCGGT
GCGGTAGACC GCCTGGCCGA TGCTATTGCA GAACCCTTGC TCGACAAGAA ATACGCCGAA
CGTGAGCGTA ATGCAGTGAA CGCTGAATTA ACCATGGCGC GTACACGTGA CGGGATGCGC
ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT
GGTAACCTCG AAACTTTAAG CGACAAACCT GGTAATCCGG TGCAGCAGGC GCTGAAAGAT
TTCCACGAGA AGTACTATTC CGCCAATTTG ATGAAGGCGG TTATTTACAG TAATAAACCG
CTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACTTTTGGTC GCGTGCCGAA CAAAGAGAGC
AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT
TACGTCCCAG CGCTGCCGCG TAAAGTTCTG CGCGTTGAGT TTCGCATCGA TAACAACTCA
GCGAAGTTCC GCAGTAAAAC GGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA
GGTACACTGT CTGACTGGCT GCAAAAGCAG GGGTTAGTTG AGGGCATTAG CGCCAACTCC
GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA
GGTCTGGCGA ATCGCGATCA GGTTGTGGCG GCTATTTTTA GCTACCTCAA TTTGTTACGT
GAAAAAGGGA TCGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC
CGTTATCCGT CAATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT
CGCGTTCCTG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGATGCTAAA
GCAGTAAAAG AACGTCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC
CCGAAAGAGC CGCACAATAA AACGGCTTAT TTTGTCGATG CGCCGTATCA GGTCGATAAA
ATCAGCGAAC AAACTTTCGC TGACTGGCAG CAAAAAGCTG CCAATATTGC GCTCTCCTTA
CCGGAGCTTA ACCCCTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA GAAGAAATAT
GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGCG TGGTGTATGC GCCAAGCCGT
TATTTTTCCA GCGAACCCAA AGCTGATGTC AGCCTGATTT TGCGTAATCC GAAAGCCATG
GACAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT
GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA CGCTAACAAC
GGCCTTATGG TTAATGCTAA TGGTTACACC CAGCGTCTGC CGCAGCTGTT CCAGGCATTG
CTCGAGGGAT ACTTTAGCTA TACCGCTACG GAAGATCAGC TTGAGCAGGC GAAGTCCTGG
TATAACCAGA TGATGGATTC CGCAGAAAAG GGTAAAGCGT TTGAGCAGGC GATTATGCCC
GCGCAGATGC TCTCGCAAGT GCCGTACTTC TCGCGAGATG AACGGCGTAA AATTTTGCCC
TCCATTACGT TGAAAGAGGT GCTGGCTTAT CGTGACGCCT TAAAATCAGG GGCTCGACCA
GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCTGGC ACGCGATGTG
CAAAAACAGT TGGGCGCTGA TGGTTCAGAG TGGTGTCGAA ACAAAGATGT AGTGGTCGAT
AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG
GTATTTGTAC CGACTGGCTA CGATGAATAC ACAAGCTCAG CCTATAGCTC TCTGTTGGGG
CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAATT GGGCTATGCC
GTGTTTGCGT TTCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT TTTGCAAAGC
AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC AACCGCAGAG
GCAAAATTGC GAGCGATGAA GCCAGATGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC
CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT TGAAGTTAAG TAAAGATTTC
GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG
ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGCATGGCT
ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTGCA TCCTGAAGGC
TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG
AATGAGTGA
 
Protein sequence
MPRSTWFKAL LLLVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD 
PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA
STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR
MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP
LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS
AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK
GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI
RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK
ISEQTFADWQ QKAANIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR
YFSSEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN
GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP
AQMLSQVPYF SRDERRKILP SITLKEVLAY RDALKSGARP EFMVIGNMTE AQATTLARDV
QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG
QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE
AKLRAMKPDE FAQIQQAVIT QMLQAPQTLG EEALKLSKDF DRGNMRFDSR DKIVAQIKLL
TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK
NE