Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2968 |
Symbol | ptrA |
ID | 6143373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3044627 |
End bp | 3047515 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617837 |
Product | protease III |
Protein accession | YP_001744989 |
Protein GI | 170682458 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.436713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA GCATCTGGTT CAAAGCATTA TTGTTGTTTG TTGCCCTTTG GGCACCCTTA AGTCAGGCAG AAACGGGATG GCAGCCGATT CAGGAAACCA TCCGTAAAAG TGATAAAGAT AACCGCCAGT ATCAGGCTAT ACGTCTGGAT AACGGTATGG TGGTCTTGCT GGTTTCTGAT CCGCAGGCGG TTAAATCGCT CTCGGCGCTG GTGGTGCCCG TTGGGTCGCT GGAAGATCCC GAGGCGTATC AGGGGCTGGC GCATTACCTT GAACACATGA GTCTGATGGG GTCGAAAAAG TACCCGCAGG CTGACAGTCT GGCCGAATAT CTCAAAATGC ACGGCGGCAG TCACAATGCC AGCACGGCAC CGTATCGCAC GGCTTTCTAT CTGGAAGTTG AGAACGACGC CTTGCCCGGT GCGGTAGACC GCCTGGCGGA CGCTATTGCA GAACCCTTGC TCGACAAGAA ATACGCCGAA CGTGAACGTA ATGCAGTGAA TGCCGAATTA ACCATGGCGC GTACGCGTGA CGGGATGCGC ATGGCACAGG TCAGCGCAGA AACCATTAAC CCGGCACACC CCGGTTCAAA GTTTTCTGGT GGTAACCTCG AAACCTTAAG CGACAAACCA GGTAATCCGG TACAGCAGGC GCTGAAAGAT TTCCACGAGA AGTACTATTC CGCCAATCTG ATGAAGGCGG TTATTTACAG TAATAAACCG TTGCCGGAGT TGGCAAAAAT GGCGGCGGAC ACCTTTGGTC GCGTGCCGAA CAAAGAGAGC AAAAAACCGG AAATCACCGT GCCGGTAGTC ACCGACGCGC AAAAGGGCAT TATCATTCAT TACGTCCCGG CGTTGCCGCG TAAAGTTCTG CGCGTTGAGT TTCGCATCGA TAACAATTCA GCGAAGTTCC GTAGTAAAAC GGATGAATTG ATTACCTATC TGATTGGTAA TCGCAGCCCA GGTACACTTT CTGACTGGCT GCAAAAGCAG GGATTAGTTG AGGGTATTAG CGCCAATTCC GATCCTATCG TCAACGGCAA CAGCGGCGTA TTAGCGATCT CTGCGTCTTT AACCGATAAA GGTCTGGCGA ATCGCGATCA GGTTGTGGCG GCCATTTTTA GCTACCTCAA TCTGTTACGC GAAAAAGGGA TTGATAAACA ATACTTCGAT GAACTGGCGA ATGTGCTGGA TATCGACTTC CGTTATCCGT CAATCACCCG TGATATGGAT TACGTCGAAT GGCTGGCAGA TACCATGATT CGCGTTCCTG TTGAGCATAC GCTGGATGCA GTCAATATTG CCGATCGGTA CGACGCTAAA GCAGTAAAAG AACGCCTGGC GATGATGACG CCGCAGAATG CGCGTATCTG GTATATCAGC CCGAAAGAGC CGCACAACAA AACGGCTTAC TTTGTCGATG CGCCGTATCA GGTCGATAAA ATTAGCGCAC AAACTTTCGC TGACTGGCAG AAAAAAGCCG CCGACATTGC GCTCTCTTTG CCAGAGCTTA ACCCCTATAT TCCTGACGAT TTCTCGCTGA TTAAGTCAGA GAAGAAATAT GACCATCCAG AGCTGATTGT TGATGAGTCG AATCTGCGTG TGGTGTATGC GCCAAGCCGT TATTTTTCCA GCGAACCCAA AGCTGATGTC AGCCTGATTT TGCGTAATCC AAAAGCCATG GATAGCGCCC GCAATCAGGT GATGTTTGCG CTCAATGATT ATCTCGCAGG GCTGGCGCTT GATCAGTTAA GCAACCAGGC GTCGGTTGGT GGCATAAGTT TTTCCACCAA TGCTAACAAC GGCCTTATGG TTAATGCCAA TGGTTACACC CAGCGCCTGC CGCAGCTGTT CCAGGCATTG CTCGAGGGGT ACTTTAGCTA TACCGCTACG GAAGATCAGC TTGAGCAGGC GAAGTCCTGG TATAACCAGA TGATGGATTC TGCAGAAAAG GGCAAAGCGT TCGAGCAGGC GATTATGCCC GCGCAGATGC TCTCGCAAGT ACCATACTTC TCGCGAGATG AACGGCGCAA AATTTTGCCC TCCATTACTT TGAAAGAGGT GATGGCCTAT CGCGACGCCT TAAAATCAGG GGCTCGACCA GAGTTTATGG TTATCGGCAA CATGACCGAG GCCCAGGCAA CAACGCTGGC ACGCGATGTG CAAAAACAGT TGGGCGCTGA TGGTTCGGAG TGGTGTCGTA ACAAAGATGT CGTGGTCGAT AAAAAACAAT CCGTCATCTT TGAAAAAGCC GGTAACAGCA CCGACTCCGC ACTGGCAGCG GTATTTGTAC CGACTGGCTA CGATGAATAC ACCAGTTCAG CGTATAGTTC TCTGTTGGGG CAGATCGTAC AGCCGTGGTT CTACAATCAG TTGCGTACCG AAGAACAGTT GGGCTATGCC GTGTTTGCGT TTCCAATGAG CGTGGGGCGT CAGTGGGGCA TGGGCTTCCT GTTGCAAAGC AATGATAAAC AGCCTTCATT CTTGTGGGAG CGTTACAAGG CGTTTTTCCC GACCGCAGAA GCGAAACTGC GGGCGATGAA GCCAGAAGAG TTTGCGCAAA TCCAGCAGGC GGTAATTACC CAGATGCTGC AGGCACCGCA AACGCTCGGC GAAGAAGCAT CGAAGTTAAG TAAAGATTTC GATCGCGGCA ATATGCGCTT CGATTCGCGT GATAAAATCG TGGCCCAGAT AAAACTGCTG ACGCCGCAAA AACTTGCTGA TTTCTTCCAT CAGGCGGTGG TCGAGCCGCA AGGTATGGCT ATTCTGTCGC AGATTTCCGG CAGCCAGAAC GGGAAAGCCG AATATGTGCA TCCTGAAGGC TGGAAAGTGT GGGAGAACGT CAGCGCGTTG CAGCAAACAA TGCCCCTGAT GAGTGAAAAG AATGAGTGA
|
Protein sequence | MPRSIWFKAL LLFVALWAPL SQAETGWQPI QETIRKSDKD NRQYQAIRLD NGMVVLLVSD PQAVKSLSAL VVPVGSLEDP EAYQGLAHYL EHMSLMGSKK YPQADSLAEY LKMHGGSHNA STAPYRTAFY LEVENDALPG AVDRLADAIA EPLLDKKYAE RERNAVNAEL TMARTRDGMR MAQVSAETIN PAHPGSKFSG GNLETLSDKP GNPVQQALKD FHEKYYSANL MKAVIYSNKP LPELAKMAAD TFGRVPNKES KKPEITVPVV TDAQKGIIIH YVPALPRKVL RVEFRIDNNS AKFRSKTDEL ITYLIGNRSP GTLSDWLQKQ GLVEGISANS DPIVNGNSGV LAISASLTDK GLANRDQVVA AIFSYLNLLR EKGIDKQYFD ELANVLDIDF RYPSITRDMD YVEWLADTMI RVPVEHTLDA VNIADRYDAK AVKERLAMMT PQNARIWYIS PKEPHNKTAY FVDAPYQVDK ISAQTFADWQ KKAADIALSL PELNPYIPDD FSLIKSEKKY DHPELIVDES NLRVVYAPSR YFSSEPKADV SLILRNPKAM DSARNQVMFA LNDYLAGLAL DQLSNQASVG GISFSTNANN GLMVNANGYT QRLPQLFQAL LEGYFSYTAT EDQLEQAKSW YNQMMDSAEK GKAFEQAIMP AQMLSQVPYF SRDERRKILP SITLKEVMAY RDALKSGARP EFMVIGNMTE AQATTLARDV QKQLGADGSE WCRNKDVVVD KKQSVIFEKA GNSTDSALAA VFVPTGYDEY TSSAYSSLLG QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSFLWE RYKAFFPTAE AKLRAMKPEE FAQIQQAVIT QMLQAPQTLG EEASKLSKDF DRGNMRFDSR DKIVAQIKLL TPQKLADFFH QAVVEPQGMA ILSQISGSQN GKAEYVHPEG WKVWENVSAL QQTMPLMSEK NE
|
| |