Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3321 |
Symbol | |
ID | 6871413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | - |
Start bp | 3197393 |
End bp | 3200281 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642786328 |
Product | protease 3 |
Protein accession | YP_002216967 |
Protein GI | 198242678 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 0.328834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA GCACCTGGTT CAAAGCGTTA TTGTTATTGG TCGCCCTCTG GGGGCCTGCA GTTCAGGCGG ATATCGGTTG GCAACCGCTG CAAGAAACTA TCCGTAAAAG CGATAAAGAT ACCCGGCAGT ATCAGGCGAT ACGTCTTGAT AATGACATGG TGGTGTTGCT CGTATCCGAT CCGCAGGCTG TAAAGTCGCT TTCAGCGTTA GTGGTGCCCG TTGGATCGCT TGAAGATCCT GAGGCTCATC AGGGGCTTGC TCATTATCTT GAGCATATGT GCCTGATGGG GTCAAAAAAA TATCCGCAGG CGGATAGCCT TGCCGAATAC CTTAAAAGAC ATGGCGGTAG CCACAACGCC AGCACCGCGC CTTATCGAAC AGCCTTCTAC CTGGAGGTTG AAAACGACGC GCTTCCAGGT GCCGTAGATC GACTGGCCGA CGCCATTGCC GCGCCATTGC TCAATAAAAA GTATGCCGAA CGCGAACGAA ACGCGGTGAA TGCCGAGCTG ACGATGGCGC GCACCCGCGA CGGTATGCGT ATGGCGCAGG TAAGCGCCGA AACCATTAAC CCGGCGCATC CAGGCTCGCA CTTTTCTGGC GGCAATCTGG AAACGCTAAG CGATAAGCCG GGAAATCCGG TGCAACAGGC GTTGATCGCT TTTCATGAAA AATACTATTC ATCTAATCTG ATGAAGGCGG TGATTTACAG TAATAAACCC TTGCCGGAAC TGGCGAGTAT TGCCGCCGCA ACCTATGGTC GCGTGCCGAA TAAACAGATT AAAAAACCGG AAATTACCGT ACCTGTCATC ACCGAGGCGC AGAAGGGCAT CATTATTCAT TACGTGCCGG CGCTACCGCG TAAAGTGCTG CGCGTGGAGT TTCGTATTGA TAACAATAGC GCGCAGTTCC GTAGCAAGAC CGATGAACTG GTCTCTTATC TGATTGGCAA TCGTAGCCCG GGGACGCTCT CTGACTGGCT GCAAAAACAA GGACTGGTCG AAGGTATTAG CGCGGATTCC GATCCGATTG TTAATGGTAA CAGCGGCGTA TTTGCCATTT CGGCAACGCT GACTGATAAA GGTTTGGCCA ATCGTGATGA AGTCGTCGCA GCTATCTTTA GCTATCTCAA TACGTTACGT GAAAAAGGGA TAGATAAACG TTACTTTGAC GAGCTGGCGC ATGTGCTGGA TCTCGACTTC CGCTATCCGT CGATTACCCG CGATATGGAC TATGTCGAGT GGCTGGCGGA CACTATGATC CGCGTCCCGG TAGCGCATAC GCTGGATGCG GCGAATATCG CCGATCGTTA CGATCCTGCC GCTATCAAAA ATCGCCTGGC GATGATGACG CCGCAGAATG CGCGCATCTG GTACATCAGT CCCCAGGAAC CCCATAATAA GATTGCGTAT TTTGTGGATG CGCCTTATCA GGTCGATAAG ATTAGCGAAC AGACGTTTAA AAACTGGCAG CAAAAAGCAC AGGGCATTGC GTTGTCGCTG CCGGAGTTAA ACCCCTATAT TCCTGATGAT TTTTCGCTTG TTAAGAATGA CAAAAACTAC GTGCGGCCAG AACTGATTGT CGATAAAGCG GATTTGCGCG TGGTTTATGC GCCGAGTCGT TATTTCGCCA GCGAGCCAAA AGCTGACGTG AGCGTGGTAT TGCGTAACCC GCAGGCGATG GACAGCGCCC GCAATCAGGT ACTGTTCGCC CTTAATGATT ATCTGGCGGG AATGGCGCTC GATCAGCTCA GCAACCAGGC GGCAGTGGGC GGCATTAGCT TTTCCACTAA CGCCAACAAT GGTCTGATGG TCACTGCGAA CGGTTATACC CAGCGTTTGC CGCAGCTTTT CCTGGCGCTG CTGGAAGGAT ATTTTAGCTA CGACGCCACG GAGGAACAGC TGGCGCAGGC GAAATCCTGG TATACGCAGA TGATGGATTC TGCGGAGAAG GGAAAAGCCT ACGAACAGGC GATTATGCCG GTGCAGATGA TTTCGCAGGT GCCTTATTTT TCCCGTGATG AACGTCGCGC TTTGCTGCCG TCCATTACGT TAAAAGAGGT GATGGCCTAT CGTAATGCGT TAAAAACGGG CGTTCGTCCG GAATTTCTGG TTATAGGGAA TATGAGCGAA GCCCAGGCGA CCTCTCTGGC GCAAGATGTC CAAAAACAGC TCGCGGCGAA CGGATCGGCG TGGTGTCGTA ACAAAGACGT GGTGGTTGAG AAAAAGCAGT CCGTAATATT TGAAAAAGCG GGCAGTAGCA CCGACTCCGC GTTAGCAGCG GTCTTTGTCC CGGTCGGCTA CGACGAGTAC GTCAGCGCCG CCTACAGCGC GATGTTAGGT CAGATTGTTC AACCGTGGTT TTACAATCAG CTACGAACCG AAGAGCAACT GGGATACGCT GTTTTCGCCT TTCCGATGAG CGTTGGCCGT CAGTGGGGAA TGGGATTCCT GCTACAGAGC AACGATAAAC AGCCCTCTTA CCTGTGGCAA CGCTATCAGG CATTTTTCCC TGACGCCGAG GCGAAGCTGA GGGCGATGAA GCCGGAAGAG TTCGCCCAAA TTCAGCAGGC GATCATTACG CAAATGCGCC AGGCGCCGCA AACGTTGGGC GAAGAAGCAT CCCGTTTAAG CAAGGATTTC GATCGGGGTA ATATGCGCTT TGACTCGCGT GATAAAATCA TCGCTCAGAT AAAATTGCTG ACGCCACAAA AGCTTGCCGA TTTCTTCCAC CAGGCGGTGG TGGAACCACA AGGTATGGCA ATATTGTCAC AGATTGCCGG TAGCCAGAAT GGAAAAGCAG AATACGTGCA TCCGACAGGC TGGAAAGTGT GGGATAACGT CAGCGCTTTG CAGCAAACGT TACCTCTAAT GAGCGAAAAG AATGAATGA
|
Protein sequence | MPRSTWFKAL LLLVALWGPA VQADIGWQPL QETIRKSDKD TRQYQAIRLD NDMVVLLVSD PQAVKSLSAL VVPVGSLEDP EAHQGLAHYL EHMCLMGSKK YPQADSLAEY LKRHGGSHNA STAPYRTAFY LEVENDALPG AVDRLADAIA APLLNKKYAE RERNAVNAEL TMARTRDGMR MAQVSAETIN PAHPGSHFSG GNLETLSDKP GNPVQQALIA FHEKYYSSNL MKAVIYSNKP LPELASIAAA TYGRVPNKQI KKPEITVPVI TEAQKGIIIH YVPALPRKVL RVEFRIDNNS AQFRSKTDEL VSYLIGNRSP GTLSDWLQKQ GLVEGISADS DPIVNGNSGV FAISATLTDK GLANRDEVVA AIFSYLNTLR EKGIDKRYFD ELAHVLDLDF RYPSITRDMD YVEWLADTMI RVPVAHTLDA ANIADRYDPA AIKNRLAMMT PQNARIWYIS PQEPHNKIAY FVDAPYQVDK ISEQTFKNWQ QKAQGIALSL PELNPYIPDD FSLVKNDKNY VRPELIVDKA DLRVVYAPSR YFASEPKADV SVVLRNPQAM DSARNQVLFA LNDYLAGMAL DQLSNQAAVG GISFSTNANN GLMVTANGYT QRLPQLFLAL LEGYFSYDAT EEQLAQAKSW YTQMMDSAEK GKAYEQAIMP VQMISQVPYF SRDERRALLP SITLKEVMAY RNALKTGVRP EFLVIGNMSE AQATSLAQDV QKQLAANGSA WCRNKDVVVE KKQSVIFEKA GSSTDSALAA VFVPVGYDEY VSAAYSAMLG QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSYLWQ RYQAFFPDAE AKLRAMKPEE FAQIQQAIIT QMRQAPQTLG EEASRLSKDF DRGNMRFDSR DKIIAQIKLL TPQKLADFFH QAVVEPQGMA ILSQIAGSQN GKAEYVHPTG WKVWDNVSAL QQTLPLMSEK NE
|
| |