Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3141 |
Symbol | |
ID | 6795775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 3067075 |
End bp | 3069963 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642777296 |
Product | protease 3 |
Protein accession | YP_002147903 |
Protein GI | 197248364 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGCA GCACCTGGTT CAAAGCGTTA TTGTTATTGG TCGCCCTCTG GGGGCCTGCA GTTCAGGCGG ATATCGGTTG GCAACCGCTG CAAGAAACTA TCCGTAAAAG CGATAAAGAT ACCCGGCAGT ATCAGGCGAT ACGTCTTGAT AATGACATGG TGGTGTTGCT GGTATCCGAT CCGCAGGCTG TAAAGTCGCT TTCAGCGTTA GTGGTACCCG TTGGATCGCT TGAAGATCCT GAGGCTCATC AGGGGCTTGC TCATTATCTT GAGCATATGT GCCTGATGGG GTCAAAAAAA TATCCGCAGG CGGATAGCCT TGCCGAATAC CTTAAAAGAC ATGGCGGTAG CCACAACGCC AGCACCGCGC CTTATCGAAC AGCCTTCTAC CTGGAGGTTG AAAACGACGC GCTTCCAGGC GCCGTAGATC GCCTGGCCGA CGCCATTGCC GCGCCATTGC TCGATAAAAA GTATGCCGAA CGCGAACGAA ACGCGGTGAA TGCCGAGCTG ACGATGGCGC GCACCCGCGA CGGTATGCGT ATGGCGCAGG TAAGCGCCGA AACCATTAAC CCGGCGCATC CAGGCTCGCA CTTTTCTGGC GGCAATCTGG AAACGCTAAG CGATAAGCCG GGCAATCCGG TGCAACAGGC GTTGATCGCT TTTCATGAAA AATACTATTC ATCTAATCTG ATGAAGGCGG TGATTTACAG TAATAAACCC TTGCCGGAAC TGGCGAGTAT TGCCGCCGCA ACCTATGGTC GCGTGCCGAA TAAACAGATT AAAAAACCGG AAATTACCGT ACCTGTCATC ACCGAGGCGC AGAAGGGCAT CATTATTCAT TACGTGCCGG CACTACCGCG TAAAGTGCTG CGCGTGGAGT TTCGTATTGA TAACAATAGC GCGCAGTTCC GCAGCAAGAC CGATGAACTG GTCTCTTATC TGATTGGCAA TCGTAGCCCA GGGACGCTCT CTGACTGGCT GCAAAAACAA GGACTGGTCG AAGGTATTAG CGCGGATTCC GATCCGATTG TTAATGGTAA CAGCGGCGTA TTTGCCATTT CGGCAACGCT GACCGATAAA GGTTTGGCCA ATCGTGATGA AGTCGTCGCG GCTATTTTCA GCTATCTCAA TATGTTACGT GAAAAAGGGA TAGATAAACG TTACTTTGAC GAGCTGGCGC ATGTGCTGGA TCTCGACTTC CGCTATCCGT CGATTACCCG CGATATGGAC TATGTCGAAT GGCTGGCGGA CACTATGATC CGCGTCCCGG TAGCGCATAC GCTGGATGCG GCGAATATCG CCGATCGTTA CGATCCTGCC GCTATCAAAA ATCGCCTGGC GATGATGACG CCGCAGAATG CGCGCATCTG GTACATCAGT CCCCAGGAAC CCCATAATAA GACGGCGTAT TTTGTGGATG CGCCTTATCA GGTCGATAAG ATTAGCGAAC AGACGTTTAA AAACTGGCAG CAAAAAGCAC AGGGCATTGC GTTGTCGCTG CCGGAGTTAA ACCCCTATAT TCCTGATGAT TTTACGCTTG TTAAGAATGA CAAAAACTAC GTGCGGCCAG AACTGATTGT CGATAAAGCG GATTTGCGCG TGGTTTATGC GCCGAGTCGT TATTTCGCCA GCGAGCCAAA AGCTGACGTG AGCGTGGTAT TGCGTAACCC GCAGGCGATG GACAGCGCCC GCAATCAGGT ACTGTTCGCC CTTAATGATT ATCTGGCGGG AATGGCGCTC GATCAGCTCA GCAACCAGGC GGCAGTGGGC GGCATTAGCT TTTCCACTAA CGCCAACAAT GGTCTGATGG TCACTGCGAA CGGTTATACC CAGCGTTTGC CGCAGCTTTT CCTGGCGTTG CTGGAAGGAT ATTTTAGCTA CGACGCCACG GAGGAACAGC TGGCGCAGGC GAAATCCTGG TATACGCAGA TGATGGATTC TGCGGAGAAG GGAAAAGCCT ACGAACAGGC GATTATGCCG GTGCAGATGA TTTCGCAGGT GCCTTATTTT TCCCGTGATG AACGTCGCGC TTTGCTGCCG TCCATTACGT TGAAAGAGGT GATGGCCTAT CGTAATGCGT TAAAAACGGG CGCTCGTCCG GAATTTCTGG TTATAGGGAA TATGAGCGAA GCCCAGGCGA CCTCTCTGGC GCAAGATGTT CAAAAACAGC TCGCGGCGAA CGGATCGGCG TGGTGTCGCA ACAAAGACGT AGTGGTTGAG AAAAAGCAGT CCGTAATATT TGAAAAAGCG GGCAGTAGCA CCGACTCCGC GTTAGCCGCG GTCTTTGTCC CGGTCGGCTA CGACGAGTAC GTCAGCGCCG CCTACAGCGC GATGTTAGGT CAGATTGTTC AACCGTGGTT TTACAATCAG CTACGAACCG AAGAGCAACT GGGATACGCC GTTTTCGCCT TTCCGATGAG CGTTGGCCGT CAGTGGGGAA TGGGATTCCT GCTACAGAGC AACGATAAAC AGCCCTCTTA CCTGTGGCAA CGCTATCAGG CATTTTTCCC TGACGCCGAG GCGAAGCTGA GGGCGATGAA GCCGGAAGAG TTCGCCCAAA TTCAGCAGGC GATCATTACG CAAATGCGCC AGGCGCCGCA AACGTTGGGC GAAGAAGCAT CCCGTTTAAG CAAGGATTTC GATCGGGGTA ATATGCGCTT TGACTCGCGT GATAAAATCA TCGCTCAGAT AAAATTGCTG ACGCCACAAA AGCTTGCCGA CTTCTTCCAC CAGGCGGTGG TGGAACCACA AGGTATGGCA ATATTGTCAC AGATTGCTGG TAGCCAGAAT GGAAAAGCAG AATACGTGCA TCCGACAGGC TGGAAAGTGT GGGATAACGT CAGCGCATTG CAGCAAACGT TACCTCTAAT GAGCGAAAAG AATGAATGA
|
Protein sequence | MPRSTWFKAL LLLVALWGPA VQADIGWQPL QETIRKSDKD TRQYQAIRLD NDMVVLLVSD PQAVKSLSAL VVPVGSLEDP EAHQGLAHYL EHMCLMGSKK YPQADSLAEY LKRHGGSHNA STAPYRTAFY LEVENDALPG AVDRLADAIA APLLDKKYAE RERNAVNAEL TMARTRDGMR MAQVSAETIN PAHPGSHFSG GNLETLSDKP GNPVQQALIA FHEKYYSSNL MKAVIYSNKP LPELASIAAA TYGRVPNKQI KKPEITVPVI TEAQKGIIIH YVPALPRKVL RVEFRIDNNS AQFRSKTDEL VSYLIGNRSP GTLSDWLQKQ GLVEGISADS DPIVNGNSGV FAISATLTDK GLANRDEVVA AIFSYLNMLR EKGIDKRYFD ELAHVLDLDF RYPSITRDMD YVEWLADTMI RVPVAHTLDA ANIADRYDPA AIKNRLAMMT PQNARIWYIS PQEPHNKTAY FVDAPYQVDK ISEQTFKNWQ QKAQGIALSL PELNPYIPDD FTLVKNDKNY VRPELIVDKA DLRVVYAPSR YFASEPKADV SVVLRNPQAM DSARNQVLFA LNDYLAGMAL DQLSNQAAVG GISFSTNANN GLMVTANGYT QRLPQLFLAL LEGYFSYDAT EEQLAQAKSW YTQMMDSAEK GKAYEQAIMP VQMISQVPYF SRDERRALLP SITLKEVMAY RNALKTGARP EFLVIGNMSE AQATSLAQDV QKQLAANGSA WCRNKDVVVE KKQSVIFEKA GSSTDSALAA VFVPVGYDEY VSAAYSAMLG QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSYLWQ RYQAFFPDAE AKLRAMKPEE FAQIQQAIIT QMRQAPQTLG EEASRLSKDF DRGNMRFDSR DKIIAQIKLL TPQKLADFFH QAVVEPQGMA ILSQIAGSQN GKAEYVHPTG WKVWDNVSAL QQTLPLMSEK NE
|
| |