Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A3158 |
Symbol | |
ID | 6517727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 3053922 |
End bp | 3056810 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642748169 |
Product | protease 3 |
Protein accession | YP_002115944 |
Protein GI | 194735677 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.782979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.694921 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGCA GCACCTGGTT CAAAGCGTTA TTGTTATTGG TCGCCCTCTG GGGGCCTGCA GTTCAGGCGG ATATCGGTTG GCAACCGCTG CAAGAAACTA TCCGTAAAAG CGATAAAGAT ACCCGGCAGT ATCAGGCGAT ACGTCTTGAT AATGACATGG TGGTGTTGCT GGTATCCGAT CCGCAGGCTG TAAAGTCGCT TTCAGCGTTA GTGGTGCCCG TTGGATCGCT TGAAGATCCT GAGGCTCATC AGGGGCTTGC TCATTATCTT GAGCATATGT GCCTGATGGG GTCAAAAAAA TATCCGCAGG CGGATAGCCT TGCCGAATAC CTTAAAAGAC ATGGCGGTAG CCACAACGCC AGCACCGCGC CTTATCGAAC AGCCTTCTAC CTGGAGGTTG AAAACGACGC GCTTCCAGGC GCCGTAGATC GCCTGGCCGA CGCTATTGCC GCGCCATTGC TTGATAAAAA GTATGCCGAA CGCGAACGAA ACGCGGTGAA TGCCGAGCTG ACGATGGCGC GTACCCGCGA CGGTATGCGT ATGGCGCAGG TAAGCGCCGA AACCATTAAC CCGGCGCATC CAGGCTCGCA CTTTTCTGGC GGCAATCTGG AAACGTTAAG CGATAAGCCG GGCAATCCGG TGCAACAGGC GTTGATCGCT TTTCATGAAA AATACTATTC ATCTAATCTG ATGAAGGCGG TGATTTACAG TAATAAACCC TTGCCGGAAC TGGCGCGTAT TGCCGCCGCA ACCTATGGTC GCGTGCCGAA TAAACAGATT AAAAAACCGG AAATTAACGT ACCTGTCATC ACCGAGGCGC AGAAGGGCAT CATTATTCAT TACGTGCCGG CGCTACCGCG TAAAGTGTTG CGCGTGGAGT TTCGTATTGA TAACAATAGC GCGCAGTTCC GCAGCAAGAC CGATGAACTG GTCTCTTATC TGATTGGCAA TCGTAGCCCG GGGACGCTCT CTGACTGGCT GCAAAAACAA GGACTGGTCG AAGGTATTAG CGCGGATTCC GATCCGATTG TTAATGGTAA CAGCGGCGTA TTTGCCATTT CGGCAACGCT GACTGATAAA GGTTTGGCCA ATCGTGATGA AGTCGTCGCA GCTATCTTTA GCTATCTCAA TACGTTACGT GAAAAAGGGA TAGATAAACG TTACTTTGAC GAGCTGGCGC ATGTGCTGGA TCTCGACTTC CGCTATCCGT CGATTACCCG CGATATGGAC TATGTCGAAT GGCTGGCGGA CACTATGATC CGCGTCCCGG TAGCGCATAC GCTGGATGCG GCGAATATCG CCGATCGTTA CGATCCTGCC GCTATCAAAA ATCGCCTGGC GATGATGACG CCGCAGAATG CGCGCATCTG GTACATTAGT CCCCAGGAAC CCCATAATAA GATTGCGTAT TTTGTGGATG CGCCTTATCA GGTCGATAAG ATTAGCGAAC AGACGTTTAA AAACTGGCAG CAAAAAGCAC AGGGCATTGC GTTGTCGCTG CCGGAGTTAA ACCCCTATAT TCCTGATGAT TTTACGCTTG TTAAGAATGA TAAAAACTAC GTGCGGCCAG AACTGATTGT CGATAAAGCG GATTTGCGCG TGGTTTATGC GCCGAGTCGT TATTTCGCCA GCGAGCCAAA AGCTGACGTG AGCGTGGTAT TGCGTAATCC GCAGGCGATG GACAGCGCCC GCAATCAGGT ACTGTTCGCC CTTAATGATT ATCTGGCGGG AATGGCGCTC GATCAGCTCA GCAACCAGGC GGCAGTGGGC GGCATTAGCT TTTCTACTAA CGCCAACAAT GGTCTGATGG TCACTGCGAA CGGTTATACC CAGCGTTTGC CGCAGCTTTT CCTGGCGTTG CTGGAAGGAT ATTTTAGCTA CGACGCCACG GAAGAACAGC TGGCGCAGGC GAAATCCTGG TATACGCAGA TGATGGATTC TGCGGAGAAG GGAAAAGCCT ACGAGCAGGC GATTATGCCG GTGCAGATGA TTTCGCAGGT GCCTTATTTT TCCCGTGATG AACGTCGCGC TTTGCTGCCG TCCATTACGT TGAAAGAGGT AATGGCCTAT CGTAATGCGT TAAAAACGGG CGCTCGTCCG GAATTTCTGG TTATAGGGAA TATGAGCGAA GCCCAGGCGA CCTCTCTGGC GCAAGATGTT CAAAAACAGC TCGCGGCGAA CGGATCGGCG TGGTGTCGCA ACAAAGACGT GGTGGTTGAG AAAAAGCAGT CCGTAATATT TGAAAAAGCG GGCAGTAGTA CCGACTCCGC GTTAGCAGCG GTCTTTGTCC CGGTCGGCTA CGACGAGTAC GTCAGCGCCG CCTACAGCGC GATGTTAGGT CAGATTGTTC AACCGTGGTT TTACAATCAG CTACGAACCG AAGAGCAACT GGGATACGCC GTTTTCGCCT TTCCGATGAG CGTTGGCCGT CAGTGGGGAA TGGGATTCCT GCTACAGAGC AACGATAAAC AGCCCTCTTA CCTGTGGCAA CGCTATCAGG CATTTTTCCC TGACGCCGAG GCGAAGCTGA GGGCGATGAA GCCGGAAGAG TTCGCCCAAA TTCAGCAGGC GATCATTACG CAAATGCGCC AGGCGCCGCA AACGTTGGGC GAAGAAGCAT CCCGTTTAAG CAAGGATTTC GATCGGGGTA ATATGCGCTT TGACTCGCGT GATAAAATCA TCGCTCAGAT AAAATTGCTG ACGCCACAAA AGCTTGCCGA CTTCTTCCAC CATGCGGTGG TGGAACCACA AGGTATGGCA ATATTGTCAC AGATTGCCGG TAGCCAGAAT GGAAAAGCAG AATACGTGCA TCCGACAGGC TGGAAAGTGT GGGATAACGT CAGCGCATTG CAGCAAACGT TACCTCTAAT GAGCGAAAAG AATGAATGA
|
Protein sequence | MPRSTWFKAL LLLVALWGPA VQADIGWQPL QETIRKSDKD TRQYQAIRLD NDMVVLLVSD PQAVKSLSAL VVPVGSLEDP EAHQGLAHYL EHMCLMGSKK YPQADSLAEY LKRHGGSHNA STAPYRTAFY LEVENDALPG AVDRLADAIA APLLDKKYAE RERNAVNAEL TMARTRDGMR MAQVSAETIN PAHPGSHFSG GNLETLSDKP GNPVQQALIA FHEKYYSSNL MKAVIYSNKP LPELARIAAA TYGRVPNKQI KKPEINVPVI TEAQKGIIIH YVPALPRKVL RVEFRIDNNS AQFRSKTDEL VSYLIGNRSP GTLSDWLQKQ GLVEGISADS DPIVNGNSGV FAISATLTDK GLANRDEVVA AIFSYLNTLR EKGIDKRYFD ELAHVLDLDF RYPSITRDMD YVEWLADTMI RVPVAHTLDA ANIADRYDPA AIKNRLAMMT PQNARIWYIS PQEPHNKIAY FVDAPYQVDK ISEQTFKNWQ QKAQGIALSL PELNPYIPDD FTLVKNDKNY VRPELIVDKA DLRVVYAPSR YFASEPKADV SVVLRNPQAM DSARNQVLFA LNDYLAGMAL DQLSNQAAVG GISFSTNANN GLMVTANGYT QRLPQLFLAL LEGYFSYDAT EEQLAQAKSW YTQMMDSAEK GKAYEQAIMP VQMISQVPYF SRDERRALLP SITLKEVMAY RNALKTGARP EFLVIGNMSE AQATSLAQDV QKQLAANGSA WCRNKDVVVE KKQSVIFEKA GSSTDSALAA VFVPVGYDEY VSAAYSAMLG QIVQPWFYNQ LRTEEQLGYA VFAFPMSVGR QWGMGFLLQS NDKQPSYLWQ RYQAFFPDAE AKLRAMKPEE FAQIQQAIIT QMRQAPQTLG EEASRLSKDF DRGNMRFDSR DKIIAQIKLL TPQKLADFFH HAVVEPQGMA ILSQIAGSQN GKAEYVHPTG WKVWDNVSAL QQTLPLMSEK NE
|
| |