Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2497 |
Symbol | |
ID | 4888155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2411021 |
End bp | 2412709 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640132433 |
Product | sedolisin |
Protein accession | YP_001063490 |
Protein GI | 126445423 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.458922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGCCGC TCGCGCTCGC CGCCGGCATC GCACATGGCG CGACGGATTG GGTCGATACG CATACCAAAG CTTTCCTGAA TCACGCGCAG ATCGAGACGC TCGCCCGCGG CGCGAACGCC GCATCGCTCG AGGTCGCGTC GGGCGAAGCC ACGCACGTCG TGGTCAGCCT GAAGCTGCGC AACGCCGAGC AATTGAAAGC CGTCGCGCGC AACGTCAACG ATCCGCATAG TTCGCAGTAT CGGCAGTACA TCACGAGCGC GCAGTTCCTC GCGAACTATG CGCCGACCGA AGCGCAGGTG AAACAGGTCG TCGCCTATCT GCGCAAGAAC GGCTTCGTCG ACATCCACGT CGCGCCGAAT CGCATGCTCG TCTCCGCGCG CGGCACCGCC GGCACGGTCA AGCAGGCGTT CAACACGTCG CTCGTGCATT TCGAGTACGC GGGCCGCGCG GGCTTCGCGA ACGCGTCGAC GGCGCAAGTG CCGCGCGCGC TCGGCGACAT CGTCGGCTCC GTGCTCGGCC TGCAGAACGT CGCGCGCGCC CGGCCGCTCA CGAAGATCGG CGCGATCGCG AAACCGCTCG CGCTCGCGTC CGGCACGGCG ACGGGCCACT ATCCATCCGA GTTTCCGGCG CTCTACAACG CAACGGGCGT GCCCACCGCG GCGAACGCGA CGGTCGGCAT CATCACGATC GGCGGCGTGT CGCAAGCGCT GTCGGATCTG CAGCAGTTCA CGAGCGCGAA CAGCTATCCG GACGTGTCGA CACAGACCAT CCAGACCAAC GGTTCCGGCG GCAACTACAG CGACGACCAG GAAGGCCAAG GCGAATGGGA TCTGGACAGC CAGTCGATCG TCGGCGCCGC GGGCGGCCAG CTCGGGCAAC TGATCTTCTA CATGGCCGAT CTCGACGCGT CGGGCAACAC CGGCCTCACG CAGGCATTCA ACCAGGCGGT GTCGGACAAC GCGGCGAAAG TGATCAACGT CTCGCTCGGC TGGTGCGAAA CCGATGCGAA CGCGGACGGC ACGCTTTCCG CCGAAGAACA GATCTTCACG CAGGCGGTCG CGCAAGGTCA GACGTTCGCG GTGTCCTCAG GCGACGAAGG CGTCTACGAG TGCAACAACC GCGGCTATCC CGATGGTTCG AACTACACGG TATCGTGGCC GGCGTCGTCG CCGCACGTGC TCGCGATCGG CGGCACGACG CTCTACACGA CTTCGTCGGG CGCCTTCTCG AACGAAACGG TATGGAACGA AGGGCTCGAC GGCAACGGCA AGCTGTGGGC GACGGGCGGC GGCGTCAGCA CGATCCTGCC GAACCCGTCA TGGCAGTCGG GCAGCCATCG CAAGCTGCCG GACATATCGT TCGACGCCGC GCAAAGCACG GGCGCGTATA TCTACAATTA CGGCCAGTTG CAGCAGATCG GCGGCACGAG CCTGTCGGCG CCGATTTTCA CGGGCTTCTG GGCGCGGCTC CTGTCGGCGA ACGGCACGGG TCTCGGCTTC CCGGCCGCGC GCTTCTACCA CTCGATTCCG ACCCACGCGT CACTCGTGCG CTACGACGTC ACGTCGGGCA ACAACGGCTA TTCGGGATAC GGCTACAAGG CATCGACCGG CTGGGACTAC CCGACCGGCT GGGGCAGCAT CAACATCTCG AACCTGAATC AGTTGATCCA GTCGGGCGGC TTCAATTGA
|
Protein sequence | MWPLALAAGI AHGATDWVDT HTKAFLNHAQ IETLARGANA ASLEVASGEA THVVVSLKLR NAEQLKAVAR NVNDPHSSQY RQYITSAQFL ANYAPTEAQV KQVVAYLRKN GFVDIHVAPN RMLVSARGTA GTVKQAFNTS LVHFEYAGRA GFANASTAQV PRALGDIVGS VLGLQNVARA RPLTKIGAIA KPLALASGTA TGHYPSEFPA LYNATGVPTA ANATVGIITI GGVSQALSDL QQFTSANSYP DVSTQTIQTN GSGGNYSDDQ EGQGEWDLDS QSIVGAAGGQ LGQLIFYMAD LDASGNTGLT QAFNQAVSDN AAKVINVSLG WCETDANADG TLSAEEQIFT QAVAQGQTFA VSSGDEGVYE CNNRGYPDGS NYTVSWPASS PHVLAIGGTT LYTTSSGAFS NETVWNEGLD GNGKLWATGG GVSTILPNPS WQSGSHRKLP DISFDAAQST GAYIYNYGQL QQIGGTSLSA PIFTGFWARL LSANGTGLGF PAARFYHSIP THASLVRYDV TSGNNGYSGY GYKASTGWDY PTGWGSINIS NLNQLIQSGG FN
|
| |