Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2473 |
Symbol | |
ID | 4901398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 2426984 |
End bp | 2428870 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640135700 |
Product | serine protease |
Protein accession | YP_001066732 |
Protein GI | 126454247 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCAA GAAAATGGGC CAGGCCTCGC GCTTCGCAGG CGAAGCATGC GATTTACGCG GCGACGTTCT TTGCCGCGGC GGCATTGAGC GCGCACGCGG CGGCGGCATG GGTCGACACG CAAACCGGCG CGTATCCGGC GCTTGCGCAG CAGGCGCTTG CCGCGTCGCA AGCTTCGGCG GCGGCAACCG CGGCCGGAAA GGCGATCGAC ACGGCGCCCG GCGAGCCGGT GCGGGTCGTC GTCAGTCTCA ATCTCAACGA CGAAGCGAGG CTCGATCGCT TCCTGCGCGA TCTGCATACG CCCGGCAGTG CCGCCTACGG CCGGCATCTG ACGCCCGCCG AATTCGCCGC GCAGTATGCG CCGACGCCGC AGCAGGTCGC GCTCGTCGAA GCGCATCTGC GCCGGGCCGG ATTCCGCGAC ATCGAGGTGG CGCCGAACCG GCTGTTGATC TCGGCGACGG GCACCGCCGC CGCGGTCAAG ACGGCGTTCA ACACGCGGCT CAAGCGCTTT ACGCTCGAAG GCCGGCGCGT GTATGCGAAC CAGGACGCGG CGCAGGTGCC CGCCGAGCTC GGCCGAATCG TTGGCGCGGT GCTCGGGCTC GATAATGCGA CGCTCGCGCG CACGTACAAC CGCCAGGCGG CGGTGACGGG CGCGGTCGGC GGCGCGAAGG CGTCGCTGGC CGCGCGCGCG AGCGACGCGT CGGCCGCCGC GAGCGGCGCG CCCGTGCTGA CGGGACACGA TCCGCTCGAA TTCTCGCGAA TCTATCGCGC GGGCTCGACG CCGACGGCGT CGCAGACGAC GGTCGGCGTG ATCATGGCGG GCGATGCGGC GCCCGTGCTG CGGGATCTCG ATACGTTCGC GGCGAAGGCC GGGCTCGCGC GCGTCGCGGC GACGGTCACG CGTACCGGGC CGCCCGGCAG CGATTACAGC GACAATTCGG GCCTGAGCGA ATGGGATATG GACAGCCAGG CGATCGTCGG CGCGGCGGGC GGCGCGGTGA AGGGGCTTGT GCTCTACGCG GCGCCCTCGA TGCTGCTCTC CGACATCACC TCGGCGTACA ACCGCGCGGT CGTCGACAAC GTCGCGAAGG TGATCAACGT GTCGCTCGGC GTGTGCGAGG CGGACGCACG CGCGTCCGGC ACGCAGGCGG CGGACGACCG CATCTTCAAG AGCGCGGTCG CGCAGGGGCA GACGTTCGTC GTCGCGGCGG GCGACGCGGG CGCGTACGAA TGCAGCGTGA GCCGCGTGTC GGGCGGGCAG GGCGTGCCGG CGCGATCGAA CTATTCGGTG AGCGAGCCTG CGACGTCGCC GTACGTCGTC GCGGTCGGCG GCACGACGCT GTCGACCGAC AGGACGACGC TCGCCTACGC GGGCGAAGTC GCATGGAACG AGGGTTTGCA GCCGATCGGC GTGTACGACG CGTACGGCAG CTACGACGGC ACGAGGCGTC TGTGGGCGAC GGGCGGCGGC TACAGCCGAA GCGAAGCGGC ACCGGCGTGG CAGCGAAGCG TGCTCGGCGC GTCGGCGAAA GCGCGCGCGC TGCCCGATGT CGCATTCGAT GCGGACGGCC GCAGCGGCGC GCACGTCTAC GTGAACGGCC GGACCGAGCA ATGGGGCGGC ACGAGCCTCG CGGCGCCGAT CTTCACGGGC ATCTGGGCGC GCGTGCAATC CGACAACGGC AACCGGCTCG GTTTTCCGCT CGCGAGCCTC TATCGCTACG CGCCGGCCAA CGGCGCGTTC GCGCATGACG TGAAATCCGG CAACAACGGC TCGGGCGGCT ATGGCTACAA GGCCGGTGCG GGCTGGGATC CGGTGACGGG CTTCGGCAGC CTCGACATCG CGAACTTCGC CGCGTTCGTC AAGCAGACGG CCGATTTCGC GCGATAA
|
Protein sequence | MTSRKWARPR ASQAKHAIYA ATFFAAAALS AHAAAAWVDT QTGAYPALAQ QALAASQASA AATAAGKAID TAPGEPVRVV VSLNLNDEAR LDRFLRDLHT PGSAAYGRHL TPAEFAAQYA PTPQQVALVE AHLRRAGFRD IEVAPNRLLI SATGTAAAVK TAFNTRLKRF TLEGRRVYAN QDAAQVPAEL GRIVGAVLGL DNATLARTYN RQAAVTGAVG GAKASLAARA SDASAAASGA PVLTGHDPLE FSRIYRAGST PTASQTTVGV IMAGDAAPVL RDLDTFAAKA GLARVAATVT RTGPPGSDYS DNSGLSEWDM DSQAIVGAAG GAVKGLVLYA APSMLLSDIT SAYNRAVVDN VAKVINVSLG VCEADARASG TQAADDRIFK SAVAQGQTFV VAAGDAGAYE CSVSRVSGGQ GVPARSNYSV SEPATSPYVV AVGGTTLSTD RTTLAYAGEV AWNEGLQPIG VYDAYGSYDG TRRLWATGGG YSRSEAAPAW QRSVLGASAK ARALPDVAFD ADGRSGAHVY VNGRTEQWGG TSLAAPIFTG IWARVQSDNG NRLGFPLASL YRYAPANGAF AHDVKSGNNG SGGYGYKAGA GWDPVTGFGS LDIANFAAFV KQTADFAR
|
| |