Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_0471 |
Symbol | |
ID | 6241972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010622 |
Strand | + |
Start bp | 534937 |
End bp | 536439 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642592235 |
Product | protease Do |
Protein accession | YP_001856709 |
Protein GI | 186475239 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0438945 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAA ATATGCTGAC CCGTAGCGCC GTCGCGGCGG CCGTCGTCAT CGCTTTGTCG GCGGGTTATG TGGCCGGCCA CCGCAATGTC CCGGCGCCCG CGGTGATCAC CCCGGCCGTG GCCGCGATGA TGCCCGCCGA AGCCGCGGCC AAAACGGGTA TCCCCGATTT CTCGGGTCTC GTCGAAACTT ATGGTCCCGC TGTCGTCAAC ATCAGCGCGA AGCACGTCGT CAAGCAGACG GCGATGCGCG GCGGCAACAA CAGCGGCCAG TTGCCCATCG ACCCGAGCGA TCCGTTTTAT CAGTTCTACC GCCACTTTTT TGGCCAGATG CCGGGTGGCC CCAATGGCGG CGGTGACGAC GGCGGCGACC GTCCGAGCGC GAGCCTCGGC TCCGGCTTCA TTATCAGCAG CGATGGTTAT GTGCTGACCA ATGCGCACGT CGTCGACGGT GCGAACGTCG TCACCGTGAA ACTCACCGAC AAGCGCGAGT ACAGGGCGAA AGTGGTCGGC GCCGACAAGC AGTCGGATGT CGCAGTCCTG AAGATCGACG CGAAGGATCT GCCGACCGTG AAGATCGGCG ATCCGCGCCA GAGCAAGGTC GGCCAGTGGG TCGTCGCGAT CGGCTCGCCC TACGGCTTCG ACAACACGGT GACGTCGGGC ATCATCAGCG CGAAGTCGCG CTCGCTGCCG GACGAGAACT ACACGCCGTT CATCCAGACG GACGTGCCCG TGAATCCGGG TAACTCGGGC GGCCCGCTGT TCAATCTGCA AGGCGAAGTG ATCGGCATCA ATTCGATGAT CTATTCGCAG ACGGGCGGCT TCCAGGGCCT TTCGTTCGCT ATCCCGATCA ATGAGGCGAT CAAGGTCAAG GACGATCTCG TAAAGACCGG CCACGTGAGC CGCGGCCGTC TCGGCGTCGC GGTGCAGAGT GTGAACCAGA CGCTCGCGGA TTCATTCGGC ATGAAGAAGC CGCAAGGCGC GCTGGTCAGC TCCGTCGATC CGGGCGGCCC GGCGGCGAAA GCAGGCCTGC AGCCCGGCGA CGTGATCCTG TCGGTGGATG GCGTCGACGT GGTGGATTCG GCGGCGCTGC CTTCGCAGAT CGCGGGCATC AGGCCGGGCA AGCAAGTCGA CGTGCAGGTG TGGCGCGACA AGTCGACGAA AGATATGAAA GTGACGATCG GCTCGTTGTC CGACGTGAAA GCGGCCGCGA ACGACGACGG TGGTCCCGCG CAGATGCAAG GCCGCCTCGG CGTCGCCGTG CGTCCGCTGA CGCCGCAGGA GAAGAGCGGC GCGTCTGTGT CGCACGGTCT GTTGGTGCAG GACGCGAGCG GCGCGGCGGC GAGCGCGGGT ATCCAGCCCG GGGACGTGAT TCTGGCCGTC AACGGACGCG CGGTGTCGAG CGTCGATCAG CTGAAGCAGG CGGTATCTGG AGCAGGTAAC AGCATCGCGC TCCTGATCCA GCGCGACAAC TCGCAGATCT TCGTGCCCGT CGATCTGGGC TGA
|
Protein sequence | MKANMLTRSA VAAAVVIALS AGYVAGHRNV PAPAVITPAV AAMMPAEAAA KTGIPDFSGL VETYGPAVVN ISAKHVVKQT AMRGGNNSGQ LPIDPSDPFY QFYRHFFGQM PGGPNGGGDD GGDRPSASLG SGFIISSDGY VLTNAHVVDG ANVVTVKLTD KREYRAKVVG ADKQSDVAVL KIDAKDLPTV KIGDPRQSKV GQWVVAIGSP YGFDNTVTSG IISAKSRSLP DENYTPFIQT DVPVNPGNSG GPLFNLQGEV IGINSMIYSQ TGGFQGLSFA IPINEAIKVK DDLVKTGHVS RGRLGVAVQS VNQTLADSFG MKKPQGALVS SVDPGGPAAK AGLQPGDVIL SVDGVDVVDS AALPSQIAGI RPGKQVDVQV WRDKSTKDMK VTIGSLSDVK AAANDDGGPA QMQGRLGVAV RPLTPQEKSG ASVSHGLLVQ DASGAAASAG IQPGDVILAV NGRAVSSVDQ LKQAVSGAGN SIALLIQRDN SQIFVPVDLG
|
| |