Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_3678 |
Symbol | |
ID | 3688404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 4023457 |
End bp | 4024623 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637730133 |
Product | DegQ protease |
Protein accession | YP_335042 |
Protein GI | 76809512 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.320686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGCTCG CACTGATGTT CATCGTGGCG ACGCTCAAGC CGCAATGGCT CCAACGGCAG GGCCAGCTCG GCAAGCAGCT CGCGGCGCCG ATCGTCGCGC TGCGGGAAGT CGCGCCCGGC GTCGGCGGCG CGCCCGCTCA ATCCTCGTAT GCGGACGCCG CGCAAAAGGC GATGCCCGCG GTCGTCAACG TGTTCTCCAG CAAGGACGGT TCGCTGCCGC CCGATCCGCG CGCGAAGGAT CCGCTGTTTC GCTATTTCTT CGGCGACCGC AACGCGCGCC GGCAGCAGGA GGAACCCGCG GCCAATCTGG GGTCGGGCGT CATCGTAAGC TCGGAGGGTT ACATTCTAAC GAACCAGCAC GTCGTCGACG GCGCGGACCA GATCGAAGTC GCGCTCGCCG ACGGCCGCAC GGCCACCGCG AAGGTGATCG GCAGCGATCC GGAAACCGAC CTCGCGGTGC TCAAGATCAA CATGACGAAC CTACCGACGA TCACGCTCGG CCGCTCCGAC CAGTCGCGCG TGGGCGACGT CGTGCTCGCG ATCGGCAACC CGTTCGGGGT CGGCCAGACG GTCACGATGG GGATCATCAG CGCGCTCGGG CGCAACCACC TCGGCATCAA CACGTTCGAG AACTTCATCC AGACCGACGC GCCGATCAAC CCGGGCAATT CGGGCGGCGC GCTCGTCGAC GTAAACGGCA ACCTGCTCGG CATCAATACG GCGATCTACT CGCGCTCGGG CGGCTCGCTC GGCATCGGCT TCGCGATCCC CGTGTCGACC GCGCGCAACG TGCTCGAGAG CATCATCACG ACGGGCACCG TCACGCGCGG CTGGATCGGC GTCGAGCCGC AGGACGTGAC GCCGGAGATC GCCGAATCGT TCAGCCTTGC GCAAAAATCG GGCGCGATCG TTGCGGGCGT GCTGCAAGGC GGCCCGGCCG ACAAGGCGGG CATCAAGCCG GGCGATATTC TGATGTCGAT CGACGGCGAG GACATCACCG ATACGACGAA GCTGCTGAAC GTCGTCGCGC AGATCAAGCC CGGCACGCCG GCGAAGGTTC ACGTGGTGCG CAAGGGCAAG GAGCTCGACG TCACCGTCGT GATCGGCAAG CGGCCGCCGC CGCCGAAGCA GGCGCTCGAC GACCAGAACA GCGACGAGGA GGAGTGA
|
Protein sequence | MLLALMFIVA TLKPQWLQRQ GQLGKQLAAP IVALREVAPG VGGAPAQSSY ADAAQKAMPA VVNVFSSKDG SLPPDPRAKD PLFRYFFGDR NARRQQEEPA ANLGSGVIVS SEGYILTNQH VVDGADQIEV ALADGRTATA KVIGSDPETD LAVLKINMTN LPTITLGRSD QSRVGDVVLA IGNPFGVGQT VTMGIISALG RNHLGINTFE NFIQTDAPIN PGNSGGALVD VNGNLLGINT AIYSRSGGSL GIGFAIPVST ARNVLESIIT TGTVTRGWIG VEPQDVTPEI AESFSLAQKS GAIVAGVLQG GPADKAGIKP GDILMSIDGE DITDTTKLLN VVAQIKPGTP AKVHVVRKGK ELDVTVVIGK RPPPPKQALD DQNSDEEE
|
| |