Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3590 |
Symbol | |
ID | 4899255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3497062 |
End bp | 3498891 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640136816 |
Product | PDZ domain-containing protein |
Protein accession | YP_001067821 |
Protein GI | 126453535 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGA TTTGCTACAC GATCGTTCCG AAAGATCCCG CCGCGCACCT GTTCGAGGTG ACGCTCACGC TCGCCGATCC GGACCCGGCG GGCCAGCGCT TCGCGCTGCC CGTATGGATT CCGGGCAGCT ATATGGTGCG CGAGTTCGCG CGCAATATCG TGACGCTGCG TGCGTTCAAC GAAGCGGGCC GCAAGCTGCG GATCGGCAAG CTCGACAAGC AGACCTGGCA GGCCGCGCCG GCGCCCGGGC CGATCACGCT GCGCTACGAC GTCTACGCGT GGGACCTGTC GGTGCGCGCC GCGCACCTGG ACGACACGGG CGGCTTCTTC AACGGCACGA GCGTGTTTCT CGCGCCGCTC GGCCGCGAGG ATGCGCCGTG CGAGGTAATG ATCGAGCGGC CGGCGGGCGA CGCGTACCGG CGCTGGCGCG TCGCGACGGC GCTGCCGGAG GCGCGCGGCA CGAAACGCTA CGGCTTCGGC GCGTACCGCG CGGAGAATTA CGACGAGCTG ATCGATCATC CGGTCACGCT CGGCGAATTC GCGCTCGCGT CGTTCGACGC GCACGGCGTG CCGCACGACA TCGCGATCGC GGGCCGCGTG ACCGGGCTCG ATCTCGAGCG GCTCGCGGCC GACCTGAAGC GGGTGTGCGA GGCGCAGATC GCGCTGTTCG AGCCGAAGAC CAGGCGCGCG CCGATGTCGC GCTACGTGTT CATGACGCAG GCGGTCAGCG ACGGCTACGG CGGGCTCGAG CATCGCGCGT CGACGGCGCT CGTCTGCAAT CGCACCGATC TGCCGGTGAA GGGGCGCCCT GAGAAGACGG ACGGCTATCG GACTTACCTC GGCCTGTGCA GCCACGAGTA CTTCCATACA TGGAACGTGA AGCGGATCAA GCCGGCCGCG TTCGCGCCGT ACGATCTGTC GCAGGAGAAT TACACGTCGC TGCTGTGGCT CTTCGAGGGC TTCACGTCGT ACTACGACGA CCTGATGCTC GCGCGCAGCG GCCTCATCTC GCAGGACGAC TATTTCGCGC TCGTCGGCCG CACGATCGCC GGCGTGCAGC GCGGCGCCGG CCGGCTCAGG CAGAGCGTCG CCGAAAGCTC GTTCGATGCG TGGATCAAGT ATTACCGGCA GGACGAAAAC GCGACGAACG CGATCGTCAG CTATTACACG AAGGGCTCGC TCGTCGCGCT CGCGTTCGAT CTGGCGATTC GCGCGCGCAG CCGCCACCGC AAATCGCTCG ACGACGTGAT GCGGCTTTTG TGGCAGCGCT TCGGGCGCGA CTTCTACCAC GGCAAGCCGC AAGGCGTCGG CGAGGACGAC GTGAAGGCGC TGATCGCCGA AGCGACGGGT GTCGATCTCG GCCGTCTTTT CGACGAAGCG GTGTCCGGCA CGCGCGATCT GCCGCTCGCC GAACTCTTCG AGCCGTTCGG CGTGACGCTC GCGCCGGACG GGGGCGCGGG CGGCGCGGCC GATGCGCCCG CGAAGCCGAC GCTCGGCGCG CGCACGCGCG GCGGCGCGGA ATGCACGCTC GCGGCCGTCT ACGAAGGCGG CGCCGCGCAT CGGGCCGGGC TGTCCGCGGG CGACGCGCTC GTCGCGATCG ACGGGCTGCG CGTGACGGGC TCGAACCTCG ACGCGCTGCT CGCGCGCTAC CGCGTCGGCG ACAAGGTCGA GATCCACGCA TTCCGGCGCG ACGAACTGCG CGTGGTGCAG CTCAAGCTCG ACGGCCCGGA CATCGCGCGC TACAAGCTGG CCGCTCAGCC GAAGCCCGCG GCCGCGCGCG CCCGTCGCGA CGCGTGGCTC GGGCTGCCGG CGGCGCGCGG CGGTCGATAA
|
Protein sequence | MKPICYTIVP KDPAAHLFEV TLTLADPDPA GQRFALPVWI PGSYMVREFA RNIVTLRAFN EAGRKLRIGK LDKQTWQAAP APGPITLRYD VYAWDLSVRA AHLDDTGGFF NGTSVFLAPL GREDAPCEVM IERPAGDAYR RWRVATALPE ARGTKRYGFG AYRAENYDEL IDHPVTLGEF ALASFDAHGV PHDIAIAGRV TGLDLERLAA DLKRVCEAQI ALFEPKTRRA PMSRYVFMTQ AVSDGYGGLE HRASTALVCN RTDLPVKGRP EKTDGYRTYL GLCSHEYFHT WNVKRIKPAA FAPYDLSQEN YTSLLWLFEG FTSYYDDLML ARSGLISQDD YFALVGRTIA GVQRGAGRLR QSVAESSFDA WIKYYRQDEN ATNAIVSYYT KGSLVALAFD LAIRARSRHR KSLDDVMRLL WQRFGRDFYH GKPQGVGEDD VKALIAEATG VDLGRLFDEA VSGTRDLPLA ELFEPFGVTL APDGGAGGAA DAPAKPTLGA RTRGGAECTL AAVYEGGAAH RAGLSAGDAL VAIDGLRVTG SNLDALLARY RVGDKVEIHA FRRDELRVVQ LKLDGPDIAR YKLAAQPKPA AARARRDAWL GLPAARGGR
|
| |