Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1815 |
Symbol | |
ID | 4885895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1771829 |
End bp | 1773325 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640131753 |
Product | OmpW family outer membrane protein |
Protein accession | YP_001062810 |
Protein GI | 126445456 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3047] Outer membrane protein W |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.198724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGACA AGCATCGTCG TCGACGTGCG CGCCGCGACT GGCGCGCTCC GGTGTCCTGC TGGCTCGGCG CGACGCTGCT CGCGTGCGCG TGGTCCGCGC ATGCGCAGGA CAGCGGCGCG GCCAGATGGC GCGACGGCGC GGACGGCATC GGCTTCTTTC CGGGCGGCGA CGCGCCCGGC TTCGACGCGC GCGCGTGGGG GCCGGTGCCG GGCGACGCGC GTCGCGCCGC GGCGAGCGAC GCGCGCAACG GGGTCGCCGC GTCGGCGGGA TCGGAGGCCA CGGCGGCGGC GCCCGCCGCG GATGCGAACG CGGCGCCGGC GCGCAAGCTC ACCGAAGAAA GGATCACGCT CGGCGAGCGC GTCGCGCCGG TGGCCGACGC CGCGCGGCGC GTGCGCGCCG ACGGCGACGA CGGCATCGGC TTCGCCGACG CGCCCGGCGG GCCGCCGGCG GGCGGCGCGA CGCCTTCGGC CACGTGCGAC GACGGCGCAT GCGTGCCCGA CGGCGGCGAC GCAGGCCGCG CGCCGCGCCG CCCGCCCGCC GGCGCGACGC CGCGCTTCAT CGCCGGCGTG CGCTACGACC GGATGCCGTA CGAACTGCAT CCGATCGACC CGGAGCGGCT GCCCGATTTG CCGGAGGCGC AGGGCCCGAC GCTGCTCGAG CAGTTGCAGG GCGACGACAG CAACATGATC GGCGTCGGCT GGCACTACGT GCTGTCGACG GGGCGCTCGA CGCCCGTGAC GACGTCGACG GCGGCGCTCG GCATCGGCAG CTTCGCGAAT CCGGGCTCCG CGGTGTCGAT CAGCAACACG AACACGCCGG CGTTCACGTT CACGCATTTC TTCGGCGAGC ACGTCGCGGC CGAGATCGTC GCGGGGATTC CGCCCGAGCT GACGATGCGC GGGCACGGCA GCATCGGGCT GCCGTTCGAC AAGATCTTCC CGGGCGTGCA GGGGCGGCTG CCGCTCGTCG ATCTCGGCAA CACGCAGAGC AACCCGCTCG GCACGACGCG CGCGTGGCTC GGCTCGGCCG TGTTCAAGTA TTACCTCGGC AAGCGCGAGG ACCGGCTGCG GCCGTACGTC GGCCTCGGCC TCAGCTACAC GCGCTTCACG AACACGAACC TGAACCCGGT GTTCGCGCAC AAGCTCGCAT CGCTCGGCGG CCTGCTGTCG GCGGGCATCT CGCTCGGCGA TCTGCAATCG CTGCTCACCG ATTCGGGCGC GCTCGACCGG CTGCTGCAGG CAGGCGCGAA CCTGATCCTG CCGAACGGCG TGCGCGCGAC GGCGGACGTG AAAAGCGCGT GGACGCCCGT GTTCGTCGTC GGCGCGAACT ATCAGCTCAC GCGTCAGCTG TCGCTGTCGA CCGCGCTGTC GTACATCCCG CTGAAGGCGG CGATCACGGT CAACATCAAC GATACGAAGG GCATCCTCGC ATCGAACACG ACGACGCTTT CCGCGAACGT GCTGCTCTGC ACGATGCTGC TCAATTTCCG GTTCTGA
|
Protein sequence | MGDKHRRRRA RRDWRAPVSC WLGATLLACA WSAHAQDSGA ARWRDGADGI GFFPGGDAPG FDARAWGPVP GDARRAAASD ARNGVAASAG SEATAAAPAA DANAAPARKL TEERITLGER VAPVADAARR VRADGDDGIG FADAPGGPPA GGATPSATCD DGACVPDGGD AGRAPRRPPA GATPRFIAGV RYDRMPYELH PIDPERLPDL PEAQGPTLLE QLQGDDSNMI GVGWHYVLST GRSTPVTTST AALGIGSFAN PGSAVSISNT NTPAFTFTHF FGEHVAAEIV AGIPPELTMR GHGSIGLPFD KIFPGVQGRL PLVDLGNTQS NPLGTTRAWL GSAVFKYYLG KREDRLRPYV GLGLSYTRFT NTNLNPVFAH KLASLGGLLS AGISLGDLQS LLTDSGALDR LLQAGANLIL PNGVRATADV KSAWTPVFVV GANYQLTRQL SLSTALSYIP LKAAITVNIN DTKGILASNT TTLSANVLLC TMLLNFRF
|
| |