Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1567 |
Symbol | glcE |
ID | 4906138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1519495 |
End bp | 1524087 |
Gene Length | 4593 bp |
Protein Length | 1530 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640144673 |
Product | polyketide synthase, type I |
Protein accession | YP_001075601 |
Protein GI | 126455887 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGG TAGCACTGAT TGGATCCAGA CTGCGGCTGC CCGGCGCCGA TACCGTCGAT GCGTTCTGGC AGAACGTTCT GGCCGGCCGG GATTGCATCG ATTCGTTGTC CGACGCGCAG CTGCTCGCGG CGGGCGTCGA TCCCGCGTTC TCGGGCCTGC CCGATTACGT GAAGCGCGCG GGCGTGCTCG CCGACGTCGA CCGCTTCGAT TACCGCTTCT TCGGCTACAC GTTCCGGGAA GCGCAGGCGA TCGACCCGCA GCAGCGCGTG CTGCTCACGC TCGCGCATCA ACTGCTCGAG CAGGTCGGCG CGCCGGGCCG AGACGTCGGC GTCTACACGT CGGTCGGCTT CCCGCACTAT CTGCTGAACA ACCTGAGCAC CCAGCCGCCG GGCCGGGTCG CGCTGTCGGA CGTGGTGTTC GGCAACAGCG GCGATTGCGC GTCCACCCGC ATCGCGTACA AGCTCGATCT GCACGGCCCG GCGATGTCGA TCCAGTCGGG GTGCTCGTCC GCGCTGATGG CGCTGCACAA CGCGCGGATC GCGATTCTCA CCGGGCAGTG CCGGATGGCG CTCGTCGGCG CGGCCGCCAT CCGCACGCCG CAAACGGAAG GCTATCTGTA CCAGCGCGAC GGCGTGCTCG CGAAGGACGG CGTTTGCCGG CCGTTCGACG CGCGCGCGTC CGGCACCGTG TTCACGAACG GCGCGGTGGT GCTCGCGTTG AAGGCGCTGT CCGCCGCGCA GCGCGACGGG GACGACATCA TCGGCGTCAT TCGCGGCTCG GCGATCAACA ACGACGGCCA GCGCAAATCG GGCTACACGG CGCCGAGCGT CGCCGGGCAG AGCGAGGCGA TCCGGCGTGC GTACGAACGC AGCGGGATCG GGCCGGAGAC GATCGGCTAT GTCGAGACGC ACGGCACGGG CACGGCGCTC GGCGATCCGA TCGAGATTCA GGCGCTGAAG GACGCATACG GCGGCGATGC CGGCGTGGGC GCGCGCGCGC GCTGCGCGCT CGGCTCGACG AAGGCGAACA TCGGGCACAC GGACGTCGCG GCGGGGCTCG CGGGCGTGCT CAAGGCGGCG TTGTGCGTGA AGCACGCGAT CAAGCCGCCG CTCGCCGGGT TCGAGCGCGC GAACCCGAAT CTGCCGCTCG ACGGCTCGCC GTTCTATATT CCGAGCACGC CCGAGCCCTG GCCGAACGCC GAAGGCCAGC CGCGACGCGC GGCGGTCAGC GCGCTCGGCG TCGGCGGCAG CAACGCGCAC GTGATTCTCG AGCAGGCGCC GCAGCGGGAT CTGTCGCGGT GCGTCGACAC GGGGCCGCTG TTGATGTCCG CGCAAACGCC GCACGCGCTC GACGCGCTCG ATGCGCAATA CGACGATGCG TTCGCGCGGC AAGTGCTGCC GCGCGGTGAC GCGTGCTACA CATCGCAACT GTTCCGTCGG CATCTGCCGG AGAAGCGGGC GTTCGTGTTC GACGCCGCGG GCGGGCGGCG CCGCGTCGCG CCGTCGGGCG ACTGGCGTCA CGCGCATGCG GCGCTGCTGT TTCCCGGCCA GGGCACGCAG TATGCGGGCA TGGGCCGCGC GCTCTACGCG CGCGGCGGGC AGTTCCGCGC GACGTTCGAC GATTGCGCGG ACCGCTTCGT GCGCGAAGGC TGCGCGGACC CGCGCGAGCT GCTGAATGCG GACGATGCGC GCATTCGCGA CACGGCGGTT CTGCAGCCCT ACCTGTTCAC GCTCGAATAT GCGTTGGGCG CGACGCTGCT GGCGATGCGA TTGCCCGTCG CGGCGGCGAT CGGCCACAGC CTCGGCGAAT ACGTCGCGGC GACGCTCGCC GGCGTGTTCG ATCTGGCCGA CGCGATCGCG ATCGTCGCGA TCCGCGCCCG CATCATGAGC CGCGCGCCGC GCGGCGCGAT GCTCGCGGTG CTGGCCGAAG AAGCGCGGGT CACGGCGTTC CTCGACGAAG CGCTGTCGCT GTGCGCGGTC AACAGCGATA CGTCGTGCGT CGTCGGCGGC ACCGAGCAGG CGATCGACGC GCTCGCCGCG CGGCTCGCCG GTGCCGGGCT CGCATCGGTG AGACTGCAGA CGTCGCACGC GTTTCATTCG CACCTGATGG AGGCGAGCAG CCGCGAATTC GCGGCGGCGT TCGACGGCGT GCCGCTGCGC GCGCCGCGCT TTCCGATCGT CTCCAATCTC GACGGGCGCG CCGACTTGCC GGAGCGCTTC GCGACAGCCG GCTATTGGGT CGATCACCTG CGTCGTCCGG TTCGCTTCAA CGACGGCCTC GCGACGCTGG CTTCGCTCGT GCCGTTCGGT GCGTGGGTCG AGGCGGGGCC GGGCAAGTCG ATCGCCAACA CGCTCGCGCG GATGCCGCTC ACCGAGGTGG CCTGCCTGTC GACGATGCTG CCCGGCAGCG AGACGGCGCT GTTCGACACG CTGGCCGCGC AATGCTGGGC GAACGGCATC GAGATCGACT GGACGCCGCT CTACGGCGAC ACGCGCGGCA ACGTCGTGCC GCTCGTGCCG CATCCGCTCG ACGAGATCTC GTGCTGGATC GACGCGCCGG CGAAGCGGGC CGTCGCCGGC GAGGCGCCCG AATACGAGAA GCAGGGCGAC ATCGATCAAT GGTTCTACGA ATACGACTGG GCGCCCGTGC AGCCGGACGG CGAAGCCGCG CCGGGGCATG CGGCGGCGCG GGCGCCGATC GCCGATTCGG TTCTGCTGAT CGGCGACGCG AGCCGCGATG CCGCGCGGCT CGCCGAGCGC GTCGCGCAGG ATGCCGAATC GTTCTTCGTC GTGGACGGCG CGCATCCGCA GGCACTGGAC GGGGCGCTCG CGACGGTCGC GGCGCAGGCC GCGGCGAAGC GCGTGCGCAT TTCGCGCGTG TTGATCGTCG TGCCGTCGCG GCGCGGCGGC GCGGATACGC AGGCGGACGG GGCGCGCGAC GAATCCGGCT GCGCGCTCGA TGCGATGCTC GCGATGCAAC GCACGTTCGA TGCGCTGCGC AACGCGCTGC CGGGCAAGCT CGACGTCGCG CTGATCGCGT TGTCCGGCGC GCGCGGCGAC GGCGGCGAGC CGTCGGTCGC GAGCGCATGG ATCGACAGCT TCGCGACCGT CGTGCATCAG GAATTCTCGC GGGTGGTCTG CCGCGCGGTT CATGTCGATG CCGCGCCCGC CGCGGGCGAG GCCGACGATG CCGCCGCGCG CCGGTGCACG ATCGATACGC TCGCCCACGC GTGCCTGCGC CATCCCGGCC GGTTCCTCCG GATTCGGGAC GGCCGGTTGA TCGAGCGCGG GCTCAAGCGC GGCGGCGCGC GCGCGGCGAA CGCGCACCTC GCGCCCGATG CGCCCGACGC GACGCCGAAG TCGCCCCGAA CCGTGCTGGT GGTCGGCGGC GCGGGGAACG TCGGGATGAT CTACGCGACG TTCTTCGCGA CGGTCGTCGG CGCGGACGTC GCGATCGTCA GCCGCCGCGC GCGCGCGTTC GCCGACTCGC TGCGCGATCC GTCGGCTGCG ACGGACCGCG CGCTGCGCCG CCGCAAGGCG CTGTACGAGC GCATGGTCGC GGCCGGCGGG CGGCTCATGT TCGTCGATGC CGACGCCACC GATCCGCGGC AGCTCGAGCG CGCCGCGCGA TCGGTCGCCG ACGCGTTCGG CTCGCTCGAT CTCGTCGTCC ATGCGGCGGG CGCGCCCGCC GACATGCACT ACCGGACGTT CGACGATACC GACGTCGCCT ACCTGGATGC GCTCGTCTCG CCGAAGCTCG ACGTCTGCGC GAACCTGCAC GCGCTGACTC GCTCGCTGCG CATTGCGCGC GTGATGATCG TGTCGTCGAT TTCGGCCACG CTCGGCGGGA TCGGGCTGTA CGGCTACGCG GCGTCCCATT CGTTGCTGAA CGCGTATGCG CAGTCGGCGA GCAGCGCGGC GTGCCGCTGG ACCGTGATCG ACTGGGACGC GTGGGAGTTC TTCAAGGACA CCCGCGACGA GGCGAACGAC GACGTGGGCA TCGATCACTA CGCGATCAGC GAGCAGGAGG GGTTGTCGGT GCTCGAGCGT CTTCACGCGC TCGGCTGGCC CGCGCACATC GTCGTCGCGA GCGGCGACCT GATCCAGCGC TATCGCAACT GGGTGCTGTC CGAGCGCGAC GACACGCCCG CCGCGGCGCA GATCGTCGCG CCGCGCCCGC TGCTGAAGGA CGAGCTGGTC GCGCCGCGCA CCGGCACCGA GGCGGCGCTC GCGAAGCTGT GGAGCGAGTG CATCGGCGTC GAGCCGGTCG GCGTGCGCGA CAACTTCTTC GAGCTGGGCG GCCATTCGCT GATCGCGCTG AAGCTCGTCG ACCGGATCAA TCAGACGCTC GACTGGGATC TGTCGGCGGT CGACATGTTC AAGTTCCCGA CGATCGAACG CCTGGCCGAC GCGAACGCGG CGCACGCGCC CGGCGACGCG GACGATGCCG GCGCACCGCG CGGCGCGGGT CACGACGCGC GCGACGACGC CCCGCGTCCG CCGGCGCAGC CCGATGCGGG CGCGCATGCG GATCGGCGCC GCCGCCACTA CTACCAAAGC AGAAAACACA GCATGGAGTC GAAAAGTGAA TAA
|
Protein sequence | MEQVALIGSR LRLPGADTVD AFWQNVLAGR DCIDSLSDAQ LLAAGVDPAF SGLPDYVKRA GVLADVDRFD YRFFGYTFRE AQAIDPQQRV LLTLAHQLLE QVGAPGRDVG VYTSVGFPHY LLNNLSTQPP GRVALSDVVF GNSGDCASTR IAYKLDLHGP AMSIQSGCSS ALMALHNARI AILTGQCRMA LVGAAAIRTP QTEGYLYQRD GVLAKDGVCR PFDARASGTV FTNGAVVLAL KALSAAQRDG DDIIGVIRGS AINNDGQRKS GYTAPSVAGQ SEAIRRAYER SGIGPETIGY VETHGTGTAL GDPIEIQALK DAYGGDAGVG ARARCALGST KANIGHTDVA AGLAGVLKAA LCVKHAIKPP LAGFERANPN LPLDGSPFYI PSTPEPWPNA EGQPRRAAVS ALGVGGSNAH VILEQAPQRD LSRCVDTGPL LMSAQTPHAL DALDAQYDDA FARQVLPRGD ACYTSQLFRR HLPEKRAFVF DAAGGRRRVA PSGDWRHAHA ALLFPGQGTQ YAGMGRALYA RGGQFRATFD DCADRFVREG CADPRELLNA DDARIRDTAV LQPYLFTLEY ALGATLLAMR LPVAAAIGHS LGEYVAATLA GVFDLADAIA IVAIRARIMS RAPRGAMLAV LAEEARVTAF LDEALSLCAV NSDTSCVVGG TEQAIDALAA RLAGAGLASV RLQTSHAFHS HLMEASSREF AAAFDGVPLR APRFPIVSNL DGRADLPERF ATAGYWVDHL RRPVRFNDGL ATLASLVPFG AWVEAGPGKS IANTLARMPL TEVACLSTML PGSETALFDT LAAQCWANGI EIDWTPLYGD TRGNVVPLVP HPLDEISCWI DAPAKRAVAG EAPEYEKQGD IDQWFYEYDW APVQPDGEAA PGHAAARAPI ADSVLLIGDA SRDAARLAER VAQDAESFFV VDGAHPQALD GALATVAAQA AAKRVRISRV LIVVPSRRGG ADTQADGARD ESGCALDAML AMQRTFDALR NALPGKLDVA LIALSGARGD GGEPSVASAW IDSFATVVHQ EFSRVVCRAV HVDAAPAAGE ADDAAARRCT IDTLAHACLR HPGRFLRIRD GRLIERGLKR GGARAANAHL APDAPDATPK SPRTVLVVGG AGNVGMIYAT FFATVVGADV AIVSRRARAF ADSLRDPSAA TDRALRRRKA LYERMVAAGG RLMFVDADAT DPRQLERAAR SVADAFGSLD LVVHAAGAPA DMHYRTFDDT DVAYLDALVS PKLDVCANLH ALTRSLRIAR VMIVSSISAT LGGIGLYGYA ASHSLLNAYA QSASSAACRW TVIDWDAWEF FKDTRDEAND DVGIDHYAIS EQEGLSVLER LHALGWPAHI VVASGDLIQR YRNWVLSERD DTPAAAQIVA PRPLLKDELV APRTGTEAAL AKLWSECIGV EPVGVRDNFF ELGGHSLIAL KLVDRINQTL DWDLSAVDMF KFPTIERLAD ANAAHAPGDA DDAGAPRGAG HDARDDAPRP PAQPDAGAHA DRRRRHYYQS RKHSMESKSE
|
| |