Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1648 |
Symbol | glcE |
ID | 4885903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1584447 |
End bp | 1589033 |
Gene Length | 4587 bp |
Protein Length | 1528 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640131587 |
Product | polyketide synthase, type I |
Protein accession | YP_001062644 |
Protein GI | 126445068 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.5134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGG TAGCACTGAT TGGATCCAGA CTGCGGCTGC CCGGCGCCGA TACCGTCGAT GCGTTCTGGC AGAACGTTCT GGCCGGCCGG GATTGCATCG ATTCGTTGTC CGACGCGCAG CTGCTCGCGG CGGGCGTCGA TCCCGCGTTC TCGGGCCTGC CCGATTACGT GAAGCGCGCG GGCGTGCTCG CCGACGTCGA CCGCTTCGAT TACCGCTTCT TCGGCTACAC GTTCCGGGAA GCGCAGGCGA TCGACCCGCA GCAGCGCGTG CTGCTCACGC TCGCGCATCA ACTGCTCGAG CAGGTCGGCG CGCCGGGCCG AGACGTCGGC GTCTACACGT CGGTCGGTTT CCCGCACTAT CTGCTGAACA ACCTGAGCAC CCAGCCGCCG GGCCGGGTCG CGCTGTCGGA CGTGGTCTTC GGCAACAGCG GCGATTGCGC GTCCACCCGC ATCGCGTACA AGCTCGATCT GCACGGCCCG GCGATGTCGA TCCAGTCGGG GTGCTCGTCC GCGCTGATGG CGCTGCACAA CGCGCGGATC GCGATTCTCA CCGGGCAGTG CCGGATGGCG CTCGTCGGCG CGGCCGCCAT CCGCACGCCG CAAACGGAAG GCTATCTGTA CCAGCGCGAC GGCGTGCTCG CGAAGGACGG CGTTTGCCGG CCGTTCGACG CGCGCGCGTC CGGCACCGTG TTCACGAACG GCGCGGTGGT GCTCGCGTTG AAGGCGCTGT CCGCCGCGCA GCGCGACGGG GACGACATCA TCGGCGTCAT TCGCGGCTCG GCGATCAACA ACGACGGCCA GCGCAAATCG GGCTACACGG CGCCGAGCGT CGCCGGGCAG AGCGAGGCGA TCCGGCGTGC GTACGAACGC AGCGGGATCG GGCCGGAGAC GATCGGCTAT GTCGAGACGC ACGGCACGGG CACGGCGCTC GGCGATCCGA TCGAGATTCA GGCGCTGAAA GACGCATACG GCGGCGATGC CGGCGCGCGC GCGCGCTGCG CGCTCGGCTC GACGAAGGCG AACATCGGGC ACACGGACGT CGCGGCGGGG CTCGCGGGCG TGCTCAAGGC GGCGTTGTGC GTGAAGCACG CGATCAAGCC GCCGCTCGCC GGGTTCGAGC GCGCGAACCC GAATCTGCCG CTCGACGGCT CGCCGTTCTA TATTCCGAGC ACGCCCGAGC CCTGGCCGAA CGCCGAAGGC CAGCCGCGAC GCGCGGCGGT CAGCGCGCTC GGCGTCGGCG GCAGCAACGC GCACGTGATT CTCGAGCAGG CGCCGCAGCG GGATCTGTCG CGGTGCGTCG ACACGGGGCC GCTGTTGATG TCCGCGCAAA CGCCGCACGC GCTCGACGCG CTCGATGCGC AATACGACGA TGCGTTCGCG CGGCAAGCGC TGCCGCGCGG TGACGCGTGC TACACATCGC AACTGTTCCG TCGGCATCTG CCGGAGAAGC GGGCGTTCGT GTTCGACGCC GCGGGCGGGC GGCGCCGCGT CGCGCCGTCG GGCGACTGGC GTCACGCGCA TGCGGCGCTG CTGTTTCCCG GCCAGGGCAC GCAGTATGCG GGCATGGGCC GCGCGCTGTA CGCGCGCGGC GGGCAGTTCC GCGCGACGTT CGACGATTGC GCGGACCGCT TCGTGCGCGA AGGCTGCGCG GACCCGCGCG AGCTGCTGAA TGCGGACGAT GCGCGCATTC GCGACACGGC GGTTCTGCAG CCCTACCTGT TCACGCTCGA ATATGCGTTG GGCGCGACGC TGCTGGCGAT GCGATTGCCC GTCGCGGCGG CGATCGGCCA CAGCCTCGGC GAATACGTCG CGGCGACGCT CGCCGGCGTG TTCGATCTGG CCGACGCGAT CGCGATCGTC GCGATCCGCG CCCGCATCAT GAGCCGCGCG CCGCGCGGCG CGATGCTCGC GGTGCTGGCC GAAGAAGCGC GGGTCACGGC GTTCCTCGAC GAAGCGCTGT CGCTGTGCGC GGTCAACAGC GATACGTCGT GCGTCGTCGG CGGCACCGAG CAGGCGATCG ACGCGCTCGC CGCGCGGCTC GCCGGCGCCG GGCTCGCATC GGTGAGACTG CAGACGTCGC ACGCGTTTCA TTCGCACCTG ATGGAGGCGA GCAGCCGCGA ATTCGCGGCG GCGTTCGACG GCGTGCCGCT GCGCGCGCCG CGCTTTCCGA TCGTCTCCAA TCTCGACGGG CGGGCCGACT TGCCGGAGCG CTTCGCGACA GCCGGCTATT GGGTCGATCA CCTGCGTCGT CCGGTTCGCT TCAACGACGG CCTCGCGACG CTGGCTTCGC TCGTGCCGTT CGGTGCGTGG GTCGAGGCGG GGCCCGGCAA GTCGATCGCC AACACGCTCG CGCGGATGCC GCTCACCGAG GTGGCCTGCC TGTCGACGAT GCTGCCCGGC AGCGAGACGG CGCTGTTCGA CACGCTGGCC GCGCAATGCT GGGCGAACGG CATCGAGATC GACTGGACGC CGCTCTACGG CGACACGCGC GGCAACGTCG TGCCGCTCGT GCCGCATCCG CTCGACGAGA TCTCGTGCTG GATCGACGCG CCGGCGAAGC GGGCCGTCGC CGACGAGGCG CCCGAATACG AGAAGCAGGG CGACATCGAT CAATGGTTCT ACGAATACGA CTGGACGCCC GTGCAGCCGG ACGGCGAAGC CGCGCCGGGG CATGCGGCGG CGCGGGCGCC GATCGCCGAT TCGGTTCTGC TGATCGGCGA CGCGAGCCGC GATGCCGCGC GGCTCGCCGA GCGCGTCGCG CAGGATGCCG AATCGTTCTT CGTCGTGGAC GGCGCGCATC CGCAGGCACT GGACGGGGCG CTCGCGACGG TCGCGGCGCA GGCCGCGGCG AAGCGCGCGC GCATTTCGCG CGTGCTGATC GTCGTGCCGT CGCGGTGCGG CGGCGCGGAT ACGCAGGCGG ACGGGGCGCG CGACGAATCC GGTCGCGCGC TCGATGCGAT GCTCGCGATG CAACGCACGT TCGATGCGCT GCGCAACGCG CTGCCGGGCA AGCTCGACGT CGCGCTGATC GCGTTGTCCG GCGCGCGCGC CGACGGCGGC GAGCCGTCGG TCGCGAGCGC ATGGATCGAC AGCTTCGCGA CCGTCGTGCA TCAGGAATTC TCGCGGGTGG TCTGCCGCGC GGTTCATGTC GATGCCGCGC CCGCCGCGGG CGAGGCCGAC GATGCCGCCG CGCGCCGGCG CACGATCGAT ACGCTCGCCC ACGCGTGTCT GCGCCATCCC GGCCGGTTCC TCCGGATTCG GGACGGCCGG TTGATCGAGC GCGGGCTCAA GCGCGGCGGC GCGCGCGCGG CGAACGCGCA CCCCGCGCCC GATGCGCCCG ACGCGACGCC GAAGTCGCCC CGAACCGTGC TGGTGGTCGG CGGCGCGGGG AACGTCGGGA TGATCTACGC GACGTTCTTC GCGACGGTCG TCGGCGCGGA CGTCGCGATC GTCAGCCGCC GCGCGCGCGC GTTCGCCGAC TCGCTGCGCG ATCCGTCGGC CGCGACAGAC CGCGCGCTGC GCCGCCGCAA GGCGCTGTAC GAGCGCATGG TCGCGGCCGG CGGGCGGATC ATGTTCGTCG ATGCCGACGC CACCGATCCG CGGCAGCTCG AGCGCGCCGC GCGATCGGTC GCCGACGCGT TCGGCTCGCT CGATCTCGTC GTCCATGCGG CGGGCGCGCC CGCCGACATG CACTACCGGA CGTTCGACGA TACCGACGTC GCCTACCTGG ATGCGCTCGT CTCGCCGAAG CTCGACGTCT GCGCGAACCT GTACGCGCTG ACTCGCTCGC TGCGCATTGC GCGCGTGATG ATCGTGTCGT CGATTTCGGC CACGCTCGGC GGGATCGGGC TGTACGGCTA CGCGGCGTCC CATTCGTTGC TGAACGCGTA TGCGCAGTCG GCGAGCAGCG CGGCGTGCCG CTGGACCGTG ATCGACTGGG ACGCGTGGGA GTTCTTCAAG GACACCCGCG ACGAGGCGAA CGACGACGTG GGCATCGATC ACTACGCGAT CAGCGAGCAG GAGGGGTTGT CGGTGCTCGA GCGTCTTCAC GCGCTCGGCT GGCCCGCGCA CATCGTCGTC GCGAGCGGCG ACCTGATCCA GCGCTATCGC AACTGGGTGC TGTCCGAGCG CGACGACACG CCCGCCGCGG CGCAGATCGT CGCGCCGCGC CCGCTGCTGA AGGACGAGCT GGTCGCGCCG CGCACCGGCA CCGAGGCGGC GCTCGCGAAG CTGTGGAGCG AGTGCATCGG CGTCGAGCCG GTCGGCGTGC GCGACAACTT CTTCGAGCTG GGCGGCCATT CGCTGATCGC GCTGAAGCTC GTCGACCGGA TCAATCAGAC GCTCGACTGG GATCTGTCGG CGGTCGACAT GTTCAAGTTC CCGACGATCG AACGCCTGGC CGACGCCAAC GCGGCGCACA CGCCCGACGC CGCGGACGAT GCCGGCGCAC CGCACGGCGC GGGTCACGAC GCGCGCGACG ACGCCCCGCG TCCGCCGGCG CAGCCCGATG CGGGCGCGCA TGCGGATCGG CGCCGCCGCC ACTACTACCA AAGCAGAAAA CACAGCATGG AGTCGAAAAG TGAATAA
|
Protein sequence | MEQVALIGSR LRLPGADTVD AFWQNVLAGR DCIDSLSDAQ LLAAGVDPAF SGLPDYVKRA GVLADVDRFD YRFFGYTFRE AQAIDPQQRV LLTLAHQLLE QVGAPGRDVG VYTSVGFPHY LLNNLSTQPP GRVALSDVVF GNSGDCASTR IAYKLDLHGP AMSIQSGCSS ALMALHNARI AILTGQCRMA LVGAAAIRTP QTEGYLYQRD GVLAKDGVCR PFDARASGTV FTNGAVVLAL KALSAAQRDG DDIIGVIRGS AINNDGQRKS GYTAPSVAGQ SEAIRRAYER SGIGPETIGY VETHGTGTAL GDPIEIQALK DAYGGDAGAR ARCALGSTKA NIGHTDVAAG LAGVLKAALC VKHAIKPPLA GFERANPNLP LDGSPFYIPS TPEPWPNAEG QPRRAAVSAL GVGGSNAHVI LEQAPQRDLS RCVDTGPLLM SAQTPHALDA LDAQYDDAFA RQALPRGDAC YTSQLFRRHL PEKRAFVFDA AGGRRRVAPS GDWRHAHAAL LFPGQGTQYA GMGRALYARG GQFRATFDDC ADRFVREGCA DPRELLNADD ARIRDTAVLQ PYLFTLEYAL GATLLAMRLP VAAAIGHSLG EYVAATLAGV FDLADAIAIV AIRARIMSRA PRGAMLAVLA EEARVTAFLD EALSLCAVNS DTSCVVGGTE QAIDALAARL AGAGLASVRL QTSHAFHSHL MEASSREFAA AFDGVPLRAP RFPIVSNLDG RADLPERFAT AGYWVDHLRR PVRFNDGLAT LASLVPFGAW VEAGPGKSIA NTLARMPLTE VACLSTMLPG SETALFDTLA AQCWANGIEI DWTPLYGDTR GNVVPLVPHP LDEISCWIDA PAKRAVADEA PEYEKQGDID QWFYEYDWTP VQPDGEAAPG HAAARAPIAD SVLLIGDASR DAARLAERVA QDAESFFVVD GAHPQALDGA LATVAAQAAA KRARISRVLI VVPSRCGGAD TQADGARDES GRALDAMLAM QRTFDALRNA LPGKLDVALI ALSGARADGG EPSVASAWID SFATVVHQEF SRVVCRAVHV DAAPAAGEAD DAAARRRTID TLAHACLRHP GRFLRIRDGR LIERGLKRGG ARAANAHPAP DAPDATPKSP RTVLVVGGAG NVGMIYATFF ATVVGADVAI VSRRARAFAD SLRDPSAATD RALRRRKALY ERMVAAGGRI MFVDADATDP RQLERAARSV ADAFGSLDLV VHAAGAPADM HYRTFDDTDV AYLDALVSPK LDVCANLYAL TRSLRIARVM IVSSISATLG GIGLYGYAAS HSLLNAYAQS ASSAACRWTV IDWDAWEFFK DTRDEANDDV GIDHYAISEQ EGLSVLERLH ALGWPAHIVV ASGDLIQRYR NWVLSERDDT PAAAQIVAPR PLLKDELVAP RTGTEAALAK LWSECIGVEP VGVRDNFFEL GGHSLIALKL VDRINQTLDW DLSAVDMFKF PTIERLADAN AAHTPDAADD AGAPHGAGHD ARDDAPRPPA QPDAGAHADR RRRHYYQSRK HSMESKSE
|
| |