Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1667 |
Symbol | |
ID | 3846010 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1989915 |
End bp | 1995620 |
Gene Length | 5706 bp |
Protein Length | 1901 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637838968 |
Product | polyketide synthase, putative |
Protein accession | YP_439861 |
Protein GI | 83716287 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATCCA GCAAGGAGAT CTTCGAGGCG CTGCGCGACG GGCGGCTGTC GCGCGAAGAG GCGCATGCGG CGCTGCGCAG CGCGCGGGCG GCGGCGGATG CGGCGGGTGC GAACGGCGCG GCCGATGCGG CCGATGCGGG CGATGCGGCA AGCGGGGCGA GCGCGGCGAA CGGCGAGCGT GCCGCGCATG CGGCGAAGAC GGCCGAATCG GCGGCCGCGG CCGACGCCAT GCCGATCGCG ATCATCGGCC GCTCGGGGCG CTATCCGGAC GCGCCGGATC TCGCCGCGTA CTGGCGCAAT CTGGCGGCGG GGCGCGACTC GGTGCGCCCG ATCGCGCCGT CGCGCTGGGA CACGGACGCA TGGTTCGACG CGCGGCGCGG CGAGCCGGGC AGGATCTATT GCCGCGCGCT CGGCGCGCTG GACGACATCG ATTGCTTCGA TCCGCTGTTT TTCCACATCT CGCCCGCCGA GGCGCAGTAC ATCGATCCGC AGCACCGCAT CGTGCTCGAG GAGGCGTACC GCGCGTTCGA GGATGCCGGC TACACGCCGC AGCGGCTCGG CGGCCGCGCG TGCGGCGTCT ATCTCGGCAT CATGAGCAAC GAATACGCGA TGATGCTCGC GCGCGAGCGC GCGCCGATCG TGAACGCGAC CGGCAACGCG TTCTCGATCG CGGCCGCGCG CATCGCGTAC TGGCTCGACC TCAAGGGGCC CGCGCTCGCG CTCGACACCG CGTGCTCGTC GTCGCTCGTC GCCGTCCATC TCGCGTGCCA GGCGCTGCGC GCGGGCGAGG TGGAGATGGC GCTCGCATGC GGCGTGTCGC TGTATCTGAC GGCCGAGTCG TACGTCGCGA TGTGCGGCGC GAACATGCTG TCGGCGCGCG GCGCGTGCCG CGCGTTCGAC AACGACGCCG ACGGCTTCGT GCCGGGCGAG GGCGCGGGCG CGGTGCTGCT CAAGCGGCTC GACGCGGCCG AGCGCGACGG CGACCGGATT CTCGCGACGA TCGCGGCGAG CGGCATCAAC CAGGACGGCA AGACGAACGG CATCACCGCG CCGAGCGCGG CGAGCCAGAG CGCGCTCGTG CGCGACGTGT ACGCGCGTTT CGGGATCGAC GCGACGAGCG TCGACTACTG CGAGATGCAC GGCACGGGCA CGAAGCTCGG CGATCCGATC GAGCTGCGCG CGCTCGCCGA CGTCTATCGC GAAGCGGGCG CGGCCGCCGG CGCGTGCGCG ATCGGCTCGG TGAAGACGAA CATCGGTCAC ACGTCGGCGG CGGCGGGCGT CGCGAGCCTG CACAAGGTCA TGCTCGCGCT CGAGCACGGC GCGCTCGCGC CGAGCCTGCA TTTCGACGCG CCGAACGAGC ATTTCGATTT CGCCGCCTCG CCGTTTCGCG TCGTCACGCA CGCGCAGCCG TGGCCGCGGC GCGCGGATCG CCCGCGCCGC GCCGCGATCA GCTCGTTCGG GCTGAGCGGG ACGAACGCGC ACGTGATCGT CGACGAATAT CGCGGCCGGG CGCGCGGCGC CGCCTCGCAT GCGTCGGGCG CGGCGCACGC GTCGGGCGTA CCGGATGCGT CAAACGTCTC CGGCGTGCGG GACGCGCTGC CGGCGTCGAA CGCCGCGAAC GCACCGCGCC GACCGGAAGT CCCGGAAGCC CCGGAAACCC CGAGCGCGCG GCACCCGGCG ACGGACGCGC TCGTCGTCCT GTCCGCGCGC GACGCCGAGC GCCTGCGCGA CTATGCGCGG CGTCTGCTGC GCCACGTGAG CGACACGTCG CCGGTATCGT TGCCGGACCT CGCGTACACG CTGCAGACGG GGCGCGCGGC GATGCCGCAT CGGCTCGCGA TCGCCGCGGA TGCCGTCGAT ACGCTGCGCG ACGCGTTGAG CGCGTTCGTC GACGGCCGGC CGCATCCGGC GCTCGTCGCA GGCGCGGCGC AGGGCGGCGC CGGTACGGGG GCGGCGCGCG AAGCGCTCAC GCATGCGCAC GCCTGGGACG CGCTCGCGCG CGCATGGGTG GCGGGCGCGG AGATCGACTG GCCCGCCGCG CATGGCGCGG CGCTCGCCGA GCGCGTGCGC GTGAACGCGC CGACGTATCC GTTCGCGCGC GAGCGCTATT GGCTGCCGCG GCCGCCCGCG GACGAGGCTG GCGCGCGCCG GCGCGTCGCG CTCGTGAGGG ACTGGACGTC CGCGCCGCTC GGCGCGCCGG CCGTGTCCGG CGCGCGGCCT GTGGCGACGC GCATCCTGAT GGTCGGCCGC TGCGCGGCGA ACCGCGCGCT CGCCGACGCG CTCGCGCCGA TGCTGCCGGA TGCGGCGCTG ATCGACGTGT GCGACGACGC CTCGTTCGAC CGCCTGCCCG CGGCGGCGGG CGTCGTCGAC CTGACAGGCT GCGCGCGCGA GCCGATCGAC GAGATGCGGT GGCTCGCGCT CGTGCAGCGG GTGGTCGCGA GCCGGCCGCG CGCGCTGTAC GCCGTCACGG CAGGGCTCGA GACGCCCGCG CGCGGCGCGG CCGGCCTCGC GGGCGCCTGG CAGGCGGGGC TGTATCTCGC GCTGGCCGCC GAATATCCGA GCGTCGCGTC GCGCGTCGTC GATTTGCCCG AAGGCGGCGA GCCGTCGGCG CTCGCGGCGC TCGTGGCGGC CGAATACGAC GGCGCGTCGG CCGATGTGCA TTGCCGCTAT CGCGACGGCG TGCGCGAGCG CGCGATCGTG CGTCCGCTCG ACATCGATGC CGACGTCGAC GCGCCCGGGC GCGCGCCGCG CGCGAGCGTG CTCGGCCCCG ACGATTGCGT ATGGATCACG GGCGGCACGC GCGGACTCGG CCTGACGTGC GCGCGGCACC TCGTGTCGCG CCACGGGGTG CGCCGGCTGC TGCTGACGGG CAGGACCGCG CTGCCGCCGC GTTCGGAATG GGACGCGCTC GGCGCGCGCG ACGCGGATTT CGCGCGGCGG GCGGCCGGGC TGCGCGCGCT CGAGGCGATG GGCGCGCGGA TCGAATATTC GGCCGTCGCG CTCGATGACG CGCGCGCCGT CGCGGCCGAG CTCGCGCGCG TGCGTCCGAC GCTCGGCGCG GTGACGGCGC TCGTTCATTG CGCGGGCGCG GTCGACTGGT CCGAGCCGGC GTTTTTCGCG AAGTCGCGCG AATCGATGCG CGCGGTGCTG CAGCCGAAAA CGGCCGGGCT CGCGACCGTC GTCGACGCGC TCGCGGGCGC GCCCGTCAAG CGGATCGTGC TGTTTTCGTC GGTTGCGGCG GCGATCCCCG CGCTCGGCGC GGGCCAGGCC GACTACGCGA CGGCGAACGC GTGGCTTGAC TACTTCGCGC GCGCCTATGC GCGCGCGCTG CCCGTCGTCA GCGTGCAATG GCCGAACTGG CGGGGCGCGG GAATGGGCGA GGTGCGCAGC CGCGCGTATG CGCAAAGCGG GCTGTCGGCG CTCGACGACG CGCAAGGGCT TGCGCTGTTC GACTGGGCGC TCGAGCGGAT GCCCGCGCCG GTGGTGTTGC CCGCCTTCGT GCGCGACGGC GAACGCCCGT GGCTGGACTG GCTGGGAGCG CGGACGGCGC ACCGAGCGGC ATTTGACGCC GAGTCCGACG CAGGCCCGGC CGCGGCGCGG GCCTCGGTGG TCGCGGCGGA GGGGGAAGCC GGCCGCGACC GCCTGTCCGG CGGCGCAAGC GGCATCGCGA GCGGGGCGGC GTCCGTCGAC GGCGGTGGCG ATGGCCACCG CATTGGCGAG GCCGCCCGCG ACGGCGACGG CGAAGGCGAG GGCAATGGCG AGGGCAATGG CGAGGGCGAA GGCGAGGGCG CTTTCGTCGC CGCCGTGCGC GCATGGCTCG CGTCGACGTT CGCGCGGGAG CTGGCGCTGC CGAGCGGGAC GCTCGATCCG GCGAAGCCGT TTCGCGAGTA CGGCGTCGAT TCGATCATGC TGACGCAACT GCTGCGTCCG CTGAACCGGC TAGCCGATGC GCCGCTCGAT CCGTCGTTGC TGTTCGAGTA CGGCGACGTC GCGCAACTGG CCGATTGGCT CGTCGCGCAT CGCGCGGAGC TGATGCGGCA AGCGCTCGCG GCCGAAGGCG CGCGCGCGCC GGCCGACGCG CGGCCGGCTG GGATCGCCGC GCCGGACCCG CGGGCGCTTT CGCCGGCCGC CGCGACGCAC GCCGACGCGG CGCGGGTGGA GACGACTTGC GCGATGCGCG GGACGGCCGC GACATCGGCG GCAATCGAGA CGCCCGCGCC GCCTGCGAGT CCGGCAGCGC CGACAGTCCC GGCAACATCG GCAACAGCGG CAACATCGGC AACAGCGGCG ACAGCGGCAA CAGCGGCGAC ACCCGCCGCC CCGCCATCCC CGATGCGGGA TTCGCAGGCG CGCCCGTGCG ACATCGCGGT CGTCGGGCTG GCCTGCCGCT TTCCCGGCGC GCCGTCGGTC GACGCGTACT GGGCGCTGCT GCGCGACGGC GCGCGCGGCA TCGGCCCCGC GCCGCGCGAG CGCTTCGCGC AAGCCGATCG CTTTTGCGGC GGCTTTCTCG ACGCCGTCGG CCGGTTCGAC CCCGACCATT TCGGCATCGC GCCCGGCGAC GCGCGCGCGA TGGACCCGCA GGCGCTCTTG CTGCTCGAGC TCGGCGTCGA GCTGTTTCAT CACGCGGGCT ACCGTCCGGA AGAGCTCAGG GGCGGGGCCG TCGGCGTGTT TCTCGGCGGC CGCAGCCAGC ACGCGCCCGA CGCCGCGCTG CTTGCGCACG CGCATCACCC GATCGTCGCG GTCGGACAAA ACTATCTGGC CGCGAACCTG TCGCGGCACT TCGACCTGAA CGGCGCGTGC GCGCTCGTCG ACACCGCGTG CTCGTCGGCG CTCGTCGCGA TGCACTCGGC CGTGCTCGCG CTCGCGGCGG GCGAGATCGA CGCGGCCGTC GTGGGCGGCG TGAGCCTGCT GTCGTCGGAC GCCGGCCATC GCCTTTTCGA GCAGCGCGGC CTGCTCGCGC CGGACGGCGC GTTCCATCTG TTCGACGAGC GCGCGAACGG CACGGTGCTG AGCGAGGGCG CGGGCCTCGT GATGCTCAAG CCGCTTGCCG CCGCGCGCGC GCACGGCGAC ACGATCTACG CGGTGCTCAA GGGGCTCGCG GTCAACAACG ACGGCCGCAC GGCCGGCCCG TCGAGCCCGA ATTTCGCCGC CCAGCAGGCG GTGATGCGCC GCGCGCTCGC GCAAAGCGGG CTGCGCGCCG ACGACGTGCG GCACGTCGAG GCGAACGGCT CGGGCTCGCG CGTGACCGAT CTGCTCGAGC TCAAGAGCAT CCGCGCCGTG TACGGCGGCC AGTCGCGGGA CGCCGCCTGG TGCGCGCTCG GCTCCGTGAA GCCGAGCATC GGCCACACGC TGTGCGCGCA GGGCATCGCG GCGTTCATCA AGAGCGTGCT GATGCTGCAT CACCGCAGCG TGCCGCCGTT CCTGTCGGGG CAGCAGCCGA TGCAGCACAG CCCGATCGAG CGCTCGCGGC TGCGCTTCGT CAGGGAAACG ATCCCGTTCG ACGTCGCGGC CCCCGCGGTC GCACTCAATT GCTTCGCCGA CGGCGGCACG AACGTGCACG CGGTGCTGCA GGCGTGGGAG GGCCCGACGC GGCCGTCGCG CACGCCGCTC GCCGCGCCCG TGTTCGCGCG CCGGAGGCTC GACGCCCGCA CGTCACATGC GAGCGCGCGG ACGGCGGCGG CATGGCCGCG TTACGTCGCG CGCTGA
|
Protein sequence | MRSSKEIFEA LRDGRLSREE AHAALRSARA AADAAGANGA ADAADAGDAA SGASAANGER AAHAAKTAES AAAADAMPIA IIGRSGRYPD APDLAAYWRN LAAGRDSVRP IAPSRWDTDA WFDARRGEPG RIYCRALGAL DDIDCFDPLF FHISPAEAQY IDPQHRIVLE EAYRAFEDAG YTPQRLGGRA CGVYLGIMSN EYAMMLARER APIVNATGNA FSIAAARIAY WLDLKGPALA LDTACSSSLV AVHLACQALR AGEVEMALAC GVSLYLTAES YVAMCGANML SARGACRAFD NDADGFVPGE GAGAVLLKRL DAAERDGDRI LATIAASGIN QDGKTNGITA PSAASQSALV RDVYARFGID ATSVDYCEMH GTGTKLGDPI ELRALADVYR EAGAAAGACA IGSVKTNIGH TSAAAGVASL HKVMLALEHG ALAPSLHFDA PNEHFDFAAS PFRVVTHAQP WPRRADRPRR AAISSFGLSG TNAHVIVDEY RGRARGAASH ASGAAHASGV PDASNVSGVR DALPASNAAN APRRPEVPEA PETPSARHPA TDALVVLSAR DAERLRDYAR RLLRHVSDTS PVSLPDLAYT LQTGRAAMPH RLAIAADAVD TLRDALSAFV DGRPHPALVA GAAQGGAGTG AAREALTHAH AWDALARAWV AGAEIDWPAA HGAALAERVR VNAPTYPFAR ERYWLPRPPA DEAGARRRVA LVRDWTSAPL GAPAVSGARP VATRILMVGR CAANRALADA LAPMLPDAAL IDVCDDASFD RLPAAAGVVD LTGCAREPID EMRWLALVQR VVASRPRALY AVTAGLETPA RGAAGLAGAW QAGLYLALAA EYPSVASRVV DLPEGGEPSA LAALVAAEYD GASADVHCRY RDGVRERAIV RPLDIDADVD APGRAPRASV LGPDDCVWIT GGTRGLGLTC ARHLVSRHGV RRLLLTGRTA LPPRSEWDAL GARDADFARR AAGLRALEAM GARIEYSAVA LDDARAVAAE LARVRPTLGA VTALVHCAGA VDWSEPAFFA KSRESMRAVL QPKTAGLATV VDALAGAPVK RIVLFSSVAA AIPALGAGQA DYATANAWLD YFARAYARAL PVVSVQWPNW RGAGMGEVRS RAYAQSGLSA LDDAQGLALF DWALERMPAP VVLPAFVRDG ERPWLDWLGA RTAHRAAFDA ESDAGPAAAR ASVVAAEGEA GRDRLSGGAS GIASGAASVD GGGDGHRIGE AARDGDGEGE GNGEGNGEGE GEGAFVAAVR AWLASTFARE LALPSGTLDP AKPFREYGVD SIMLTQLLRP LNRLADAPLD PSLLFEYGDV AQLADWLVAH RAELMRQALA AEGARAPADA RPAGIAAPDP RALSPAAATH ADAARVETTC AMRGTAATSA AIETPAPPAS PAAPTVPATS ATAATSATAA TAATAATPAA PPSPMRDSQA RPCDIAVVGL ACRFPGAPSV DAYWALLRDG ARGIGPAPRE RFAQADRFCG GFLDAVGRFD PDHFGIAPGD ARAMDPQALL LLELGVELFH HAGYRPEELR GGAVGVFLGG RSQHAPDAAL LAHAHHPIVA VGQNYLAANL SRHFDLNGAC ALVDTACSSA LVAMHSAVLA LAAGEIDAAV VGGVSLLSSD AGHRLFEQRG LLAPDGAFHL FDERANGTVL SEGAGLVMLK PLAAARAHGD TIYAVLKGLA VNNDGRTAGP SSPNFAAQQA VMRRALAQSG LRADDVRHVE ANGSGSRVTD LLELKSIRAV YGGQSRDAAW CALGSVKPSI GHTLCAQGIA AFIKSVLMLH HRSVPPFLSG QQPMQHSPIE RSRLRFVRET IPFDVAAPAV ALNCFADGGT NVHAVLQAWE GPTRPSRTPL AAPVFARRRL DARTSHASAR TAAAWPRYVA R
|
| |