Gene BURPS668_A1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1645 
Symbol 
ID4888928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1574500 
End bp1576968 
Gene Length2469 bp 
Protein Length822 aa 
Translation table11 
GC content73% 
IMG OID640131584 
Productpolyketide synthase, type I 
Protein accessionYP_001062641 
Protein GI126443101 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAAT CGCACAGTGA CACGGGCGCG CATGCGGCCG GCGCGCCGGA AGCCGGCCCG 
ATGGACATCG CGATCGTCGG CCTGAGCGCG CGCGTGCCCG GCGCGGGCGA CGCCGACGCG
CTGTGGTCGC TGCTGATGTC GGGCGGCGAC GGGCTCACGA CGTTCGAGCC GCACGCGCTC
ACGGCCGAGC TCGCGCAACT GACCGATGTC GAGCGCGCGC GCTACGTCGC GCGGCGCGGC
GTGCTCGACG GCATCGAAGC GTTCGACGCG GGCTTCTTCA ACCTGTCGGC GGCGGAGGCG
ACGCTGCTCG ATCCGCAGCA TCGCGTGCTG ATGGAACTGG TGTGGGAGGC GCTCGAGGAC
GCGGGCGCGG CCGATTGCGC GGGGCGCCGC GTCGGCGTGT TCACGACGTC GGGCCTGAGC
CATTACCTGA TCAAGCACCT GCTGCCCGAT CCGTCGATCG GCGAGCGGCA CGGGCAGTTG
CAGCTGCTGA TGCTCAACGA CAAGGATTTT CTCGCGACCC GCATCGCGTA CTTCCTGAAC
GTGCACGGGC CGGCCGTCAA CGTGCAGACC GGATGCTCGA GCTCGCTCGT CGCGCTGCAC
TACGCGTGCC TGTCGCTGCT GCGGCGCGAT TGCGAAATGG CCGTCGTCGG CGGCGTGTCG
ATCGCGCTGC CGCAGCAGTC GGGCTATGTG TTCTCGGAGA ACATGATCGG CTCGCGCGAT
GGCTTTTGCC GCGCGTTCGA CAGCGACGCG TCGGGCACGG TGCGCGGCAA CGGCGCGTGC
GCGGTCGTGC TCAGGCCGCT CGCCGACGCG CTCGCGAACC GCGATCCGGT GTGGGCGGTG
ATCAAGGGCA CGGCGCTCAA CAACGACGGG CGCGACAAGG TCGGGTTCAC CGCGCCGAGC
CCGAAGTGGC AGGCCGACGT GATCGACGCC GTCTATCGCC GAAGCGGCGT CGCGCCCGAC
GATGTCGATT TCGTCGAGGC GCACGGCACG GGCACGCCGC TCGGCGATCC GATCGAGGCG
AGGGCGCTCA ATCAGGTGTT CGCGGGCGCG CGGCGGCCGC GCTATCTCGG CGCGCTGAAG
ACGCAGATCG GGCACCTCGA CACGGCGGCC GGCCTCGCCG GGCTGATCAA GCTGTGCATG
TCGCTGCGAA ACGGCGTGAT TCCGCCGACC GCGCACTTCC GCACGCTCAA TCCGCGTATC
TCGTTCGACG ATTCGCGGTT CACGATCAAC ACCGAACCCG TCGCGTGGCC GAGGCGCGAT
GCGCCGCGCT ACGCGGCGCT CAGCTCGTTC GGCATCGGCG GCACCAACGC GCATGTGGTC
GTCGCGGACG GGCCAGCCGA CGCGCGGCAA GCCGGGGCGC CGTGCGACGA ACGGGCGTCA
AGCGCGCAGC GCGGCCCGCA TCTGATCGTC GTGTCCGCGA AGAGCGCCGC TTCGCTCGCC
GCGCTCACGC GCGCCTACGC GGACGCGATC GCCGCGCTCG ACGACGCGGC GCTCGCCGAT
TTCGCGTACT CGACGCGCGT CGGCCGCCGC GGCTTCGAGC ATCGGCTGGC CGTTTGCGCG
GACGATCGCG AAAGCGCGGT TGCGCGGCTG CGCGCGGCGC GCGCGCGCTT CAGCGCGAAG
CCGATCGCCG GCGTGAGCGT CGCCGCGGCC GACGCGCACG CCGTCGCCGC GCAGCTCGGC
GTCACGCCGC CCGACGGCGC GTCGCGCGCG ATCGACGCGA TCGCCGCGCG GCTCGCCGAG
TGGGGCATCG CGATCGACGC GGGCTCGGCG GTGCGGCTCG GCGCCGCCGG CGGCCGGCTG
CACGTCGCCG CGCCCGGCGC ACCGTCCGAG ACGATCGACG CCGCGACGCC CGGCACGATC
GGCTGCTATC TGCAGGCGGC GCTGTGGTGC GCGGGCGTGG CGGTCGATTT CAGCCGCGAA
GCCGCGCGTG GAGCTGCCGG CGCGGTTGCC GGTGAAATCG GCGGAGAAGC CGGTGAAAAG
GCCCGCGCGC CCGCGCGCAG AAGAATCCGG ATTCCGACCT ACCGGTTCGA CCGGCGGCGG
CACTGGATCG ACGCGCCGGG CGAGCGGCGC GGCGACGGCC GCGACAGCGG GCCCGGCGAC
GCCGCCGCAC CGGAGGCGGC CGCGTCCGAC GCGCGGCCGC GCAAGCCGCC CGCCGAGCAG
CCGACGAGCC TCGTCGCGCT CGAGGCGGTG CTGCTGTCGC ACTGGCGCAG CCTGTTGGGA
CTGCCGACGC TGCGCAGCAC CGACAACGTC TTCGAGCAAG GCGCGGATTC GCTGACGGTC
GCGCAGTTCG TCGCGCAACT GAGCGCGGAG CGCGCGCTGC CCGTGCACGT CGTCGATTGC
TACAGCGAGC CGAGCGTGGG CGGCCAGGCG CGCCTCGTCG GCGCGCGCAT CGGGCTGGGC
GGCCGCGCCG CGGCAGCCGC GCAAGCGAGC GCGCCGGCGC CCGAAGTCGA GAGATTTGAC
AACCTTTAA
 
Protein sequence
MTQSHSDTGA HAAGAPEAGP MDIAIVGLSA RVPGAGDADA LWSLLMSGGD GLTTFEPHAL 
TAELAQLTDV ERARYVARRG VLDGIEAFDA GFFNLSAAEA TLLDPQHRVL MELVWEALED
AGAADCAGRR VGVFTTSGLS HYLIKHLLPD PSIGERHGQL QLLMLNDKDF LATRIAYFLN
VHGPAVNVQT GCSSSLVALH YACLSLLRRD CEMAVVGGVS IALPQQSGYV FSENMIGSRD
GFCRAFDSDA SGTVRGNGAC AVVLRPLADA LANRDPVWAV IKGTALNNDG RDKVGFTAPS
PKWQADVIDA VYRRSGVAPD DVDFVEAHGT GTPLGDPIEA RALNQVFAGA RRPRYLGALK
TQIGHLDTAA GLAGLIKLCM SLRNGVIPPT AHFRTLNPRI SFDDSRFTIN TEPVAWPRRD
APRYAALSSF GIGGTNAHVV VADGPADARQ AGAPCDERAS SAQRGPHLIV VSAKSAASLA
ALTRAYADAI AALDDAALAD FAYSTRVGRR GFEHRLAVCA DDRESAVARL RAARARFSAK
PIAGVSVAAA DAHAVAAQLG VTPPDGASRA IDAIAARLAE WGIAIDAGSA VRLGAAGGRL
HVAAPGAPSE TIDAATPGTI GCYLQAALWC AGVAVDFSRE AARGAAGAVA GEIGGEAGEK
ARAPARRRIR IPTYRFDRRR HWIDAPGERR GDGRDSGPGD AAAPEAAASD ARPRKPPAEQ
PTSLVALEAV LLSHWRSLLG LPTLRSTDNV FEQGADSLTV AQFVAQLSAE RALPVHVVDC
YSEPSVGGQA RLVGARIGLG GRAAAAAQAS APAPEVERFD NL