Gene BURPS668_A1470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1470 
Symbol 
ID4887782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1369202 
End bp1370461 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content64% 
IMG OID640131409 
Productputative polyketide biosynthesis protein pksG 
Protein accessionYP_001062467 
Protein GI126443435 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0349844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCCG TTGGTATTGA GGCGCTCAAC GTTTATGCCG GGGTTGCCAG TCTCGACGTC 
TCGAGGCTGG CGGAGCATCG CAAGCTGGAC ATGGCGAGAT TCCAGAACCT GCTGATGCGG
GAAAAATCGG TTGCGCTGCC CTATGAGGAT CCGATCACCT ATGGCGTCAA CGCCGCCAAG
CCGATCGTCG ACGCGCTGAC GCCCGACGAG CGCGACCGCA TCGAAATGCT GATCACCTGC
ACCGAATCGG CATTCGATTT CGGCAAATCG ATGAGCACCT ACTTTCACCA CCATCTCGGC
CTGAAGCGCA ACTGCCGCCT GTTCGAAGTC AAGAACGCGT GCTACTCCGG CGTCGCCGGC
CTGCAAACGG CGATCAACTT CATCCTGGCC CAGGTCTCGC CCGGCGCGAA GGCGCTCGTG
ATCGCGACCG ACCTGTCGCG CTTCATCGTC GAGGAAGGCG GCGAGGCCTT GTCCGCCGAC
TGGTCGTTCG CCGAGCCGAG CAGCGGCGCG GGCGCGGTCG CGATGCTCGT CAGCGACACG
CCGCACGTGT TTCGCATCGA CGTCGGCGCG AACGGGTACT ACGGCTACGA GGTGATGGAC
ACCTGCCGGC CGACCACCGA TAGCGAAGCC GGCAATTCGG ATCTGTCGCT CCTGTCGTAT
CTCGACTGCT GCGAAAACGC GTTCCTCGAA TACCAGAAGC GCGTGTGCGA CGTCGATTAC
GCGAGCACGT TCGGATTCCT CGCTTTTCAC ACGCCGTTCG GCGGCATGGT GAAGGGCGCG
CACCGCAATC TGATGCGCAA GGCGAGCCGC TGCTCGACGC AGGAAATCGA GCAGGACTTC
CAGCGCCGCG CGGGCCCCGG GCTCGTCTAC TGCCAGCGGC TCGGCAACAT CATGGGTGCG
ACGGCGATGC TGTCGGTCGC CAGCACGATC GACAACGGCG AGTATCGCGC GCCGCAGCGC
GTGGGCGTGT TCTCGTACGG CTCGGGCTGC TGCTCGGAGT TCTTCAGCGG CATCGTCGAC
GAGGAAGGCC AGCGCCGGCT GCGCGGCATG CGCATCGGCG AGCAGTTGGA CCGCCGCTAC
GCGCTGTCCA TCGACGAATA CGAGCACGTG CTCAAGGAAA GCCGGGTCGT GCGCTTCGGC
ACCCGCAACG CGAAAATCGA CGACGGCTTC ATCCCCGCGG CGCGGCGCGC GCACGGCCGC
GAAACGCTCT TCCTGAGCCG GATCAACGAA TACCATCGGG AATACGAATG GATATGCTGA
 
Protein sequence
MTAVGIEALN VYAGVASLDV SRLAEHRKLD MARFQNLLMR EKSVALPYED PITYGVNAAK 
PIVDALTPDE RDRIEMLITC TESAFDFGKS MSTYFHHHLG LKRNCRLFEV KNACYSGVAG
LQTAINFILA QVSPGAKALV IATDLSRFIV EEGGEALSAD WSFAEPSSGA GAVAMLVSDT
PHVFRIDVGA NGYYGYEVMD TCRPTTDSEA GNSDLSLLSY LDCCENAFLE YQKRVCDVDY
ASTFGFLAFH TPFGGMVKGA HRNLMRKASR CSTQEIEQDF QRRAGPGLVY CQRLGNIMGA
TAMLSVASTI DNGEYRAPQR VGVFSYGSGC CSEFFSGIVD EEGQRRLRGM RIGEQLDRRY
ALSIDEYEHV LKESRVVRFG TRNAKIDDGF IPAARRAHGR ETLFLSRINE YHREYEWIC