Gene BURPS668_1544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1544 
Symbol 
ID4882028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1510265 
End bp1513660 
Gene Length3396 bp 
Protein Length1131 aa 
Translation table11 
GC content70% 
IMG OID640127472 
Productalpha-glucosidase 
Protein accessionYP_001058585 
Protein GI126440601 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCCG CAGACACGCT CGAAGACGTC GGCCGCGCGC AGTTCGTCGC GCACATGGCC 
CGAAGGCCGC GCCGCGCGCG CAGGCGAGAA GCGCCCGGCG TCGCCGACAC GCCGCTGTGG
TACAAGGATG CGATCATCTA CCAGTTGCAC GTGAAGTCGT TCTTCGACGC CAATAACGAC
GGCGTCGGCG ATTTCGCCGG CCTGATCGCG AAGCTCGACT ACATCGCGGA GCTCGGCGTC
GACACGGTGT GGCTGCTGCC GTTCTATCCG TCGCCGCGCC GCGACGACGG CTACGACATC
GCCGACTACC GCGGCGTGCA TCCCGACTAC GGCACGCTCG CGGACGTGCG CCGCTTCATC
CGCGAGGCGC ACGCGCGCGG GCTGCGGGTG ATCACCGAGC TCGTGATCAA CCACACGTCG
GATCAGCATC CGTGGTTCCA GCGCGCCCGG CGCGCGAAGC GCGGCTCCGT CCACCGCGAC
TACTACGTGT GGTCCGACAC CGATCTCAAG TACGCGGGCA CGCGGATCAT CTTCCTCGAT
ACCGAGACGT CGAACTGGAC GCACGATCCG GTCGCGGGCC AGTACTACTG GCATCGCTTC
TATTCGCATC AGCCCGACCT GAACTTCGAC AATCCGGCCG TGGTGCGCGA AGTGCTGCAG
ATCATGCGCT TCTGGCTCGA TCTCGGCATC GACGGGCTGC GGCTCGACGC GGTGCCCTAC
CTCGTCGAGC GCGAGGGCAC GAGCAACGAG AACCTGCCGC AGACGCACGC GATCCTCAAG
CTGATCCGCG CGACGATCGA CGCCGAGTAC CCGAACCGGA TGCTGCTCGC CGAGGCGAAC
CAGTGGCCCG AGGACGTCCA GGAATATTTC GGCGACGAGG ACGAGTGCCA CATGGCCTTC
CACTTCCCGC TGATGCCGCG CATCTACATG TCGATCGCGA GCGAGGACCG CTTTCCGATC
ATCGACATCA TGCGGCAGAC GCCCGCCCTC GCGCCGAGCA ATCAATGGGC GGTGTTCCTG
CGCAACCACG ACGAGCTCAC GCTCGAGATG GTCACCGATT CGGAGCGCGA CCTGCTCTGG
CAAGCCTACG CGAGCGAGCG CCGCGCGCGG CTGAACCTCG GCATCCGCCG CCGGCTCGCG
CCGCTGATGG AGCGCGACCG CCGCAGGATC GAGCTGATCA ACTCGATCCT GCTGTCGATG
CCCGGCACGC CCGTCATCTA CTACGGCGAC GAGCTCGGCA TGGGCGACAA CCTGCACCTC
GGCGATCGCG ACGGCGTGCG CACGCCGATG CAATGGTCGT CGGACCGCAA CGGCGGCTTC
TCGCGCGCCG ATCCCGAACT GCTCGTGCTG CCTCCGGTGA TGGGCACGCT GTACGGCTAC
GACGCGATCA ACGTCGAGGC GCAGACGCGC GATCCGCATT CGCTGCTGAA CTGGACGCGG
CGCATCCTGT CGACGCGCCG CGCGACGCGC GTGTTCGGAC GCGGCGCGAT CCGCTTCCTG
CGCCCGGGCA ACCGCAAGAT CCTCGCGTAT CTGCGCGAGC TCGACGGCGA AACGCCGGTG
CTCTGCGTCG CGAACCTGTC GCGCGCGTCG CAGGCGGTCG AGCTCGACCT GTCGGAATTC
GCCGGCTGCG TGCCGACCGA AATGACGTCG GACTCGCCGT TCCCGCCGAT CGGCCAACTG
CCGTATCTGC TGACGTTCCC GCCGTACGGC TTCCTGTGGT TCGCGCTGTC CGAGCACGGC
CGCGAGCCGG CCTGGCATCA GCAGTACGCC GAGCCGCTGC CCGAATTCCT GACGCTCGTG
ATGCGGCGCG GCGAGACGCA GCCGGGCGCC GCGCTGCTCG ACACGCTCGC GCAGGACGCG
CTGCCGTCAT GGCTCGCGCG GCGGCGCTGG TTCGCATCGA AGGAGCGGCG CGTCGACAGC
GCGCGCTTCG ACGCGCTCAC GCCGATTCCG GGCGAGCCGT TTCAGTATGC GGAGGCGCGC
GTCGCCGTCG ACGGCCGCGA AGAACGCTAC GTGGTGCCGC TCGCGAGCGC CTGGGGCAGC
GAGACGCCTC AGCCGCTGTT CGCGCAGCTC GCGCTCGCGC GCGTGCGGCG CGGCCATACG
GTCGGCCTGC TGACCGACGC GTTCGCGCTG CCGTCGTTCG CGCACGGCGT GCTGCGCCAG
CTGCGCGCGG GCGCGGCTGT GCCGGTGGCG GGCGGCGGGC GTCTCGAATT CCGGCCCGAG
CCGGACCTGG CGCGGCTCGA TCCCGGCGAT GCGCCCGGCG TGCGGTGGTT CGCCGCCGAG
CAGAGCAACA GCTCGCTCGT GATCGGCGAG GCGATCGTGC TGAAGATCGT GCGCAAGCTC
GCCGCGGGCG TCCACCCGGA AGCGGAGATG GGCCGGCACC TGCGGCGCAT CGGCTACCGG
AACGTCGCGC CGCTCGTCGG CGAGGTCGTG CGCATCGGCG CGGACGGCGC GCCGCACACC
GTCGCGATCC TGCAGGAATA CGTCGACAAT CAGGGCGACG CGTGGACGCG CTCGTGCGAT
TTTCTCAAGC GCGCGATCGA GGAGCTCGCG CTGCCCGCGG CAAACGACGC CGACGCGGCG
GCGCCGAGCG CCGAGCCGGA GATGATCGAC GGCTACGCGA CGTTCGCGGG CATCGTCGGC
AAGCGGCTCG GCGAACTGCA CGCGGCGCTC GCGCAGCCGA GCGACGATGC GGCGTTCGCG
CCGCAGCGCG TGTCGCCCGC GCGCGTCGAC GGCTGGATCG GCGATGCGCT GGGCTGGTTC
GAACGCGCGG TGGGCCTGCT GGCCGAACGG CTCGACACGC TCGAAGGCCA AACGCGCGCG
GCCGCCGAGC TGCTCGTCGC GCAGCGCGCC GCGGTGGCCG AGGCGCTGCG TGCGCTGGTG
CCGCGCGAGC TCGACGGATG CTGCATCCGG ATTCACGGCG ACTTCCACCT GGGCCAGGTG
CTCGACGTGC AAGGCGACGC GCTTCTCATC GATTTCGAGG GCGAACCCGC GCGCCCGCTC
GGGCAGCGCC GCGCGCAATC GCATCCGTTA CGCGACGTGG CCGGGTTCCT GCGGTCGCTG
TCGTACGCGA GCGCGGCCGC GCAGTTCACG ATCGAAAAGG CGCCGCAGCA GGCCGCCGAG
CGCAAGCGCG CGCTGTTCGA GCGCTTCGGG CAGGCGGCCG CGGACCGCTT CGTCGCGCAA
TACCGCGAAG CGCTGTCGGC CGCGTCGCGC GAATTCGTCG AGCCGCGCTA TGCGGACCGG
CTGCTCGCGC TGTTCCTCAT CGAGAAGGCG TCCTACGAGC TGTGCTACGA AGCCGCGAAC
CGGCCGGACT GGCTGAGCGT GCCCGCAAAC GGCCTCGCCG CGCTCGTCGC GCGCCTGATC
GGCGGCGCCG CGCCGCAGCA GGAGGACGCG CGATGA
 
Protein sequence
MKPADTLEDV GRAQFVAHMA RRPRRARRRE APGVADTPLW YKDAIIYQLH VKSFFDANND 
GVGDFAGLIA KLDYIAELGV DTVWLLPFYP SPRRDDGYDI ADYRGVHPDY GTLADVRRFI
REAHARGLRV ITELVINHTS DQHPWFQRAR RAKRGSVHRD YYVWSDTDLK YAGTRIIFLD
TETSNWTHDP VAGQYYWHRF YSHQPDLNFD NPAVVREVLQ IMRFWLDLGI DGLRLDAVPY
LVEREGTSNE NLPQTHAILK LIRATIDAEY PNRMLLAEAN QWPEDVQEYF GDEDECHMAF
HFPLMPRIYM SIASEDRFPI IDIMRQTPAL APSNQWAVFL RNHDELTLEM VTDSERDLLW
QAYASERRAR LNLGIRRRLA PLMERDRRRI ELINSILLSM PGTPVIYYGD ELGMGDNLHL
GDRDGVRTPM QWSSDRNGGF SRADPELLVL PPVMGTLYGY DAINVEAQTR DPHSLLNWTR
RILSTRRATR VFGRGAIRFL RPGNRKILAY LRELDGETPV LCVANLSRAS QAVELDLSEF
AGCVPTEMTS DSPFPPIGQL PYLLTFPPYG FLWFALSEHG REPAWHQQYA EPLPEFLTLV
MRRGETQPGA ALLDTLAQDA LPSWLARRRW FASKERRVDS ARFDALTPIP GEPFQYAEAR
VAVDGREERY VVPLASAWGS ETPQPLFAQL ALARVRRGHT VGLLTDAFAL PSFAHGVLRQ
LRAGAAVPVA GGGRLEFRPE PDLARLDPGD APGVRWFAAE QSNSSLVIGE AIVLKIVRKL
AAGVHPEAEM GRHLRRIGYR NVAPLVGEVV RIGADGAPHT VAILQEYVDN QGDAWTRSCD
FLKRAIEELA LPAANDADAA APSAEPEMID GYATFAGIVG KRLGELHAAL AQPSDDAAFA
PQRVSPARVD GWIGDALGWF ERAVGLLAER LDTLEGQTRA AAELLVAQRA AVAEALRALV
PRELDGCCIR IHGDFHLGQV LDVQGDALLI DFEGEPARPL GQRRAQSHPL RDVAGFLRSL
SYASAAAQFT IEKAPQQAAE RKRALFERFG QAAADRFVAQ YREALSAASR EFVEPRYADR
LLALFLIEKA SYELCYEAAN RPDWLSVPAN GLAALVARLI GGAAPQQEDA R