Gene BURPS1106A_A2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2247 
Symbol 
ID4906422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2230201 
End bp2232396 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content68% 
IMG OID640145352 
Productputative beta-D-glucosidase 
Protein accessionYP_001076280 
Protein GI126457301 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGCAA AACGCCTTTC CATCGCCGTT CTTTCCGCCA CGCTGTGCGC GCTCGCGCAT 
GCCGCCGGCA ACGACGCGCC GTCGCCGGAC ATCGCGTCCC GCGACGCTTA CGCGCTTCGC
CGCGCGCACG CGCTGGTTCG CCAGATGACG CTCGACGAAA AGCTTCAACT GATTCATTCG
AAGTACCCAA TGAGCGACGT GCCGGGCGGC GGCGCGGGCT TCATCCAGGG TATCGCGCGG
CTTGGCATTC CCGATCTGAA CATGGTGGAT TCGGCGACGG GCTCGGGCAG CACGTCGCAG
CCGAGCACGA CGTTTCCCGC GACGATCGGG CTCGCGGCGA GCTGGGACAA GCGCCTTTCG
TACGCATTCG GCGCGGTGAT CGCCGACCAG TTGCGCGCGC AAGGATTCGC GATGAGCCTG
GGCGGAGGCA CCAACCTCGC GCGCGAGCCG CGCGGCGGAC GCCTGTTCGA GTATCTCGGC
GAAGATCCCG TCCTCGCCGG CGAAATGCTC GCGGCGCGCA CGCGCGGCAC GCAGGACCGC
AAGGTGATCG CGACGATCAA GCACTACGTC GGCAACGAAC AGGAAACGAA CCGGATGGGC
GGCGACGACC AGATCGACGA GCGCACATTG CGCGAGCTCT ATCTGCTGCC GTTCGAAATC
GCGATGAAGG CCGCGCGCCC CGGCAATGTG ATGTGCAGCT ACAACCGCCT TAACGGCGAC
TATGCATGCG AGAACGCACA CGTGCTCACC GACGTGCTCA AGAACGAATG GCATTTCCAG
GGGCAGGTGC AGTCCGACTG GGGCGCCGCG CATAGCACCG CGAAGGCGAT CAACGCGGGG
CTCGACGAAG AGGAAGACGT CGGGCCGACC GTGTTCCTCA CGCCCGCGCT CGTCAAGCAG
GCGCTCGCGA CTCGCGAGAT CGCGCCGGCG CGCCTCGACG ACATGGTCCG GCGCAAGCTC
TACGCGATGA TCCGCACGGG CGTGATGGAC GATCCGCCGC GCGGCGGCGG CACGATCGAT
TTCGCCGCGG CCAATCGATT TGTTCAATAT GCGGCGGAAC AGTCGATCGT GCTCCTCAAG
AATCAGGACC GCCAACTTCC GCTCGATGCC GCGAGCCTGA AGCGGATCGC CGTGATCGGC
GGCCATGCGG ACGCGGCCGT ACTCGCGGGA GGCGGATCGG GCAATACGCG GCATCCCGTC
ACCGGCGCGT TTCCCGGATG CGGCGGCCTC ACGTTCCCGA CCACGACGGG CTGCAACTGG
TGGCCGAATC CGTGGCTGAA GCTCGACGTG CCGATCGTCC AGGCGATCCG CGACCTCGCG
CCGGGAGCAA CGGTCGCTTT CGCCGGGAAC AGCGATCGGC AATCGCCGTT CGCCGCGTAC
ACACCGCAGC AAATCGATGC GGCCGCCGAT CTCGCGCGAC GCTCGGACGT GGCGATCGTC
TTCGTCACGC AGGCCGCCGG CGAGGACTTC GGCGAACTGC GCAGCCTCGC GCTCGCGAAC
CCGACGAATC AGGACGCGCT CGTCCAGGCC GTCGCGCAAG CCAATCCGCG CGTGATCGTC
GTCGTCGAGA GCGGCAACCC GGTGCTGATG CCGTGGCGCG ACCAGGTGCC CGCGATCGTC
CAGGCATGGT TCCCCGGTGA AGGCGGCGGC AACGCGATCG CCAACGTGCT GTTCGGCAAG
GTCAACCCGT CGGGCAAGCT GCCCGTCACG TTCCCCGCGC GCGACGAGGA CACGCCGACC
TGGGGCGCGG ACGGCACGCT CGCGCCGAAC CCCGTCTACT CGGAGAAGCT GAAGATCGGC
TATCGCTGGT ACGACGCGCA TCGCATCGCG CCGATGTTCC CGTTCGGACA CGGCCTGTCG
TACACGCACT TCTCGTATTC CGGGCTCGAA GTCAAGCAGC GCCCGGACGC GGCGACGACG
GTGTCGTTTG CGCTGACCAA CGATGGCCCG GTGGCCGGCG CCGAAGTGCC GCAGGTCTAT
CTCGGCGATC TCGATGATCC GCAGGAACCG CCGAAGCGCC TCGTCGGATG GGACAAGGTG
GGCCTGCGCG CGGGCGAAAC GCGGCGCGTG CGTATCGTGA TTCCCGCCGA GATGCGGCGC
GTGTGGGATG CGAGCCGCAA CGGATGGGCG CTCGCGAAGG GCGGGCGCAT CTACGTGGGC
GCGTCTTCGC GCGACATTCG GCTTCAGCAG CCGTGA
 
Protein sequence
MHAKRLSIAV LSATLCALAH AAGNDAPSPD IASRDAYALR RAHALVRQMT LDEKLQLIHS 
KYPMSDVPGG GAGFIQGIAR LGIPDLNMVD SATGSGSTSQ PSTTFPATIG LAASWDKRLS
YAFGAVIADQ LRAQGFAMSL GGGTNLAREP RGGRLFEYLG EDPVLAGEML AARTRGTQDR
KVIATIKHYV GNEQETNRMG GDDQIDERTL RELYLLPFEI AMKAARPGNV MCSYNRLNGD
YACENAHVLT DVLKNEWHFQ GQVQSDWGAA HSTAKAINAG LDEEEDVGPT VFLTPALVKQ
ALATREIAPA RLDDMVRRKL YAMIRTGVMD DPPRGGGTID FAAANRFVQY AAEQSIVLLK
NQDRQLPLDA ASLKRIAVIG GHADAAVLAG GGSGNTRHPV TGAFPGCGGL TFPTTTGCNW
WPNPWLKLDV PIVQAIRDLA PGATVAFAGN SDRQSPFAAY TPQQIDAAAD LARRSDVAIV
FVTQAAGEDF GELRSLALAN PTNQDALVQA VAQANPRVIV VVESGNPVLM PWRDQVPAIV
QAWFPGEGGG NAIANVLFGK VNPSGKLPVT FPARDEDTPT WGADGTLAPN PVYSEKLKIG
YRWYDAHRIA PMFPFGHGLS YTHFSYSGLE VKQRPDAATT VSFALTNDGP VAGAEVPQVY
LGDLDDPQEP PKRLVGWDKV GLRAGETRRV RIVIPAEMRR VWDASRNGWA LAKGGRIYVG
ASSRDIRLQQ P