Gene BURPS1710b_A0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0720 
Symbol 
ID3693374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp958252 
End bp961020 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content67% 
IMG OID637730974 
Productbeta-glucosidase 
Protein accessionYP_335879 
Protein GI76818251 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0469034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGGC CGGCAAACGT ACGCTACCAA ACGGCGCGTG CTGCGCAACC GACGGCGACA 
AGCCGGATCT CATCGGCATC GGCATCGGCA TCGGCATCGG CATCCGCAGC GACTGCGACT
GCGACTGCGA CTGCGACTGC GGCTGCGGCT GCGGCAATCC CACGATGCCG GCCGGCGTCC
ATCTTGCCGC TCGATTCACG ATCGATTCGG TCGCCCAAAC CTGATCCGGC AACGGCCCGC
CGGGCCCGAG CCCGCCCGGC CGGCGGCCGT TCCGTTGCGT ACGCACGGCC CAATCCGCGC
CAGTCGAAAA CGTCATCGAT CGATCACCCC ATCGTTCAAA AGTTTACTTG TAGCCAGTCT
ACTTATGCGC TATCACTGAA CCTGAAGTTT TGCTACATCC CCACGCAACC GCCGCTCACG
CCGGATTCGT CACCGGCCGC GCGCGGCCGG TCCGTGACGC ATGACCGAGT GTCATCCGTC
ACACGCTTGA CGGCGCCCGT TGCCGCGCGC TCGCGCGGCG TTTCGCGTCG CGCCGAACGA
CAGAACCAAG CAAACATTCA CGGAGACAAA CGCATGCACG CAAAACGCCT TTCCATCGCC
GTTCTTTCCG CCACGCTGTG CGCGCTCGCG CATGCCGCCG GCAACGACGC GCCGTCGCCG
GACATCGCGT CCCGCGACGC TTACGCGCTT CGCCGCGCGC ACGCGCTGGT TCGCCAGATG
ACGCTCGACG AAAAGCTTCA ACTGATTCAT TCGAAGTACC CAATGAGCGA CGTGCCGGGC
GGCGGCGCGG GCTTCATCCA GGGTATCGCG CGGCTTGGCA TTCCCGATCT GAACATGGTG
GATTCGGCGA CGGGCTCGGG CAGCACGTCT CAGCCGAGCA CGACGTTTCC CGCGACGATC
GGGCTCGCGG CGAGCTGGGA CAAGCGCCTT TCGTACGCAT TCGGCGCGGT GATCGCCGAC
CAGTTGCGCG CGCAAGGATT CGCGATGGGC CTGGGCGGAG GCACCAACCT CGCGCGCGAG
CCGCGCGGCG GACGCCTGTT CGAGTATCTC GGCGAAGATC CCGTCCTCGC CGGCGAAATG
CTCGCGGCGC GCACGCGCGG CACGCAGGAC CGCAAGGTGA TCGCGACGAT CAAGCACTAC
GTCGGCAACG AACAGGAAAC GAACCGGATG GGCGGCGACG ACCAGATCGA CGAGCGCACA
TTGCGCGAGC TCTATCTGCT GCCGTTCGAA ATCGCGATGA AGGCCGCGCG CCCCGGCAAT
GTGATGTGCA GCTACAACCG CCTTAACGGC GACTATGCAT GCGAGAACGC ACACGTGCTC
ACCGACGTGC TCAAGAACGA ATGGCATTTC CAGGGGCAGG TGCAGTCCGA CTGGGGCGCC
GCGCATAGCA CCGCGAAGGC GATCAACGCG GGGCTCGACG AAGAGGAAGA CGTCGGGCCG
ACCGTGTTCC TCACGCCCGC GCTCGTCAAG CAGGCGCTCG CGAATCGCGA GATCGCGCCG
GCGCGCCTCG ACGACATGGT CCGGCGCAAG CTCTACGCGA TGATCCGCAC GGGCGTGATG
GACGATCCGC CGCGCGGCGG CGGCACGATC GATTTCGCCG CGGCCAATCG ATTTGTTCAA
TATGCGGCGG AACAGTCGAT CGTGCTCCTC AAGAATCAGG ACCGCCAACT TCCGCTCGAT
GCCGCGGGCC TGAAGCGGAT CGCCGTGATC GGCGGCCATG CGGACGCGGC CGTACTCGCG
GGAGGCGGAT CGGGCAATAC GCGGCATCCC GTCACCGGCG CGTTTCCCGG ATGCGGCGGC
CTCACCTTCC CGACCACGAC GGGCTGCAAC TGGTGGCCGA ATCCGTGGCT GAAGCTCGAC
GTGCCGATCG TCCAGGCGAT CCGCGACCTC GCGCCGGGAG CAACGGTCGC TTTCGCCGGG
AACAGCGATC GGCAATCGCC GTTCGCCGCG TACACACCGC AGCAAATCGA TGCGGCCGCC
GATCTCGCGC GACGCTCGGA CGTGGCGATC GTCTTCGTCA CGCAGGCCGC CGGCGAGGAC
TTCGGCGAAC TGCGCAGCCT CGCGCTCGCG AACCCGACGA ATCAGGACGC GCTCGTCCAG
GCCGTCGCGC AAGCCAATCC GCGCGTGATC GTCGTCGTCG AGAGCGGCAA CCCGGTGCTG
ATGCCGTGGC GCGACCAAGT GCCCGCGATC GTCCAGGCAT GGTTCCCCGG TGAAGGCGGC
GGCAACGCGA TCGCCAACGT GCTGTTCGGC AAGGTCAACC CGTCGGGCAA GCTGCCCGTC
ACGTTCCCCG CGCGCGACGA GGACACGCCG ACCTGGGGCG CGGACGGCAC GCTCGCGCCG
AACCCCGTCT ACTCGGAGAA GCTGAAGATC GGCTATCGCT GGTACGACGC GCATCGCATC
GCGCCGATGT TCCCGTTCGG ACACGGCCTG TCGTACACGC ACTTCTCGTA TTCCGGGCTC
GAAGTCAAGC AGCGCCCGGA CGCGGCGACG ACGGTGTCGT TTGCGCTGAC CAACGATGGC
CCGGTGGCCG GCGCCGAAGT GCCGCAGGTC TATCTCGGCG ATCTCGATGA TCCGCAGGAA
CCGCCGAAGC GCCTCGTCGG ATGGGACAAG GTGGGCCTGC GCGCGGGCGA AACGCGGCGC
GTGCGCATCG TGATTCCCGC CGAGATGCGG CGCGTGTGGG ATGCGAGCCG CAACGGATGG
GCGCTCGCGA AGGGCGGGCG CATCTACGTG GGCGCATCTT CGCGCGACAT TCGGCTTCAG
CAGCCGTGA
 
Protein sequence
MRRPANVRYQ TARAAQPTAT SRISSASASA SASASAATAT ATATATAAAA AAIPRCRPAS 
ILPLDSRSIR SPKPDPATAR RARARPAGGR SVAYARPNPR QSKTSSIDHP IVQKFTCSQS
TYALSLNLKF CYIPTQPPLT PDSSPAARGR SVTHDRVSSV TRLTAPVAAR SRGVSRRAER
QNQANIHGDK RMHAKRLSIA VLSATLCALA HAAGNDAPSP DIASRDAYAL RRAHALVRQM
TLDEKLQLIH SKYPMSDVPG GGAGFIQGIA RLGIPDLNMV DSATGSGSTS QPSTTFPATI
GLAASWDKRL SYAFGAVIAD QLRAQGFAMG LGGGTNLARE PRGGRLFEYL GEDPVLAGEM
LAARTRGTQD RKVIATIKHY VGNEQETNRM GGDDQIDERT LRELYLLPFE IAMKAARPGN
VMCSYNRLNG DYACENAHVL TDVLKNEWHF QGQVQSDWGA AHSTAKAINA GLDEEEDVGP
TVFLTPALVK QALANREIAP ARLDDMVRRK LYAMIRTGVM DDPPRGGGTI DFAAANRFVQ
YAAEQSIVLL KNQDRQLPLD AAGLKRIAVI GGHADAAVLA GGGSGNTRHP VTGAFPGCGG
LTFPTTTGCN WWPNPWLKLD VPIVQAIRDL APGATVAFAG NSDRQSPFAA YTPQQIDAAA
DLARRSDVAI VFVTQAAGED FGELRSLALA NPTNQDALVQ AVAQANPRVI VVVESGNPVL
MPWRDQVPAI VQAWFPGEGG GNAIANVLFG KVNPSGKLPV TFPARDEDTP TWGADGTLAP
NPVYSEKLKI GYRWYDAHRI APMFPFGHGL SYTHFSYSGL EVKQRPDAAT TVSFALTNDG
PVAGAEVPQV YLGDLDDPQE PPKRLVGWDK VGLRAGETRR VRIVIPAEMR RVWDASRNGW
ALAKGGRIYV GASSRDIRLQ QP