Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_A0720 |
Symbol | |
ID | 3693374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007435 |
Strand | + |
Start bp | 958252 |
End bp | 961020 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637730974 |
Product | beta-glucosidase |
Protein accession | YP_335879 |
Protein GI | 76818251 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0469034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACGGC CGGCAAACGT ACGCTACCAA ACGGCGCGTG CTGCGCAACC GACGGCGACA AGCCGGATCT CATCGGCATC GGCATCGGCA TCGGCATCGG CATCCGCAGC GACTGCGACT GCGACTGCGA CTGCGACTGC GGCTGCGGCT GCGGCAATCC CACGATGCCG GCCGGCGTCC ATCTTGCCGC TCGATTCACG ATCGATTCGG TCGCCCAAAC CTGATCCGGC AACGGCCCGC CGGGCCCGAG CCCGCCCGGC CGGCGGCCGT TCCGTTGCGT ACGCACGGCC CAATCCGCGC CAGTCGAAAA CGTCATCGAT CGATCACCCC ATCGTTCAAA AGTTTACTTG TAGCCAGTCT ACTTATGCGC TATCACTGAA CCTGAAGTTT TGCTACATCC CCACGCAACC GCCGCTCACG CCGGATTCGT CACCGGCCGC GCGCGGCCGG TCCGTGACGC ATGACCGAGT GTCATCCGTC ACACGCTTGA CGGCGCCCGT TGCCGCGCGC TCGCGCGGCG TTTCGCGTCG CGCCGAACGA CAGAACCAAG CAAACATTCA CGGAGACAAA CGCATGCACG CAAAACGCCT TTCCATCGCC GTTCTTTCCG CCACGCTGTG CGCGCTCGCG CATGCCGCCG GCAACGACGC GCCGTCGCCG GACATCGCGT CCCGCGACGC TTACGCGCTT CGCCGCGCGC ACGCGCTGGT TCGCCAGATG ACGCTCGACG AAAAGCTTCA ACTGATTCAT TCGAAGTACC CAATGAGCGA CGTGCCGGGC GGCGGCGCGG GCTTCATCCA GGGTATCGCG CGGCTTGGCA TTCCCGATCT GAACATGGTG GATTCGGCGA CGGGCTCGGG CAGCACGTCT CAGCCGAGCA CGACGTTTCC CGCGACGATC GGGCTCGCGG CGAGCTGGGA CAAGCGCCTT TCGTACGCAT TCGGCGCGGT GATCGCCGAC CAGTTGCGCG CGCAAGGATT CGCGATGGGC CTGGGCGGAG GCACCAACCT CGCGCGCGAG CCGCGCGGCG GACGCCTGTT CGAGTATCTC GGCGAAGATC CCGTCCTCGC CGGCGAAATG CTCGCGGCGC GCACGCGCGG CACGCAGGAC CGCAAGGTGA TCGCGACGAT CAAGCACTAC GTCGGCAACG AACAGGAAAC GAACCGGATG GGCGGCGACG ACCAGATCGA CGAGCGCACA TTGCGCGAGC TCTATCTGCT GCCGTTCGAA ATCGCGATGA AGGCCGCGCG CCCCGGCAAT GTGATGTGCA GCTACAACCG CCTTAACGGC GACTATGCAT GCGAGAACGC ACACGTGCTC ACCGACGTGC TCAAGAACGA ATGGCATTTC CAGGGGCAGG TGCAGTCCGA CTGGGGCGCC GCGCATAGCA CCGCGAAGGC GATCAACGCG GGGCTCGACG AAGAGGAAGA CGTCGGGCCG ACCGTGTTCC TCACGCCCGC GCTCGTCAAG CAGGCGCTCG CGAATCGCGA GATCGCGCCG GCGCGCCTCG ACGACATGGT CCGGCGCAAG CTCTACGCGA TGATCCGCAC GGGCGTGATG GACGATCCGC CGCGCGGCGG CGGCACGATC GATTTCGCCG CGGCCAATCG ATTTGTTCAA TATGCGGCGG AACAGTCGAT CGTGCTCCTC AAGAATCAGG ACCGCCAACT TCCGCTCGAT GCCGCGGGCC TGAAGCGGAT CGCCGTGATC GGCGGCCATG CGGACGCGGC CGTACTCGCG GGAGGCGGAT CGGGCAATAC GCGGCATCCC GTCACCGGCG CGTTTCCCGG ATGCGGCGGC CTCACCTTCC CGACCACGAC GGGCTGCAAC TGGTGGCCGA ATCCGTGGCT GAAGCTCGAC GTGCCGATCG TCCAGGCGAT CCGCGACCTC GCGCCGGGAG CAACGGTCGC TTTCGCCGGG AACAGCGATC GGCAATCGCC GTTCGCCGCG TACACACCGC AGCAAATCGA TGCGGCCGCC GATCTCGCGC GACGCTCGGA CGTGGCGATC GTCTTCGTCA CGCAGGCCGC CGGCGAGGAC TTCGGCGAAC TGCGCAGCCT CGCGCTCGCG AACCCGACGA ATCAGGACGC GCTCGTCCAG GCCGTCGCGC AAGCCAATCC GCGCGTGATC GTCGTCGTCG AGAGCGGCAA CCCGGTGCTG ATGCCGTGGC GCGACCAAGT GCCCGCGATC GTCCAGGCAT GGTTCCCCGG TGAAGGCGGC GGCAACGCGA TCGCCAACGT GCTGTTCGGC AAGGTCAACC CGTCGGGCAA GCTGCCCGTC ACGTTCCCCG CGCGCGACGA GGACACGCCG ACCTGGGGCG CGGACGGCAC GCTCGCGCCG AACCCCGTCT ACTCGGAGAA GCTGAAGATC GGCTATCGCT GGTACGACGC GCATCGCATC GCGCCGATGT TCCCGTTCGG ACACGGCCTG TCGTACACGC ACTTCTCGTA TTCCGGGCTC GAAGTCAAGC AGCGCCCGGA CGCGGCGACG ACGGTGTCGT TTGCGCTGAC CAACGATGGC CCGGTGGCCG GCGCCGAAGT GCCGCAGGTC TATCTCGGCG ATCTCGATGA TCCGCAGGAA CCGCCGAAGC GCCTCGTCGG ATGGGACAAG GTGGGCCTGC GCGCGGGCGA AACGCGGCGC GTGCGCATCG TGATTCCCGC CGAGATGCGG CGCGTGTGGG ATGCGAGCCG CAACGGATGG GCGCTCGCGA AGGGCGGGCG CATCTACGTG GGCGCATCTT CGCGCGACAT TCGGCTTCAG CAGCCGTGA
|
Protein sequence | MRRPANVRYQ TARAAQPTAT SRISSASASA SASASAATAT ATATATAAAA AAIPRCRPAS ILPLDSRSIR SPKPDPATAR RARARPAGGR SVAYARPNPR QSKTSSIDHP IVQKFTCSQS TYALSLNLKF CYIPTQPPLT PDSSPAARGR SVTHDRVSSV TRLTAPVAAR SRGVSRRAER QNQANIHGDK RMHAKRLSIA VLSATLCALA HAAGNDAPSP DIASRDAYAL RRAHALVRQM TLDEKLQLIH SKYPMSDVPG GGAGFIQGIA RLGIPDLNMV DSATGSGSTS QPSTTFPATI GLAASWDKRL SYAFGAVIAD QLRAQGFAMG LGGGTNLARE PRGGRLFEYL GEDPVLAGEM LAARTRGTQD RKVIATIKHY VGNEQETNRM GGDDQIDERT LRELYLLPFE IAMKAARPGN VMCSYNRLNG DYACENAHVL TDVLKNEWHF QGQVQSDWGA AHSTAKAINA GLDEEEDVGP TVFLTPALVK QALANREIAP ARLDDMVRRK LYAMIRTGVM DDPPRGGGTI DFAAANRFVQ YAAEQSIVLL KNQDRQLPLD AAGLKRIAVI GGHADAAVLA GGGSGNTRHP VTGAFPGCGG LTFPTTTGCN WWPNPWLKLD VPIVQAIRDL APGATVAFAG NSDRQSPFAA YTPQQIDAAA DLARRSDVAI VFVTQAAGED FGELRSLALA NPTNQDALVQ AVAQANPRVI VVVESGNPVL MPWRDQVPAI VQAWFPGEGG GNAIANVLFG KVNPSGKLPV TFPARDEDTP TWGADGTLAP NPVYSEKLKI GYRWYDAHRI APMFPFGHGL SYTHFSYSGL EVKQRPDAAT TVSFALTNDG PVAGAEVPQV YLGDLDDPQE PPKRLVGWDK VGLRAGETRR VRIVIPAEMR RVWDASRNGW ALAKGGRIYV GASSRDIRLQ QP
|
| |