Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2247 |
Symbol | |
ID | 4906422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 2230201 |
End bp | 2232396 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640145352 |
Product | putative beta-D-glucosidase |
Protein accession | YP_001076280 |
Protein GI | 126457301 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGCAA AACGCCTTTC CATCGCCGTT CTTTCCGCCA CGCTGTGCGC GCTCGCGCAT GCCGCCGGCA ACGACGCGCC GTCGCCGGAC ATCGCGTCCC GCGACGCTTA CGCGCTTCGC CGCGCGCACG CGCTGGTTCG CCAGATGACG CTCGACGAAA AGCTTCAACT GATTCATTCG AAGTACCCAA TGAGCGACGT GCCGGGCGGC GGCGCGGGCT TCATCCAGGG TATCGCGCGG CTTGGCATTC CCGATCTGAA CATGGTGGAT TCGGCGACGG GCTCGGGCAG CACGTCGCAG CCGAGCACGA CGTTTCCCGC GACGATCGGG CTCGCGGCGA GCTGGGACAA GCGCCTTTCG TACGCATTCG GCGCGGTGAT CGCCGACCAG TTGCGCGCGC AAGGATTCGC GATGAGCCTG GGCGGAGGCA CCAACCTCGC GCGCGAGCCG CGCGGCGGAC GCCTGTTCGA GTATCTCGGC GAAGATCCCG TCCTCGCCGG CGAAATGCTC GCGGCGCGCA CGCGCGGCAC GCAGGACCGC AAGGTGATCG CGACGATCAA GCACTACGTC GGCAACGAAC AGGAAACGAA CCGGATGGGC GGCGACGACC AGATCGACGA GCGCACATTG CGCGAGCTCT ATCTGCTGCC GTTCGAAATC GCGATGAAGG CCGCGCGCCC CGGCAATGTG ATGTGCAGCT ACAACCGCCT TAACGGCGAC TATGCATGCG AGAACGCACA CGTGCTCACC GACGTGCTCA AGAACGAATG GCATTTCCAG GGGCAGGTGC AGTCCGACTG GGGCGCCGCG CATAGCACCG CGAAGGCGAT CAACGCGGGG CTCGACGAAG AGGAAGACGT CGGGCCGACC GTGTTCCTCA CGCCCGCGCT CGTCAAGCAG GCGCTCGCGA CTCGCGAGAT CGCGCCGGCG CGCCTCGACG ACATGGTCCG GCGCAAGCTC TACGCGATGA TCCGCACGGG CGTGATGGAC GATCCGCCGC GCGGCGGCGG CACGATCGAT TTCGCCGCGG CCAATCGATT TGTTCAATAT GCGGCGGAAC AGTCGATCGT GCTCCTCAAG AATCAGGACC GCCAACTTCC GCTCGATGCC GCGAGCCTGA AGCGGATCGC CGTGATCGGC GGCCATGCGG ACGCGGCCGT ACTCGCGGGA GGCGGATCGG GCAATACGCG GCATCCCGTC ACCGGCGCGT TTCCCGGATG CGGCGGCCTC ACGTTCCCGA CCACGACGGG CTGCAACTGG TGGCCGAATC CGTGGCTGAA GCTCGACGTG CCGATCGTCC AGGCGATCCG CGACCTCGCG CCGGGAGCAA CGGTCGCTTT CGCCGGGAAC AGCGATCGGC AATCGCCGTT CGCCGCGTAC ACACCGCAGC AAATCGATGC GGCCGCCGAT CTCGCGCGAC GCTCGGACGT GGCGATCGTC TTCGTCACGC AGGCCGCCGG CGAGGACTTC GGCGAACTGC GCAGCCTCGC GCTCGCGAAC CCGACGAATC AGGACGCGCT CGTCCAGGCC GTCGCGCAAG CCAATCCGCG CGTGATCGTC GTCGTCGAGA GCGGCAACCC GGTGCTGATG CCGTGGCGCG ACCAGGTGCC CGCGATCGTC CAGGCATGGT TCCCCGGTGA AGGCGGCGGC AACGCGATCG CCAACGTGCT GTTCGGCAAG GTCAACCCGT CGGGCAAGCT GCCCGTCACG TTCCCCGCGC GCGACGAGGA CACGCCGACC TGGGGCGCGG ACGGCACGCT CGCGCCGAAC CCCGTCTACT CGGAGAAGCT GAAGATCGGC TATCGCTGGT ACGACGCGCA TCGCATCGCG CCGATGTTCC CGTTCGGACA CGGCCTGTCG TACACGCACT TCTCGTATTC CGGGCTCGAA GTCAAGCAGC GCCCGGACGC GGCGACGACG GTGTCGTTTG CGCTGACCAA CGATGGCCCG GTGGCCGGCG CCGAAGTGCC GCAGGTCTAT CTCGGCGATC TCGATGATCC GCAGGAACCG CCGAAGCGCC TCGTCGGATG GGACAAGGTG GGCCTGCGCG CGGGCGAAAC GCGGCGCGTG CGTATCGTGA TTCCCGCCGA GATGCGGCGC GTGTGGGATG CGAGCCGCAA CGGATGGGCG CTCGCGAAGG GCGGGCGCAT CTACGTGGGC GCGTCTTCGC GCGACATTCG GCTTCAGCAG CCGTGA
|
Protein sequence | MHAKRLSIAV LSATLCALAH AAGNDAPSPD IASRDAYALR RAHALVRQMT LDEKLQLIHS KYPMSDVPGG GAGFIQGIAR LGIPDLNMVD SATGSGSTSQ PSTTFPATIG LAASWDKRLS YAFGAVIADQ LRAQGFAMSL GGGTNLAREP RGGRLFEYLG EDPVLAGEML AARTRGTQDR KVIATIKHYV GNEQETNRMG GDDQIDERTL RELYLLPFEI AMKAARPGNV MCSYNRLNGD YACENAHVLT DVLKNEWHFQ GQVQSDWGAA HSTAKAINAG LDEEEDVGPT VFLTPALVKQ ALATREIAPA RLDDMVRRKL YAMIRTGVMD DPPRGGGTID FAAANRFVQY AAEQSIVLLK NQDRQLPLDA ASLKRIAVIG GHADAAVLAG GGSGNTRHPV TGAFPGCGGL TFPTTTGCNW WPNPWLKLDV PIVQAIRDLA PGATVAFAGN SDRQSPFAAY TPQQIDAAAD LARRSDVAIV FVTQAAGEDF GELRSLALAN PTNQDALVQA VAQANPRVIV VVESGNPVLM PWRDQVPAIV QAWFPGEGGG NAIANVLFGK VNPSGKLPVT FPARDEDTPT WGADGTLAPN PVYSEKLKIG YRWYDAHRIA PMFPFGHGLS YTHFSYSGLE VKQRPDAATT VSFALTNDGP VAGAEVPQVY LGDLDDPQEP PKRLVGWDKV GLRAGETRRV RIVIPAEMRR VWDASRNGWA LAKGGRIYVG ASSRDIRLQQ P
|
| |