Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_5207 |
Symbol | |
ID | 4095038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | + |
Start bp | 2498328 |
End bp | 2499743 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638018488 |
Product | glycoside hydrolase family protein |
Protein accession | YP_625054 |
Protein GI | 107027543 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.226411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCATCG ATACGTCCAT TCCGCGCTCG CCGGCGCGCC GCACCTTCCT GCGCGCCGGC GGCGCGGCCG CCGCCGGCCT GGCCTGCGCA CCCACGTTCG CCCGCGACGA CGGCATGCGT TTTGCCGACG ATTTCGTGTG GGGCGTCGCG GCGTCGGCTC CGCAAACGGA AAGCCGGGAA GGCCGCGGGC GGAGCAACTG GGACGTGTTC GCGGAGCGCG CCGGCACGAT CGCCGACGGC TCGACCAATG CGCGCTGCAC CGAATTCGAG CGGCGCTATC CGGCCGATAT CGCGTGGATG GCGGCCGCCG GGATTCGCGC CTTCCGCTTC TCGATCGCGT GGCCGCGCGT GCAGCCGCAA GGGCCCGGCG CGCCGAGCGA CGCCGGCCTC GCCACCTATG ACCGGATGAT CGACACGATG CTCGCGCGCG GCATCGAGCC GTTTCCGACG CTGTTTCACT GGGATACGCC GGTCTGGGCC GGCGATTTCC GGACACGCGA CATCGCCTAT CGGCTGGCCG ATTACGCCGA TCGGGTGACG CGCCGGCTCG GCGATCGCGT GAAGCACTGG ATCGTGCTGA ACGAGCCCAA TAGTCTCGCG CTGCGCGGCT ACGGGATGGG CGTGCATGCG CCGGGGCTGC GTTCGCCGGA AGGCGTGTTC GCGGCGATGC ATCACCAGAA CCTCGCACAG GGCCTCGCAT TCCAGGTGCT GCGGGCGAAC CTGCGCGACG CGCGGATCGG CACGACGATC AACCTTCAGC CGGTGCGCCC GGCGGCGGCG CGCGACGAAG ACCGCAAGGC GGCCGGCCTC GTCGACGTGC TGTGGAATCG TGCGTTCCTC GATCCGCTGT ACGGTCACGG GTATCCCGAG CCGCTCGCGC ATTCGCTCGC CGGCCTCGTG CGGCCCGGCG ACATGGCCAT CGTCGCCGCG AAGCCGGATT TCCTCGGGAT GAACTACTAC TCGCGCATCT ACGTTCGCGC CAATCCGTCC GCCCCGTTCG GCGTCGAGCA GGCCGAGCCG CCGGCCGATC TGCCGCGCAC TGCGTACTTC CAGGTCGAGC CGGACGGGAT GACCGAGATG CTGCTGCGCG TGCATCGCGA TTACGGCGCG CCGGACATCT ACATCACCGA AACCGGCTTC GCGCTCGACG ATCCGGCGCC GCACGACGGC GTCGTCGACG ACGGCCCGCG CGGCGATTAC CTGTCGTGCT ATCTGCGCGC GGCGCACGAC GCCTATCGGC AAGGTGTGCG ACTGAAAGGG CTGTTCTACT GGGCCGCCAC CGACAACTGG GAATGGGGGC AGGGGTTTTC GAAGCGATTC GGCCTCGTGC ACGTCGATCT CGACACGCAG GTTCGTACAC CCAAACGCAG CCTCGCGTAC TACTCGCGAT GCATCGCGCA GAACGCGGTG GCGTGA
|
Protein sequence | MPIDTSIPRS PARRTFLRAG GAAAAGLACA PTFARDDGMR FADDFVWGVA ASAPQTESRE GRGRSNWDVF AERAGTIADG STNARCTEFE RRYPADIAWM AAAGIRAFRF SIAWPRVQPQ GPGAPSDAGL ATYDRMIDTM LARGIEPFPT LFHWDTPVWA GDFRTRDIAY RLADYADRVT RRLGDRVKHW IVLNEPNSLA LRGYGMGVHA PGLRSPEGVF AAMHHQNLAQ GLAFQVLRAN LRDARIGTTI NLQPVRPAAA RDEDRKAAGL VDVLWNRAFL DPLYGHGYPE PLAHSLAGLV RPGDMAIVAA KPDFLGMNYY SRIYVRANPS APFGVEQAEP PADLPRTAYF QVEPDGMTEM LLRVHRDYGA PDIYITETGF ALDDPAPHDG VVDDGPRGDY LSCYLRAAHD AYRQGVRLKG LFYWAATDNW EWGQGFSKRF GLVHVDLDTQ VRTPKRSLAY YSRCIAQNAV A
|
| |