Gene Bcep18194_C7540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7540 
Symbol 
ID3734992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp1160894 
End bp1163782 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content68% 
IMG OID637761241 
ProductBeta-galactosidase/beta- glucuronidase family protein 
Protein accessionYP_367228 
Protein GI78060653 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.263418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.712319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGC AATCGCGCCG GAAATTCCTG TCTTACAGCG CAGCACTCGC CGGCTCGGGC 
TGGCTCGCGG GCTGCAACGG CGATATCGAT TCGACGTCGG GCGCCGTCGG CACGCCAGGC
GCCGATGGCC CGGCGACGGT CGCGCCCACC GTGCCGGGCG TGCCGAGCGA CCCGGAACAC
AGCCTGCTGG CCGCGCAGGA GCTCGCGACG AACTGGACGT TCGCCTCGGC GAGCAGCCTG
CCGGGCGCCG GCGGTGCGCA ACTGTCGGGC GCGGCCGGCG TGACGGGCAT GATGCCCGCG
ACGGTGCCCG GCACCGTGCT GAACAGCATG ATCGTCAACG GCAAGTATCC CGATCCGCTC
TATGGCCGCA TCGTCACGGA CACCATCCCC GATACGCTGA AGGACACCGA CTACTGGTAC
CGGACGACCT TCGCGGCGCC GGCACGGCAG CCGGGGCAGC GGTTGTGGCT GCGTTTCGGC
GGCGTCAATT ACTGCGCCGA GGTCTGGCTG AACGGAGCGC TCGTCGGCCG GCTGGAAGGC
GCGTTCAAGC AGGGCGCGTT CGATATCTCG CGGCTCGTGC CGCAGGCGGG CGGCGCGGCC
AACCTCGCGG TACGCGTGGT CAAGCTGGAT TTCTCCGAAG GGCCGCTGCT GCCGAGCTAT
AAAAGCGGCG TCACGCGCGG CGGTCGCAAC GGCGGCCCGA CCGGCGTCAC GCTGAAGAAC
GGGCCGACGT TCTTCTGCTC CGCGGGCTGG GACTGGTTGC CGACGATCCC CGATCGCGAG
CTCGGCATCT GGCAGCCGGT GACCTGGTTC ACGACCGGCG CGGTGCGCAT CGCGGCGATC
AATGTCGCGC ACACGCTGTC CACCGACCTG TCGCGTGCCG AGCTGCGACT CGATCTCGAA
CTGGACAACG GCTCGGGTGC GGATCTCGTC GCCACTGTCG TCGGCACCAT CGGCAACGGT
GTGCCGTTCC GCCACGACAT CGCGATTCCC GCATCGAGTA CGACCACGAA GGTGTCGCTC
ACGTCGTCGG ATATCGCGGC GCTGTCGATC AGGCAACCGC GCCTGTGGTG GCCGAACGGG
TATGGCGAGC CGAATCTCTA CGCGGTGAAG GTCGGTGTCG ACGTCGCGCA CCGGCGCTCG
GACGAGCGCA CGCTGAATAT CGGCCTGCGC CGCATCGAAT ACGCGCGCGA CATCGGCATG
GGCCAGCAGT TGAGCATTAC GGTCAACGGC CTGCCGATCC TCGTGATGGG CGGCAACTGG
GGGCTCGACG AAGCGCTGAA GCGCATTCCG CGCACCCGGC TGTTCAACCA GGTGCGGCTC
CATCGCGACG CGAACCTCAA CCTGATCCGC AACTGGAACG GGCAGAGCAC GAGCGACGAT
TTCTTCGACG CATGCGATCG CTACGGGATC CTCGTCTGGC AGGATTTCTT CTTCTCGACC
GAAGGAGACG GATCGGGCCC GGCCAATGTG CCGCGCGATC TCGACAACAT CCGCGACGTG
ATCGCACGCA ACCGTCATCG TCCGTCGATC CTGCTCTGGT GCGGCGGCAA CGAAGGGTCG
CCGCCGCCGG CACTCGTCAA GGGGCTCGAC GCGCTCGTCG CCGAGCTGGA CCCGCAACGC
CTGTGCCTCA CGAGTTCGGC CGGCGATACG GGCGCCGGCG CGGTGAACGG GTATTCGTCC
GGCGGTCCGT ACAACTGGGC CTCGCCGCAG GCCGCGTTCA GCCGCGGCTA CGGCACGACG
TCGGTCGCGT TTCACAACGA AGTCGGCTCG CATTCGATCC CGACGCTCGA ATTCGTCGAA
TCGATGCTGC CGCCCGGCTC GTACGAATGC CCCGACGATT TCTGGGCCGA TCGCGACATG
AACGGCAACG GTGCGTACTA CCCGGCGGTC GGCAAGCAGG GCGGTGCCGG GTACATCGCG
ATGACGGCGC TGCGCTACGG CGCGATCCGG AATCTCGCGG ATTTCGTGCG CAAGGCGCAG
ATGATGAACT ACGAATGCAT CCGCGCGATC TACGAGGCGA ACGCGGCCGT GATGATCGGC
CCGGTGGCCG GGAGGATCAC GTCGCCGGCC ACCGGCGTGA TCATGTGGAT GACGAACCCC
GCACAGCCGA GCTTCGTGTG GCAGATGTAC AGCCACGATC TCGAGGAACA TGCGTCGTTC
TTCGCGGTGC AGCACGGGTG CCGTCGCGTC AATGCGATTC TCGATGCCGG CACGGCCGAC
GTGACGATCG CGAATCACAC GGCGGCAGCC GTCACCGGCC GCGTCGAGAT GCGCGTGTAC
AACCTCGACG GCACGCTGAG CAGCCGGACG ACCGCGGATG TCGGTGGCGT CGCGAAGGCG
TCGTATCGTG TGGTGGCGAA TCTTGCGTCG GCGCTGGCTG CCGCGAAGTC CGACGTGTGC
ATCGTCGCGC TTGCGCTGAC CGATTCGGGC GGCACGACGC TGGCCGAGAA CGTCTACTGG
CGGCAGCGCG ACGGGGGCGA CAACGCATAC ACGTCGCTCG ACACGATGCC CGGTGCGGCG
GTTTCCGTCA GTGCGACATC GACCGAGACC GACGCGACGA CGACGCGCAT CACGGTCGAC
GTCGCGAACA TCGGCACCGC CGTCGCGCTG ATGACGCACC TGCAGGTGTT CGACCCGTCA
ACCGGCGTGC GTGTGCTGCC TGCGTTCTAC AGCGACAACT ACCTGAACCT GGTACCGGGC
GCGAAGCGGC AGGTTACGAT CGACCTGCCG CATGCGGGCG GCGCGCCGGT GCCGCGCGTC
GCGCTGCGCG TCGACGGGTG GCGGCTCGAT CGCCCGAACT GCCGGCTGGG GCTAGGCGGC
GTGCCGGTCG TGTTCAACGA GCGCGCGCTG GCGGTCGCGC CGGCGGTGCC GACGTTCGCG
GCGTGCTGA
 
Protein sequence
MKSQSRRKFL SYSAALAGSG WLAGCNGDID STSGAVGTPG ADGPATVAPT VPGVPSDPEH 
SLLAAQELAT NWTFASASSL PGAGGAQLSG AAGVTGMMPA TVPGTVLNSM IVNGKYPDPL
YGRIVTDTIP DTLKDTDYWY RTTFAAPARQ PGQRLWLRFG GVNYCAEVWL NGALVGRLEG
AFKQGAFDIS RLVPQAGGAA NLAVRVVKLD FSEGPLLPSY KSGVTRGGRN GGPTGVTLKN
GPTFFCSAGW DWLPTIPDRE LGIWQPVTWF TTGAVRIAAI NVAHTLSTDL SRAELRLDLE
LDNGSGADLV ATVVGTIGNG VPFRHDIAIP ASSTTTKVSL TSSDIAALSI RQPRLWWPNG
YGEPNLYAVK VGVDVAHRRS DERTLNIGLR RIEYARDIGM GQQLSITVNG LPILVMGGNW
GLDEALKRIP RTRLFNQVRL HRDANLNLIR NWNGQSTSDD FFDACDRYGI LVWQDFFFST
EGDGSGPANV PRDLDNIRDV IARNRHRPSI LLWCGGNEGS PPPALVKGLD ALVAELDPQR
LCLTSSAGDT GAGAVNGYSS GGPYNWASPQ AAFSRGYGTT SVAFHNEVGS HSIPTLEFVE
SMLPPGSYEC PDDFWADRDM NGNGAYYPAV GKQGGAGYIA MTALRYGAIR NLADFVRKAQ
MMNYECIRAI YEANAAVMIG PVAGRITSPA TGVIMWMTNP AQPSFVWQMY SHDLEEHASF
FAVQHGCRRV NAILDAGTAD VTIANHTAAA VTGRVEMRVY NLDGTLSSRT TADVGGVAKA
SYRVVANLAS ALAAAKSDVC IVALALTDSG GTTLAENVYW RQRDGGDNAY TSLDTMPGAA
VSVSATSTET DATTTRITVD VANIGTAVAL MTHLQVFDPS TGVRVLPAFY SDNYLNLVPG
AKRQVTIDLP HAGGAPVPRV ALRVDGWRLD RPNCRLGLGG VPVVFNERAL AVAPAVPTFA
AC