Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_C7540 |
Symbol | |
ID | 3734992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007509 |
Strand | - |
Start bp | 1160894 |
End bp | 1163782 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637761241 |
Product | Beta-galactosidase/beta- glucuronidase family protein |
Protein accession | YP_367228 |
Protein GI | 78060653 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.263418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.712319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCGC AATCGCGCCG GAAATTCCTG TCTTACAGCG CAGCACTCGC CGGCTCGGGC TGGCTCGCGG GCTGCAACGG CGATATCGAT TCGACGTCGG GCGCCGTCGG CACGCCAGGC GCCGATGGCC CGGCGACGGT CGCGCCCACC GTGCCGGGCG TGCCGAGCGA CCCGGAACAC AGCCTGCTGG CCGCGCAGGA GCTCGCGACG AACTGGACGT TCGCCTCGGC GAGCAGCCTG CCGGGCGCCG GCGGTGCGCA ACTGTCGGGC GCGGCCGGCG TGACGGGCAT GATGCCCGCG ACGGTGCCCG GCACCGTGCT GAACAGCATG ATCGTCAACG GCAAGTATCC CGATCCGCTC TATGGCCGCA TCGTCACGGA CACCATCCCC GATACGCTGA AGGACACCGA CTACTGGTAC CGGACGACCT TCGCGGCGCC GGCACGGCAG CCGGGGCAGC GGTTGTGGCT GCGTTTCGGC GGCGTCAATT ACTGCGCCGA GGTCTGGCTG AACGGAGCGC TCGTCGGCCG GCTGGAAGGC GCGTTCAAGC AGGGCGCGTT CGATATCTCG CGGCTCGTGC CGCAGGCGGG CGGCGCGGCC AACCTCGCGG TACGCGTGGT CAAGCTGGAT TTCTCCGAAG GGCCGCTGCT GCCGAGCTAT AAAAGCGGCG TCACGCGCGG CGGTCGCAAC GGCGGCCCGA CCGGCGTCAC GCTGAAGAAC GGGCCGACGT TCTTCTGCTC CGCGGGCTGG GACTGGTTGC CGACGATCCC CGATCGCGAG CTCGGCATCT GGCAGCCGGT GACCTGGTTC ACGACCGGCG CGGTGCGCAT CGCGGCGATC AATGTCGCGC ACACGCTGTC CACCGACCTG TCGCGTGCCG AGCTGCGACT CGATCTCGAA CTGGACAACG GCTCGGGTGC GGATCTCGTC GCCACTGTCG TCGGCACCAT CGGCAACGGT GTGCCGTTCC GCCACGACAT CGCGATTCCC GCATCGAGTA CGACCACGAA GGTGTCGCTC ACGTCGTCGG ATATCGCGGC GCTGTCGATC AGGCAACCGC GCCTGTGGTG GCCGAACGGG TATGGCGAGC CGAATCTCTA CGCGGTGAAG GTCGGTGTCG ACGTCGCGCA CCGGCGCTCG GACGAGCGCA CGCTGAATAT CGGCCTGCGC CGCATCGAAT ACGCGCGCGA CATCGGCATG GGCCAGCAGT TGAGCATTAC GGTCAACGGC CTGCCGATCC TCGTGATGGG CGGCAACTGG GGGCTCGACG AAGCGCTGAA GCGCATTCCG CGCACCCGGC TGTTCAACCA GGTGCGGCTC CATCGCGACG CGAACCTCAA CCTGATCCGC AACTGGAACG GGCAGAGCAC GAGCGACGAT TTCTTCGACG CATGCGATCG CTACGGGATC CTCGTCTGGC AGGATTTCTT CTTCTCGACC GAAGGAGACG GATCGGGCCC GGCCAATGTG CCGCGCGATC TCGACAACAT CCGCGACGTG ATCGCACGCA ACCGTCATCG TCCGTCGATC CTGCTCTGGT GCGGCGGCAA CGAAGGGTCG CCGCCGCCGG CACTCGTCAA GGGGCTCGAC GCGCTCGTCG CCGAGCTGGA CCCGCAACGC CTGTGCCTCA CGAGTTCGGC CGGCGATACG GGCGCCGGCG CGGTGAACGG GTATTCGTCC GGCGGTCCGT ACAACTGGGC CTCGCCGCAG GCCGCGTTCA GCCGCGGCTA CGGCACGACG TCGGTCGCGT TTCACAACGA AGTCGGCTCG CATTCGATCC CGACGCTCGA ATTCGTCGAA TCGATGCTGC CGCCCGGCTC GTACGAATGC CCCGACGATT TCTGGGCCGA TCGCGACATG AACGGCAACG GTGCGTACTA CCCGGCGGTC GGCAAGCAGG GCGGTGCCGG GTACATCGCG ATGACGGCGC TGCGCTACGG CGCGATCCGG AATCTCGCGG ATTTCGTGCG CAAGGCGCAG ATGATGAACT ACGAATGCAT CCGCGCGATC TACGAGGCGA ACGCGGCCGT GATGATCGGC CCGGTGGCCG GGAGGATCAC GTCGCCGGCC ACCGGCGTGA TCATGTGGAT GACGAACCCC GCACAGCCGA GCTTCGTGTG GCAGATGTAC AGCCACGATC TCGAGGAACA TGCGTCGTTC TTCGCGGTGC AGCACGGGTG CCGTCGCGTC AATGCGATTC TCGATGCCGG CACGGCCGAC GTGACGATCG CGAATCACAC GGCGGCAGCC GTCACCGGCC GCGTCGAGAT GCGCGTGTAC AACCTCGACG GCACGCTGAG CAGCCGGACG ACCGCGGATG TCGGTGGCGT CGCGAAGGCG TCGTATCGTG TGGTGGCGAA TCTTGCGTCG GCGCTGGCTG CCGCGAAGTC CGACGTGTGC ATCGTCGCGC TTGCGCTGAC CGATTCGGGC GGCACGACGC TGGCCGAGAA CGTCTACTGG CGGCAGCGCG ACGGGGGCGA CAACGCATAC ACGTCGCTCG ACACGATGCC CGGTGCGGCG GTTTCCGTCA GTGCGACATC GACCGAGACC GACGCGACGA CGACGCGCAT CACGGTCGAC GTCGCGAACA TCGGCACCGC CGTCGCGCTG ATGACGCACC TGCAGGTGTT CGACCCGTCA ACCGGCGTGC GTGTGCTGCC TGCGTTCTAC AGCGACAACT ACCTGAACCT GGTACCGGGC GCGAAGCGGC AGGTTACGAT CGACCTGCCG CATGCGGGCG GCGCGCCGGT GCCGCGCGTC GCGCTGCGCG TCGACGGGTG GCGGCTCGAT CGCCCGAACT GCCGGCTGGG GCTAGGCGGC GTGCCGGTCG TGTTCAACGA GCGCGCGCTG GCGGTCGCGC CGGCGGTGCC GACGTTCGCG GCGTGCTGA
|
Protein sequence | MKSQSRRKFL SYSAALAGSG WLAGCNGDID STSGAVGTPG ADGPATVAPT VPGVPSDPEH SLLAAQELAT NWTFASASSL PGAGGAQLSG AAGVTGMMPA TVPGTVLNSM IVNGKYPDPL YGRIVTDTIP DTLKDTDYWY RTTFAAPARQ PGQRLWLRFG GVNYCAEVWL NGALVGRLEG AFKQGAFDIS RLVPQAGGAA NLAVRVVKLD FSEGPLLPSY KSGVTRGGRN GGPTGVTLKN GPTFFCSAGW DWLPTIPDRE LGIWQPVTWF TTGAVRIAAI NVAHTLSTDL SRAELRLDLE LDNGSGADLV ATVVGTIGNG VPFRHDIAIP ASSTTTKVSL TSSDIAALSI RQPRLWWPNG YGEPNLYAVK VGVDVAHRRS DERTLNIGLR RIEYARDIGM GQQLSITVNG LPILVMGGNW GLDEALKRIP RTRLFNQVRL HRDANLNLIR NWNGQSTSDD FFDACDRYGI LVWQDFFFST EGDGSGPANV PRDLDNIRDV IARNRHRPSI LLWCGGNEGS PPPALVKGLD ALVAELDPQR LCLTSSAGDT GAGAVNGYSS GGPYNWASPQ AAFSRGYGTT SVAFHNEVGS HSIPTLEFVE SMLPPGSYEC PDDFWADRDM NGNGAYYPAV GKQGGAGYIA MTALRYGAIR NLADFVRKAQ MMNYECIRAI YEANAAVMIG PVAGRITSPA TGVIMWMTNP AQPSFVWQMY SHDLEEHASF FAVQHGCRRV NAILDAGTAD VTIANHTAAA VTGRVEMRVY NLDGTLSSRT TADVGGVAKA SYRVVANLAS ALAAAKSDVC IVALALTDSG GTTLAENVYW RQRDGGDNAY TSLDTMPGAA VSVSATSTET DATTTRITVD VANIGTAVAL MTHLQVFDPS TGVRVLPAFY SDNYLNLVPG AKRQVTIDLP HAGGAPVPRV ALRVDGWRLD RPNCRLGLGG VPVVFNERAL AVAPAVPTFA AC
|
| |