Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1526 |
Symbol | |
ID | 3849863 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 1726472 |
End bp | 1728082 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841198 |
Product | 3-octaprenyl-4-hydroxybenzoate carboxy-lyase |
Protein accession | YP_442071 |
Protein GI | 83719856 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.74472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGCC CCCATCGCCG CCGCCTGTCG CGCGCGGCCC GGACATCCTT CATGAAATAC AAAGACTTAC GCGATTTCAT CCACAGCCTC GAGCAGCGCG GCGAATTGCG ACGCATCACG CAGCCCGTGT CGCCCGCCCT CGAAATGACC GAACTCTGCG ACCGCGTGCT GCGCGCGGGC GGTCCCGCAC TGCTGTTCGA TGCGCCGGAC GGCTACCGGT TTCCGGTGCT CGGCAACCTG TTCGGCACGC CGCGGCGCGT CGCGCTCGGG ATGGGCGTCG ACGCCGACGA CAACGCGACG CTCGCGTCGC TGCGCGACAT CGGCCGCCTG CTGTCCGCGC TCAAGGAGCC CGATCCGCCG AAGAGCCTGA AGGATGCCGG CAAGCTGCTA TCGCTCGCGA AGGCCGTGTG GGACATGGGC CCGAAGACGG TGTCCGCGCC GCCGTGCCAG GAGATCGTCT GGGAAGGCGA CGACGTCGAT CTGCACAAGC TGCCGATCCA GACCTGCTGG CCAGGCGACG CCGGGCCGCT GCTCACGTGG GGCCTGACCG TCACGCGCGG ACCGAACAAG ACGCGCCAGA ATCTCGGCAT CTACCGGCAG CAACTGATCG GACGCAACAA ACTGATCATG CGCTGGCTCG CGCATCGCGG CGGCGCACTC GACTTCCGCG AATTCGCGCT GAAGCATCCG GGCCAGCCCT ATCCCGTCGC CGTCGTGCTC GGCGCCGATC CGGCGACAGC GCTCGGCGCC GTCACGCCCG TGCCCGACAC GCTGTCCGAA TACCAGTTCG CGGGCCTGCT GCGCGGCGCG CGCACCGAGC TCGCGAAATG CCTGACGCCC GGCGTCGACA CGCTGCAGGT GCCGGCGCGC GCGGAAATCG TGCTCGAAGG CTTCATTCAC CCGCAGCAAG GCGCGCCCGC GCCGGCGCCC GAGGGCGCGC CGCCAAGGCC GACGGCGGGC GCCGCAGCCG GCTACGAGCA TGCGCTCGAA GGCCCGTACG GCGACCACAC CGGCTACTAC AACGAGCAAG AGTGGTTTCC GGTCTTCACG GTCGAGCGGA TCACGATGCG CCGCGACGCG ATCTATCACT CGACGTACAC CGGCAAGCCG CCCGACGAGC CGGCGGTGCT CGGCGTCGCG CTGAACGAGG TGTTCGTGCC GCTGCTGCAG AAGCAGTTCT CCGAGATCAC TGACTTCTAT CTGCCGCCCG AGGGATGCAG CTACCGGATG GCGATCGTCC AGATGAAGAA GAGCTACGCG GGCCACGCGA AGCGCGTGAT GTTCGGTGTC TGGAGCTTCC TGCGGCAGTT CATGTATACG AAGTTCATCG TCGTCGTCGA CGAGGACGTG AACGTGCGCG ACTGGAAGGA AGTGATCTGG GCGATCACGA CACGCGTCGA TCCCGCGCGC GACACGGTGC TCGTCGAGAA CACGCCGATC GACTACCTCG ATTTCGCGTC GCCCGTCGCC GGCCTCGGCT CGAAGATGGG GCTCGACGCG ACCAACAAGT GGCCGGGCGA AACCCAGCGC GAATGGGGCC GGCCGATCGA GATGGACGCC GCCGTGAAAG CGCGCGTCGA TCGTCTGTGG ACCGAAATCG GCCTGTCGTG A
|
Protein sequence | MRRPHRRRLS RAARTSFMKY KDLRDFIHSL EQRGELRRIT QPVSPALEMT ELCDRVLRAG GPALLFDAPD GYRFPVLGNL FGTPRRVALG MGVDADDNAT LASLRDIGRL LSALKEPDPP KSLKDAGKLL SLAKAVWDMG PKTVSAPPCQ EIVWEGDDVD LHKLPIQTCW PGDAGPLLTW GLTVTRGPNK TRQNLGIYRQ QLIGRNKLIM RWLAHRGGAL DFREFALKHP GQPYPVAVVL GADPATALGA VTPVPDTLSE YQFAGLLRGA RTELAKCLTP GVDTLQVPAR AEIVLEGFIH PQQGAPAPAP EGAPPRPTAG AAAGYEHALE GPYGDHTGYY NEQEWFPVFT VERITMRRDA IYHSTYTGKP PDEPAVLGVA LNEVFVPLLQ KQFSEITDFY LPPEGCSYRM AIVQMKKSYA GHAKRVMFGV WSFLRQFMYT KFIVVVDEDV NVRDWKEVIW AITTRVDPAR DTVLVENTPI DYLDFASPVA GLGSKMGLDA TNKWPGETQR EWGRPIEMDA AVKARVDRLW TEIGLS
|
| |