Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0234 |
Symbol | |
ID | 3844488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 283921 |
End bp | 284937 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637837540 |
Product | class III extradiol-type catecholic dioxygenase, putative |
Protein accession | YP_438436 |
Protein GI | 83718071 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02298] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATCT CGCGGCCCGC CGCACGACCG GCAAGCTGCT GCTCGTGCCG TGACACCCCT TCGTTCGCAC GCTCACCCAC TCATCGAACA GGATGCCTAT TCATGGGAAA GATCATCGGT GCCGGCCTGA TTTCGCACGC GCCCGTCGTG ATGATGCCGC GGGCCGTGCG CCTGCGCGAA AACGACGGCC GGGACTTCAC GCTCGCGACC GGCCTCGCGC GATTGCGGCG CGAAGTGTTC GACGCGCACG ACTACGATAC GGTGCTCGTT CTCGACAGCC ACTGGCGAAC GACGACCGAG GCGGTCGTCA CCGCGCACGC GCGGCGCACG GGCCGCTTCA CGTCGGACGA GATGCCGAAC GCGATCAGGC AACTGCCGTA CGATCTGGCG GGCGACCCCG AGCTCGCGCG CGCGATTGCC GAGTTGGCGA CGCGGCGCGC GTGCTGGATC GCCGCGGTCG ACGATCCGTG CCTGCCGATT CATTACGCGA CGCTCAATCC GTGGACCTAT CTCGGCCGTC CCGACAAACG CTGGATTTCG ATGTCCGTGT GCCAGACCGC GACGACCGAC GATTTCCTGC GGATGGGCGA GATCGTCGCG CAGGCGATCT CGCGGCTCGA TCGCAACGTG CTGCTGGTCG CTTCGGGCGG GCTGTCGCAC GCGTTCTGGC CGCTCGCCGA ACTGCGTCGC CGGATGGCGG GCGCCGCGTC GAACATCGTG ACGCCCGCCG CGCGCGCGGC CGACGAGCGG CGGATCGCGT GGCTCGAACA AGGGCGGCAC GATCGGGTGA TCGATGCGAT GTCGGAATTC CTGCGCTTCG ATCCGGAGGC GAACTTCGGC CACTATCTGA TGATGGCGGG CGCGATCGGC GCGCGCGCGT GCGCGGCGCG AGCGCGCCGC TTCAGCGAGT ATGAGAACGG CATCGGCACC GGGCATGTTC ATCTGTGGTT CGGTCCCGTC GACGGTGGAT GGACGCGGGC TGAAACGCGG GCTGAGCGAG AAGCGGCGCG CGCGTGA
|
Protein sequence | MLISRPAARP ASCCSCRDTP SFARSPTHRT GCLFMGKIIG AGLISHAPVV MMPRAVRLRE NDGRDFTLAT GLARLRREVF DAHDYDTVLV LDSHWRTTTE AVVTAHARRT GRFTSDEMPN AIRQLPYDLA GDPELARAIA ELATRRACWI AAVDDPCLPI HYATLNPWTY LGRPDKRWIS MSVCQTATTD DFLRMGEIVA QAISRLDRNV LLVASGGLSH AFWPLAELRR RMAGAASNIV TPAARAADER RIAWLEQGRH DRVIDAMSEF LRFDPEANFG HYLMMAGAIG ARACAARARR FSEYENGIGT GHVHLWFGPV DGGWTRAETR AEREAARA
|
| |