Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcenmc03_6983 |
Symbol | |
ID | 6125892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia MC0-3 |
Kingdom | Bacteria |
Replicon accession | NC_010512 |
Strand | + |
Start bp | 1095394 |
End bp | 1097010 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641647987 |
Product | tryptophan halogenase |
Protein accession | YP_001774579 |
Protein GI | 170735465 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.448897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.89609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCACC CGATCAAGAA TATCGTCATC GTGGGCGGCG GCACGGCGGG CTGGATGTCC GCCTCGTACC TCGTCCGGGC GCTCCAGCAG CAGGCGAACA TCACGCTCGT CGAATCCGCG GCGATCCCCC GGATCGGCGT GGGCGAGGCG ACCATTCCGA GTTTGCAGAA GGTGTTCTTC GATTTCCTCG GGATACCGGA GCGGGAGTGG ATGCCGCAGG TGAACGGCGC GTTCAAGTCC GCCATCAGGT TCGTGAACTG GAGGAAATCT CCCGACGGTT CACGTAGCGA CCACTTCTAC CACTTGTTCG GCAACGTGCC GAACTGCGAC GGCGTGCCGC TGACCCACTA CTGGCTGCGC AAGCGCGAAC AGGGTTTCCA GCAGCCGATG GAATACGCGT GTTATCCGCA GACCGAGGCG CTCGACGGCA AGCTGGCACC GTGCTCGCTC GACGGCACCC GCCAGATGTC CCACGCGTGG CACTTCGACG CGCACCTGGT GGCCGACTTC CTGAAACGCT GGGCCACCGA ACGCGGGGTG AAGCGCGTGG TCGACGAGGT CGAGCAGGTC CACCTGAACG ACCGCGGCCA CATCTCGAGC CTGTCCACCA AGGAGGGGCG CACGCTCGAA GCCGACCTGT TCATCGACTG TTCCGGCATG CGGGGGCTGT TGATCAACCA GGCCCTGAAG GAGCCCTTCA TCGACATGTC CGACTACCTG CTGTGCGACA GCGCGGTCGC CAGCGCGGTG CCCAGCGACG ACGCGCGTGT GGGGATCGAG CCGTACACGT CGGCGATCGC GATGAACTCG GGGTGGACCT GGAAGATTCC GATGCTGGGC CGGTTCGGCA GCGGCTATGT GTTCTCGAGC AAGTTCACGT CGCGCGACCA GGCTACCGCC GACTTCCTCG ACCTCTGGGG GCTCTCGGAC AAGCAGCCGC TCAACCAGAT CAAGTTCCGG GTCGGGCGCA ACAGGCGCGC GTGGGTCAAT AATTGCGTCT CCATCGGGCT GTCGTCGTGC TTTCTGGAGC CGCTGGAATC GACGGGCATC TACTTCATCT ACGCGGCGCT TTACCAGCTC GTGAAGCACT TCCCCGATAC CGCGTTCGAC CCGCGGTTGG CCGACGCCTT CAACGCCGAG ATCGTCTACA TGTTCGACGA CTGCCGGGAT TTCGTGCAGG CGCACTATTT CACGACGTCG CGCGACGACA CCCCGTTCTG GCGCGCGAAC CGGCACGACC TGCGGCTCTC CGACGCCATC AAGGAGAAGG TCGAACGTTA CAAGGCGGGG CTGCCGCTGA CCACCACGTC GTTCGACGAT TCGACGTACT ACGAAACCTT CGACTACGAA TTCCGGAACT TCTGGTTGAA CGGAAACTAC TACTGCATTT TTGCCGGCCT GGGGCTGCTG CCCGACCAGT CGCTGCCGCT GTTGCGGCAC CGGCCGGAGT CGATCGACAA GGCCGAGGCG ATGTTCGCCC GAATCCGGCG CGAGGCCGAG CGTCTGCGGT CGAGCCTGCC GACGAACCAT GATTACCTGC GCTCGCTGCG CGAGCGCGCC GCGGGGCGGT CTCGCAGCCA GCCCGGGCCG ACGCTCGCGA CGCCGGAGAC GCTGTAG
|
Protein sequence | MTHPIKNIVI VGGGTAGWMS ASYLVRALQQ QANITLVESA AIPRIGVGEA TIPSLQKVFF DFLGIPEREW MPQVNGAFKS AIRFVNWRKS PDGSRSDHFY HLFGNVPNCD GVPLTHYWLR KREQGFQQPM EYACYPQTEA LDGKLAPCSL DGTRQMSHAW HFDAHLVADF LKRWATERGV KRVVDEVEQV HLNDRGHISS LSTKEGRTLE ADLFIDCSGM RGLLINQALK EPFIDMSDYL LCDSAVASAV PSDDARVGIE PYTSAIAMNS GWTWKIPMLG RFGSGYVFSS KFTSRDQATA DFLDLWGLSD KQPLNQIKFR VGRNRRAWVN NCVSIGLSSC FLEPLESTGI YFIYAALYQL VKHFPDTAFD PRLADAFNAE IVYMFDDCRD FVQAHYFTTS RDDTPFWRAN RHDLRLSDAI KEKVERYKAG LPLTTTSFDD STYYETFDYE FRNFWLNGNY YCIFAGLGLL PDQSLPLLRH RPESIDKAEA MFARIRREAE RLRSSLPTNH DYLRSLRERA AGRSRSQPGP TLATPETL
|
| |