Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_5264 |
Symbol | |
ID | 6181497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010552 |
Strand | + |
Start bp | 2477052 |
End bp | 2478668 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641685019 |
Product | tryptophan halogenase |
Protein accession | YP_001811924 |
Protein GI | 172064273 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC CGATCAAGAA TATCGTTATC GTGGGCGGCG GCACCGCGGG GTGGATGACC GCCTCGTACC TCGTCCGGGC GCTCCAGCAG CAGGCGAACA TTACGCTCAT CGAGTCCGCG GCGATCCCGC GGATCGGCGT GGGCGAGGCA ACCATCCCGA GTTTGCAGAA GGTGTTCTTC GACTTCCTCG GGATACCGGA GCGGGAGTGG ATGCCCCAGG TGAACGGCGC GTTCAAGGCC GCGATCAAGT TCGTGAACTG GAGGAAGGCT CCCGACTCCT CGCGCGACGA TCACTTCTAC CATTTGTTCG GCAACGTGCC GAACTGCGAC GGCGTGCCGC TTACCCACTA CTGGCTGCGC AAGCGCGAGC AGGGCTTCCA GCAGCCGATG GAGTACGCGT GCTACCCGCA GCCCGGCGCG CTCGACGGCA AGCTGGCACC GTGCCTGGCC GACGGCACGC GCCAGATGTC CCACGCGTGG CACTTCGACG CGCACCTGGT GGCCGACTTC CTGAAGCGCT GGGCCGTCGA ACGCGGGGTG AAGCGCGTGG TCGACGAGGT CGTGGACGTT CACCTTGACG AGCGCGGCTA CATCTCCAGC CTGTCCACCA AGGAGGGGCG CACGCTGGAG GCGGACCTGT TCATCGACTG CTCCGGCATG CGCGGGCTCC TGATCAATCA GGCCCTGAAG GAACCCTTCA TCGACATGTC CGACCACCTG CTGTGCGACA GCGCGGTCGC CAGCGCCGTG CCCAACGACG ACACGCGCGA CGGGATCGAG CCGTACACCT CCGCGATCGC GATGAACTCC GGGTGGACCT GGAAGATTCC GATGCTCGGC CGGTTCGGCA GCGGCTACGT CTTCTCGAGC AAGTTCACGT CGCGCGACCA GGCCACCGCC GACTTCCTCA AACTCTGGGG CCTCTCGGAC AATCAGCCGC TCAACCAGAT CAAGTTCCGC GTCGGGCGCA ACAGGCGCGC GTGGGTCAAC AACTGCGTGT CGATCGGGCT GTCGTCGTGC TTTCTGGAGC CGCTGGAATC GACGGGCATC TACTTCATCT ACGCGGCGCT TTACCAGCTC GTGAAGCACT TTCCCGACAC GTCGTTCGAC CCGCGGTTGA GCGACGCCTT CAACGCCGAG ATCGTCTACA TGTTCGACGA CTGCCGGGAT TTCGTCCAGG CGCACTACTT CACCACGTCG CGCGACGACA CGCCGTTCTG GCTCGCGAAC CGGCACGACC TGCGGCTCTC GGATGCGATC AAGGAGAAGG TTCAGCGCTA CAAGGCGGGA CTGCCGCTGA CCACCACGTC GTTCGACGAT TCGATGTACT ACGAAACCTT CGACTACGAA TTCAAGAACT TCTGGTTGAA CGGCAACTAC TACTGCATCT TTGCCGGCTT GGGGCTGCTG CCCGACCGGT CGCTGCCGCT CCTGCAGCAC CGACCGGAAT CGATCGAGAA GGCCGAGGCG ATGTTCGCCA GCATCCGGCG CGAGGCCGAG CGTCTGCGCA CCAGCCTGCC GACGAACTAC GACTACCTGC GCTCGCTGCG CGACGGCGAC GCGGGGCTGT CTCGCAGCCA GCGCGGGTCG ACGCTCGCGA CGCAGGAAAT CCTGTAG
|
Protein sequence | MSNPIKNIVI VGGGTAGWMT ASYLVRALQQ QANITLIESA AIPRIGVGEA TIPSLQKVFF DFLGIPEREW MPQVNGAFKA AIKFVNWRKA PDSSRDDHFY HLFGNVPNCD GVPLTHYWLR KREQGFQQPM EYACYPQPGA LDGKLAPCLA DGTRQMSHAW HFDAHLVADF LKRWAVERGV KRVVDEVVDV HLDERGYISS LSTKEGRTLE ADLFIDCSGM RGLLINQALK EPFIDMSDHL LCDSAVASAV PNDDTRDGIE PYTSAIAMNS GWTWKIPMLG RFGSGYVFSS KFTSRDQATA DFLKLWGLSD NQPLNQIKFR VGRNRRAWVN NCVSIGLSSC FLEPLESTGI YFIYAALYQL VKHFPDTSFD PRLSDAFNAE IVYMFDDCRD FVQAHYFTTS RDDTPFWLAN RHDLRLSDAI KEKVQRYKAG LPLTTTSFDD SMYYETFDYE FKNFWLNGNY YCIFAGLGLL PDRSLPLLQH RPESIEKAEA MFASIRREAE RLRTSLPTNY DYLRSLRDGD AGLSRSQRGS TLATQEIL
|
| |