Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BamMC406_4091 |
Symbol | |
ID | 6179750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia ambifaria MC40-6 |
Kingdom | Bacteria |
Replicon accession | NC_010552 |
Strand | + |
Start bp | 1116885 |
End bp | 1118132 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641683861 |
Product | tryptophan halogenase |
Protein accession | YP_001810772 |
Protein GI | 172063121 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAA TGAAATCGAA TTCCGTCGAC GTCGCGATCA TCGGCGCGGG GCCGTCCGGC GCGGTCGCGG CCGCATTGTT GCGCAGGGCG AGCCGTTCCG TGGTGGTGCT CGAGCGCCAG CATTTTCCGC GCTTTTCGAT CGGCGAGAGT CTGTTGCCTC AAAGCATGCA GTATCTCGAA GAGGCCGGTA TGTTGCAGGC CGTCGTCGAG GCCGGCTTCC AGTTCAAGAA CGGCGCCTAT TTCGTTTATC GCGACAGGAT GTCGTCGTTC GATTTCCGGG AAAAATTCTC CGACGGCTGG GGGACGGCCT ATCAGGTCGA ACGCGCCGCC TTCGACGATC TGCTGATCCG TTGTGCGGCG GAACAGGGCG CGGACGTGCG ATTCGGCCAT ACCGTGCAAG CTTTCCATCC TGGCGATATG CAACGGCTCG AGGTCGTCGA CGAAGCAGGG TGCCAATATT CCGTTCACGC GTCGTTCGTA CTCGACGCCA GCGGATTCGC GCGCGTGCTG CCGCGGCTGC TCGATCTCGA GGCGCCGACC GGCATGCCGA CGCGCGCGGC AATCTTTTCG CACGTCGAAG ACGGTCTGCC GGCCGGGTCG ACCGATCGCG ACAAGATCTG CATTGCCGTC CATCCGGAGC GTCGCGACGT GTGGTTCTGG ATGATCCCGC TGACGAACGG CCGCTCGTCG GTCGGCTGCG TGGCCGACGC CGGTTTTCTC GACGTGCCGC AAGCGCAGCA GGAATCGTTG CTGCGGGAAT TGCTGCAGAG CGAGCCGACG TTGTCGCGGC TCGTCGGCAG CAAGCCGTTC GTGATGCCGG TGCGTCGCAT CGCGGGCTAC GCGTCGAATG TCGAGCACCT GCACGGGCGC GGCTATGCGT TGCTCGGCAA CGCGGGCGAA TTCCTCGATC CGATCTTCTC GTCCGGTGTG ACGATCGCGA TGCGCTCCGC GCAACTGGCG GTCGCCGTAC TCGAACGTCA GTTGCGCGGC GAGACGGTCG ATTGGACGCG CGATTACGAC ATCGCGCTGC GCAAGGGCAT CGACACCTTC CGCGCGTTCG TCGAGCGCTG GTACTCAGGG GCGCTTCAGG ACATCGTGTT TCATGAAGAC AAGGCATCCG ACGTGAAACG CATGGTCTGC TCGATTCTGG CCGGCTATGC GTGGGACGAG TCGAATCCGT TCGTGCGGGA ACCGGCGCGC GGCCTCGACG TGCTCGCCGA ATTCTGCCGT GCCGGTGGCC TCGGGTGA
|
Protein sequence | MSEMKSNSVD VAIIGAGPSG AVAAALLRRA SRSVVVLERQ HFPRFSIGES LLPQSMQYLE EAGMLQAVVE AGFQFKNGAY FVYRDRMSSF DFREKFSDGW GTAYQVERAA FDDLLIRCAA EQGADVRFGH TVQAFHPGDM QRLEVVDEAG CQYSVHASFV LDASGFARVL PRLLDLEAPT GMPTRAAIFS HVEDGLPAGS TDRDKICIAV HPERRDVWFW MIPLTNGRSS VGCVADAGFL DVPQAQQESL LRELLQSEPT LSRLVGSKPF VMPVRRIAGY ASNVEHLHGR GYALLGNAGE FLDPIFSSGV TIAMRSAQLA VAVLERQLRG ETVDWTRDYD IALRKGIDTF RAFVERWYSG ALQDIVFHED KASDVKRMVC SILAGYAWDE SNPFVREPAR GLDVLAEFCR AGGLG
|
| |