Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3631 |
Symbol | |
ID | 5153142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 3797608 |
End bp | 3798678 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640558477 |
Product | putative phenol hydroxylase (phenol 2-monooxygenase P5 component) |
Protein accession | YP_001239623 |
Protein GI | 148255038 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases [COG0633] Ferredoxin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.411014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGT CGGTTGGTAC GAAGGCTGTG GTGACGGACG TGCAGGTTCA CAAGGTGCGG TTCGAGCCTG TAGGGATCGA GATGGAGGTC GAAGAAGGCG AGACCGTGCT CGATGCGGCG TTCCGCCAGG GCATCTCGCT GATGCATGGC TGCAAGGAGG GCCAGTGCGG AAGCTGCAAG TCGAAGCTCG TCGATGGTGA CATCGAGCTC CTGAAATATT CGACCTTCGC GCTGCCGGAT TACGAGAGCG AGAACGGGCA TGTTCTGCTG TGCCGGACCC ACGCCTATAG CGACGTCAGC TTCGAATTGC TGAACTATGA CGAGGACATG CTGCGGCGAT CGATCGCGGT CAAGGCGTTT CGCGGCCGCG TCGCGGCGAT CACGGCGCTG ACCTCGGACA TCCGCCTGCT GGAGATCGAG ATCGACAAGC CGATGAAGTT CTGGGCCGGG CAGTATGTTG ACCTGACGAT CGACGACGGC CGCATCACCC GTGCCTTCTC GATGGCTAAT GCGCCGGGCG AGGGCACGCG GCTCAGCTTC ATCATCAAGA AATATCCGAA TGGCGCGTTT TCCGCCCAGC TCGACGGCGG ACTTGGTGTC GGCGATGTCG TGATGGCGAA GGGTCCCTAT GGCACCTGCT TCCGGCGCGA GGAACGGCCT GGCCCGATGC TGCTGATCGG CGGCGGCTCG GGGATGTCGC CGCTGTGGTC GATCCTGGCC GACCATATCG CCAGCGGCGA ACAGCGGCCG GTCCGCTTCT TCTACGGCGC CAGGACGCGC GCCGATCTGT TCTATCTCGA CGAACTGGCC GCGATCGGCC GGCAGCTCAA CGATTTCAAA TTCGTCCCGG CGCTGTCGCA CGCCTCGCCC GAGGACGGTT GGGACGGCGA GACCGGCTTC GTGCACGAGG TCGTGGCCCG TCACCTGAAG CAGGAGAACC TGGCCGGGGC AATCGACGCC TATGCCTGCG GCCCGACGCC GATGATCGAT GCTGTGCTGC CGGTCCTGCA GATCAATGGC GTCGAGCCGG ACCACATCTA TTTCGACAAG TTTACGCCGG CGGTGCGATG A
|
Protein sequence | MSMSVGTKAV VTDVQVHKVR FEPVGIEMEV EEGETVLDAA FRQGISLMHG CKEGQCGSCK SKLVDGDIEL LKYSTFALPD YESENGHVLL CRTHAYSDVS FELLNYDEDM LRRSIAVKAF RGRVAAITAL TSDIRLLEIE IDKPMKFWAG QYVDLTIDDG RITRAFSMAN APGEGTRLSF IIKKYPNGAF SAQLDGGLGV GDVVMAKGPY GTCFRREERP GPMLLIGGGS GMSPLWSILA DHIASGEQRP VRFFYGARTR ADLFYLDELA AIGRQLNDFK FVPALSHASP EDGWDGETGF VHEVVARHLK QENLAGAIDA YACGPTPMID AVLPVLQING VEPDHIYFDK FTPAVR
|
| |