Gene BBta_5237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5237 
Symbol 
ID5154413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp5461022 
End bp5462221 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content72% 
IMG OID640560007 
Productputative salicylate 1-monooxygenase 
Protein accessionYP_001241132 
Protein GI148256547 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.192496 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATCGC GTACCATCTT CGTTGCAGGC GCCGGGATCG GCGGCCTGAC AGCAGCGCTC 
GCACTGGCCA CCAAGGGCTT TCGTGTCGTC GTGCTGGAAA AGGCCGAGCG GCTCGAAGAG
GTCGGCGCCG GGCTGCAACT GTCCCCGAAT GCCAGTCGCA TTCTGATCGA TCTCGGATTG
GGGCCGCGGC TTGCCTCCCG CGTGGTGGTG CCCGAGGCCG TCAGCATCCT CAGCGCCCGG
GCCGGCGGTG AAATCGCGCG GCTGCCGCTC GGCGCGGCGG CGAGCGAGGC GGCCGGCGCG
CCCTATTGGG TCGTGCATCG CGCCGATCTG CAGGCGGCCC TGGCGGCCGA GGCGATGGCG
CATCCGGATG TCGAGCTGCG CCTGGGCTGC CAGTTCGAGG ATGTGGCGGC CCACGCCAAG
GGCTTGACCG TGGTGCATCG CCGCGGCGAT GAACGCCGGC AGGACGTGGC GCTGGCGCTG
ATCGGCGCCG ACGGCGTCTG GTCGGCAGTG CGGCATCATC TGTTTCCCCA GGTCCGCGCC
GAATTCTCCG GCCTGATCGC GTGGCGCGGC ACGCTCGACG CCCGGCAATT GCCCCGCGAC
CATGCCAGCG CGCGGGTGCA GCTCTGGATG GGACCCGGTG CTCATCTGGT CGCCTACCCG
ATCTCAGCCG GGCGGCAGGT CAACGTCGTC GCGGTGGTGC CCGGCACCTG GAACCGGCCG
GGCTGGAGCG CCGAGGGCGA TCCAGCGGAG CTGAAGGCGG CGTTCGGCCC GCCACGCTGG
CCGGCCACGG CCCGCTTGCT GCTCAATGCG GTCGATGGCT GGCGCAAATG GGCCTTGTTC
GGGGTTCCCG AGGGCATCGC GTGGACCGCA GGCAATGTCG CGCTGCTGGG CGACGCCGCC
CATGCGATGT TGCCCTTCGC CGCGCAGGGC GCGGGCATGG CGATCGAGGA CGCAGCCGTG
CTCGCCAAGA CCCTGAGCGA GGCGCGCCCG GATGGCCCTG GCGCCATCGA GGGGGCATTG
CAGCGCTACG CCAGGCTGCG TCGCCCCCGC GTCGGCCGGG TGCAGCGCAC GGCGCGCCAG
CAAGGGCGCG TCTACCACAT GACGGGTCCG CTGGCGCTGG CACGCGACCT CAGCATCAAG
GCGCTCGGCC CGCAACGGCT ATCGGCGCGG CAAAGCTGGA TCTACGATTG GCGGCTCTAG
 
Protein sequence
MASRTIFVAG AGIGGLTAAL ALATKGFRVV VLEKAERLEE VGAGLQLSPN ASRILIDLGL 
GPRLASRVVV PEAVSILSAR AGGEIARLPL GAAASEAAGA PYWVVHRADL QAALAAEAMA
HPDVELRLGC QFEDVAAHAK GLTVVHRRGD ERRQDVALAL IGADGVWSAV RHHLFPQVRA
EFSGLIAWRG TLDARQLPRD HASARVQLWM GPGAHLVAYP ISAGRQVNVV AVVPGTWNRP
GWSAEGDPAE LKAAFGPPRW PATARLLLNA VDGWRKWALF GVPEGIAWTA GNVALLGDAA
HAMLPFAAQG AGMAIEDAAV LAKTLSEARP DGPGAIEGAL QRYARLRRPR VGRVQRTARQ
QGRVYHMTGP LALARDLSIK ALGPQRLSAR QSWIYDWRL