Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3583 |
Symbol | |
ID | 3911385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4107540 |
End bp | 4108709 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885485 |
Product | 4-hydroxybenzoate 3-monooxygenase |
Protein accession | YP_487189 |
Protein GI | 86750693 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR02360] 4-hydroxybenzoate 3-monooxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.114816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.502455 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACGC AAGTCGCCAT CATCGGCGCC GGCCCATCCG GCCTGCTGCT CGGCCAGTTG CTGCACCGCT ACGGCATCGA CGCGGTCATT CTGGAACGCA AAGACCCCGA TTATGTGCTG TCCCGGATCC GCGCCGGGGT GCTGGAACAG GGCCTGGTCG GGCTGCTCGA CGAAGCCGGG GTCGGCGCAC GGCTGCATCA GGAAGGCCTG GTGCATGACG GCTTCGAGAT CGCGTTCTCC GGCAAGCGCC ACCGCATCGA TCTCGCCGGC ACCACGGGCG GCAAGCACGT CACCGTCTAC GGCCAGACCG AGGTGACGCG CGACCTGATG GAGGCGCGCA AGGCCGCCGG CCTCACCACC GTCTACGATG CCGCCGATGT CAGCCTGCAC GATTTCGACG GCGACACGCC GAAGGTGCGC TGGGTCAAGG ACGGCGTCAC CCACGAGCTC GCCTGCGATT TCATCGCCGG CTGCGACGGC TTCCACGGCG TGTCGCGGCA AAGCGTGGCC GGCGCGGTCC AGAGCTTCGA GCGGGTGTAT CCGTTCGGCT GGCTCGGCGT GCTGTCGGAT ACGCCGCCGG TGTCGCACGA ACTGATCTAC GTCAACCACG AGCGCGGCTT CGCGCTGTGC TCGATGCGCT CGACGCAGCG CAGCCGCTAT TACGTGCAGT GCCCGCTCTC CGACGACGTC GCGCAATGGA GCGACGACCG GTTCTGGGAC GAGTTGAAGC ACAGGCTCGA TCCTGAAGCC GCGGACAAGC TGGTCACCGG GGCGTCGATC GAGAAGAGCA TCGCGCCGCT GCGCTCATTC GTCGCCGAGC CGATGCGGTT CGGGAGATTA TTTCTGGCCG GCGACGCCGC CCACATCGTG CCGCCGACCG GCGCCAAGGG CCTCAACCTC GCCGCCAGCG ACGTGTACTA CCTGTCGCGC GCGTTGCGCG AATTCTACGG CGAGCACTCC AAGGCGGGGA TCGACGCCTA TTCGGCCGAC GCGCTGCGCC GGGTGTGGAA GGCCGAGCGG TTCTCGTGGT GGATGACCTC GATGCTGCAC CGCTTCCCCG ACAGCGACGC CTTCTCCCAA CGCATCCAGA CCGCCGAGCT CGACTATCTG ATCAGCTCGC AGGCCGCGAT CACCTCGCTG GCGGAAAACT ACGTCGGCCT GCCGTACTGA
|
Protein sequence | MRTQVAIIGA GPSGLLLGQL LHRYGIDAVI LERKDPDYVL SRIRAGVLEQ GLVGLLDEAG VGARLHQEGL VHDGFEIAFS GKRHRIDLAG TTGGKHVTVY GQTEVTRDLM EARKAAGLTT VYDAADVSLH DFDGDTPKVR WVKDGVTHEL ACDFIAGCDG FHGVSRQSVA GAVQSFERVY PFGWLGVLSD TPPVSHELIY VNHERGFALC SMRSTQRSRY YVQCPLSDDV AQWSDDRFWD ELKHRLDPEA ADKLVTGASI EKSIAPLRSF VAEPMRFGRL FLAGDAAHIV PPTGAKGLNL AASDVYYLSR ALREFYGEHS KAGIDAYSAD ALRRVWKAER FSWWMTSMLH RFPDSDAFSQ RIQTAELDYL ISSQAAITSL AENYVGLPY
|
| |