Gene RPB_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3583 
Symbol 
ID3911385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4107540 
End bp4108709 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content67% 
IMG OID637885485 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_487189 
Protein GI86750693 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.114816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.502455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGC AAGTCGCCAT CATCGGCGCC GGCCCATCCG GCCTGCTGCT CGGCCAGTTG 
CTGCACCGCT ACGGCATCGA CGCGGTCATT CTGGAACGCA AAGACCCCGA TTATGTGCTG
TCCCGGATCC GCGCCGGGGT GCTGGAACAG GGCCTGGTCG GGCTGCTCGA CGAAGCCGGG
GTCGGCGCAC GGCTGCATCA GGAAGGCCTG GTGCATGACG GCTTCGAGAT CGCGTTCTCC
GGCAAGCGCC ACCGCATCGA TCTCGCCGGC ACCACGGGCG GCAAGCACGT CACCGTCTAC
GGCCAGACCG AGGTGACGCG CGACCTGATG GAGGCGCGCA AGGCCGCCGG CCTCACCACC
GTCTACGATG CCGCCGATGT CAGCCTGCAC GATTTCGACG GCGACACGCC GAAGGTGCGC
TGGGTCAAGG ACGGCGTCAC CCACGAGCTC GCCTGCGATT TCATCGCCGG CTGCGACGGC
TTCCACGGCG TGTCGCGGCA AAGCGTGGCC GGCGCGGTCC AGAGCTTCGA GCGGGTGTAT
CCGTTCGGCT GGCTCGGCGT GCTGTCGGAT ACGCCGCCGG TGTCGCACGA ACTGATCTAC
GTCAACCACG AGCGCGGCTT CGCGCTGTGC TCGATGCGCT CGACGCAGCG CAGCCGCTAT
TACGTGCAGT GCCCGCTCTC CGACGACGTC GCGCAATGGA GCGACGACCG GTTCTGGGAC
GAGTTGAAGC ACAGGCTCGA TCCTGAAGCC GCGGACAAGC TGGTCACCGG GGCGTCGATC
GAGAAGAGCA TCGCGCCGCT GCGCTCATTC GTCGCCGAGC CGATGCGGTT CGGGAGATTA
TTTCTGGCCG GCGACGCCGC CCACATCGTG CCGCCGACCG GCGCCAAGGG CCTCAACCTC
GCCGCCAGCG ACGTGTACTA CCTGTCGCGC GCGTTGCGCG AATTCTACGG CGAGCACTCC
AAGGCGGGGA TCGACGCCTA TTCGGCCGAC GCGCTGCGCC GGGTGTGGAA GGCCGAGCGG
TTCTCGTGGT GGATGACCTC GATGCTGCAC CGCTTCCCCG ACAGCGACGC CTTCTCCCAA
CGCATCCAGA CCGCCGAGCT CGACTATCTG ATCAGCTCGC AGGCCGCGAT CACCTCGCTG
GCGGAAAACT ACGTCGGCCT GCCGTACTGA
 
Protein sequence
MRTQVAIIGA GPSGLLLGQL LHRYGIDAVI LERKDPDYVL SRIRAGVLEQ GLVGLLDEAG 
VGARLHQEGL VHDGFEIAFS GKRHRIDLAG TTGGKHVTVY GQTEVTRDLM EARKAAGLTT
VYDAADVSLH DFDGDTPKVR WVKDGVTHEL ACDFIAGCDG FHGVSRQSVA GAVQSFERVY
PFGWLGVLSD TPPVSHELIY VNHERGFALC SMRSTQRSRY YVQCPLSDDV AQWSDDRFWD
ELKHRLDPEA ADKLVTGASI EKSIAPLRSF VAEPMRFGRL FLAGDAAHIV PPTGAKGLNL
AASDVYYLSR ALREFYGEHS KAGIDAYSAD ALRRVWKAER FSWWMTSMLH RFPDSDAFSQ
RIQTAELDYL ISSQAAITSL AENYVGLPY