Gene RPD_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1884 
Symbol 
ID4022366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2112887 
End bp2114059 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content64% 
IMG OID637962077 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_569020 
Protein GI91976361 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.316632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.150769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACGC AAGTCGGCAT CATCGGCGCC GGGCCGTCGG GTCTGCTGCT CGGCCAGTTG 
CTGCACACAT ACGGGATCGA GGCCGTCATT CTCGAGCGCA AGAACCCCGA CTACGTGCTT
TCACGCATCC GCGCCGGCGT ACTTGAACAG GGGATGGTCG ATCTGCTCGA CGAGGCCGGG
GTCGGCCGGC GCCTGCATCA GGAAGCGTTG GTGCACGATG GCTTCGAGAT CGCGTTTTCC
GGCCGGCGAC ACCGCATCGA CCTCAAGCAC TCGACCGGCG GCAGAACCGT CACCGTGTAC
GGCCAGACCG AGGTGACGCG CGATCTGATG GAGGCCCGGA AAGCCGCCGG CCTGACTACG
ATTTACGAAG CGGCCGATAT CACCCTTCAC GATTTCGACG GCGAACGCCC CAGGGTGCGT
TACTTCAAGG ACGGCGTCAG TCAGGAGCTC GCCTGCGATT TCATCGCCGG CTGCGACGGC
TTCCACGGAG TCGCGCGGCA GAGCGCGCCG GCCAACGCGT TACAGACCTA CGAGCGGGTC
TATCCGTTCG GCTGGCTCGG GGTGTTGTCC GACACGCCGC CGGTGTCGTC GGAACTGATC
TACGTCAACC ACGACCGCGG GTTTGCGCTG TGCTCGATGC GCTCGGCACA TCGCAGCCGC
TATTACGTGC AGTGTCCGCT GTCCGACGAT GTCGGTGAAT GGAGCGACGA TCGGTTCTGG
GACGAACTGA AACAAAGGCT CGGCCCGGAA ACCGCCGGCC ATCTCGTCAC CGGCGCGTCG
ATCGAGAAGA GCATCGCTCC ACTGCGTTCC TTCGTTGCCG AGCCGATGCG GTTCGGCCGG
CTGTTCCTCG CCGGCGACGC CGCCCACATC GTGCCGCCGA CCGGTGCCAA GGGCCTCAAC
CTCGCAGCCA GCGACGTGTA CTATCTTTCG CGCGCGCTGC GCGAGTTCTA TGATGAGGGA
TCGAAAGGTG GGATCGATGC TTATTCCGCC AACGCGCTTC GCCGGGTGTG GAAGGCCGAA
CGATTCTCGT GGTGGATGAC GTCGATTCTT CATCGCTTCC CCGACAGCGA CGCCTTCACC
CAACGCATCC AGACCGCCGA ACTCGACTAT CTGGTCAGTT CGCAAGCCGC GACGACCTCG
CTCGCGGAAA ACTACGTCGG GCTGCCTTAC TAA
 
Protein sequence
MRTQVGIIGA GPSGLLLGQL LHTYGIEAVI LERKNPDYVL SRIRAGVLEQ GMVDLLDEAG 
VGRRLHQEAL VHDGFEIAFS GRRHRIDLKH STGGRTVTVY GQTEVTRDLM EARKAAGLTT
IYEAADITLH DFDGERPRVR YFKDGVSQEL ACDFIAGCDG FHGVARQSAP ANALQTYERV
YPFGWLGVLS DTPPVSSELI YVNHDRGFAL CSMRSAHRSR YYVQCPLSDD VGEWSDDRFW
DELKQRLGPE TAGHLVTGAS IEKSIAPLRS FVAEPMRFGR LFLAGDAAHI VPPTGAKGLN
LAASDVYYLS RALREFYDEG SKGGIDAYSA NALRRVWKAE RFSWWMTSIL HRFPDSDAFT
QRIQTAELDY LVSSQAATTS LAENYVGLPY