Gene RPD_3292 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3292 
Symbol 
ID4023801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3644111 
End bp3645325 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content71% 
IMG OID637963495 
Productsalicylate 1-monooxygenase 
Protein accessionYP_570417 
Protein GI91977758 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0110847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAGCCC CCCGCACAAT CATCGTTGCT GGTGCGGGAA TTGGCGGGCT GACGGCGTCG 
CTCGCGCTCG CGGCAAAGGG CTTCCGGGTC ATCAATCTGG AGAAGGCGGA ACGGCTCGAG
GAAGCCGGCG CCGGACTCCA GCTTTCCCCC AACGCCAGCC GCGTGCTGAT CGATCTCGGT
CTTGCCGGCC GGCTCGCGCA GCGCGCGATC GTGCCGGACG CGGTGACGGT GATGAGCGCG
CGGACCGGCC GCGCGCTGGT GCGGCTGCCG CTCGGCGACG CCGCGGGCGC ACGCGCCGGC
GCGCCCTATT GGGTGATCCA CCGCGCCGAT TTGCAAGCCG CGCTCGAAGC GCAGGTCAAC
GCCCACCCGT CGATCGATCT GCGGCTCGGC TGCCGGTTCG AGGATTTCGC CAACGACGTC
CACGGCGTCA GTATCGGCCA TCGCTGCCGC GCCGAGCGCA AGCAGGACTC TGCGCTGGCG
CTGATCGGCG CCGACGGCAT CTGGTCGACG GTGCGCGGGA AATTGTTTCC GACGGCGCAG
CCTCGTTTCA GCGGACTAAT TGCCTGGCGC GGCACGGTCG AGGCCAAGGC GCTGCCGCAA
CGCGCCGCGC TCGCCGGCGT GCAGCTCTGG ATGGGACCGG ACGCGCATCT CGTGGTCTAT
CCGATCTCCG GCGGGCGGCT CGTCAATCTG GTGGCGATTG TTGCGGACGA CTGGCGCCGC
GAGGGTTGGA GCGCACCCGG CGACGCCCGT GACATCCAAC GCCGGTTCGC CGCCGCCGGC
TGGGCGTCCG CGGCGAGGCT GCTGATTGAC TCGGTCGAAA ACTGGAAGCG CTGGGCGCTG
TTCGCGATGC CGGATGGCGG GGTGTGGACC GCGGGCTCGA CCGCGCTGCT CGGCGACGCG
GCGCATGGAA TGCTGCCGTT CGCAGCGCAG GGCGCGGGCA TGGCGATCGA GGACGCCGCG
GTGCTGGCGA AATGCCTCGG CGAAAGCCAT GGCGCGGACG CTTCAGACGC CGCGCTCCCG
GTTGCGGCGT CGCTCCAGCG CTACGCGCAG GCGCGCAGCA CGCGGGTGGC GCGGGTGCAG
CGGCTGGCGC GGCAGAACGG CGGCATCTAT CACCTCAAGG GTCCGATCGC ACTGGCGCGC
GATCTGGCGA TGCAGGCGCT CGGCGGCGAA CTGCTGCTGG CGCGGCAGAA TTGGATCTAC
GACTGGCGGG CGTGA
 
Protein sequence
MSAPRTIIVA GAGIGGLTAS LALAAKGFRV INLEKAERLE EAGAGLQLSP NASRVLIDLG 
LAGRLAQRAI VPDAVTVMSA RTGRALVRLP LGDAAGARAG APYWVIHRAD LQAALEAQVN
AHPSIDLRLG CRFEDFANDV HGVSIGHRCR AERKQDSALA LIGADGIWST VRGKLFPTAQ
PRFSGLIAWR GTVEAKALPQ RAALAGVQLW MGPDAHLVVY PISGGRLVNL VAIVADDWRR
EGWSAPGDAR DIQRRFAAAG WASAARLLID SVENWKRWAL FAMPDGGVWT AGSTALLGDA
AHGMLPFAAQ GAGMAIEDAA VLAKCLGESH GADASDAALP VAASLQRYAQ ARSTRVARVQ
RLARQNGGIY HLKGPIALAR DLAMQALGGE LLLARQNWIY DWRA