Gene RPC_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3149 
Symbol 
ID3972682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3492705 
End bp3493901 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID637926258 
Productsalicylate 1-monooxygenase 
Protein accessionYP_533010 
Protein GI90424640 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.136472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAGCCC CGCGAACCAT CGTCGTCGCC GGCGCCGGGA TCGGAGGATT GACCGCGTCG 
CTGACGCTTG CAGCCAAGGG TTTTCGCGTC ATCGTGCTGG AAAAGGCGCA GCGGCTGGAA
GAAGCCGGCG CCGGGCTGCA GCTGTCACCC AATGCCAGCC GCGTGCTGAT CGATCTCGGA
TTGCGGCCGC GGCTGGAGCC CAGCGTCATT ACGCCTGACG CCGTCACCAT CATGAGCGCG
CGCGGCCAGG GCGAGATCGT TCGCCTGCCG CTCGGCGACG AGGCCAGGTT TCGCGCCGGG
GCTCCCTATT GGGTGATTCA TCGCGCCGAT CTGCAGACCG CGCTGGCGGC GCAGGTCCGC
GACCATCGCA ACATCGAACT GCGGCTCGGC TGGCAGTTCG AAGAGGTCAG CAGCACCGCC
GACGGCGTTT CGGTGACGCA GCGCAACGGA CTGTCGCGGC TGCATGAGCC GGCTTTAGCG
TTGGTCGGAG CCGACGGCAT CTGGTCGGCG GTCCGCCGCC AGCTGTTTCC CGATGCGCAG
CCGAAGTTCT CCGGACTGAT CGCATGGCGC GGCACGTTCG AGGCCGACCG CTTGCCGGCC
GGGTTTGCCG CACGCAATGT GCAGCTGTGG ATGGGCGGCA ATGCGCATCT GATCGCCTAT
CCGATTTCCG CAGGCCGTCG CATCAACATC GTGGCGATCG TCGCCGGATC CTGGAACCGG
CCGGGCTGGA GCGCGCCCGG CGATCCCGGC GAGATCAACA GCCAGTTCGC CCCGCCGCAT
TGGCCGGACC AGGCTCGCGT CCTGATCGAC GCCGTGCAGG GCTGGCGCCG CTGGGCGCTG
TTCACCATGC AGGATGGCGG GGTGTGGAAT CACGGCGCCG CGGCGATGCT CGGCGATGCC
GCGCACGGCA TGCTGCCGTT CGCCGCGCAG GGCGCCGGCA TGGCGATCGA AGACGCCGCG
GTGCTGGCCG CCTGCCTCGG CGACATCTCG GCCCCGGAGG CGGTGCCGGC GGCGCTGCAG
CGTTACGCGG AGCTACGGCA GCCGCGCGTC GGCCGGGTGC AGCGCACCGC GCGACTGAAT
GGTCAGATCT ATCACCTCGC CGGCGCCGCG GCGCTGGCGC GCGACCTGAC GATGCGCGCG
CTGGGCGGCC CGCGGCTCTT GGCGCGGCAG CGCTGGATCT ATGATTGGCG GGTGTGA
 
Protein sequence
MTAPRTIVVA GAGIGGLTAS LTLAAKGFRV IVLEKAQRLE EAGAGLQLSP NASRVLIDLG 
LRPRLEPSVI TPDAVTIMSA RGQGEIVRLP LGDEARFRAG APYWVIHRAD LQTALAAQVR
DHRNIELRLG WQFEEVSSTA DGVSVTQRNG LSRLHEPALA LVGADGIWSA VRRQLFPDAQ
PKFSGLIAWR GTFEADRLPA GFAARNVQLW MGGNAHLIAY PISAGRRINI VAIVAGSWNR
PGWSAPGDPG EINSQFAPPH WPDQARVLID AVQGWRRWAL FTMQDGGVWN HGAAAMLGDA
AHGMLPFAAQ GAGMAIEDAA VLAACLGDIS APEAVPAALQ RYAELRQPRV GRVQRTARLN
GQIYHLAGAA ALARDLTMRA LGGPRLLARQ RWIYDWRV