Gene Rpal_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1983 
Symbol 
ID6409643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2143761 
End bp2144933 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content66% 
IMG OID642711869 
Product4-hydroxybenzoate 3-monooxygenase 
Protein accessionYP_001990981 
Protein GI192290376 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR02360] 4-hydroxybenzoate 3-monooxygenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.334759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGACAC AAGTGGCCAT CATCGGCGCG GGTCCGTCCG GACTGCTGCT CGGCCAGCTG 
CTGCACAAAT ACGGCATCGA CGCCGTCATC GTGGAACGCA AGGATCCGGA CTACGTGCTG
TCGCGGATTC GCGCCGGCGT GCTCGAACAG GGCATGGTCG ACCTGCTCGA CGAAGCCGGC
GTCTCCACCC GGTTGCACCA GGAAGCGCTG GTGCACGGCG GCTTCGAGAT CGCGTTCGCC
GGTCAGCGGC ACCACATCGA TCTGAGGGGC GCGACCGGCG GCAAGAGCGT CACCGTGTAT
GGCCAGACCG AGGTCACCCG CGACCTGATG GAAGCGCGTA GCGCCGCCGG TCTCACCACG
ATCTACGACG CCGCCGACGT CAGCCTGCAC GACTTCGAAG GCGCCCATCC CAAGGTGCGC
TACGTCAAGG ACGGCACCAC GCGCGAGATC GTGTGCGATT TCATCGCCGG CTGCGACGGC
TTCCACGGCA TCAGCCGGCA GAGCGTGCCG GCAAGTGCGG TGCAGTCGTT CGAGCGGGTG
TATCCGTTCG GCTGGCTCGG ATTGCTCTCC GACACTCCGC CGGTGTCGTC GGAGCTGATT
TACGTCAACC ACGACCGCGG CTTTGCGTTG TGCTCGATGC GCTCGATGCA CCGCAGCCGC
TATTACGTGC AGTGCCCGCT CACCGACGAC GTCGCCGACT GGAGCGACGA CCGGTTCTGG
GACGAGCTGA AGAGCCGGCT CGATCCGGAA ACCGCCGGAA AGCTGGTCAC CGGCCCATCG
ATTGAAAAGA GCATCGCGCC ACTACGCTCA TTTGTCGCCG AGCCGATGCG GTTCGGCCGG
CTGTTCCTCG CCGGCGACGC CGCCCACATC GTGCCGCCGA CCGGCGCCAA GGGCCTGAAC
CTCGCCGCCA GCGACGTTTA CTATCTCTCT CGCGTCATGC GCGAATACTA CGCGGAGAAG
TCGGAGGCCG GCATCGATGC CTATTCGGCG AGCGCGCTTC GCCGGGTGTG GAAGGCCGAG
CGGTTCTCCT GGTGGATGAC CTCGCAGCTC CACCGCTTCC CCGACAGCGA CGCCTTCTCC
CAGCGCATCC AGACCGCCGA ACTCGACTAC CTGGTCAACT CTAAGGCCGC GCTCACCTCG
CTGGCGGAGA ACTACGTCGG CCTGCCGTAC TGA
 
Protein sequence
MRTQVAIIGA GPSGLLLGQL LHKYGIDAVI VERKDPDYVL SRIRAGVLEQ GMVDLLDEAG 
VSTRLHQEAL VHGGFEIAFA GQRHHIDLRG ATGGKSVTVY GQTEVTRDLM EARSAAGLTT
IYDAADVSLH DFEGAHPKVR YVKDGTTREI VCDFIAGCDG FHGISRQSVP ASAVQSFERV
YPFGWLGLLS DTPPVSSELI YVNHDRGFAL CSMRSMHRSR YYVQCPLTDD VADWSDDRFW
DELKSRLDPE TAGKLVTGPS IEKSIAPLRS FVAEPMRFGR LFLAGDAAHI VPPTGAKGLN
LAASDVYYLS RVMREYYAEK SEAGIDAYSA SALRRVWKAE RFSWWMTSQL HRFPDSDAFS
QRIQTAELDY LVNSKAALTS LAENYVGLPY