Gene RPB_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3504 
Symbol 
ID3911306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4010789 
End bp4012585 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content63% 
IMG OID637885406 
Productchloride peroxidase 
Protein accessionYP_487110 
Protein GI86750614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCA GGCGAGTTTT CGTCGCGACG TTCATACTGT CGCTCACTCC CGTGGTTTCC 
ACGGCCCAGC CGGTGCTTCA GACACAATCG CTCGACGACT TCCGGCGAAT CCTCAGTCCC
GGCCTGATTT CGCGCGAGGT GACCAGCGAA AAGGTGAGTC AACTCAACGC GGAGGCTGCG
AAGAGCTGCT TCGCCCCCGA CGCGTTTCAG CGCGCGATCC CGGCACTGCC GATCAGATAT
AAGCGGCCAC TTCCTTGCGC CCCGCAGAAT GCGTTCGACC GGTTCGTGAT GTGGAATCAG
ATCGCACTCG ACACCACATC GATGGATCAT GCTCCGCCGC CATTGGGCAC GCCGGACAAT
CCCGCCCTGC ACAATTACGG CCCGCACCGC TCGAGCCGGG TGATGGCGAT CGTTCACATC
GCGATGTTCG ACGCGATCAA TCTGGCGATG CGGTTCGACT CGAAATCACC CGACAAGCCG
AGATACGTGT TCGCGACCTA TCTGCAGGGC ATTCCCAATC CGAAAGAAGA CGCCTCGGTC
GACATGGCGA TCACGTATTC GGCGGGTGAG ACCCTGAAGG CGCTTTATCC TCAACAGGTG
GATGTGATCG AGAATCTCAT CGCCATCGAC GAGGGCGCCG TCCTGAGCGG CAAGGATCGC
AACGCGCCGA TGTATCGGGC CGGCCGCGAA CTCGGAATCA CCGTCGCGAA GGCGATTCTC
CGGGCACGCC GGAACGACGG CTCGGCGCAC GAAGAGCCGC AGGTCGGCGA TCCTGGCTTT
CCGCTCGCGG AAAAATCAGG TCAGTGGCGA CCCGACCCCG TCAGCAACAT CGCCGCGGCG
CTCGGCGGCA AGTGGAAGGA CGTTCAGCCG TTCGTCATCA AGAACGTGCC GACCTTCCGG
CCGCCCCCGC CGCCGTCGAC CGACAGCGCC GAATACGCCC GGGACTTCGA AGAGGTCAGG
AAGCTCGGCG GCGAGAACAC CGAGCGCTCG AGGTCTGAAC GCAGTGCGGA CCAGACACTG
ATCGGAACGT TCTGGGCTTA TGACGGCACC GCTTTCCTGT GCGCTCCGCC GCGGCTCTAC
AATCAGGTGA TCCGGCAGAT CGTGCAGCAG CAGATCAAGG ACGGCGATAC CAAGGACGAG
GATCCGAAGC TGCTCAACTA CGCCCGATTG TTCGCGCTCG CCAACATCGC AATGGCGGAC
GCGGCGATCG CCGCATGGGA CGCAAAGTAT CACTATCGCT ACTGGCGCCC GGTCGTCGGC
ATTCGGGCGG CCGCGACCGA CGGCAACGAT GCCACCCATG CCGACGTCTA CTGGAAGGCG
CTGGGGGCCC CGGCCAGCAA CTCCGTTCGC GGTCCCAACT TCACGCCTCC TTTCCCGGCC
TATCCTTCCG GACACGCCAC GTTCGGCGGC GCTCTGTTCG AAGTGCTGCG CGCGTTCTAC
CCGGACGACA CGTCCTTCAG CTTCATCTCC GACGAGTACA ACGGGCAAAA CAAGCCGGCC
GGCTCGGACG TGCCGCGACC GGAAGTGACC CGACGCTTCG TCAACTTCCG CGCGGCGGAG
GACGAAAATG CGCGCAGCCG GGTCTATCTC GGTGTGCACT GGCAGTTCGA TGCCGATGCC
GGGATCGCGC AGGGCAACCA GGTCGGCAGC TTCGTGGTCG GCAGCACGCT GCGCTGTCTC
GACGATGACG GCAGAGCGCT GGATTGCAAA CCGGGCAGCG GATTCGACGT CAAGCGCAAG
TTCCTGATCT CGACGGAGAA GCGGGTGCTG AGCACTCCGT TCACTCCGTC CCAGTAA
 
Protein sequence
MTIRRVFVAT FILSLTPVVS TAQPVLQTQS LDDFRRILSP GLISREVTSE KVSQLNAEAA 
KSCFAPDAFQ RAIPALPIRY KRPLPCAPQN AFDRFVMWNQ IALDTTSMDH APPPLGTPDN
PALHNYGPHR SSRVMAIVHI AMFDAINLAM RFDSKSPDKP RYVFATYLQG IPNPKEDASV
DMAITYSAGE TLKALYPQQV DVIENLIAID EGAVLSGKDR NAPMYRAGRE LGITVAKAIL
RARRNDGSAH EEPQVGDPGF PLAEKSGQWR PDPVSNIAAA LGGKWKDVQP FVIKNVPTFR
PPPPPSTDSA EYARDFEEVR KLGGENTERS RSERSADQTL IGTFWAYDGT AFLCAPPRLY
NQVIRQIVQQ QIKDGDTKDE DPKLLNYARL FALANIAMAD AAIAAWDAKY HYRYWRPVVG
IRAAATDGND ATHADVYWKA LGAPASNSVR GPNFTPPFPA YPSGHATFGG ALFEVLRAFY
PDDTSFSFIS DEYNGQNKPA GSDVPRPEVT RRFVNFRAAE DENARSRVYL GVHWQFDADA
GIAQGNQVGS FVVGSTLRCL DDDGRALDCK PGSGFDVKRK FLISTEKRVL STPFTPSQ