Gene RPC_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3814 
Symbol 
ID3969273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4245090 
End bp4246439 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content65% 
IMG OID637926924 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_533667 
Protein GI90425297 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.156886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCTG CTTTGGAAAA ATACGCCAAG AGTTCGGTGC CGCGCTACAC CAGCTATCCC 
ACCGCACCGC ATTTTGCCGC GGAGTTTCCC GAATCGATCT ATCGCGGCTG GCTGGCGCAG
CTCGACCTCG ACGAGCCGAT TTCGCTGTAT CTGCACGTGC CGTTCTGCAA GCAGATGTGC
TGGTACTGCG GCTGCAACAT GAAGCTCGCC TCGCGCTATG AGCCGGTTGC CACCTATCTG
CAGCACCTGC TCGACGAGAT CGACCTTCTG GCCAACGCCA TGCCGGGCCG GATGCCGGTC
GGTCACGTGC ATTTTGGCGG CGGCACCCCG ACGGTGCTCG AGCCGGACGA TCTCGGCGCG
GTGATGGCGC TGATCGCCGA TCGCTTCCGC CTGATCGAGG GCGCTGAGAT CGCAATCGAA
AGCGATCCGC GCACCTTGAC CGACGACATG GTCGACAAGA TCGGCGCGAT CGGCTTCACC
CGCGCGAGCT TCGGCGTGCA GGAGTTCGAC CCCAAGGTGC AGGCCGCGAT CAACCGGATC
CAGCCGCCCG AGATGGTCAA GCACGCGATG GACCGTTTCC GTGCGGTCGG CGTCAAAAAA
CTCAATTTCG ACCTGATCTA CGGGCTACCC TATCAGACTG CGGAGGATCT GCGCCGCACC
GTCGAACAAT GCGTCGAGAT GCGGCCGGAC CGCGTCGCGC TGTTCGGCTA CGCCCATGTG
CCGTGGGTCG CCAAGAACCA ACGGATGATT CCGGACGAAT CGCTGCCGGA ACCGGCGCAG
CGCGCCGAGC AGGCCCGCGC CGCCGCCGAC GCCTTGGTCA AGGGCGGTTA CGTGCGGATC
GGCATCGACC ATTTCGCGCT GCCCGGCGAT ACGCTGGCGG TGGCCGCAGC GACCGGCGAG
CTGCACCGCA ACTTCCAGGG CTACACCCCC GACGCCTCGC CCACCCTGAT CGGCATCGGC
GCCACCTCGA TCGGCCGCAC GCCGTCGGGC TATGTGCAGA ACATCAGCGA GACCGGCGCC
TATATGCGCG CGGTCGAAGC CGGCAAGTTG CCGATCGCCC GCGGCCACGC CTTCAAGGGC
CAAGACGACC TGCGCGCCCA CGTCATCGAG CGCATCATGT GCGACGGCAA GGTCGACCTC
AACGCTGTGG GGCGGATCTT CGGCGCCGCC GAGGACTGGT ACGACGGCGA ACGCGAGGCA
ATCGCAGAGC TACAGAAGGA CGGTGTGTTG ACCTGCGCCA ATGGCACGCT GACGCTCACT
CCCGCCGGCG AGCCGCTCGC CCGCGTCGTC GCCGCGGTGT TTGACACTTA TTTGCGCAAC
TCCTCCGTCC GCCATTCGAT CGCGGTCTGA
 
Protein sequence
MTSALEKYAK SSVPRYTSYP TAPHFAAEFP ESIYRGWLAQ LDLDEPISLY LHVPFCKQMC 
WYCGCNMKLA SRYEPVATYL QHLLDEIDLL ANAMPGRMPV GHVHFGGGTP TVLEPDDLGA
VMALIADRFR LIEGAEIAIE SDPRTLTDDM VDKIGAIGFT RASFGVQEFD PKVQAAINRI
QPPEMVKHAM DRFRAVGVKK LNFDLIYGLP YQTAEDLRRT VEQCVEMRPD RVALFGYAHV
PWVAKNQRMI PDESLPEPAQ RAEQARAAAD ALVKGGYVRI GIDHFALPGD TLAVAAATGE
LHRNFQGYTP DASPTLIGIG ATSIGRTPSG YVQNISETGA YMRAVEAGKL PIARGHAFKG
QDDLRAHVIE RIMCDGKVDL NAVGRIFGAA EDWYDGEREA IAELQKDGVL TCANGTLTLT
PAGEPLARVV AAVFDTYLRN SSVRHSIAV