Gene RPC_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1971 
Symbol 
ID3973644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2144676 
End bp2146091 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content68% 
IMG OID637925082 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_531847 
Protein GI90423477 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.794108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA TCGCCGCGGC GTCGGCGTTG GCCGCACCCC CGCACGCCGT GATGGGGCGC 
GGCCATCCGC ATGGCGGGCC GAAGCCGGTC AGTGCCTTCC TTGTCGAGCC CGGTCGCGAT
GCCTTGACGC AGGCGTTCAG CGGTCGAGAG TTCTCGCCGC CATGGCACGG CAGCATCGAG
GTTGCGGCGA GCGAGGCCGC GGCCGCCTTC GAGCGATTGT TCGCGACGCC GCGTAGCGCG
CCTGCGATCG CCTATATCCA TATGCCTTAC TGTCAGAACC ACTGCCTGTT CTGCGGCTTC
TTTCAGAATG TCTGGCGACC GGAGGTCGCA GAGCCCTTTG TCGACGACGT GATTGCCGAA
ATCGCCGCCA AGGCGGCGAC GCCGTTGGTG GCGTCGGCGC CGATCGAGGC GGTCTATATC
GGCGGCGGCA CGCCCTCGGC GCTGCCGGAC CAACAGCTCG TTCGGCTGAT CGGCAGCCTG
CGGCAATTGT TGCCGTTGAC GCCGGATTGC GAGATCACGC TGGAGGGCCG CTCGCACGGT
TTCGGCGTGG CCAAGGCGGC TGCGGTGCTG GAGGCCGGAG CGACCCGGAT CTCGCTCGGC
GTGCAAACCT TCTCCACCGC TGTCCGCCGC CGGCTCGGAC GCAAACAGTC CGGGCCGGAG
GTGGCGTCGT TTCTCGAGGA TCTGGTCGGG CTCGGTCGTG CCAGTATCGT GTGCGATCTG
ATCTATGGCC TGCCCGGACA GGATCACGAC GGCTGGCTGC GCGACATCGA CATCGCCAAC
GCCGTCGGGC TCGATGGCGT GACCTTGTAT GCGCTCAACC TGTTTCCCGG CGGCCCGCTC
GCCACCGCGA TCGAGCATGG CAAGTTGCCG CCCGCGGCCA ACATCGCCGC GCAGGCGCGG
GACTACGCCG CTGGCGTCGA TCGGTTGCTG GGGTTCGGCT GGCGGCAGGC CTATCAGTCG
CATCTGATCC GCTCGCCGCA CGAGCAGAAC CGTTACAACG CCCTGATCAA GCAGGGATCG
GCCTGCCTGC CGTTCGGTCC GGGCGCCGGC GGGCAGGCGC ACGGTTATCG TTGGCGCAAT
GTGATCGACG TCGAGCAGCG CCGCGCCATG CTGGCGCAGG GCGTGGCCCC GGTGGAGGGG
CTGTCGCGGG TGCCGCTGCA ATATGCCGCG CAAGCGGTGA TCACCGCGGG CCTCGAGGCC
GGGCGGCTCG ATCTCGCCGC GGTGGAGAGC TTGCATCCGG GGTTTCGCGT CGCGGCGGCA
CCGTTGCTGG CGAACTGGAC CGAGGTCGGG CTCGGCGAGA TCGTCAAAGA CCATTTCCAG
CCGAGCCGCG CCGGCGCATT CTGGATCACC AAACTGACCG GCGGCTTCTA TGCCGCGCTA
CGATCGGCGC CGTCGGCAAA GGGCGAAGGA GGCTAA
 
Protein sequence
MTAIAAASAL AAPPHAVMGR GHPHGGPKPV SAFLVEPGRD ALTQAFSGRE FSPPWHGSIE 
VAASEAAAAF ERLFATPRSA PAIAYIHMPY CQNHCLFCGF FQNVWRPEVA EPFVDDVIAE
IAAKAATPLV ASAPIEAVYI GGGTPSALPD QQLVRLIGSL RQLLPLTPDC EITLEGRSHG
FGVAKAAAVL EAGATRISLG VQTFSTAVRR RLGRKQSGPE VASFLEDLVG LGRASIVCDL
IYGLPGQDHD GWLRDIDIAN AVGLDGVTLY ALNLFPGGPL ATAIEHGKLP PAANIAAQAR
DYAAGVDRLL GFGWRQAYQS HLIRSPHEQN RYNALIKQGS ACLPFGPGAG GQAHGYRWRN
VIDVEQRRAM LAQGVAPVEG LSRVPLQYAA QAVITAGLEA GRLDLAAVES LHPGFRVAAA
PLLANWTEVG LGEIVKDHFQ PSRAGAFWIT KLTGGFYAAL RSAPSAKGEG G