Gene RPD_0397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0397 
Symbol 
ID4020863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp463569 
End bp464726 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID637960582 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_567536 
Protein GI91974877 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCGCTG ATCCCGACAA GGCGTTCGGC GTCTACGTGC ACTGGCCGTT CTGCCTGTCG 
AAGTGCCCGT ATTGCGACTT CAACAGCCAC GTCCGCCACG CCGCGATCGA CGAGGCGCGA
TTCGCCCGCG CCTTCGCCCG CGAGATTGCA GCCACTGCGG CGCGGATCGG TCCGCGTGAT
GTCAGCTCGA TCTTTCTCGG CGGCGGCACG CCGTCGCTGA TGCAGCCCTC GACCGTCGGC
GCGATCCTCG ACGCGATCGG CAAACATTGG CGCGTCACGC CCGATGCCGA AGTATCACTC
GAAGCCAATC CGACCAGCGT CGAGGCGACG CGGTTTCGCG GCTATCGCGC CGCCGGCGTC
AACCGCGTCT CGCTCGGCGT GCAGGCGCTC GACGACGCCT CGCTGAAGAC GCTCGGCCGG
CTGCACACCG CGCAGGAGGC GATGGACGCG GTGGCGATCG CGCGCTCGGC TTTCGATCGT
TATTCGTTCG ACCTGATCTA TGCACGTCCT GGACAAACGC CGGCGATGTG GGAGGCCGAG
CTGCGCCGGG CGATCGGCGA GGCGGCCGAA CATCTGTCGC TCTATCAGCT CACCATCGAA
GCCGAGACGC CGTTCTTTGC GCTGCATCGG GCGGGCAAAC TGCAGACACC GGACGAATCC
GCGTCGCGCG CGCTCTACGA CGTCACGCAA TCGGTCTGCG CAGAGCTCGG ACTCCCTGCC
TACGAGATTT CCAATCACGC CCGCCCGGGT GCCGAGTGCA AGCACAACCT CGTCTACTGG
CGCGGCCAGG AATATGCCGG GATCGGACCC GGCGCGCATG GCCGGCTCGA CATCGGCGGC
ACGCGCTACG CGATCGCCAC CGAAAAACGG CCGGAGAGCT GGATGATGCG GGTCGAAGCC
ACCGGCACCG GCGTGATCAC TGACGACCGG CTGAACAGCG AAGAGCGCGC GGACGAATTC
CTGCTGATGG GGCTGCGGCT CGCCGAGGGC ATCGACCCGC GGCGCTATCA GGCGCTGGCC
GGCCGCTCGC TCGATCCGTC GCGGATCGCG CTGTTGTGCG CGGAAGGCGC CATCGTGGTC
GACGCTGATG GTCGTCTGCG CGTGACCCAA GCCGGCTTCC CGGTGCTCGA CGCCGTGGTG
GCGGATCTGG CGGCTTGA
 
Protein sequence
MGADPDKAFG VYVHWPFCLS KCPYCDFNSH VRHAAIDEAR FARAFAREIA ATAARIGPRD 
VSSIFLGGGT PSLMQPSTVG AILDAIGKHW RVTPDAEVSL EANPTSVEAT RFRGYRAAGV
NRVSLGVQAL DDASLKTLGR LHTAQEAMDA VAIARSAFDR YSFDLIYARP GQTPAMWEAE
LRRAIGEAAE HLSLYQLTIE AETPFFALHR AGKLQTPDES ASRALYDVTQ SVCAELGLPA
YEISNHARPG AECKHNLVYW RGQEYAGIGP GAHGRLDIGG TRYAIATEKR PESWMMRVEA
TGTGVITDDR LNSEERADEF LLMGLRLAEG IDPRRYQALA GRSLDPSRIA LLCAEGAIVV
DADGRLRVTQ AGFPVLDAVV ADLAA