Gene Rpal_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1862 
Symbol 
ID6409521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1998764 
End bp2000113 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content66% 
IMG OID642711750 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001990863 
Protein GI192290258 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCTG CCCTCGACAA ATACGCCAAG AGCTCGGTGC CGCGCTACAC CAGCTATCCC 
ACCGCACCGC ATTTCGCGAA GGACTTCCCG GAGTCGATCT ATCGCGGCTG GCTCGCCCAG
CTCGACACCG ACGAGCCGGT TTCGCTGTAT CTTCATGTGC CGTTCTGCAA GCAGATGTGC
TGGTACTGCG GCTGCAACAT GAAGCTGGCG GCGAAGTACG ATCCGGTCGC CGACTACGTC
GAGCACCTGA TCGACGAAAT CGATCTGGTC GCGGACGCTC TCCCCGGCAC CATGCCGGTG
CGTCATCTGC ATTTCGGCGG CGGCACCCCG ACGGTGATCG ATCCGCAGGA TCTCGGCGCG
CTGATGACGC TGCTGCGCGA GCGCTTCGAG TTCCTGCCCG ATGCCGAGAT CGCGATCGAG
AGCGACCCGC GCACGCTGAC CGAAGACATG GCCGCCAAGA TCGGCGAGCT CGGCTTCACC
CGCGCCAGCT TCGGCGTGCA GGAATTCGAC CCGAAGGTGC AGGAAGCGAT CAACCGCGTC
CAGCCGCCCG AAATGGTCGC GCGCGCGATG CAGCTGTTCA AATCGGCCGG CGTCGAGCGC
ATCAATTTCG ACCTGATCTA CGGCCTGCCC TATCAGACCG CCGAAGACCT GCGCCGCACC
GTCGAACAGT GCGTCGAGAT GAAGCCCGAC CGGGTCGCGC TATTCGGCTA CGCTCACGTG
CCGTGGGTCG CCAAGAACCA GCGGATGATC CCGGACGAGT CGCTGCCGAA GCCGGAGCTG
CGCGCCACGC AGGCCGAGAC CGCTGCCGAA GCCCTGGTGA AGGGCGGCTA CGTCCGCATC
GGCATCGACC ATTTCGCGCT GCCGGGTGAC TCGCTCGCGA TCGCGGCCAA GACCGGCGAA
CTGCACCGGA ATTTCCAGGG CTACACCTCC GACGCGGCGC AGACCCTGAT CGGCCTGGGC
GCCACATCGA TCGGCCGCAC CCCGAGCGGC TATGTGCAGA ACATCAGCGA AACCGGTGCC
TGGTCGCGCG CGGTCGAAGC CGGCAAGATC CCGGTCGCGC GTGGTCACGC TCTGACCCAG
CAGGACAATC TGCGCGCCCA CGTGATCGAA CGCATCATGT GCGACGGCAA GGTCGACCTC
GCCGCAGCCG GCAAGGCCTT CGGCTGTGGC GAAGACTGGT ACGCGCCGGA GCAGGACTCG
CTCGCCGAAC TGCAGCGCGA CGGCGCCGTG GTGTGCAATG GCAGCAAGCT GACGCTGACG
CCGGAAGGCG TCCGGCTGTC GCGTGTGGTG GCGTCGGTGT TCGACACCTA CCTGCGCAAC
TCGTCGGTCC GGCACTCGAT CGCGGTCTGA
 
Protein sequence
MSSALDKYAK SSVPRYTSYP TAPHFAKDFP ESIYRGWLAQ LDTDEPVSLY LHVPFCKQMC 
WYCGCNMKLA AKYDPVADYV EHLIDEIDLV ADALPGTMPV RHLHFGGGTP TVIDPQDLGA
LMTLLRERFE FLPDAEIAIE SDPRTLTEDM AAKIGELGFT RASFGVQEFD PKVQEAINRV
QPPEMVARAM QLFKSAGVER INFDLIYGLP YQTAEDLRRT VEQCVEMKPD RVALFGYAHV
PWVAKNQRMI PDESLPKPEL RATQAETAAE ALVKGGYVRI GIDHFALPGD SLAIAAKTGE
LHRNFQGYTS DAAQTLIGLG ATSIGRTPSG YVQNISETGA WSRAVEAGKI PVARGHALTQ
QDNLRAHVIE RIMCDGKVDL AAAGKAFGCG EDWYAPEQDS LAELQRDGAV VCNGSKLTLT
PEGVRLSRVV ASVFDTYLRN SSVRHSIAV