Gene RPC_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3812 
Symbol 
ID3969271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4242590 
End bp4244299 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content62% 
IMG OID637926922 
Productmagnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase 
Protein accessionYP_533665 
Protein GI90425295 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID[TIGR02026] magnesium-protoporphyrin IX monomethyl ester anaerobic oxidative cyclase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATGCT TTCTTGACAA TGACGGGAGA GAGTCAGTGC GCATTATTCT CATTCATCCG 
AATTACCATT CTGGCGGCGC CGAGATCGCA GGAAACTGGC CGCCGGCATG GGCGGCCTAT
CTCGCCGGCG CACTGAAGTC GAATGGCTTC ACCGACCTTC GCTTCATCGA TGCGATGACC
GAGAACATCT CCGACGAAGA CCTCGCGGCG ATTCTCGCCG AGGAGAAGCC GGACATGATC
GGCTGCACCT CGATCACGCC ATCGATCTAC AAGGCCGAGC GTACGCTGCA GATCGCCAAA
CAGGTGCATC CGGATTGCGT CACCGTGTTG GGCGGCGTGC ACGCCACCTT CATGTTCCAG
CAGGTGCTGG GCGAAGCGCC CTGGATCGAC GTGGTGGTGC GCGGCGAAGG CGAAGAAATC
CTCGTCGAGC TGGCGCGCGC GATCGAGAGC GGCCAATGGC CGGCCAACCG CAACGACATC
AAGGGCATCG CCTTCCCCGA TCAGGCCGCC GACGGCACCC AGGTGGTCGC CACCGCGGCC
GCGCCCACCA TCAAGGACCT CGACGGCATC AGCCCGGACT GGGGCATCCT GAAGTGGGAC
ATGTACAAAT ACATCCCGAT GAACACCCGG GTCGCGATCC CGAACATGGC GCGCGGTTGC
CCGTTCACCT GTTCGTTCTG TTCGCAGTGG AAGTTCTGGC GCGACTATCG GGTCCGCGAC
CCGAAGAAGG TGGTCGACGA AATCGAGGAG TTGGTCAACA AGTACCAAGT CGGTTTCTTC
ATCCTCGCCG ACGAAGAGCC CACCATCAAC CGCAAGAAGT TCATCCAGTT CTGCGAAGAG
CTGATCGCCC GCGGCCTGAA CAAGAAGGTG CAGTGGGGCA TCAACACCCG CGTCACCGAC
ATTCTGCGCG ACGAGAAGCT GCTGAAGTTC TACAACGAGG CGGGCTTGAT GCACGTCTCG
CTCGGCACCG AGGCCGCGGC GCAGCTCAAG CTCGACCTGT TCAACAAAGA GACCAAGATC
TCCGACAACA AGAAGGCGAT CCGTCTGCTG CGCGAAGCCG GCATCGTCTG CGAAGCCCAG
TTCATCGTCG GCCTCGATAG CGAGACCCCG GAGACGCTGG AAGAAACCTA CCGCATGGCG
ATGGACTGGA AGCCCGACCT CGCCAACTGG TCGATGTACA CGCCGTGGCC GTTCACGCCG
CTGTTCAAGG AACTCAGCGA CAAGGTCGAG GTGTTCGACT TCGACAAGTA CAACTTCGTC
ACCCCGATCC TGAAGCCGGC GGCGATGGAG CGCGGCGAAT TGCTCGACCG GGTGATGAAC
AACTATCGCC GGTTCTACAT GTACAAGGCG TTCTTCTCCT ATCCGTGGTC GGGCACCGGG
CGGCGTCGCC GCTATCTGCT GGGCTGCCTG AAGGCATTCC TGAAGGCCGG CTTCGAACGC
AAGTTCTACG ATCTCGGCCG GGTCGGCTAT TGGGGCCCGC AGTCGAAGAA GAAGGTGGAC
TTCCACTTCG ACAACACGCG CTCCAAGGCG TTCGGGCAGA CCGCGGATTG GGAAGCCAAC
GCCGACCGTT CGCGCAAGGC GACCACGCCC ACCATCGTCT CCGCCTGCGG CGGCGGCACC
GAGCAGATGG CGGAAGACGC CGAATGCGCG GCCCTGCAGG CCAAGGTATT GGACAAAGCC
GATGGCGCCG GCGCAGTCAG CCGCCATTAA
 
Protein sequence
MSCFLDNDGR ESVRIILIHP NYHSGGAEIA GNWPPAWAAY LAGALKSNGF TDLRFIDAMT 
ENISDEDLAA ILAEEKPDMI GCTSITPSIY KAERTLQIAK QVHPDCVTVL GGVHATFMFQ
QVLGEAPWID VVVRGEGEEI LVELARAIES GQWPANRNDI KGIAFPDQAA DGTQVVATAA
APTIKDLDGI SPDWGILKWD MYKYIPMNTR VAIPNMARGC PFTCSFCSQW KFWRDYRVRD
PKKVVDEIEE LVNKYQVGFF ILADEEPTIN RKKFIQFCEE LIARGLNKKV QWGINTRVTD
ILRDEKLLKF YNEAGLMHVS LGTEAAAQLK LDLFNKETKI SDNKKAIRLL REAGIVCEAQ
FIVGLDSETP ETLEETYRMA MDWKPDLANW SMYTPWPFTP LFKELSDKVE VFDFDKYNFV
TPILKPAAME RGELLDRVMN NYRRFYMYKA FFSYPWSGTG RRRRYLLGCL KAFLKAGFER
KFYDLGRVGY WGPQSKKKVD FHFDNTRSKA FGQTADWEAN ADRSRKATTP TIVSACGGGT
EQMAEDAECA ALQAKVLDKA DGAGAVSRH