Gene RPB_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3971 
Symbol 
ID3911778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4535830 
End bp4536927 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content61% 
IMG OID637885875 
Productmagnesium-protoporphyrin IX monomethyl ester cyclase 
Protein accessionYP_487575 
Protein GI86751079 
COG category 
COG ID 
TIGRFAM ID[TIGR02029] magnesium-protoporphyrin IX monomethyl ester aerobic oxidative cyclase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.209511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.408306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCGA TGGAAGGTGG CGCCCAGAGC GCGCTGCGAA GCCGGCCGGC GATCAAGGGC 
AGTGTCGAAA GTCTGAACAT CGCGAAAGAG GATACCATCC TCACGCCGCG GTTCTACACC
ACCGACTATG CCGCGATGGA CAAGCTCGAC GTCAGCCTGG TTCGTTCCGA ATGGAACGTG
ATGATGAACG AGATGCGCGC CGACTACAAC AAGTCGCACT TCAAGAAGAA CGACGAGTTT
CTGGAAAGCG ACCTCGACAA GCTGCCGCCG GCGCTGCGCG CCGAGTTCAA GGACTTCCTG
GTGTCGTCGC TCACCGCGGA ATTTTCCGGC TGCGTGCTTT ACGCCGAGAT CAAGAAGCGC
ATCAAGAATC CCGAAATCCG CGAATTGTTC GGTCTGCTGA GCCGCGACGA GGCCCGTCAT
GCCGGCTTCA TCAACGAGAT CCTCAAGGAT CACGGCATCG GCGTCGACCT GTCGTTCCTG
ACCAAGGTCA AGAAGTACAC CTATTTCCGG CCGAAGTTCA TCTTCTACGC GACCTATCTG
TCGGAGAAGA TCGGCTACGC CCGCTACATC ACGATCTATC GCCAGATGGA GCGGCATCCC
GAGCGCCGGT TCCATCCGAT CTTCAAATGG TTCGAGCGCT GGTGCAACGA CGAGTTCCGC
CACGGCGAGG CTTTCGCGCT GCTGATGCGC GCCGACCCGT CGCTGCTTTC GGGCGTGAAC
AAGCTGTGGA TCCGCTTCTT CCTGCTCGCC GTGTTCGCGA CGATGTACGT CCGCGATCAT
ATGCGGCCGG CGTTCTACGA GGCGCTCGGC ATGGACGCCG CCGAGTACGG CATGCAGGTT
TTCCGCATCA CGACCGAGAT CTCGAAGCAG GTTTTCCCGG TCACGATCAA CCTCGACGAC
CCGCGCTTCC TGGCGGGCCT CGAGCGCCTG CGCGTGGCCT CGGAGAAGCT CGCCGACTGT
CGCAGCCAAG GGTTCGTCGG CAAGCTGAAG CGGCCGTTCT ACGTGGCGTC TGCGGCGTTG
GCCTTCGGCC GGCTTTTCCT TCTGCCGGCG AAGCGCAACG AGTTGCCGCG CGTCATCGGC
CTTCGGCCGG CGTGGTGA
 
Protein sequence
MIPMEGGAQS ALRSRPAIKG SVESLNIAKE DTILTPRFYT TDYAAMDKLD VSLVRSEWNV 
MMNEMRADYN KSHFKKNDEF LESDLDKLPP ALRAEFKDFL VSSLTAEFSG CVLYAEIKKR
IKNPEIRELF GLLSRDEARH AGFINEILKD HGIGVDLSFL TKVKKYTYFR PKFIFYATYL
SEKIGYARYI TIYRQMERHP ERRFHPIFKW FERWCNDEFR HGEAFALLMR ADPSLLSGVN
KLWIRFFLLA VFATMYVRDH MRPAFYEALG MDAAEYGMQV FRITTEISKQ VFPVTINLDD
PRFLAGLERL RVASEKLADC RSQGFVGKLK RPFYVASAAL AFGRLFLLPA KRNELPRVIG
LRPAW