Gene RPB_4542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4542 
SymbolhemH 
ID3912359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5136270 
End bp5137307 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content64% 
IMG OID637886446 
Productferrochelatase 
Protein accessionYP_488136 
Protein GI86751640 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.220243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.879431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGA TTGTCCCCAT TCACGGCCCT GCACCCGCTC TGGCACCTGC GCCCGAGCGC 
GTCGGCGTAT TGCTGGTCAA TCTCGGCACC CCCGACAGCT GCGACACCAA GGGCGTGCGG
GTCTATTTGC GCGAGTTCCT GTCGGACCCG CGGGTGATCG AGAATCAGGG GATCTTCTGG
AAGCTGGCGC TGAACGGCAT CATCCTGAAC ACCCGTCCGG CCCGCAAGGC CAAGGACTAC
CAGAAGATCT GGAACCAGGA GAAGAACGAG TCGCCGCTGA AGACCATCAC CCGCGCGCAG
GCCGAGAAGC TCGCCGCGTC GCTGAGCGAT CGCAGCCACC TGGTGGTGGA CTGGGCGATG
CGTTACGGCA ACCCGTCGAT GCGCGACCGG ATCGAGGCGC TGGTGGCGCA AGGCTGCTCG
CGGCTGCTGG TGGTGCCCCT CTATCCACAA TATTCGGCGG CGACCTCGGC CACCGTGTGC
GACCAGGCGT TTCGCGTGCT GCGCGAATTG CGCGCCCAGC CGACGCTGCG GGTGACACCG
CCTTACTACC GCGACGACGC CTATATCGAC GCGCTGGCGA ATTCGATCCA TGCGCATCTG
GCGACGCTGC CGTTCAAGCC GGAGATGATC GTCGCCTCTT TTCACGGCAT GCCGCAGGCC
TATATCGAGA AGGGCGATCC GTATCAGTCG CAATGCGTCG CCACCGTCGA TGCGCTGCGC
GAGCGGATGG GGCTGGACGA CAAGAAGCTG CTGCTGACGT TCCAGTCGCG GTTCGGCTTC
GACCAGTGGC TGCAGCCCTA CACCGACAAG ACCATCGAGA AGCTCGCCAA GGACGGCGTG
CGCAAGCTCG CCGTGGTGAT GCCCGGCTTC GCCGCGGACT GCCTCGAGAC GCTGGAAGAA
ATCGCGCAGG AGAATGCCGA GATCTTCATG CACAATGGCG GCGAGGAGTT CTCCGCGATC
CCCTGCCTCA ACGACAGCGC CGACGGCATC GCGGTGATCC GGCAACTGGT GATGCGCGAA
CTGGAAGGTT GGCTGTAG
 
Protein sequence
MTVIVPIHGP APALAPAPER VGVLLVNLGT PDSCDTKGVR VYLREFLSDP RVIENQGIFW 
KLALNGIILN TRPARKAKDY QKIWNQEKNE SPLKTITRAQ AEKLAASLSD RSHLVVDWAM
RYGNPSMRDR IEALVAQGCS RLLVVPLYPQ YSAATSATVC DQAFRVLREL RAQPTLRVTP
PYYRDDAYID ALANSIHAHL ATLPFKPEMI VASFHGMPQA YIEKGDPYQS QCVATVDALR
ERMGLDDKKL LLTFQSRFGF DQWLQPYTDK TIEKLAKDGV RKLAVVMPGF AADCLETLEE
IAQENAEIFM HNGGEEFSAI PCLNDSADGI AVIRQLVMRE LEGWL