Gene RPC_4842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4842 
SymbolhemH 
ID3973546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5403976 
End bp5405088 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content64% 
IMG OID637927954 
Productferrochelatase 
Protein accessionYP_534683 
Protein GI90426313 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGG CTCGATTTCG GCGAATGTTG CCTTATCTTG ACAACATTAT GCCAACGCGC 
GAAGCCAACC GCCTCATGAC AGTCATTGTT CCCATTCACG GCCCTGAGGC CGTTGCCGAT
CAGGCGCCGG AACGGGTGGG CGTGCTGTTG GTCAATCTCG GCACGCCGGA CAGCGCCGAC
ACCAAGGGGG TGCGTGACTA TCTGCGCGAG TTCCTGTCGG ATCCGCGCGT CATCGAGAAC
CAGGGCATCG TCTGGAAGCT GGCGCTGAAC GGATTGATCC TGCGTACCCG CCCGGCGCGC
AAGGCGCGCG ATTACCAAAA GATCTGGAAC AACGAAGCCA ACGAATCGCC GCTGAAGACC
ATCACAAGGG CGCAGGCCGA CAAACTCGCC GCGACGCTGA CCGCGCACGA CCACATCGTG
GTCGATTGGG CGATGCGCTA CGGCAATCCG TCGATGCGCT CGCGGATCGA CGCACTGGTC
GCGCAAGGCT GCAACCGGCT GCTGGTGGTG CCGCTGTATC CGCAATATTC CGCGGCGACC
TCGGCCACGG TCTGCGACCA GGCGTTCCGG GTGCTCAGCG AGATGCGCGC GCAGCCGACG
CTGCGGGTGA CGCCGCCGTA TTATCGCGAC GCCGCCTATA TCGACGCGCT GGCCAATTCG
ATCAGCAGCC ATCTGGCGAC GCTGCCGTTC GAGCCCGAAC GTATCGTCGC CTCGTTCCAC
GGCATGCCGC AGGCCTACAT CAACAAGGGC GACCCCTACC AGTCGCATTG CATCGCCACG
GTGGATGCGT TGCGCGAGCG CATGGGCCTC GACGAGAAGC GTCTGATGCT GACCTTCCAG
TCGCGGTTCG GCTTCGATCA GTGGCTGCAG CCCTACACCG ACAAGACCAT CGAACAACTC
GGCAAGGACG GCGTCCGCCG GCTCGCCGTG GTGATGCCGG GCTTCGCCTC GGACTGCCTG
GAAACGCTGG AAGAAATCGC GCAGGAAAAC GCCGAGATCT TCATGCACAA TGGCGGCGAA
AAGTTCGCCG CGGTGCCCTG CCTCAACGAC AGCGACGACG GCATCGCGGT GATCCGTCAA
CTGGTGCTGC GCGAGCTCGA GGGCTGGCTG TAG
 
Protein sequence
MSKARFRRML PYLDNIMPTR EANRLMTVIV PIHGPEAVAD QAPERVGVLL VNLGTPDSAD 
TKGVRDYLRE FLSDPRVIEN QGIVWKLALN GLILRTRPAR KARDYQKIWN NEANESPLKT
ITRAQADKLA ATLTAHDHIV VDWAMRYGNP SMRSRIDALV AQGCNRLLVV PLYPQYSAAT
SATVCDQAFR VLSEMRAQPT LRVTPPYYRD AAYIDALANS ISSHLATLPF EPERIVASFH
GMPQAYINKG DPYQSHCIAT VDALRERMGL DEKRLMLTFQ SRFGFDQWLQ PYTDKTIEQL
GKDGVRRLAV VMPGFASDCL ETLEEIAQEN AEIFMHNGGE KFAAVPCLND SDDGIAVIRQ
LVLRELEGWL