Gene RPB_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4014 
SymbolhemE 
ID3911821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4580766 
End bp4581812 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID637885918 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_487618 
Protein GI86751122 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.104329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0629126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACACAGA AACTCGTGAC GAAACCGTTC ATTGAGGTGC TTTCCGGAAA TCGGCAGGCA 
TCTCCCCCGA TGTGGATGAT GCGGCAGGCC GGCCGTTACC TGCCGGAATA CCGCGCGACC
CGCGCCGAAG CCGGCAGCTT CCTCGATCTG TGCTTCAACG CCAAGCTCGC CGCCGAGGTG
ACGTTGCAGC CGATCCGGCG CTTCGGCTTC GACGCCGCGA TCATCTTTTC GGATATTCTG
GTCGTGCCTT ACGCGCTCGG ACGCGCGGTG CGCTTCGAGG TCGGCGAAGG CCCGCGGCTC
GATCCGTTGA ATTCGCCGGA CCTGGTCGGC ACGCTGAATG GCGCGATCGA CCTGTCGAAG
CTCGAGCCGG TGTTCGAAGC GCTGCGCATC GTGCGCAGCG AGCTCGCCCC GGAGACGACG
CTGATCGGCT TCTGTGGCGC GCCGTTCACC GTCGCGACCT ACATGGTCGC GGGTCAGGGC
ACGTCGGATC AGCACCCGGC GCGACTGATG GCGTATCAGC ACCCCGGCGC GTTCGCCAGG
ATCATCGACG TGCTGGTCGA GAGTTCGATC CAGTATCTGT TGAAGCAGCT CGAGGCCGGC
GCCGACGTGC TGCAGATCTT CGACACCTGG GGCGGCATCC TGCCGCCCCG CGAATTCGAG
AAGTGGTGCA TCGAGCCGAC CCGCCGCATC GTCGAGGGTG TCCGCAAGGT GAGCCCCGGC
GCCAAGATCA TCGGCTTCCC GCGCGGCGCC GGCGCGATGC TGCCGGACTT CATCGCGCGC
ACCGGCGTCG ACGCCGTGAG CATCGACTGG ACGGCCGAGC CGAACATGAT CCGCGAACGG
GTGCAGAGCA AGGTCGCGGT TCAGGGCAAC CTCGATCCGC TGCTGCTGAT CGCCGGCGGT
TCGGCGCTCG ATCAAGGCGT CGACGACGTG CTGAAGAACT TCTCGGCCGG ACGCCACATC
TTCAATCTCG GCCACGGCAT CACGCCGGAC GCGCCGGTGG CGCATGTCGA GCAGATGGTG
AAACGGGTCC GCGCCTACAA AGGCTGA
 
Protein sequence
MTQKLVTKPF IEVLSGNRQA SPPMWMMRQA GRYLPEYRAT RAEAGSFLDL CFNAKLAAEV 
TLQPIRRFGF DAAIIFSDIL VVPYALGRAV RFEVGEGPRL DPLNSPDLVG TLNGAIDLSK
LEPVFEALRI VRSELAPETT LIGFCGAPFT VATYMVAGQG TSDQHPARLM AYQHPGAFAR
IIDVLVESSI QYLLKQLEAG ADVLQIFDTW GGILPPREFE KWCIEPTRRI VEGVRKVSPG
AKIIGFPRGA GAMLPDFIAR TGVDAVSIDW TAEPNMIRER VQSKVAVQGN LDPLLLIAGG
SALDQGVDDV LKNFSAGRHI FNLGHGITPD APVAHVEQMV KRVRAYKG