Gene RPD_3769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3769 
SymbolhemE 
ID4024285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4206123 
End bp4207169 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content65% 
IMG OID637963973 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_570891 
Protein GI91978232 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.477463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACACAGA AACTGGTGAC AAAACCTTTC ATTGAGGTCA TTTCCGGAAA ACGGCAAGCC 
TCGCCCCCGA TGTGGATGAT GCGACAGGCA GGCCGTTACC TGCCGGAGTA CCGCGCCACC
CGCACCGAAG CCGGAAGCTT CCTCGATCTG TGCTTCAACC CCAAACTCGC CGCCGAGGTG
ACGTTGCAGC CGATCCGGCG CTTCGGTTTC GACGCCGCGA TCATCTTTTC CGACATTCTG
GTGGTGCCGT ACGCGCTCGG GCGCGCGGTC CGCTTCGAGG TCGGCGAGGG TCCGAGGCTC
GATCCGCTGA ACTCGCCGGA CCTGGTCGGC ACGCTCAACG GCGCGATCGA CCTCGGCAAG
CTCGAGCCGG TGTTCGAAGC GCTGCGAATT GTGCGCAGCG AGCTCGCGCC CGAGACGACG
CTGATCGGCT TCTGCGGCGC GCCGTTCACG GTCGCGACCT ACATGGTCGC CGGCCAGGGC
ACGTCGGATC AACATCCGGC GCGGCTGATG GCGTATCAGC ACCCCGGCGC CTTCGCCAAG
ATCATCGATG TTCTGGTCGA GAGTTCGATC CAGTATCTGC TGAAGCAGCT CGAAGCCGGC
GCCGACGTGC TGCAGATCTT CGACACCTGG GGCGGCATCC TGCCGCCGCG CGAATTCGAA
AAGTGGTGCA TCGAGCCGAC CCGCCGCATC GTCGAAGGCG TCCGCAAGGT GAAGCCCGAC
GCCAAGATCA TCGGCTTCCC GCGCGGCGCC GGCGCGCTGC TGCCGGCCTT CATCGAGCGC
ACCGGCGTTG ACGCCGTCAG CATCGACTGG ACGGCGGAGC CGAAGATGGT GCGCGACCAG
GTGCAGACCA AGGTCGCCGT GCAGGGCAAC CTCGACCCGC TTTTGCTGAT CGCCGGCGGC
TCGGCGCTCG ACCAAGGGGT GGACGACGTG CTGAAGAACT TCTCGGCCGG TCGCCACATC
TTCAATCTCG GCCACGGCAT CACGCCGGAC GCCTCGATCG CGCATGTCGA GCAGATGGTG
AAGCGAGTCC GCGCCTTCAG AGGCTGA
 
Protein sequence
MTQKLVTKPF IEVISGKRQA SPPMWMMRQA GRYLPEYRAT RTEAGSFLDL CFNPKLAAEV 
TLQPIRRFGF DAAIIFSDIL VVPYALGRAV RFEVGEGPRL DPLNSPDLVG TLNGAIDLGK
LEPVFEALRI VRSELAPETT LIGFCGAPFT VATYMVAGQG TSDQHPARLM AYQHPGAFAK
IIDVLVESSI QYLLKQLEAG ADVLQIFDTW GGILPPREFE KWCIEPTRRI VEGVRKVKPD
AKIIGFPRGA GALLPAFIER TGVDAVSIDW TAEPKMVRDQ VQTKVAVQGN LDPLLLIAGG
SALDQGVDDV LKNFSAGRHI FNLGHGITPD ASIAHVEQMV KRVRAFRG