Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4014 |
Symbol | hemE |
ID | 3911821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4580766 |
End bp | 4581812 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885918 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_487618 |
Protein GI | 86751122 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.104329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0629126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACACAGA AACTCGTGAC GAAACCGTTC ATTGAGGTGC TTTCCGGAAA TCGGCAGGCA TCTCCCCCGA TGTGGATGAT GCGGCAGGCC GGCCGTTACC TGCCGGAATA CCGCGCGACC CGCGCCGAAG CCGGCAGCTT CCTCGATCTG TGCTTCAACG CCAAGCTCGC CGCCGAGGTG ACGTTGCAGC CGATCCGGCG CTTCGGCTTC GACGCCGCGA TCATCTTTTC GGATATTCTG GTCGTGCCTT ACGCGCTCGG ACGCGCGGTG CGCTTCGAGG TCGGCGAAGG CCCGCGGCTC GATCCGTTGA ATTCGCCGGA CCTGGTCGGC ACGCTGAATG GCGCGATCGA CCTGTCGAAG CTCGAGCCGG TGTTCGAAGC GCTGCGCATC GTGCGCAGCG AGCTCGCCCC GGAGACGACG CTGATCGGCT TCTGTGGCGC GCCGTTCACC GTCGCGACCT ACATGGTCGC GGGTCAGGGC ACGTCGGATC AGCACCCGGC GCGACTGATG GCGTATCAGC ACCCCGGCGC GTTCGCCAGG ATCATCGACG TGCTGGTCGA GAGTTCGATC CAGTATCTGT TGAAGCAGCT CGAGGCCGGC GCCGACGTGC TGCAGATCTT CGACACCTGG GGCGGCATCC TGCCGCCCCG CGAATTCGAG AAGTGGTGCA TCGAGCCGAC CCGCCGCATC GTCGAGGGTG TCCGCAAGGT GAGCCCCGGC GCCAAGATCA TCGGCTTCCC GCGCGGCGCC GGCGCGATGC TGCCGGACTT CATCGCGCGC ACCGGCGTCG ACGCCGTGAG CATCGACTGG ACGGCCGAGC CGAACATGAT CCGCGAACGG GTGCAGAGCA AGGTCGCGGT TCAGGGCAAC CTCGATCCGC TGCTGCTGAT CGCCGGCGGT TCGGCGCTCG ATCAAGGCGT CGACGACGTG CTGAAGAACT TCTCGGCCGG ACGCCACATC TTCAATCTCG GCCACGGCAT CACGCCGGAC GCGCCGGTGG CGCATGTCGA GCAGATGGTG AAACGGGTCC GCGCCTACAA AGGCTGA
|
Protein sequence | MTQKLVTKPF IEVLSGNRQA SPPMWMMRQA GRYLPEYRAT RAEAGSFLDL CFNAKLAAEV TLQPIRRFGF DAAIIFSDIL VVPYALGRAV RFEVGEGPRL DPLNSPDLVG TLNGAIDLSK LEPVFEALRI VRSELAPETT LIGFCGAPFT VATYMVAGQG TSDQHPARLM AYQHPGAFAR IIDVLVESSI QYLLKQLEAG ADVLQIFDTW GGILPPREFE KWCIEPTRRI VEGVRKVSPG AKIIGFPRGA GAMLPDFIAR TGVDAVSIDW TAEPNMIRER VQSKVAVQGN LDPLLLIAGG SALDQGVDDV LKNFSAGRHI FNLGHGITPD APVAHVEQMV KRVRAYKG
|
| |