Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2335 |
Symbol | hemE |
ID | 4896506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2471118 |
End bp | 2472149 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640112931 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001044209 |
Protein GI | 126463095 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.540822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.254547 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA CGATGCTGCG TGCGCTGAAG GGCGAGACGC TGCCCACGCC TCCCATCTGG CTCATGCGTC AGGCGGGGCG CTATCTGCCG GAATATCGCG CCACGCGCGC CCAGGCGGGG GACTTCCTCT CGCTCTGCTA CACGCCGGAT CTCGCGGCGG AAGTGACGCT TCAGCCGATC CGCCGCTATG GGTTCGACGC GGCGATCCTC TTTGCCGACA TCCTGCTCTT GCCGCAGGCG CTGGGGGCGG ACCTGTGGTT CGAGACCGGC GAAGGGCCGC GCATGTCGAC CATCACCGAC ATGGAGGGCG TGACCGCGCT GAAGGGCCGC GACGACATCC ACGAGACGCT CGCGCCGGTC TATGAGACCT GCCGCATCCT CGCCCGCGAA CTTCCCAAGG AGACAACCTT TATCGGCTTT GCGGGCATGC CCTGGACGGT CGCGACCTAC ATGATCGCGG GCCGCGGCAG CAAGGATCAG GCCGCCGCGC ACAAGCTGAA GGACACCGAC CGTCCCGCCT TCGAGGCGCT GATGGACCGC GTGACCGAGG CCACCATCGA ATATCTCGCC AAGCAGGTCG AGGCCGGCTG CGAGGTGGTG AAGCTCTTCG ACAGCTGGGC CGGCTCGCTG AAGGGCCAGG ACTTCGAGGA TTTCGCCGTG GCCCCCGCCA AGCGGATCGT CTCGGAGCTG AAGGCCCGCT TCCCGGGACT GCCGGTCATC GCCTTCCCGC GCGAGGCGGG CGAGGGCTAC ATCGGCTTCG CCGAGAAGAC CGGCGCCGAC TGCGTGGCCA TCGACAATTC CGTCAGCCCG GAATGGGCGG CCGAGAAGGT GCAGGCGGGC CGGACCTGCG TGCAGGGCAA CCTCGACCCG AAATACATGG TGACGGGTGG CGAGGAGCTG GTGCAGGCCA CGAAACGCGT GGTCGAGGCG TTCCGGAACG GCCCGCACAT CTTCAACCTC GGCCATGGCA TCACGCCCGA GGCCGATCCG GAGAACGTGA CGCTGCTCGT CGAGACGATC CGCGGCAAGT GA
|
Protein sequence | MTKTMLRALK GETLPTPPIW LMRQAGRYLP EYRATRAQAG DFLSLCYTPD LAAEVTLQPI RRYGFDAAIL FADILLLPQA LGADLWFETG EGPRMSTITD MEGVTALKGR DDIHETLAPV YETCRILARE LPKETTFIGF AGMPWTVATY MIAGRGSKDQ AAAHKLKDTD RPAFEALMDR VTEATIEYLA KQVEAGCEVV KLFDSWAGSL KGQDFEDFAV APAKRIVSEL KARFPGLPVI AFPREAGEGY IGFAEKTGAD CVAIDNSVSP EWAAEKVQAG RTCVQGNLDP KYMVTGGEEL VQATKRVVEA FRNGPHIFNL GHGITPEADP ENVTLLVETI RGK
|
| |