Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3616 |
Symbol | hemE |
ID | 3837072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4154681 |
End bp | 4155727 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637827740 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_428697 |
Protein GI | 83594945 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00257093 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTGC CGTCCAAGAC CCAAAAGCCC TTTCTCCAAG CCCTGGCCGG CGAAACCCTG AGTCCGCCGC CGGTATGGTT GATGCGTCAG GCTGGACGCT ACCTTCCCGA GTATCGGGCG ACCCGGGAAG AGGCCGGTGG GTTCCTCGAT CTGTGCTACA CCCCGAAGTT GGCGGTGGAG GTCACCCTGC AGCCGCTGCG CCGTTACGCC TTCGACGCGG CGATCTTGTT TTCCGATATT CTGGTGGTGC CCAACGCCAT CGGTCGGCAG GTCGCTTTCA AGCAAGGCGA GGGACCGGTT CTTGATCCGC TGACCAGCCG TGCCGATGTC GAGGCGCTTG AGCCCGGAAA ACTGCGCGAG CGTCTGGGGC CGGTGTTTGA GACCGTGCGG GGTCTGGCGA GCGCCATTCC GTCGACGACG GCGCTGATCG GTTTCGCCGG CGCGCCGTGG ACGGTGGCGA CCTATATGCT CGAAGGCGGG TCGAGTAAGG ATTTCTCGGT GGCCAAATCG TGGATCTACA GCCGTCCCGA TGATTTCGCC GCCCTGATGG AGGTGCTGAT CAGCGCCACC ACCGATTATC TGATCGCCCA GATCGACGCC GGCGCCGAAG CCATCCAGAT TTTCGACACC TGGGCCGGCG TTCTGCCGGA AACGGAATTC CATCGCTGGG TGATCGAGCC GATCGGCCGG ATCACCCGCG CCCTTCACGA ACAGCGCCCC GGGGTTCCGG TGATCGGTTT TCCCAAAGGC GCCGGGGTTC TTTACGAGAC CTTCATCCGG GAAACCGGCG TGGACGGCGT TGGGCTCGAC GCCTCGGTTC CTTTGGCCTG GGCGGCCAAG ACCCTGCAGC CGCTGTGCAC CGTGCAGGGC AACATGGATC CCTTGCTGCT GGTCGAGGGC GGTCCGCTGA TGGAACAGGC GGTCAAGCGC CTTCTTGATA CCCTTGGCCA TGGACCCTTT ATCTTCAATC TCGGGCATGG CATTGTGCCG CAGACCCCTC CCGAGAATGT TGCTCGCCTG ATCGACTTGG TTCGCGCGCC GCGTTAG
|
Protein sequence | MSVPSKTQKP FLQALAGETL SPPPVWLMRQ AGRYLPEYRA TREEAGGFLD LCYTPKLAVE VTLQPLRRYA FDAAILFSDI LVVPNAIGRQ VAFKQGEGPV LDPLTSRADV EALEPGKLRE RLGPVFETVR GLASAIPSTT ALIGFAGAPW TVATYMLEGG SSKDFSVAKS WIYSRPDDFA ALMEVLISAT TDYLIAQIDA GAEAIQIFDT WAGVLPETEF HRWVIEPIGR ITRALHEQRP GVPVIGFPKG AGVLYETFIR ETGVDGVGLD ASVPLAWAAK TLQPLCTVQG NMDPLLLVEG GPLMEQAVKR LLDTLGHGPF IFNLGHGIVP QTPPENVARL IDLVRAPR
|
| |