Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1258 |
Symbol | hemE |
ID | 3972051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 1374245 |
End bp | 1375222 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637924368 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_531139 |
Protein GI | 90422769 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0175405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGATGA TGCGTCAGGC CGGCCGTTAT CTACCGGAGT ACCGGGAGGT TCGCGCCAAG GCAGGCGGCT TTCTCGACCT GTGCTTCAAC GCGGAATTCG CCGCCGAAGT GACGCTGCAG CCGATCCGCC GCTTCGGCTT CGACGCCGCG ATCATCTTCT CCGACATTCT GGTGGTGCCC TACGCGCTCG GCCGCGCGGT GCGGTTCGAG GTCGGCGAAG GCCCGCGGCT GGAGCCGCTG GACTCGCCGG ACAAGGTCGG CACGCTGTCC AAGGCGATCG ACCTGTCGAA GCTGCAGCCG GTGTTCGACG CGCTGAAGAT CGTGCGCCGC GAACTCCCCG CCGAGACGGC GCTGATCGGC TTCTGCGGTT CGCCGTTCAC GGTGGCGACC TATATGGTCG CCGGCCACGG CACCCCGGAT CAGGCCCCGG CGCGCAACAT GGCCTATCAG CACCCCGGCG CGTTCGCCAA AATCATCGAC GTGCTGGTCG AGAGCTCGAT CAGCTACCTG TTGGCGCAGC TCGACGCCGG CGCCGAAGTG CTGCAGATCT TCGACACCTG GGCCGGCGTG CTGCCGCCGC GCGAATTCGA GCGCTGGTCG ATCGAACCGA CCCGCCGCAT CGTCGAAGGC GTGCGCAAGG TCAAGCCGGG CGCCAAGATC ATCGGCTTCC CGCGCGGCGC CGGCGCGATG CTGCCGGCGT TCGTCGAACG CACCGGCGTC GACGGCGTGT CGATCGATTG GACCGCCGAG CCGTCCTTCG TTCGCGAGAA GGTGCAAAGC AAGGTCGTGG TGCAGGGCAA TCTCGATCCG CTGGTTCTGA TCGCCGGCGG TGCTGCGCTC GACGAAGCGG TCGACGACGT GCTGAACAGC TATTCCGGCG GCCGCCACAT CTTCAACCTC GGCCACGGCA TCCAGCCGGA AACCCCGATC GCTCACGTCG AGCAGATGAT CAAGCGGGTG CGCGACTACA AGGGCTGA
|
Protein sequence | MWMMRQAGRY LPEYREVRAK AGGFLDLCFN AEFAAEVTLQ PIRRFGFDAA IIFSDILVVP YALGRAVRFE VGEGPRLEPL DSPDKVGTLS KAIDLSKLQP VFDALKIVRR ELPAETALIG FCGSPFTVAT YMVAGHGTPD QAPARNMAYQ HPGAFAKIID VLVESSISYL LAQLDAGAEV LQIFDTWAGV LPPREFERWS IEPTRRIVEG VRKVKPGAKI IGFPRGAGAM LPAFVERTGV DGVSIDWTAE PSFVREKVQS KVVVQGNLDP LVLIAGGAAL DEAVDDVLNS YSGGRHIFNL GHGIQPETPI AHVEQMIKRV RDYKG
|
| |