Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_06391 |
Symbol | hemE |
ID | 4717340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 557876 |
End bp | 558916 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640078352 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001009032 |
Protein GI | 123968174 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCAAG ATTTACCGCT ACTACTTTCT GCCGCATTAG GTAAAAAAGT AAATAGGCCT CCAGTATGGA TGATGAGGCA AGCAGGAAGA TATATGAAAA TCTATAGAGA TTTAAGGGAG CGTTACCCAA GCTTTAGAGA GAGGTCTGAA AATCCAGAAC TATCATATGA GATTTCAATG CAGCCTTTTC ATGCTTTCAA ACCGGATGGT GTGATCCTTT TTTCAGATAT TCTCACACCT CTTCCAGGGA TGGGCATAAA TTTTGAAATA ATAGAAAGTA AAGGTCCAAT TATTGAGGAC CCAATAAGAA CTCTTAATCA GGTAGAAAAT TTAAGAGAAT TAAATCCAAG CGAGAGTTTA AGCTTTGTTG GGCAAGTTCT TTCTTCACTA AAAAAAGATG TAAATAACGA GGCAACTATT TTAGGTTTTG TTGGCGCACC TTGGACTCTT GCTGCATATG TAGTTGAAGG TAAAAGCAGT AAGAATTATT CCTTAATAAA ATCAATGGCT TTTAATGAAC CAGATTTACT TCATAAACTT CTTGATCATT TTGCAAAATC TATTGGTGAA TATCTTAAAT ATCAAATAAA ATCTGGAGCG CAAGTAGTAC AAATTTTTGA TTCATGGGCA GGCCAACTAA GCCCACAAGA TTATGATATG TTTGCTGGGC CGTATCAAAA AAAAGTTGTT GAAATTGTAA AAGCGGAATA CCCTGAAACA CCAATAATTC TTTACATTTC AGGAAGTGCT GGGGTACTGG AAAGAATGGC AAAAACTGGA GTAGATATAA TTTCACTAGA CTGGACAGTA GATATTGAAG AGGCTTGTAA AAGAATCCCC AGGGGAATTG GAATTCAAGG TAATGTTGAC CCTGGCATTT TATTCGGAAA CAAAAAATCA ATAAAAGAAA GGATAGATGA TACTTTCAAT AAAATTAAAG ACAGGAAATA TATTCTTAAT TTGGGTCATG GGATTTTACC TGGGACTCCA GAAGAAAATG CTCAAACATT TTTTGAACAT GGGAAAAAAC TCACTTACTA G
|
Protein sequence | MGQDLPLLLS AALGKKVNRP PVWMMRQAGR YMKIYRDLRE RYPSFRERSE NPELSYEISM QPFHAFKPDG VILFSDILTP LPGMGINFEI IESKGPIIED PIRTLNQVEN LRELNPSESL SFVGQVLSSL KKDVNNEATI LGFVGAPWTL AAYVVEGKSS KNYSLIKSMA FNEPDLLHKL LDHFAKSIGE YLKYQIKSGA QVVQIFDSWA GQLSPQDYDM FAGPYQKKVV EIVKAEYPET PIILYISGSA GVLERMAKTG VDIISLDWTV DIEEACKRIP RGIGIQGNVD PGILFGNKKS IKERIDDTFN KIKDRKYILN LGHGILPGTP EENAQTFFEH GKKLTY
|
| |