Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_06391 |
Symbol | hemE |
ID | 4779337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 579581 |
End bp | 580639 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640083917 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001014466 |
Protein GI | 124025350 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.401207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.983268 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGAAA CTACTCCTTT ACTACTCCGT GCAGCTCGCG GAGAACATGT TGAAAGGCCC CCAGTTTGGA TGATGAGACA AGCGGGGAGA TACATGAAGG TATATCGCGA CCTTCGTGAT AATCATCCAA GCTTCAGGGA AAGATCCGAA AACCCCGATC TTTCTTATGA AATTTCAATG CAACCTTTTA CAGCTTTTCA ACCAGATGGA GTGATACTTT TTTCAGATAT CTTGACTCCT CTACCTGGGA TGGGAATTAA CTTTGACATC GTTGAAAGCA AAGGACCCTT AATAAATGAC CCAATAAGAA GCCTCAAACA GGTTAAAGAC CTTAAACCCC TTCAACCAGA AGAAAGCATG TCTTTTGTTG GGGAAGTCCT TGGAAGGCTA AGGGAAAGCG TTGGGAACAA GGCTGCAGTT CTTGGGTTTG TAGGTGCTCC TTGGACTCTT GCTGCATATG TTGTAGAGGG AAAAAGCAGC AAAAATTATG CAGTTATAAA GGCGATGGCA TTCCAAGAGC CAGAACTACT GCATCAACTT TTAAATCACT TTGCAGAATC AATTGCAAAC TATTTATCCT ATCAAATTCA ATCTGGGGCC CAAGTAGTTC AAATGTTTGA TTCATGGGCA GGACAATTAA GTCCACAAGA TTATGACGAG TTTGCTGCGC CTTATCAACA AAAAGTAGTC AATTTAGTAA AAGAAAAACA TCCAGATACA CCTATGATTT TATACATCTC AGGTAGCGCG GGAGTTCTTG AAAGGATGGG ACAAACCGGA GTAGATATAG TCTCTCTAGA TTGGACTGTT GACATGGCAG ATGGACTAAA AAGGCTGCCT CAATCAGTTG GAGTCCAAGG AAATGTTGAT CCAGGACTTT TGTTTGGTAC TCCTGATGCG ATCAGATCAA GAATTGTTGA TGTCGTCAAA AAAGCGAAAG GTAGAAAACA TATTCTTAAC CTTGGTCATG GAATACTTCC AGGGACGCCA GAAGAAAATG CAAGAGTATT TTTCGAGGCT GGTAAAAATG TGAATGAACT CATAAAAGTT TCATCTTGA
|
Protein sequence | MNETTPLLLR AARGEHVERP PVWMMRQAGR YMKVYRDLRD NHPSFRERSE NPDLSYEISM QPFTAFQPDG VILFSDILTP LPGMGINFDI VESKGPLIND PIRSLKQVKD LKPLQPEESM SFVGEVLGRL RESVGNKAAV LGFVGAPWTL AAYVVEGKSS KNYAVIKAMA FQEPELLHQL LNHFAESIAN YLSYQIQSGA QVVQMFDSWA GQLSPQDYDE FAAPYQQKVV NLVKEKHPDT PMILYISGSA GVLERMGQTG VDIVSLDWTV DMADGLKRLP QSVGVQGNVD PGLLFGTPDA IRSRIVDVVK KAKGRKHILN LGHGILPGTP EENARVFFEA GKNVNELIKV SS
|
| |