Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_4028 |
Symbol | hemE |
ID | 6064627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4434468 |
End bp | 4435532 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603443 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001726954 |
Protein GI | 170022000 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00901601 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGAAC TTAAAAACGA TCGTTATCTG CGGGCGCTGC TGCGCCAGCC CGTTGATGTC ACTCCAGTAT GGATGATGCG CCAGGCGGGT CGCTATCTAC CGGAATATAA AGCCACGCGC GCCCAGGCGG GCGATTTTAT GTCGCTGTGC AAAAACGCCG AGCTGGCGTG CGAAGTGACT TTGCAACCGC TGCGTCGCTA CCCGCTGGAT GCGGCGATCC TCTTTTCCGA TATCCTCACC GTGCCGGACG CGATGGGGTT AGGGCTCTAT TTTGAAGCCG GAGAAGGTCC GCGTTTTACC TCGCCAGTCA CCTGCAAAGC CGACGTCGAT AAACTGCCAA TTCCGGACCC GGAAGATGAG CTGGGTTACG TGATGAACGC GGTGCGTACC ATTCGTCGCG AACTGAAAGG CGAAGTGCCG CTGATTGGTT TTTCCGGCAG CCCGTGGACG CTGGCGACCT ACATGGTGGA AGGCGGCAGC AGCAAAGCGT TCACCGTGAT CAAAAAAATG ATGTATGCCG ATCCGCAGGC GCTGCACGCT CTACTCGATA AACTGGCGAA AAGCGTCACT TTGTATCTGA ATGCGCAGAT TAAAGCCGGT GCTCAGGCAG TGATGATTTT CGACACCTGG GGCGGTGTGC TTACCGGGCG CGATTATCAA CAGTTCTCGC TCTATTACAT GCATAAAATT GTTGATGGTT TACTGCGTGA AAACGACGGT CGCCGCGTAC CGGTCACGCT GTTTACCAAA GGCGGCGGAC AGTGGCTGGA AGCGATGGCA GAAACCGGTT GCGATGCGTT GGGCCTCGAC TGGACAACGG ATATCGCCGA TGCGCGCCGC CGTGTGGGCA ATAAAGTCGC GTTGCAGGGT AATATGGATC CGTCGATGCT GTACGCGCCG CCTGCCCGCA TTGAAGAAGA AGTAGCGACT ATACTTGCAG GTTTCGGTCA CGGCGAAGGT CATGTCTTTA ACCTTGGTCA CGGCATTCAT CAGGATGTGC CGCCAGAACA TGCTGGCGTG TTCGTGGAGG CAGTGCATCG ACTGTCTGAA CAATATCACC GCTAA
|
Protein sequence | MTELKNDRYL RALLRQPVDV TPVWMMRQAG RYLPEYKATR AQAGDFMSLC KNAELACEVT LQPLRRYPLD AAILFSDILT VPDAMGLGLY FEAGEGPRFT SPVTCKADVD KLPIPDPEDE LGYVMNAVRT IRRELKGEVP LIGFSGSPWT LATYMVEGGS SKAFTVIKKM MYADPQALHA LLDKLAKSVT LYLNAQIKAG AQAVMIFDTW GGVLTGRDYQ QFSLYYMHKI VDGLLRENDG RRVPVTLFTK GGGQWLEAMA ETGCDALGLD WTTDIADARR RVGNKVALQG NMDPSMLYAP PARIEEEVAT ILAGFGHGEG HVFNLGHGIH QDVPPEHAGV FVEAVHRLSE QYHR
|
| |