Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4445 |
Symbol | hemE |
ID | 6144888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4542247 |
End bp | 4543311 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619265 |
Product | uroporphyrinogen decarboxylase |
Protein accession | YP_001746381 |
Protein GI | 170683454 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | [TIGR01464] uroporphyrinogen decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000531857 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGAAC TTAAAAACGA TCGTTATCTG CGGGCGCTGC TGCGCCAGCC CGTTGATGTC ACTCCAGTAT GGATGATGCG CCAGGCGGGT CGCTATCTAC CGGAATATAA AGCCACGCGC GCCCAGGCGG GCGATTTTAT GTCGCTGTGC AAAAACGCCG AGCTGGCGTG CGAAGTGACT TTGCAACCGC TGCGTCGCTA CCCGCTGGAT GCGGCGATCC TCTTTTCCGA TATCCTCACC GTGCCGGACG CGATGGGGTT AGGGCTCTAT TTTGAAGCCG GAGAAGGTCC GCGTTTTACC TCGCCAGTCA CCTGCAAAGC CGACGTCGAT AAACTGCCAA TTCCGGACCC GGAAGATGAG CTGGGTTACG TGATGAACGC GGTGCGTACC ATTCGTCGCG AACTGAAAGG CGAAGTGCCG CTGATTGGTT TTTCTGGCAG CCCGTGGACG CTGGCGACCT ACATGGTGGA AGGCGGCAGC AGCAAAGCCT TCACCGTGAT CAAAAAAATG ATGTATGCCG ATCCGCAGGC GCTGCACGCG CTACTCGATA AACTGGCGAA AAGCGTCACT TTGTATCTGA ATGCGCAGAT TAAAGCCGGT GCTCAGGCAG TGATGATTTT CGACACCTGG GGCGGCGTGC TTACCGGGCG CGATTATCAA CAGTTCTCGC TCTATTACAT GCATAAAATT GTTGATGGTT TACTGCGTGA AAACGACGGT CGCCGCGTAC CGGTCACGCT GTTTACCAAA GGTGGCGGAC AGTGGCTGGA AGCGATGGCA GAAACCGGTT GCGATGCGCT GGGCCTCGAC TGGACAACGG ACATCGCCGA TGCGCGCCGT CGCGTGGGCA ATAAAGTCGC GCTGCAGGGT AATATGGATC CGTCGATGCT GTACGCGCCA CCTGCCCGCA TTGAAGAAGA AGTTGCGACT ATACTTGCCG GTTTCGGTCA CGGCGAAGGT CATGTCTTTA ACCTTGGTCA CGGCATTCAT CAGGATGTGC CGCCAGAACA TGCTGGCGTG TTTGTGGAGG CAGTGCATCG CTTGTCTGAA CAGTATCACC GCTAA
|
Protein sequence | MTELKNDRYL RALLRQPVDV TPVWMMRQAG RYLPEYKATR AQAGDFMSLC KNAELACEVT LQPLRRYPLD AAILFSDILT VPDAMGLGLY FEAGEGPRFT SPVTCKADVD KLPIPDPEDE LGYVMNAVRT IRRELKGEVP LIGFSGSPWT LATYMVEGGS SKAFTVIKKM MYADPQALHA LLDKLAKSVT LYLNAQIKAG AQAVMIFDTW GGVLTGRDYQ QFSLYYMHKI VDGLLRENDG RRVPVTLFTK GGGQWLEAMA ETGCDALGLD WTTDIADARR RVGNKVALQG NMDPSMLYAP PARIEEEVAT ILAGFGHGEG HVFNLGHGIH QDVPPEHAGV FVEAVHRLSE QYHR
|
| |