Gene SeD_A4574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4574 
SymbolhemE 
ID6871439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4414581 
End bp4415645 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content57% 
IMG OID642787481 
Producturoporphyrinogen decarboxylase 
Protein accessionYP_002218083 
Protein GI198242852 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0407] Uroporphyrinogen-III decarboxylase 
TIGRFAM ID[TIGR01464] uroporphyrinogen decarboxylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value0.300698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC TCAAAAACGA TCGTTATCTA CGTGCGCTGC TGCGCCAGCC CGTTGATGTT 
ACGCCAGTGT GGATGATGCG CCAGGCGGGC CGTTATCTAC CGGAGTACAA AGCCACTCGC
GCTCAGGCGG GCGATTTTAT GTCGCTGTGC AAAAATGCTG AACTGGCCTG CGAGGTCACT
TTACAGCCGC TGCGCCGTTA TCCGCTTGAT GCGGCGATCC TTTTCTCGGA TATCCTGACG
ATTCCGGACG CGATGGGCCT GGGGCTCTAT TTTGAAGCTG GGGAAGGCCC GCGCTTTACG
GCTCCTGTCA CCTGCAAAGC CGACGTTGAA AAACTGCCGA TTCCGGATCC AGAAGGTGAA
CTGGGCTACG TGATGAATGC GGTACGGACC ATCCGCCGCG AACTAAAAGG CGAGGTTCCG
CTGATCGGCT TTTCCGGCAG CCCGTGGACG CTGGCGACTT ACATGGTGGA AGGCGGCAGT
AGCAAAGCCT TCACGGTGAT TAAAAAGATG ATGTACGCTG ACCCGCAGGC GTTGCATCTG
CTGCTGGATA AGTTGGCGAA AAGCGTCACG CTGTACCTCA ACGCGCAGAT TAAAGCGGGT
GCGCAGTCGG TGATGATTTT CGATACCTGG GGCGGCGTGC TGACTGGCCG CGATTACCAG
CAATTCTCCC TCTACTACAT GCATAAAATC GTCGATGGCC TGCTGCGTGA AAACGACGGT
CGCTGCGTGC CGGTAACGCT GTTCACTAAA GGTGGCGGTC AGTGGCTGGA GGCGATGGCG
GAAACCGGCT GCGACGCGCT GGGTCTCGAC TGGACGACAG ATATCGCTGA TGCGCGCCGT
CGCGTTGGCC ATAAAGTGGC GCTGCAGGGC AATATGGACC CCTCGATGCT GTATGCGCCA
CCGGCACGGA TCGAAGACGA AGTAGCGACT ATACTTGCTG GTTTCGGTCA GGGAGAAGGG
CACGTCTTTA ACCTTGGACA TGGCATCCAT CAGGATGTGC CGCCAGAACA TGCTGGCGCA
TTTGTGGAGG CAGTGCACCG ACTTTCTGCG CAGTATCACA ACTAA
 
Protein sequence
MTELKNDRYL RALLRQPVDV TPVWMMRQAG RYLPEYKATR AQAGDFMSLC KNAELACEVT 
LQPLRRYPLD AAILFSDILT IPDAMGLGLY FEAGEGPRFT APVTCKADVE KLPIPDPEGE
LGYVMNAVRT IRRELKGEVP LIGFSGSPWT LATYMVEGGS SKAFTVIKKM MYADPQALHL
LLDKLAKSVT LYLNAQIKAG AQSVMIFDTW GGVLTGRDYQ QFSLYYMHKI VDGLLRENDG
RCVPVTLFTK GGGQWLEAMA ETGCDALGLD WTTDIADARR RVGHKVALQG NMDPSMLYAP
PARIEDEVAT ILAGFGQGEG HVFNLGHGIH QDVPPEHAGA FVEAVHRLSA QYHN