Gene SeD_A0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0404 
SymbolhemB 
ID6874958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp426976 
End bp427950 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content55% 
IMG OID642783636 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_002214323 
Protein GI198246038 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.956627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00000000427813 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGACC TTATCCATCG CCCTCGCCGC CTGCGCAAAT CCGCTGCGCT GCGCGCTATG 
TTTGAAGAGA CAACACTTAG CCTGAACGAC CTGGTGTTGC CGATCTTTGT TGAAGAAGAA
CTCGACGATT ACAAAGCTAT TGATGCCATG CCCGGCGTAA TGCGTATCCC GGAAAAGCAG
CTGGCGCGAG AGATTGAACG TATCGCAAAT GCTGGCATTC GTTCCGTTAT GACCTTCGGC
ATTTCTCACC ATACTGATGA CACCGGCAGC GATACCTGGA AAGAAGACGG CCTGGTGGCA
AGAATGTCCC GCATCTGTAA GCAAACCGTG CCGGAGATGA TCGTCATGTC CGACACCTGC
TTCTGCGAAT ACACCTCGCA CGGCCACTGC GGCGTGTTGT GCGAACACGG TGTGGATAAC
GATGCGACGC TGGCCAACCT CGGCAAGCAG GCCGTTGTCG CGGCGGCGGC TGGAGCAGAT
TTTATTGCGC CCTCCGCGGC CATGGATGGA CAAGTCCAGG CTATCCGCCA GGCGCTGGAT
GCCGCCGGTT TTACCGATAC GGCAATAATG TCCTACTCCA CCAAGTTCGC CTCTTCTTTC
TACGGCCCCT TCCGTGAAGC AGCGGGTACC GCGTTAAAAG GCGACCGCAA GACGTACCAA
ATGAATCCGA TGAACCGCCG TGAAGCGATT CGCGAATCAC TGCTTGACGA AGCCCAGGGC
GCGGATTGCT TAATGGTGAA ACCGGCCGGC GCGTATCTGG ACGTGCTGCG TGAAATCCGC
GAACGCACAG AGTTGCCGCT TGGCGCTTAC CAGGTGAGCG GTGAATATGC CATGATTAAA
TTTGCTGCTA TGGCTGGCGC CATCGATGAA GAAAAGGTCG TGCTGGAAAG TCTGGGCTCG
ATTAAACGCG CCGGCGCCGA TTTGATTTTC AGTTACTTCG CGCTGGATCT GGCTGAGAAA
AATATTCTGC GTTAA
 
Protein sequence
MTDLIHRPRR LRKSAALRAM FEETTLSLND LVLPIFVEEE LDDYKAIDAM PGVMRIPEKQ 
LAREIERIAN AGIRSVMTFG ISHHTDDTGS DTWKEDGLVA RMSRICKQTV PEMIVMSDTC
FCEYTSHGHC GVLCEHGVDN DATLANLGKQ AVVAAAAGAD FIAPSAAMDG QVQAIRQALD
AAGFTDTAIM SYSTKFASSF YGPFREAAGT ALKGDRKTYQ MNPMNRREAI RESLLDEAQG
ADCLMVKPAG AYLDVLREIR ERTELPLGAY QVSGEYAMIK FAAMAGAIDE EKVVLESLGS
IKRAGADLIF SYFALDLAEK NILR