Gene SNSL254_A0412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0412 
SymbolhemB 
ID6484735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp428136 
End bp429110 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content55% 
IMG OID642735836 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_002039610 
Protein GI194442596 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.00540989 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGACC TTATCCATCG CCCTCGCCGC CTGCGCAAAT CCGCTGCGCT GCGCGCTATG 
TTTGAAGAGA CAACACTTAG CCTGAACGAC CTGGTGTTGC CGATCTTTGT TGAAGAAGAA
CTCGACGATT ACAAAGCTAT TGATGCCATG CCCGGCGTAA TGCGTATCCC GGAAAAGCAG
CTGGCGCGAG AGATTGAACG TATCGCAAAT GCTGGTATTC GTTCCGTTAT GACCTTCGGC
ATTTCTCACC ATACTGATGA CACCGGCAGC GATACCTGGA AAGAAGACGG TCTGGTGGCA
AGAATGTCCC GCATCTGTAA GCAAACCGTG CCGGAGATGA TCGTCATGTC CGACACCTGC
TTCTGCGAAT ACACCTCGCA CGGCCACTGC GGCGTGTTGT GCGAACACGG CGTGGATAAC
GATGCGACGC TGGCCAACCT CGGCAAGCAG GCCGTTATCG CGGCGGCGGC TGGAGCAGAT
TTTATTGCGC CCTCCGCGGC AATGGATGGA CAAGTCCAGG CTATCCGCCA GGCGCTGGAT
GCCGCCGGTT TTACCGATAC GGCAATAATG TCCTACTCCA CCAAGTTCGC CTCTTCTTTC
TACGGCCCCT TCCGTGAAGC AGCGGGCACC GCGTTAAAAG GCGACCGCAA GACGTATCAA
ATGAATCCGA TGAACCGCCG TGAAGCGATT CGCGAATCAC TGCTCGACGA AGCCCAGGGC
GCGGATTGCT TAATGGTGAA ACCGGCCGGC GCGTATCTGG ACGTGCTGCG TGAAATCCGC
GAACGCACAG AGTTGCCGCT TGGCGCTTAC CAGGTGAGCG GTGAATACGC CATGATTAAA
TTTGCTGCTA TGGCTGGCGC CATCGATGAA GAAAAGGTCG TGCTGGAAAG TCTGGGCTCG
ATTAAACGCG CCGGCGCCGA TTTGATTTTC AGTTACTTCG CGCTGGATCT GGCTGAGAAA
AATATTCTGC GTTAA
 
Protein sequence
MTDLIHRPRR LRKSAALRAM FEETTLSLND LVLPIFVEEE LDDYKAIDAM PGVMRIPEKQ 
LAREIERIAN AGIRSVMTFG ISHHTDDTGS DTWKEDGLVA RMSRICKQTV PEMIVMSDTC
FCEYTSHGHC GVLCEHGVDN DATLANLGKQ AVIAAAAGAD FIAPSAAMDG QVQAIRQALD
AAGFTDTAIM SYSTKFASSF YGPFREAAGT ALKGDRKTYQ MNPMNRREAI RESLLDEAQG
ADCLMVKPAG AYLDVLREIR ERTELPLGAY QVSGEYAMIK FAAMAGAIDE EKVVLESLGS
IKRAGADLIF SYFALDLAEK NILR