Gene SNSL254_A4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4220 
SymbolhemC 
ID6486297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4111359 
End bp4112321 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content58% 
IMG OID642739473 
Productporphobilinogen deaminase 
Protein accessionYP_002043172 
Protein GI194442426 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.782422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATAATGA CGGTAACAAG CATGTTAGAC AATGTTTTAA GAATTGCCAC ACGCCAAAGT 
CCCCTTGCGC TTTGGCAGGC ACATTATGTC AAAGACGCAT TGATGGCAAC CCATCCGGGA
CTGACGGTAG AACTGGTGCC GATGGTCACA CGCGGCGACG TGATTCTCGA TACTCCCCTG
GCAAAAGTGG GCGGTAAGGG ACTGTTTGTT AAAGAGCTTG AAATCGCGCT GCTGGAAAAG
CGCGCTGATA TCGCCGTGCA CTCTATGAAA GACGTTCCGG TGGCCTTCCC GGACGGTCTC
GGTCTGGTGA CCATTTGCGA GCGCGAAGAT CCGCGCGACG CGTTTGTCTC GAATAAATAT
CACAGTCTGG ACGATCTGCC CGCGGGTAGT ATCGTCGGGA CGTCCAGTTT GCGTCGTCAG
TGTCAACTGG CGGAACGCCG TCCGGACCTC ATTATCCGTT CGTTGCGCGG CAACGTCGGC
ACACGTCTCG GCAAGCTGGA CAACGGCGAC TATGACGCCA TTATCCTGGC CGTGGCCGGT
CTGAAACGCT TAGGTCTGGA GTCGCGCATT CGCACAGCCT TGCCGCCCGA CGTTTCACTT
CCTGCCGTAG GCCAGGGCGC CGTCGGGATT GAGTGTCGTC TTGACGACGC GCGAACGCAG
GCGCTGCTCG CACCGTTGAA TCACTCGCAA ACCGCGCTGC GCGTAACGGC GGAACGCGCT
ATGAACACCC GCCTGGAAGG CGGGTGTCAG GTGCCAATTG GCAGCTATGC AGAAATCATC
AACGGTGAAA TTTGGTTACG CGCGCTGGTT GGCGCACCGG ACGGTTCGGT GATGGTGCGC
GGCGAACGTC GTGGTTCTCC CGAGCAGGCG GAGCAAATGG GCATCTCGCT TGCAGAGGAA
CTGCTGGAAA ACGGCGCACG CGCGATTCTG ACGGAAGTTT ATAACGGCGA GACGCCCGCA
TGA
 
Protein sequence
MIMTVTSMLD NVLRIATRQS PLALWQAHYV KDALMATHPG LTVELVPMVT RGDVILDTPL 
AKVGGKGLFV KELEIALLEK RADIAVHSMK DVPVAFPDGL GLVTICERED PRDAFVSNKY
HSLDDLPAGS IVGTSSLRRQ CQLAERRPDL IIRSLRGNVG TRLGKLDNGD YDAIILAVAG
LKRLGLESRI RTALPPDVSL PAVGQGAVGI ECRLDDARTQ ALLAPLNHSQ TALRVTAERA
MNTRLEGGCQ VPIGSYAEII NGEIWLRALV GAPDGSVMVR GERRGSPEQA EQMGISLAEE
LLENGARAIL TEVYNGETPA