Gene SeHA_C4268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4268 
SymbolhemC 
ID6490768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4154979 
End bp4155935 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content59% 
IMG OID642744360 
Productporphobilinogen deaminase 
Protein accessionYP_002047954 
Protein GI194449507 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones90 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAA CAAGCATGTT AGACAATGTT TTAAGAATTG CCACACGCCA AAGTCCCCTT 
GCGCTTTGGC AGGCACATTA TGTCAAAGAC GCATTGATGG CAACCCATCC GGGACTGACG
GTAGAACTGG TGCCGATGGT CACACGCGGC GACGTGATTC TCGATACTCC CCTGGCGAAA
GTGGGCGGTA AGGGACTGTT TGTTAAAGAG CTTGAAATCG CGCTGCTGGA AAAACGCGCT
GATATCGCCG TGCACTCTAT GAAAGACGTT CCGGTGGCCT TCCCGGACGG TCTCGGTCTG
GTGACCATTT GCGAGCGCGA AGATCCGCGC GACGCGTTTG TCTCGAATAA ATATCACAGT
CTGGACGATC TGCCCGCGGG TAGTATCGTC GGGACGTCCA GTTTGCGTCG CCAGTGTCAA
CTGGCGGAAC GCCGTCCGGA CCTCATTATC CGTTCGTTGC GCGGCAACGT CGGCACACGT
CTCGGCAAGC TGGACAACGG CGACTATGAC GCCATTATCC TGGCCGTGGC CGGTCTGAAA
CGCTTAGGTC TGGAGTCGCG CATTCGCACA GCCTTGCCGC CCGACGTTTC GCTTCCTGCC
GTAGGCCAGG GCGCCGTCGG GATTGAGTGT CGTCTTGACG ACGCGCGAAC ACAGGCGCTG
CTCGCACCGT TGAATCACTC GCAAACCGCG CTACGCGTAA CGGCGGAACG CGCTATGAAC
ACCCGCCTGG AAGGCGGATG TCAGGTGCCG ATTGGCAGCT ATGCAGAAAT CATCAACGGT
GAAATTTGGC TACGCGCGCT GGTTGGCGCG CCGGACGGTT CGGTGATGGT GCGCGGCGAA
CGTCGTGGTT CTCCCGAGCA GGCGGAGCAA ATGGGCATCT CGCTTGCAGA GGAACTGCTG
GAAAACGGCG CACGCGCGAT TCTGACGGAA GTTTATAACG GCGAGACGCC CGCATGA
 
Protein sequence
MTVTSMLDNV LRIATRQSPL ALWQAHYVKD ALMATHPGLT VELVPMVTRG DVILDTPLAK 
VGGKGLFVKE LEIALLEKRA DIAVHSMKDV PVAFPDGLGL VTICEREDPR DAFVSNKYHS
LDDLPAGSIV GTSSLRRQCQ LAERRPDLII RSLRGNVGTR LGKLDNGDYD AIILAVAGLK
RLGLESRIRT ALPPDVSLPA VGQGAVGIEC RLDDARTQAL LAPLNHSQTA LRVTAERAMN
TRLEGGCQVP IGSYAEIING EIWLRALVGA PDGSVMVRGE RRGSPEQAEQ MGISLAEELL
ENGARAILTE VYNGETPA