Gene SeHA_C1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1212 
SymbolhpaE 
ID6491465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1194233 
End bp1195699 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID642741451 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002045102 
Protein GI194448648 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG 
ACCACTAACC CGGCGACCGG TGATGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA
GTGAACCAGG CTGTCGCGGC GGCAAAAGAG GCGTTCCCGA AGTGGGCCAA CCTGCCGATG
AAAGAGCGCG CGCGCCTGAT GCGCCGCCTT GGGGACCTGA TTGACCAGCA TGTGCCGGAA
ATCGCGGCGA TGGAAACCGC CGACACCGGC CTGCCTATTC ACCAGACTAA AAACGTGCTG
ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC
AAGACCTACC CGGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC
TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTTGCGCCG
TGCCTGGCGC TGGGTAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCC
GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGCGTGCT GAACGTGGTG
CAGGGCTACG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT GCGTGCGGTG
TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG GCTGAAAAAA
TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGACGC CGACATTGAG
CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC
GGGTCGCGCA TTTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA
CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACGGACCCGA ACACCCAGGT CGGCGCGCTG
ATTAGCCAAC AGCACTGGGA AAAAGTCTCT GGCTATATTC GCCTCGGCAT TGAAGAGGGC
GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC
GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTTGCGCAG
GAAGAGATCT TTGGGCCGGT CGCCTGCCTG CTGCCATTCA AAGACGAGGC GGAAGGGTTA
CGTTTGGCGA ACGACGTCGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC
AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC
GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC
GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC
CATCCTATCC CAAAATGGGG AGTTTGA
 
Protein sequence
MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM 
KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG
KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA
DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK
YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE
RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG
GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS
KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD
HPIPKWGV