Gene SeD_A1177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1177 
SymbolhpaE 
ID6871178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1168005 
End bp1169471 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content59% 
IMG OID642784360 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002215033 
Protein GI198245087 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.585771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG 
ACCACTAACC CGGCGACCGG TGACGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA
GTGAACCAGG CTGTCGCGGC GGCCAAAGAG GCGTTCCCGA AATGGGCCAA TCTGCCGATG
AAAGAGCGCG CGCGCCTGAT GCGTCGTCTG GGTGACCTGA TTGACCAGAA CGTACCGGAA
ATCGCGGCGA TGGAAACCGC CGACACCGGT CTGCCTATTC ACCAGACTAA AAACGTGCTG
ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC
AAGACCTATC CTGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC
TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTCGCGCCG
TGCCTGGCGC TGGGCAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GTTGACTGCC
GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGGGTGCT GAACGTGGTG
CAGGGCTACG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT ACGTGCGGTG
TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG CCTGAAAAAA
TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGATGC CGACATTGAG
CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC
GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA
CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ACACCCAGGT CGGCGCGCTG
ATTAGCCAAC AGCACTGGGA AAAAGTCTCC GGTTATATCC GCCTCGGCAT TGAAGAGGGC
GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC
GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG
GAAGAGATCT TTGGGCCAGT CGCCTGTCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA
CGTTTGGCGA ACGACGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC
AAAGTGTTGC GCCTGGCGCG TGCGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC
GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC
GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC
CATCCTATCC CAAAATGGGG AGTTTGA
 
Protein sequence
MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM 
KERARLMRRL GDLIDQNVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG
KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA
DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK
YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE
RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG
GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS
KVLRLARAIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD
HPIPKWGV