Gene SeSA_A1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1166 
SymbolhpaE 
ID6519569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1148178 
End bp1149644 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID642746291 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002114100 
Protein GI194734436 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.608265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.856893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAAAGTTG CAGGTAACGA CTACTTCCAA 
ACCACTAACC CGGCGACTGG TGACGTGCTG GCGGAAGTGG CCTCCGGCGG TGAAGCAGAA
GTGAACCAGG CTGTCGCGGC GGCCAAAGAG GCGTTCCCGA AATGGGCCAA CCTGCCGATG
AAAGAGCGTG CGCGCCTGAT GCGCCGCCTT GGCGACCTGA TTGACCAGCA TGTGCCGGAA
ATCGCGGCGA TGGAAACCGC CGACACCGGC CTGCCTATTC ACCAGACTAA AAACGTGCTG
ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC
AAGACCTATC CTGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC
TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTCGCGCCG
TGCCTGGCGC TGGGCAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCC
GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGCGTGCT GAACGTGGTG
CAGGGCTATG GCGCGACGGC GGGCGATTCG CTGGTACGCC ACCATGACGT GCGTGCGGTG
TCGTTTACCG GCGGGACCGC CACCGGTCGC AATATCATGA AAAATGCTGG CCTGAAAAAA
TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGATGC CGACATTGAG
CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC
GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA
CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ATACCCAGGT CGGCGCGCTG
ATTAGCCAAC AGCACTGGGA GAAAGTCTCC GGTTATATTC GCCTCGGCAT TGAAGAGGGC
GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC
GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG
GAAGAGATCT TTGGGCCGGT CGCCTGCCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA
CGTTTGGCGA ACGACGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC
AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC
GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC
GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC
CATCCTATCC CAAAATGGGG AGTTTGA
 
Protein sequence
MKKINHWING KKVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM 
KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG
KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA
DRLGELALEA GIPAGVLNVV QGYGATAGDS LVRHHDVRAV SFTGGTATGR NIMKNAGLKK
YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE
RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG
GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS
KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD
HPIPKWGV