Gene SNSL254_A1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1197 
SymbolhpaE 
ID6483568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1191843 
End bp1193309 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content59% 
IMG OID642736599 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002040357 
Protein GI194445601 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value0.692139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG 
ACCACTAACC CGGCGACCGG TGATGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA
GTGAACCAGG CTGTCGCGGC GGCAAAAGAG GCGTTCCCGA AATGGGCCAA CCTGCCGATG
AAAGAGCGCG CGCGCCTGAT GCGTCGCCTT GGCGATCTGA TTGACCAGCA TGTGCCGGAA
ATCGCGGCGA TGGAAACCGC CGACACCGGC TTGCCTATTC ACCAGACTAA AAACGTGCTG
ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC
AAGACCTATC CTGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC
TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTCGCGCCG
TGCCTGGCGC TGGGTAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCT
GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGGGTGCT GAACGTGGTG
CAGGGCTATG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT GCGTGCGGTG
TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG CCTGAAAAAA
TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGATGC CGACATTGAG
CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC
GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA
CGTGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ACACCCAGGT CGGCGCGCTG
ATTAGCCAAC AGCACTGGGA AAAAGTCTCC GGTTATATCC GCCTCGGCAT TGAAGAGGGC
GCAACGCTGC TGGCGGGCGG TGCGGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC
GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG
GAAGAGATCT TTGGGCCAGT TGCCTGTCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA
CGCCTGGCGA ACGATGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC
AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC
GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC
GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC
CATCCTATCC CAAAATGGGG AGTTTGA
 
Protein sequence
MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM 
KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG
KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA
DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK
YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE
RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG
GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS
KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD
HPIPKWGV