Gene SeAg_B1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1061 
SymbolhpaE 
ID6797381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1059497 
End bp1060963 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content60% 
IMG OID642775330 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_002145971 
Protein GI197250383 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA TAAATCATTG GATTAACGGC AAAAACGTTG CAGGTAACGA CTACTTCCAG 
ACCACTAACC CGGCGACCGG TGACGTGCTG GCGGAAGTAG CCTCCGGCGG TGAAGCAGAA
GTGAACCAGG CTGTCGCGGC GGCAAAAGAG GCGTTCCCGA AATGGGCCAA CCTGCCGATG
AAAGAGCGCG CGCGCCTGAT GCGCCGCCTT GGGGACCTGA TTGACCAGCA TGTGCCGGAA
ATCGCGGCGA TGGAAACCGC CGACACCGGC CTGCCTATTC ACCAGACTAA AAACGTGCTG
ATCCCGCGCG CCTCGCATAA CTTCGAATTC TTCGCCGAAG TGTGCCAGCA GATGAACGGC
AAGACCTACC CGGTTGACGA TAAAATGCTC AATTATACGC TGGTGCAGCC CGTCGGCGTC
TGCGCGCTGG TGTCGCCGTG GAACGTGCCG TTTATGACCG CGACCTGGAA AGTTGCGCCG
TGCCTGGCGC TGGGTAACAC CGCGGTGCTC AAAATGTCCG AGCTGTCGCC GCTGACTGCC
GACAGGCTGG GCGAGCTGGC GCTGGAGGCA GGAATTCCGG CAGGCGTGCT GAACGTGGTG
CAGGGCTACG GCGCGACGGC GGGCGATGCG CTGGTACGCC ACCATGACGT GCGTGCGGTG
TCGTTTACCG GCGGTACCGC CACCGGTCGC AATATCATGA AAAATGCCGG GCTGAAAAAA
TACTCGATGG AGCTGGGCGG CAAATCGCCG GTGCTGATTT TTGAAGACGC CGACATTGAG
CGCGCGCTGG ACGCCGCGCT GTTCACCATC TTCTCGATCA ACGGCGAACG CTGCACCGCC
GGGTCGCGCA TCTTTATCCA GCAGAGCATT TACCCTGAGT TCGTGAAGCG CTTTGCCGAA
CGCGCGAATC GCCTGCGTGT CGGCGATCCG ACCGACCCGA ACACCCAGGT CGGCGCGCTG
ATTAGCCAAC AGCACTGGGA AAAAGTCTCC GGTTATATCC GCCTCGGCAT TGAAGAGGGC
GCAACGCTGC TGGCGGGCGG TGCAGAAAAA CCCACTGACC TGCCTGCGCA TCTGAAAGGC
GGTAACTTCC TGCGCCCAAC CGTGCTGGCC GATGTCGACA ACCGTATGCG CGTCGCGCAG
GAAGAGATCT TTGGGCCAGT CGCCTGTCTG CTGCCATTCA AAGACGAAGC GGAAGGGTTA
CGTTTGGCGA ACGACGTGGA ATACGGTCTG GCCTCTTATA TCTGGACCCA GGACGTGAGC
AAAGTGTTGC GCCTGGCGCG TGGGATTGAA GCCGGCATGG TCTTCGTCAA CACCCAGAAC
GTCCGCGACC TGCGCCAGCC GTTCGGCGGC GTGAAAGCCT CCGGTACCGG GCGCGAAGGC
GGCGAATATA GCTTCGAAGT GTTTGCGGAA ATGAAAAACG TCTGCATCTC AATGGGCGAC
CATCCTATCC CAAAATGGGG AGTTTGA
 
Protein sequence
MKKINHWING KNVAGNDYFQ TTNPATGDVL AEVASGGEAE VNQAVAAAKE AFPKWANLPM 
KERARLMRRL GDLIDQHVPE IAAMETADTG LPIHQTKNVL IPRASHNFEF FAEVCQQMNG
KTYPVDDKML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAVL KMSELSPLTA
DRLGELALEA GIPAGVLNVV QGYGATAGDA LVRHHDVRAV SFTGGTATGR NIMKNAGLKK
YSMELGGKSP VLIFEDADIE RALDAALFTI FSINGERCTA GSRIFIQQSI YPEFVKRFAE
RANRLRVGDP TDPNTQVGAL ISQQHWEKVS GYIRLGIEEG ATLLAGGAEK PTDLPAHLKG
GNFLRPTVLA DVDNRMRVAQ EEIFGPVACL LPFKDEAEGL RLANDVEYGL ASYIWTQDVS
KVLRLARGIE AGMVFVNTQN VRDLRQPFGG VKASGTGREG GEYSFEVFAE MKNVCISMGD
HPIPKWGV