Gene YPK_2459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_2459 
Symbol 
ID6088042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2691881 
End bp2693347 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content53% 
IMG OID641597525 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_001721189 
Protein GI170024684 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATTG TTAACCATTG GATTGATGGG AAAAATATTA CCAGCAATGA CTATTTCACA 
ACAATCAATC CGGCCACTGG CGAGGTGCTG GCTGACGTGG CAAGCGGGGG GATAAAAGAG
ATCAACCAAG CCGTTGCTGC CGCGAAAAGT GCTTTCCCTC ACTGGGCTAA CCTGCCGATG
AAAGAGCGTG CCCGTCTAAT GCGCCGTCTG GGGGAGTTGA TTGAGCAACA GATCCCAGAA
ATAGCGCAGA TGGAAACGCA GGATACCGGC CTGCCCATTT ATCAAACGCA AAATGCCTTG
ATCCCGCGGG CAGCACATAA CTTCGAATTT TTTGCCGAAA TTTGTCAGCA AATGAATGGC
CAGACGTATC CGGTTGACGA TCAAATGTTG AATTACACCT TGGTGCAACC CGTGGGAGTG
TGTGCGTTGG TCTCCCCTTG GAATGTCCCT TTTATGACGG CGACCTGGAA GGTCGCGCCT
TGTTTGGCGC TGGGTAACAC CGCGATATTG AAAATGTCGG AGCTATCGCC ACTGACCGCA
GACAAACTGG GTGAACTGGC CTTAGAGGCG GGTATACCGG CGGGGGTTCT CAACGTGGTA
CAAGGATATG GGGCCACTGT CGGTGATGCA TTGGTATGTC ATCAGGATGT CCGAGCTATC
TCTTTTACCG GCGGCACCGC GACGGGAAAC CGCATCATGC AACGTGCCGG GTTGAAAAAA
TACTCCATGG AACTCGGTGG TAAATCCCCG GTACTTATCT TCGACGATGC TGATATCGAA
CGGGCTATGG ATGCGGCGCT ATTTTCCATC TTCTCTCTCA ATGGTGAACG TTGCACGGCG
GGTTCGCGCA TTTTTATTCA AGAGAGTCTC TATTCGGCAT TTATTCAACG TTTTGCTGAG
CGGGCCAGCC GTTTACGTGT GGGGGACCCA CAAGATCTCG ACACTCAAGT TGGCGCATTG
ATCAGTAAAC CGCATTGGGA CAAAGTTTCC GGCTATATCC AGTTGGGGAT AGAGGAGGGG
GCCACGTTGT TGGCAGGGGG GCCGGATAAA CCCATCGACC TACCTGCTCA TCTGCGCGGA
GGGCACTTCC TGCGTCCAAC GGTGTTGGCC GATGTTGATA ACCGAATGCG GGTTGCTCAG
GAAGAGATTT TTGGACCGGT CGCTTGCCTG ATCCCCTTTA AGAATGAAGA CGCCGGACTG
CGTTTGGCAA ACAGCGTGCC ATACGGTCTG GCTGCTTATA TCTGGACACA AGACGTCAGC
AAAGTGCTGC GTTTGGCCCG AAGTATTGAA GCCGGCATGG TGTTCGTGAA TACCCAGAAT
GTGCGGGATC TCCGCCAGCC ATTTGGCGGC ATCAAGGCAT CGGGAACCGG GCGTGAAGGG
GGAAAGTACA GTTTTGATGT TTTTGCTGAA GTGAAAAACG TCTGTATTTC CATGGGGGAG
CATCCGATCC CCCGTTGGGG GATGTAA
 
Protein sequence
MKIVNHWIDG KNITSNDYFT TINPATGEVL ADVASGGIKE INQAVAAAKS AFPHWANLPM 
KERARLMRRL GELIEQQIPE IAQMETQDTG LPIYQTQNAL IPRAAHNFEF FAEICQQMNG
QTYPVDDQML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAIL KMSELSPLTA
DKLGELALEA GIPAGVLNVV QGYGATVGDA LVCHQDVRAI SFTGGTATGN RIMQRAGLKK
YSMELGGKSP VLIFDDADIE RAMDAALFSI FSLNGERCTA GSRIFIQESL YSAFIQRFAE
RASRLRVGDP QDLDTQVGAL ISKPHWDKVS GYIQLGIEEG ATLLAGGPDK PIDLPAHLRG
GHFLRPTVLA DVDNRMRVAQ EEIFGPVACL IPFKNEDAGL RLANSVPYGL AAYIWTQDVS
KVLRLARSIE AGMVFVNTQN VRDLRQPFGG IKASGTGREG GKYSFDVFAE VKNVCISMGE
HPIPRWGM