Gene YpsIP31758_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2363 
SymbolhpaE 
ID5385655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2662424 
End bp2663890 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content53% 
IMG OID640865352 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_001401332 
Protein GI153950198 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.702786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATCG TTAACCATTG GATTGATGGG AAAAATATTA CCAGTAAGGA CTATTTCACA 
ACAATCAACC CGGCCACTGG CGAGGTGCTG GCTGACGTGG CAAGCGGGGG GATAAAAGAG
ATCAACCAAG CCGTTGCTGC CGCGAAAAGT GCTTTCCCTC ACTGGGCTAA CCTGCCGATG
AAAGTGCGTG CCCGTCTAAT GCGCCGTCTG GGGGAGTTGA TTGAGCAACA GATCCCAGAA
ATAGCGCAGA TGGAAACGCA GGATACCGGC CTGCCCATTT ATCAAACGCA AAATGCCTTG
ATCCCGCGGG CAGCACATAA CTTCGAATTT TTTGCCGAAA TTTGTCAGCA AATGAATGGC
CAGACGTATC CGGTTGACGA TCAAATGTTG AATTACACCT TGGTGCAACC CGTGGGAGTG
TGTGCGTTGG TCTCCCCTTG GAATGTCCCT TTTATGACGG CGACCTGGAA GGTCGCGCCT
TGTTTGGCGC TGGGTAACAC CGCGATATTG AAAATGTCGG AGCTATCGCC ACTGACCGCA
GACAAACTGG GTGAACTGGC CTTAGAGGCG GGTATACCGG CGGGGGTTCT CAACGTGGTA
CAAGGATATG GGGCCACTGT CGGTGATGCA TTGGTATGTC ATCAGGATGT CCGAGCTATC
TCTTTTACCG GCGGCACCGC GACGGGAAAC CGCATCATGC AACGTGCCGG GTTGAAAAAA
TACTCCATGG AACTCGGTGG TAAATCTCCG GTACTTATCT TCGACGATGC TGATATCGAA
CGGGCTATGG ATGCGGCGCT ATTTTCCATC TTCTCTCTCA ATGGTGAACG TTGCACGGCG
GGTTCGCGCA TTTTTATTCA AGAGAGTCTC TATTCGGCAT TTATTCAACG TTTTGCTGAG
CGGGCCAGCC GTTTACGTGT GGGGGACCCA CAAGATCTCG ACACTCAAGT TGGCGCATTG
ATCAATAAAC CGCATTGGGA CAAAGTTTCC GGCTATATCC AGTTGGGGAT AGAGGAGGGG
GCCACGTTGT TGGCAGGGGG GCCGGATAAA CCCATCGACC TACCTGCTCA TCTGCGCGGA
GGGCACTTCC TGCGTCCAAC GGTGTTGGCC GATGTTGATA ACCGAATGCG GGTTGCTCAG
GAAGAGATTT TTGGACCGGT CGCTTGCCTG ATCCCCTTTA AGAATGAAGA CGCCGGACTG
CGTTTGGCAA ACAGCGTGCC ATACGGTCTG GCTGCTTATA TCTGGACACA AGACGTCAGC
AAAGTGCTGC GTTTGGCCCG AAGTATTGAA GCCGGCATGG TGTTCGTGAA TACCCAGAAT
GTGCGGGATC TCCGCCAGCC ATTTGGCGGC ATCAAGGCAT CGGGAACCGG GCGCGAAGGG
GGAAAGTACA GTTTTGATGT TTTTGCTGAA GTGAAAAACG TCTGTATTTC CATGGGGGAG
CATCCGATCC CCCGTTGGGG GATGTAA
 
Protein sequence
MKIVNHWIDG KNITSKDYFT TINPATGEVL ADVASGGIKE INQAVAAAKS AFPHWANLPM 
KVRARLMRRL GELIEQQIPE IAQMETQDTG LPIYQTQNAL IPRAAHNFEF FAEICQQMNG
QTYPVDDQML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAIL KMSELSPLTA
DKLGELALEA GIPAGVLNVV QGYGATVGDA LVCHQDVRAI SFTGGTATGN RIMQRAGLKK
YSMELGGKSP VLIFDDADIE RAMDAALFSI FSLNGERCTA GSRIFIQESL YSAFIQRFAE
RASRLRVGDP QDLDTQVGAL INKPHWDKVS GYIQLGIEEG ATLLAGGPDK PIDLPAHLRG
GHFLRPTVLA DVDNRMRVAQ EEIFGPVACL IPFKNEDAGL RLANSVPYGL AAYIWTQDVS
KVLRLARSIE AGMVFVNTQN VRDLRQPFGG IKASGTGREG GKYSFDVFAE VKNVCISMGE
HPIPRWGM