Gene YpAngola_A1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1646 
SymbolhpaE 
ID5800117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1699703 
End bp1701100 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content53% 
IMG OID641339592 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_001606149 
Protein GI162418764 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG TTAACCATTG GATTGATGGG AAAAATATTA CCAGCAATGA CTATTTCACA 
ACAATCAATC CGGCCACTGG CGAGGTGCTG GCTGACGTGG CAAGCGGGGG GATAAAAGAG
ATCAACCAAG CCGTTGCTGC CGCGAAAAGT GCTTTCCCTC ACTGGGCTAA CCTGCCGATG
AAAGTGCGTG CCCGTCTAAT GCGCCGTCTG GGGGAGTTGA TTGAGCAACA GATCCCAGAA
ATAGCGCAGA TGGAAACGCA GGATACCGGC CTGCCCATTT ATCAAACGCA AAATGCCTTG
ATCCCGCGGG CAGCACATAA CTTCGAATTT TTTGCCGAAA TTTGTCAGCA AATGAATGGC
CAGACGTATC CGGTTGACGA TCAAATGTTG AATTACACCT TGGTGCAACC CGTGGGAGTG
TGTGCGTTGG TCTCCCCTTG GAATGTCCCT TTTATGACGG CGACCTGGAA GGTCGCGCCT
TGTTTGGCGC TGGGTAACAC CGCGATATTG AAAATGTCGG AGCTATCGCC ACTGACCGCA
GACAAACTGG GTGAACTGGC CTTAGAGGCG GGTATACCGG CGGGGGTTCT CAACGTGGTA
CAAGGATATG GGGCCACTGT CGGTGATGCA TTGGTATGTC ATCAGGATGT CCGAGCTATC
TCTTTTACCG GCGGCACCGC GACGGGAAAC CGCATCATGC AACGTGCCGG GTTGAAAAAA
TACTCCATGG AACTCGGTGG TAAATCCCCG GTACTTATCT TCGACGATGC TGATATCGAA
CGGGCTATGG ATGCGGCGCT ATTTTCCATC TTCTCTCTCA ATGGTGAACG TTGCACGGCG
GGTTCGCGCA TTTTTATTCA AGAGAGTCTC TATTCGGCAT TTATTCAACG TTTTGCTGAG
CGGGCCAGCC GTTTACGTGT GGGGGACCCA CAAGATCTCG ACACTCAAGT TGGCGCATTG
ATCAGTAAAC CGCATTGGGA CAAAGTTTCC GGCTATATCC AGTTGGGGAT AGAGGAGGGG
GCCACGTTGT TGGCAGGGGG GCCGGATAAA CCCATCGACC TACCTGCTCA TCTGCGCGGA
GGGCACTTCC TGCGTCCAAC GGTGTTGGCC GATGTTGATA ACCGAATGCG GGTTGCTCAG
GAAGAGATTT TTGGACCGGT CGCTTGCCTG ATCCCCTTTA AGAATGAAGA CGCCGGACTG
CGTTTGGCCC GAAGTATTGA AGCCGGCATG GTGTTCGTGA ATACCCAGAA TGTGCGGGAT
CTCCGCCAGC CATTTGGCGG CATCAAGGCA TCGGGAACCG GGCGTGAAGG GGGAAAGTAC
AGTTTTGATG TTTTTGCTGA AGTGAAAAAC GTCTGTATTT CCATGGGGGA GCATCCGATC
CCCCGTTGGG GGATGTAA
 
Protein sequence
MKIVNHWIDG KNITSNDYFT TINPATGEVL ADVASGGIKE INQAVAAAKS AFPHWANLPM 
KVRARLMRRL GELIEQQIPE IAQMETQDTG LPIYQTQNAL IPRAAHNFEF FAEICQQMNG
QTYPVDDQML NYTLVQPVGV CALVSPWNVP FMTATWKVAP CLALGNTAIL KMSELSPLTA
DKLGELALEA GIPAGVLNVV QGYGATVGDA LVCHQDVRAI SFTGGTATGN RIMQRAGLKK
YSMELGGKSP VLIFDDADIE RAMDAALFSI FSLNGERCTA GSRIFIQESL YSAFIQRFAE
RASRLRVGDP QDLDTQVGAL ISKPHWDKVS GYIQLGIEEG ATLLAGGPDK PIDLPAHLRG
GHFLRPTVLA DVDNRMRVAQ EEIFGPVACL IPFKNEDAGL RLARSIEAGM VFVNTQNVRD
LRQPFGGIKA SGTGREGGKY SFDVFAEVKN VCISMGEHPI PRWGM