Gene YPK_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1841 
Symbol 
ID6088558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2040805 
End bp2041851 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content51% 
IMG OID641596909 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001720585 
Protein GI170024080 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAGA CAGATGAACT GCGGACCGCT CGCATCGATA GCTTGATTAC ACCGCAGCAA 
CTGGCTGAAA AGTTACCGAT TTCTGAGGTT ATTGCAGATA ACGTGACAGC GTCACGTAAA
CGAATAGAGA AAATACTTAT TGGTGAAGAC CCACGTCTAC TCGTGGTGAT TGGCCCCTGC
TCTATTCACG ACCTTGATGC AGCCGTTGAT TATGCCACCC GGCTCAAGGT GCTACGAGAA
CGCTATCAAG ACCGGCTGGA AATCGTGATG CGCACCTATT TCGAGAAACC ACGGACTGTA
GTGGGTTGGA AGGGGCTGAT TTCTGATCCG GCACTTGACG GCTCATGCCA GGTGAACTTG
GGTATTGAAC TGGCACGTAA GCTACTGTTA GCCGTGAATG AACTCGGGCT GCCGACCGCT
ACCGAGTTCC TCGATATGGT AACAGGCCAA TATATTGCCG ACCTCATCAG TTGGGGGGCA
ATAGGCGCAC GTACCACCGA AAGCCAGATC CACCGAGAGA TGGCCTCGGC ACTCTCCTGC
CCCGTGGGTT TCAAAAATGG TACAGATGGC AATGTGCGTA TTGCTATTGA TGCCATTCGC
GCCGCACAAG CCAGCCATAT GTTCCTTTCT CCGGATAAAA CCGGCCAAAT GACGATTTAC
CAAACCAGTG GTAACCCCTA TGGGCATATT ATTATGCGGG GTGGCAAGCA ACCTAACTAT
GATGCCTCTG ATATCGCAGC CGCCTGTGAC AGCTTGCGGG AATTTGATTT GCCAGAACAT
CTGGTGGTGG ATTTTAGCCA CGGCAATTGC CAGAAGATGC ATCGCCGCCA GTTGGATGTT
GCCGAAAATA TCGGGCTACA GATCCGTGCG GGTTCAACAG CGATTGTCGG TGTTATGGCT
GAGAGTTTCC TGATTGAGGG CACACAGAAG ATTGTTGCCG GACAGCCCTT AACTTATGGG
CAATCCATCA CTGACCCTTG CCTGAATTGG GATGATACTG AACAACTGTT AAGCCTATTG
GCAGATGCAG TAAACAGCCG GTTTTAA
 
Protein sequence
MYKTDELRTA RIDSLITPQQ LAEKLPISEV IADNVTASRK RIEKILIGED PRLLVVIGPC 
SIHDLDAAVD YATRLKVLRE RYQDRLEIVM RTYFEKPRTV VGWKGLISDP ALDGSCQVNL
GIELARKLLL AVNELGLPTA TEFLDMVTGQ YIADLISWGA IGARTTESQI HREMASALSC
PVGFKNGTDG NVRIAIDAIR AAQASHMFLS PDKTGQMTIY QTSGNPYGHI IMRGGKQPNY
DASDIAAACD SLREFDLPEH LVVDFSHGNC QKMHRRQLDV AENIGLQIRA GSTAIVGVMA
ESFLIEGTQK IVAGQPLTYG QSITDPCLNW DDTEQLLSLL ADAVNSRF