Gene YpsIP31758_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3220 
SymbolaroF 
ID5386830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3627150 
End bp3628226 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content49% 
IMG OID640866229 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001402177 
Protein GI153950605 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATGC AAAAAGACTC GCTCAATAAC GTCCATATCA GTGCCGAACA AATCCTGATA 
ACCCCGGAAG AACTGAAAAA TCAGTTTCCA CTGAGCGAAA ATGATCAGTA TTCGATAGAG
CGCGCACGTA AAACCATTGC TGACATTATT CAGGGGCGAG ATCCGCGTCT GTTGGTCGTT
TGTGGGCCCT GTTCAATTCA TGATGTGGAT GCGGCACTGG ATTACGCGCG TCGTTTGAAA
AAACTCTCTG TGGAATTGGA TGACAGCTTA TATATCGTTA TGCGTGTCTA TTTTGAGAAG
CCAAGAACTA CCGTGGGTTG GAAAGGCCTG ATCAATGACC CTGCAATGGA TGGTTCATTT
GATGTAGAGG CAGGTTTACA CATTGCCCGT CGTTTATTGC TGGATTTAGT GGGCATGGGG
TTGCCGTTAG CGACTGAAGC TCTGGATCCT AATAGCCCAC AATATTTAGG TGACCTGTTC
AGTTGGTCGG CCATTGGTGC CCGTACAACG GAGTCACAGA CCCACCGTGA AATGGCATCA
GGCTTGTCTA TGCCGGTTGG ATTTAAAAAT GGCACTGACG GTAGCCTAGG CACGGCAATC
AATGCAATGC GCGCCGCTGC CATGCCACAT CGCTTTATGG GGATCAATCA GTCGGGCCAG
GTCTGCCTGT TACAAACTCA GGGTAACCCA CACGGCCATG TCATTCTACG GGGAGGTAAA
ACACCAAACT ACAGTGCACA AGATGTCGCT CAGTGTGAAA AACAGATGCA GGATGCGGGA
CTCATCCCAT CCTTAATGAT AGATTGCAGT CACGGTAATT CAAATAAAGA CTACCGCCGT
CAGGTTGCGG TGGCTGAATC TGTGGTTGAA CAGATCAAGG CGGGCAATCG TTCAATTACA
GGTGTGATGC TGGAAAGCCA CATCCACGAA GGAAATCAGT CATCTGAACA GCCACGTGCT
GATATGCGCT ACGGTGTTTC TGTGACTGAC GCCTGTATTA ACTGGGAAAG CACTGAAACC
CTGTTACGTG GTATGCGCCA AGAATTGCTT GCAGCACTGA CGGCACGGAC TGCATGA
 
Protein sequence
MIMQKDSLNN VHISAEQILI TPEELKNQFP LSENDQYSIE RARKTIADII QGRDPRLLVV 
CGPCSIHDVD AALDYARRLK KLSVELDDSL YIVMRVYFEK PRTTVGWKGL INDPAMDGSF
DVEAGLHIAR RLLLDLVGMG LPLATEALDP NSPQYLGDLF SWSAIGARTT ESQTHREMAS
GLSMPVGFKN GTDGSLGTAI NAMRAAAMPH RFMGINQSGQ VCLLQTQGNP HGHVILRGGK
TPNYSAQDVA QCEKQMQDAG LIPSLMIDCS HGNSNKDYRR QVAVAESVVE QIKAGNRSIT
GVMLESHIHE GNQSSEQPRA DMRYGVSVTD ACINWESTET LLRGMRQELL AALTARTA