Gene YPK_4057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_4057 
Symbol 
ID6090485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp4475754 
End bp4477604 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content53% 
IMG OID641599154 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001722772 
Protein GI170026267 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAGT ACCGTTCCCA TACCACCACC CATGGCCGCA ATATGGCCGG CGCCCGCGCA 
CTGTGGCGCG CAACCGGTAT GACCGATGAT GACTTCGGCA AACCGATTAT TGCAGTCGTT
AACTCCTTTA CCCAATTTGT ACCGGGCCAC GTACATTTGC GCGATTTAGG CAAGCTGGTT
GCGGAGCAAA TTGTAGCTTC TGGCGGTGTG GCTAAAGAGT TCAACACCAT TGCGGTGGAT
GATGGTATCG CGATGGGCCA CGGTGGCATG CTCTACTCTC TGCCATCGCG TGAATTGATC
GCCGACTCCG TTGAGTACAT GGTTAACGCC CACTGTGCGG ATGCCATGGT GTGTATCTCT
AACTGTGACA AAATTACCCC AGGGATGCTG ATGGCGTCTC TGCGCCTGAA TATTCCGGTG
ATCTTTGTGT CTGGTGGCCC GATGGAAGCC GGTAAGACCA AGCTGTCAGA TAAAATCATC
AAGCTGGATT TGATCGATGC CATGATTCAG GGTGCGAATC CTAATGTGAG CGATGAAGAG
AGCGCCCAGA TTGAGCGTTC TGCTTGCCCG ACCTGTGGTT CTTGCTCCGG TATGTTTACG
GCTAACTCGA TGAACTGCCT GAATGAAGCG CTGGGTCTGG CGTTGCCGGG TAATGGTTCA
TTGTTGGCAA CCCACGCTGA CCGTAAGCAA CTGTTCTTGG ATGCGGGTAA ACACATTGTT
GCCTTGACCA AACGTTATTA TGAACAAGAT GACGTCAGCG CCTTGCCACG CAACATCGCG
AATAAAGCGG CCTTTGAAAA CGCCATGATA TTGGATATCG CCATGGGCGG TTCCACGAAT
ACCGTATTGC ATTTGCTGGC GGCGGCGCAG GAAGGTGAAA TTGATTTCAG CATGACCGAT
ATCGATCACC TGTCCCGTAA AGTCCCACAT TTGTGCAAAG TGGCCCCGAG TACTCAGAAA
TACCACATGG AAGATGTGCA CCGTGCGGGG GGGGTCATTG GTATTTTAGG TGAGTTGGAT
CGCGCTGGTT TGCTTAACCG CGATGTCAGT AACGTGTTGG GGCTGAATCT GACACAAACG
CTGGAAGCCT ATGACGTGAT GCTGACTCAG GATGAAGGCG TGAAGCAGAT GTACGCCGCA
GGCCCAGCCG GTATTCGCAC CACTAAAGCG TTCTCACAGG ATTGCCGTTA TCCGTCACTG
GATACCGATC GCGAAGAGGG TTGTATCCGT ACCCGTGAAC ATGCCTACAG CCAGGATGGT
GGTTTAGCGG TGTTGTACGG CAATATTGCG GCAGACGGCT GTATTGTTAA AACTGCGGGT
GTTGATAAAG ACAGCCTGAC GTTCCGTGGC CCGGCGAAAG TATTTGAGAG CCAGGATGAG
GCGGTAGAGG CGATCCTCGG TGGTAAAGTT GTGGCGGGTG ATGTGGTTGT TATCCGTTAT
GAAGGGCCAA AAGGGGGGCC GGGTATGCAG GAAATGCTCT ATCCGACCAC TTATCTGAAA
TCCATGGGGT TGGGCAAGAG TTGTGCCTTA CTGACCGATG GCCGTTTCTC TGGCGGGACA
TCCGGTTTGT CTATCGGCCA TGTGTCTCCA GAAGCCGCCA GTGGTGGGTT GATTGGTTTG
GTACAAGATG GCGATTTCAT CAATATCGAT ATTCCGAACC GTGGCATTGT CTTGGATGTT
AGCGAAGCTG AACTGGCTGC TCGCCGTGAA ACTGAAGAAG CGCATGGTGA TGCGGCCTGG
TCACCGAAGG GCCGTGAGCG CCAGGTCTCT TATGCCTTAC GCGCTTACGC GATGTTAGCA
ACCAGCGCTG ATAAAGGCGC GGTGCGCGAT AAAAGTAAGC TGGGAGGCTA A
 
Protein sequence
MPKYRSHTTT HGRNMAGARA LWRATGMTDD DFGKPIIAVV NSFTQFVPGH VHLRDLGKLV 
AEQIVASGGV AKEFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MASLRLNIPV IFVSGGPMEA GKTKLSDKII KLDLIDAMIQ GANPNVSDEE
SAQIERSACP TCGSCSGMFT ANSMNCLNEA LGLALPGNGS LLATHADRKQ LFLDAGKHIV
ALTKRYYEQD DVSALPRNIA NKAAFENAMI LDIAMGGSTN TVLHLLAAAQ EGEIDFSMTD
IDHLSRKVPH LCKVAPSTQK YHMEDVHRAG GVIGILGELD RAGLLNRDVS NVLGLNLTQT
LEAYDVMLTQ DEGVKQMYAA GPAGIRTTKA FSQDCRYPSL DTDREEGCIR TREHAYSQDG
GLAVLYGNIA ADGCIVKTAG VDKDSLTFRG PAKVFESQDE AVEAILGGKV VAGDVVVIRY
EGPKGGPGMQ EMLYPTTYLK SMGLGKSCAL LTDGRFSGGT SGLSIGHVSP EAASGGLIGL
VQDGDFINID IPNRGIVLDV SEAELAARRE TEEAHGDAAW SPKGRERQVS YALRAYAMLA
TSADKGAVRD KSKLGG