Gene YPK_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1031 
Symboltas 
ID6089490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp1159021 
End bp1160061 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content50% 
IMG OID641596094 
Productputative aldo-keto reductase 
Protein accessionYP_001719785 
Protein GI170023280 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0194633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATC ATCGTATCCC CCACAGTTCA TTGGAAGTAA GCCTGCTGGG TCTGGGCACC 
ATGACGTTTG GTGAGCAAAA CAGTGAAGCC GATGCCCACG CTCAACTGGA TTATGCCGTT
GCAGCCGGTA TTAACTTGAT TGATACCGCA GAAATGTACC CGGTGCCTCC AAGGCCAGAA
ACTCAGGGAT TAACTGAGCA ATATATTGGT CGCTGGATAA AAGCACGCGG TTGCCGCGAA
AAAATTATTT TAGCCAGTAA AGTCTCCGGG CCATCACGCG GTGATGATCA GCCCATTCGC
CCGAATATGG CATTGGATCG GAAGAATATC CGCATCGCGC TGGAAGAGAG CCTTAAGCGC
CTTAATACCG ATTATCTTGA TATTTATCAG TTACATTGGC CTCAGCGGGA AACAAACTGT
TTCGGTAAGC TGAATTATCG CTATAGCGAG CAAACTGCCG TTGTGACCTT GCTGGAAACA
CTGGAAGCCC TGAACGAGCA AGTGCGGGCC GGTAAAATTC GTTATATCGG GGTATCCAAT
GAAACACCAT GGGGTGTCAT GCGTTATCTG CAACTGGCAG AAAAGCATGA TCTACCGCGT
ATCGTCTCTA TTCAGAACCC TTACAGCCTG TTAAACCGTA GCTTTGAAGT GGGTCTGGCA
GAGATTAGCC AGCACGAAGG CGTTGAGTTA TTAGCTTATT CCAGCCTGGC TTTTGGCACA
CTGAGCGGCA AATACCTTAA TGGCGCGAAA CCTGCCGGTG CACGCAACAC CTTGTTCAGC
CGTTTCACCC GTTACTCTGG GCCACAAACC CAATTAGCGG TGGCTGAATA TGTGTCGCTG
GCAAAACACC ATGGGCTGGA TCCGGCGCAG ATGGCTCTGG CCTTTGTGCG GCAACAGCCG
TTTGTTGCCA GTACGCTACT CGGCGCAACG TCGCTGGAAC AACTGAAAAG TAATATTGAT
AGCCAAAATA TCGTGCTGAG TCAGGAAGTA CTGGATGCAC TGGAAGCGAT CCATACCCGC
TATACCTTCC CCGCACCTTA A
 
Protein sequence
MQYHRIPHSS LEVSLLGLGT MTFGEQNSEA DAHAQLDYAV AAGINLIDTA EMYPVPPRPE 
TQGLTEQYIG RWIKARGCRE KIILASKVSG PSRGDDQPIR PNMALDRKNI RIALEESLKR
LNTDYLDIYQ LHWPQRETNC FGKLNYRYSE QTAVVTLLET LEALNEQVRA GKIRYIGVSN
ETPWGVMRYL QLAEKHDLPR IVSIQNPYSL LNRSFEVGLA EISQHEGVEL LAYSSLAFGT
LSGKYLNGAK PAGARNTLFS RFTRYSGPQT QLAVAEYVSL AKHHGLDPAQ MALAFVRQQP
FVASTLLGAT SLEQLKSNID SQNIVLSQEV LDALEAIHTR YTFPAP