Gene YpsIP31758_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0979 
Symboltas 
ID5386061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1170465 
End bp1171505 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content50% 
IMG OID640863946 
Productputative aldo-keto reductase 
Protein accessionYP_001399963 
Protein GI153947484 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00000256881 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATATC ATCGTATCCC CCACAGTTCA TTGGAAGTAA GCCTGCTGGG TCTGGGCACC 
ATGACGTTTG GTGAGCAAAA CAGTGAAGCC GATGCCCACG CTCAACTGGA TTATGCCGTT
GCAGCCGGTA TTAACTTGAT TGATACCGCA GAAATGTACC CGGTGCCTCC AAGGCCAGAA
ACTCAGGGAT TAACTGAGCA ATATATTGGT CGCTGGATAA AAGCACGCGG TTGCCGCGAA
AAAATTATTT TAGCCAGTAA AGTCTCCGGG CCATCACGCG GTGATGATCA GCCCATTCGC
CCGAATATGG CATTGGATCG GAAGAATATC CGCATCGCGC TGGAAGAGAG CCTTAAGCGC
CTTAATACCG ATTATCTTGA TATTTATCAG TTACATTGGC CTCAGCGGGA AACAAACTGT
TTCGGTAAGC TGAATTATCG CTATAGCGAG CAAACTGCCG TTGTGACCTT GCTGGAAACA
CTGGAAGCCC TGAACGAGCA AGTGCGGGCC GGTAAAATTC GTTATATCGG GGTATCCAAT
GAAACACCAT GGGGTGTCAT GCGTTATCTG CAACTGGCAG AAAAGCATGA TCTACCGCGT
ATCGTCTCTA TTCAGAACCC TTACAGCCTG TTAAACCGTA GCTTTGAAGT GGGTCTGGCA
GAGATTAGCC AGCACGAAGG CGTTGAGTTA TTAGCTTATT CCAGCCTGGC TTTTGGCACA
CTGAGCGGCA AATACCTTAA TGGCGCGAAA CCTGCCGGTG CACGCAACAC CTTGTTCAGC
CGTTTCACCC GTTACTCTGG GCCACAAACC CAATTAGCGG TGGCTGAATA TGTGTCGCTG
GCAAAACACC ATGGGCTGGA TCCGGCGCAG ATGGCTCTGG CCTTTGTGCG GCAACAGCCG
TTTGTTGCCA GTACGCTACT CGGCGCAACG TCGCTGGAAC AACTGAAAAG TAATATTGAT
AGCCAAAATA TCGTGCTGAG TCAGGAAGTA CTGGATGCAC TGGAAGCGAT CCATACCCGC
TATACCTTCC CCGCACCTTA A
 
Protein sequence
MQYHRIPHSS LEVSLLGLGT MTFGEQNSEA DAHAQLDYAV AAGINLIDTA EMYPVPPRPE 
TQGLTEQYIG RWIKARGCRE KIILASKVSG PSRGDDQPIR PNMALDRKNI RIALEESLKR
LNTDYLDIYQ LHWPQRETNC FGKLNYRYSE QTAVVTLLET LEALNEQVRA GKIRYIGVSN
ETPWGVMRYL QLAEKHDLPR IVSIQNPYSL LNRSFEVGLA EISQHEGVEL LAYSSLAFGT
LSGKYLNGAK PAGARNTLFS RFTRYSGPQT QLAVAEYVSL AKHHGLDPAQ MALAFVRQQP
FVASTLLGAT SLEQLKSNID SQNIVLSQEV LDALEAIHTR YTFPAP