Gene PICST_59568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59568 
SymbolAKR1 
ID4838756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp951406 
End bp952365 
Gene Length960 bp 
Protein Length319 aa 
Translation table12 
GC content44% 
IMG OID640390071 
ProductProtein with similarity to aldo-keto reductases 
Protein accessionXP_001384488 
Protein GI150865323 
COG category[R] General function prediction only 
COG ID[COG0656] Aldo/keto reductases, related to diketogulonate reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0630567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACC TCACTACCCT GATCCAATTG ACCCAACAAA GCACCTACAA GTTGAATAAC 
GGACAACATA TTCCGGTTGC CGGATATGGT CTTTACTTAT GTCCCGATGA ACAATCGAAA
CACTTGGTCT ACAAAGCTTT GGAAGCCGGT TACAGACACA TTGACAGTGC TGTATACTAC
GGCAACCAAA GACTGGCAGC ACAGGCTATT GCGGAGTTCC TCAAAGATCA CCCGGAAGTT
AAGCGTGAAG ACATCTGGTT CACTACAAAG TTGACTAACG ATGCGCATGG GTACGAAGAA
ACCAAGAAGG AGATTGCTCT TATTGCTAGT GAAATTAAGG AATCGCTTGG CTATCTCGAT
TTGGTCTTGC TCCACTCGCC AAAATCCAAC AAGGAAAGAA GATTGGGCAC TTGGAAAGCG
TTGCAAGAAT TCGTTTTGCA CCCACAGAAC GAAGTGCTAA ACATTCGCTC CATCGGAGTT
TCCAACTTCG GAGTCGACCA TTTGGAAGAA ATCTTGAACT GGGATGGTTT ATTAGTGAAG
CCTGTGCTTA ACCAATTGGA ATTGCACCCA TGGTTGCCGC GCTTGGAATT GCGTGAATAC
TTGTGTAAGC ACGATATACT TGCCGAAGCA TACTCTCCCT TGACTCAAGG TTACATGTTG
AACGATCCAG AATTATTGGA ATTGGAAAAG AAGTCGGGCA TCTCTAAAAT CGAAATCCTC
ATTAAGTGGT CCTATTTACA GGGATTTGTC GTTTTAGTTA AGACTGAGAA AGAGGAAAGA
ATTGCTCAAA ATCTCAACAT CTTGCCGAAG GGAAACAATG ACATACTCGG TGAAACTTCA
AACTTGGGCA AGATCGAGTT GCCACTGTCT GTATTGGAAG CTCTAGACAA GCCGGACTCT
CATGTCGTCT TGACTTGGGA TAATGTCGAT CCTACTCTCT ACAAGGACGG CGACATTTAG
 
Protein sequence
MSNLTTSIQL TQQSTYKLNN GQHIPVAGYG LYLCPDEQSK HLVYKALEAG YRHIDSAVYY 
GNQRSAAQAI AEFLKDHPEV KREDIWFTTK LTNDAHGYEE TKKEIALIAS EIKESLGYLD
LVLLHSPKSN KERRLGTWKA LQEFVLHPQN EVLNIRSIGV SNFGVDHLEE ILNWDGLLVK
PVLNQLELHP WLPRLELREY LCKHDILAEA YSPLTQGYML NDPELLELEK KSGISKIEIL
IKWSYLQGFV VLVKTEKEER IAQNLNILPK GNNDILGETS NLGKIELPSS VLEALDKPDS
HVVLTWDNVD PTLYKDGDI