Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_59568 |
Symbol | AKR1 |
ID | 4838756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 951406 |
End bp | 952365 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390071 |
Product | Protein with similarity to aldo-keto reductases |
Protein accession | XP_001384488 |
Protein GI | 150865323 |
COG category | [R] General function prediction only |
COG ID | [COG0656] Aldo/keto reductases, related to diketogulonate reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0630567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAACC TCACTACCCT GATCCAATTG ACCCAACAAA GCACCTACAA GTTGAATAAC GGACAACATA TTCCGGTTGC CGGATATGGT CTTTACTTAT GTCCCGATGA ACAATCGAAA CACTTGGTCT ACAAAGCTTT GGAAGCCGGT TACAGACACA TTGACAGTGC TGTATACTAC GGCAACCAAA GACTGGCAGC ACAGGCTATT GCGGAGTTCC TCAAAGATCA CCCGGAAGTT AAGCGTGAAG ACATCTGGTT CACTACAAAG TTGACTAACG ATGCGCATGG GTACGAAGAA ACCAAGAAGG AGATTGCTCT TATTGCTAGT GAAATTAAGG AATCGCTTGG CTATCTCGAT TTGGTCTTGC TCCACTCGCC AAAATCCAAC AAGGAAAGAA GATTGGGCAC TTGGAAAGCG TTGCAAGAAT TCGTTTTGCA CCCACAGAAC GAAGTGCTAA ACATTCGCTC CATCGGAGTT TCCAACTTCG GAGTCGACCA TTTGGAAGAA ATCTTGAACT GGGATGGTTT ATTAGTGAAG CCTGTGCTTA ACCAATTGGA ATTGCACCCA TGGTTGCCGC GCTTGGAATT GCGTGAATAC TTGTGTAAGC ACGATATACT TGCCGAAGCA TACTCTCCCT TGACTCAAGG TTACATGTTG AACGATCCAG AATTATTGGA ATTGGAAAAG AAGTCGGGCA TCTCTAAAAT CGAAATCCTC ATTAAGTGGT CCTATTTACA GGGATTTGTC GTTTTAGTTA AGACTGAGAA AGAGGAAAGA ATTGCTCAAA ATCTCAACAT CTTGCCGAAG GGAAACAATG ACATACTCGG TGAAACTTCA AACTTGGGCA AGATCGAGTT GCCACTGTCT GTATTGGAAG CTCTAGACAA GCCGGACTCT CATGTCGTCT TGACTTGGGA TAATGTCGAT CCTACTCTCT ACAAGGACGG CGACATTTAG
|
Protein sequence | MSNLTTSIQL TQQSTYKLNN GQHIPVAGYG LYLCPDEQSK HLVYKALEAG YRHIDSAVYY GNQRSAAQAI AEFLKDHPEV KREDIWFTTK LTNDAHGYEE TKKEIALIAS EIKESLGYLD LVLLHSPKSN KERRLGTWKA LQEFVLHPQN EVLNIRSIGV SNFGVDHLEE ILNWDGLLVK PVLNQLELHP WLPRLELREY LCKHDILAEA YSPLTQGYML NDPELLELEK KSGISKIEIL IKWSYLQGFV VLVKTEKEER IAQNLNILPK GNNDILGETS NLGKIELPSS VLEALDKPDS HVVLTWDNVD PTLYKDGDI
|
| |