Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_74116 |
Symbol | SOR5 |
ID | 4840939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 481467 |
End bp | 482743 |
Gene Length | 1277 bp |
Protein Length | 378 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392254 |
Product | Sorbitol dehydrogenase |
Protein accession | XP_001386484 |
Protein GI | 150866775 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.145167 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTCTCATTA ATTGAACTCG CCATTTTATC GCATAGTATA TATTCATTCC AAGACTGTGC CATTGATAGA TTGAATCTCA CGCAGGCACA ACAATGAGGG CAATAGTTTA CCACGGAAAC AAGGACGTAA GGTACGATCC CCAGTATCCG GAACCCACTA TAAATCATCC CAAAGAGGTC AAAATCAAAA TTGACTACTG CGGAATATGT TGCAGTGATT TAGATGAATA CCGGGATGGT CCAATATTTT TCTCGCCGGA CCACCAGGCC ATTTCCCGGA AACCCTTCCC ACAAGCCATG GGCCATGAGA TGTGTGGAGA AATCGTGGAA TTGGGCTCTG CCGTTAATGC CAATTTGAAG GTTGGTCAAA AGGTTGTCGT CGAATCCACA GGAACCTGTT TGGATCGGGA ATATCTAGAG TTACCTGAAA AGAAGTGTCT TTCATGTGTT CAAGGAGCCT ATAACCATTG CGACCACATA GGATTTTATG GATTGGGCTT CTCCGATGGA GGATTTGCTG ATTTCTGTGT TGTTGGAGAA CATCATGTAA TTCCATATTC CGAGGAAGAT TTGCCAGTTG AGATTGCGGC TCTCACGGAG CCTTTAGCAG TCAGCTGGCA TGGAGTGCGT GTTTCAAATG TCACCAAGGA CGATTCGGCT TTGGTCCTTG GAGCAGGCTC GATTGGGTTG ACCACCATAA TTGCACTCAA AGGTCATGGG GTGAGGAATA TAATAGTTAG TGAGCCAAAA GAATCTAGAA GAGCTTTGGC TGAAAAGTTC CATGTCCAGG TTTTTGATCC CACGCCCTTT AATCACGACG AGAAGCTGTT GACTAAGGAA TTGTTGAAGT TGAGTCCTCT TGGATGGGGT TTCAGTAGAA TTTTCGATTG TTCAGGTAAA AAGGAAACGT TTGACATCTC CTTGAGTGCA TTGAAGACAA CTGGTATTGC CACCAATGTA GCCTTTTGGG CTCACAACAA GCAAACAGTG TTCCCTATGG ATTTGACATT GCACGAAAAG AATCTCACCA GCTCTCTGTG CTACGTTAGA GAAGACTTCG AAGAAGTCAT CCAAGCGTAC AGAGATGGAT TAATCGACCC CGAGGAGGTA AGACAGATAG TCACCAAAGT CGTAGCACTT AAGGATGGTT TTGAAGAAGG TTTTCTCCAA TTAATAAACC ATAAAGGAAA GCACATTAAG GTCCTAGTAT CACCATTGCC AAAAATCTGA ATAGATATTC ATATATGGTA TTTAGATAAG TCATATCATC TGTGGAA
|
Protein sequence | MRAIVYHGNK DVRYDPQYPE PTINHPKEVK IKIDYCGICC SDLDEYRDGP IFFSPDHQAI SRKPFPQAMG HEMCGEIVEL GSAVNANLKV GQKVVVESTG TCLDREYLEL PEKKCLSCVQ GAYNHCDHIG FYGLGFSDGG FADFCVVGEH HVIPYSEEDL PVEIAALTEP LAVSWHGVRV SNVTKDDSAL VLGAGSIGLT TIIALKGHGV RNIIVSEPKE SRRALAEKFH VQVFDPTPFN HDEKSLTKEL LKLSPLGWGF SRIFDCSGKK ETFDISLSAL KTTGIATNVA FWAHNKQTVF PMDLTLHEKN LTSSSCYVRE DFEEVIQAYR DGLIDPEEVR QIVTKVVALK DGFEEGFLQL INHKGKHIKV LVSPLPKI
|
| |