Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_64024 |
Symbol | SGD1 |
ID | 4840907 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 666876 |
End bp | 669890 |
Gene Length | 3015 bp |
Protein Length | 988 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640392222 |
Product | suppressor of glycerol defect |
Protein accession | XP_001386721 |
Protein GI | 150866950 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.109115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGGA GGAGAAACAA TAACGAAGAA AACGAAGAGG AGAGAGGAAT TCCTTCAAAA CTTTTACAGC AGATACAAAC AAAGGAGGAC AAAGGTGACT ACAACGATGA GGATAGATTT ACAAAGTTCA ACTCCAAGAA GAGAAAGGAA AAGCCTTTGT CAAGAAAAGA CAGAAGACAG CAAGAAAGAC AACTAAAGAA ACAAAAGAAA AACACCAAGC CAATTAATAA CGAAGCCAAA AGGAAAGCAA TCACAAACAA TAATAGTAAA AGCATCAAGA CTAGCCAGAA ACCAGTTAAG TCTCAAACGA AAGAAGATTC TCGCGCAGAC TTTAAGGCAT TGAAGACTAA CAGGGCGCAA ACTTCAACAA AAACTATAGA TGATCCTGTG GAGCAACTCA GATTGTTGAA AGAAAAGAAA AAAGGATCTC AGCCAAATGA ATTCAGAGTT GTCAAAGCTG ACGAACTCCT GGAAGATTCT GACGGAGACG ACAACTTCGA TGATTTCGAC AATCTGGATG ATTTCGACGA TTATGATGAT GAAGTAGAAG AAAATCCATT GGAAGTTTTA AAAAAGTTGA AGGAGGTGAA GAAAGGTTCC AAGAACAAAT CTGAAGTAAG AGTCATCAAG TTGGACGAAT TGCTGGACGA TGAACTTTCG GAAGATGTTT CGGAAGATGA ACTTTCGGAA GATGAAGAAG AAGATGAAAA CGAAGACATA GATGACAATG ATTCTAATGT GAAAGACAAT TTCGATGGCT TTGACGAAGA AGAGAATGAA GACTCTGAGG ATGGCTTCAT CGTAGACGAC GATAACATAG ATGCAGATGA CTTTGATAGC TTTGATGAAG AGGAAATTTT GGAAGAAGAG GAAGATCCTT TAGCCAAATT GAAGGCATTG AAGGAACAAA AGAGAAATGT AAAGAAGGAT AATAAACCAT CTAAAGAAGT CAAGAATTCC AGAGTGATCA GTCAACATGA ATTAGAGCTC ATGAAAAGAG ATGAAAATGA TATGGAATTC TATGCTAAGA AGCTCGGTTT GAAGAACGGT AAAAAATCAA AAATTGCCAA AACAGACGAT GATGACATTA TTGGAGGATT ATTAGACGGA TTGGACTTTG ATTTCGAAGA AGAAATGGAA CCACTGGAAG AAGAAGCTGA TGATTTCGAT GAAGGGGAAG AAGAAAATTA TTCTGGTGAT GATTTTGATG ATGATGAAGA GGACGCACCA AAGAAGGAAA ATCCTTATGT AGCCCCTATT CAAAATAAGA CTTCGGATGA GGTGTCAGAC TCAAGTAATC ATGCTGCTGG TTCATATATC CCACCAGCAT TGCGTAGAAA GATGGCATTA GAAGGGGGTA ATGTTTCTGA AGAGATTCTC AACTTGAGAA AAGCCATCAA GGGTCCTTTG AACAAGTTGT CTGAGTCCAA TATCAGCAGC ATTGTCAATG AAATCAATAC ATTGTATTTA TCGAACTCAA GAAATTCGGT AAATAATGAG TTGACTACTA TCGTGATGGA CAGTATCATT CAACAGGGAA GATTGTTGGA TACCTTTGTA TACTTACACG CAACACTTGT CGTTGCTATC TACCGTTTGC AGGGCGTCGA ATTTGGTGCT CACTTCATTC AGTCAATGAT TGAAAAATTT GAAAGCTATC ATAAGGAAAC CGGAAAAGGA AAAGAAGCTT CTAATATGAT ATCACTCTTA TCTTCGGTTT ATCAATTTCA GTTGGTATCA TCTAAATTAT TGTACGATGT GATCAAGGTT TTGATAAACG AATTAAATGA AAACAACGCT GAACTCTTGC TCAGATTGAT CAGAAATTCA GGTAACCAAA TGAGGTCTGA CGATCCTTCT GCCTTGAAAG ATATTGTAAT TCTTTTGAAT GATGCCAAAT CATCCATTCC ACAGTCCCAG ATGAGTACAA GAACACAATT CTTGGTTGAA ACAATTACTT CACTTAAGAA CAACAAACTT AAGATCAACA ATGAATCCAG TCACCAGCTC GCTATCAGAT TGAAGAAATT CTTAGCAACA ATCAACAACA ACAAATTCAA CGATCCAATA CAAGTTTCTC TCAATGATAT CCACAGCATT GCTTCGAAAG GGAAGTGGTG GTTAGTTGGT GCTGCTTGGA AAGGTAGTGA TGGAGATGAC AAAAATGCAG AGCCTGACCT CAACAGAGAT GCTATCAATG ATATATTAGA CAACTCTGAA CCTAATTGGA TGGACTTAGC TAGATCTCAG CGTATGAACA CTGATATCAG AAGAGCTATT TTCATATCCA TCATGTCAGC AAATGATTAC ATTGATGCAT TAACTAAGTT GGACAAGTTG GCTTTGAAAC GTTCTCAAGA TAGAGAAATT CCTCGTGTAT TGATACATTG TACAAGTGTT GAGCCAGCGT ATAATCCTTA CTATGGAATC TTAGCAAGCA AGCTTTGTGA GGATCACAGG TATAGAAAGA CTTTCCAATT CATGCTCTGG GATCTTATTA AAGAATTCGA AGGTTCCACT TCTGACGAAG ATGAAGATTT TGTTGGGTTC GATCATGGTG ATGATGACGA TGAAACCAAG TTGAAGAGAA TCTTGAATTT GGGTCGATTT TTCGGATTTC TCTTGGCGGA AGGCTCTTTG CCTTTGCACT TGCTTAGAAC TGTTAACTTC TTAACTGCTG CTAGTGACAC TATCTTGTTC ATGGAAGTTG TTTTTGTTAG CTTTTTGGAT AATATCGGGA AGAAGTCACA GATCAATTCA GTGGGAGCTG GTCTCGGTAA AAGATCGAAG AATATGTATG AACAAAAGTT CGATGACAGG TTATTAATAG AGAGGGTGAT CAAAGCCCAG GAACAGATGA CTTTGCTCAG AGGAATTCAA TTCTTCTTAC AGGACAAGGT TAGAAAAAGT GATATTATCT CTGGTAAGAA GCAGACCAGA AGAGTTGAGT GGGGAATAAA TGCTATGGTA GATATCATAG AGGAATTTGT GAGATCTAAC GACGATAAGA ACTAG
|
Protein sequence | MNRRRNNNEE NEEERGIPSK LLQQIQTKED KGDYNDEDRF TKFNSKKRKE KPLSRKDRRQ QERQLKKQKK NTKPINNEAK RKAITNNNKD SRADFKALKT NRAQTSTKTI DDPVEQLRLL KEKKKGSQPN EFRVVKADEL SEDSDGDDNF DDFDNSDDFD DYDDEVEENP LEVLKKLKEV KKGSKNKSEV RVIKLDELSD DELSEDVSED ELSEDEEEDE NEDIDDNDSN VKDNFDGFDE EENEDSEDGF IVDDDNIDAD DFDSFDEEEI LEEEEDPLAK LKALKEQKRN VKKDNKPSKE VKNSRVISQH ELELMKRDEN DMEFYAKKLG LKNGKKSKIA KTDDDDIIGG LLDGLDFDFE EEMEPSEEEA DDFDEGEEEN YSGDDFDDDE EDAPKKENPY VAPIQNKTSD EVSDSSNHAA GSYIPPALRR KMALEGGNVS EEILNLRKAI KGPLNKLSES NISSIVNEIN TLYLSNSRNS VNNELTTIVM DSIIQQGRLL DTFVYLHATL VVAIYRLQGV EFGAHFIQSM IEKFESYHKE TGKGKEASNM ISLLSSVYQF QLVSSKLLYD VIKVLINELN ENNAELLLRL IRNSGNQMRS DDPSALKDIV ILLNDAKSSI PQSQMSTRTQ FLVETITSLK NNKLKINNES SHQLAIRLKK FLATINNNKF NDPIQVSLND IHSIASKGKW WLVGAAWKGS DGDDKNAEPD LNRDAINDIL DNSEPNWMDL ARSQRMNTDI RRAIFISIMS ANDYIDALTK LDKLALKRSQ DREIPRVLIH CTSVEPAYNP YYGILASKLC EDHRYRKTFQ FMLWDLIKEF EGSTSDEDED FVGFDHGDDD DETKLKRILN LGRFFGFLLA EGSLPLHLLR TVNFLTAASD TILFMEVVFV SFLDNIGKKS QINSVGAGLG KRSKNMYEQK FDDRLLIERV IKAQEQMTLL RGIQFFLQDK VRKSDIISGK KQTRRVEWGI NAMVDIIEEF VRSNDDKN
|
| |