Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_71431 |
Symbol | |
ID | 4838284 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 337073 |
End bp | 340172 |
Gene Length | 3100 bp |
Protein Length | 807 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389599 |
Product | predicted protein |
Protein accession | XP_001383345 |
Protein GI | 150864507 |
COG category | [B] Chromatin structure and dynamics [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.71958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACACAGGGA CCACTCCAGA AGCAGCAACA AATCATGTCG GAGGAACCAA CCGGCGTTGC GCCCGAATCG GCTCCGAAAA AAAGGCTTCT TGAAGACGCC GACACCGATA CTAAGTCTAC GTCTAACACT TCCGTTTCTG GCACTGTTGA AAATCTGACA GACATAAATT TTCATCCAGA CACCAATTCA ACAGAAGCTT CAAAAGGGAA TCCGGACCAG ATACCAGTGT CGAATGGCGA TGACCATAAG AGAATAAAGC TTGAGCCCAA AGTTGGAAAC GAGTCTTCTG TTACTCTTGC GGAGCAAAAT GAGCCTCTAG TTGAAAATAA TGTGGAAACA TACATTGAAC AATTAGATAA GGAAAAGTCG GCAACGGATC AAGATGAACC AGCAAACAAG ATAGCCACAT TCGAGCCAGC AAAGCAGGCA GTAGAACAAG CAAAACCTCA GAGTCAAGAA GATAAGCAAG TAAGTGCAAA TGATGAAGAT GGAAGAGATA AAGAACATAG AAAGGACAAA GAAGAACTGA AAGAAAACGA GGAAATAACT ATAGAAACCC AAGGAATACC AGAAATTGCT AAGCCTGCTG TTGAAGTAAA AATCGAGAAA GGCGTGAAAG ACGGAAATGT AAAATCTGAA ACGAAACTTC TTTCGGAGTC TACAGCTGAT GATAAAATCA AGCAAGAGGA GCCACCAAAA GATAAGAATG CTCACAAAGT GGTTGGTATT ACTGGAGAAA AACTTGCGGT TGCTTCCAAC GGTGTTGGTG TTAAACAAAG CACGCCTCAG ATTGTAGTAG TCCCACCCAC CAAACCTCAG TTGTTCTATA GTCCGTTGAA GACTGGATTG GTGTATGACG TGAGAATGAG ATACCATGCA AAAATTTTCA CTTCTTACTT TGAGTACATT GACCCACATC CCGAAGACCC GCGTCGTATC TACCGAATCT ACAAGAAGCT TGCAGAGGCT GGCCTTATAG TAGATAGCTC GCTATCAGGT GTCGAGGATA TAGGTCCTCT TATGGTGAAA ATCCCCATCA GGGAAGCCAC CGCTGAGGAA ATCTTGGAAG TCCATTCTGA AAGTCATCTC AAATTCATCC AGTCGACAGA GACTATGTCT AGAGAGCGTT TATTAGAGGA GACCGAAAAG GGTGACTCTA TTTATGTGAA TAACGATTCG TACTTCTCAG CTAAACTCTC ATGTGGAGGA ACCATTGAGG CTTGTAAGGC AGTTATTGAA GGCAGAGTGA AAAACTCGTT GGCTATTGTG AGACCTCCGG GCCATCATGC TGAACCAGAA ACTCCAGGTG GGTTCTGTCT TTTCAGCAAT GTTGCTGTTG CAGCCAAGAA CATCCTCAAG GCATACCCTG AGTCTGTACG CAAGATCGTT ATTGTTGATT GGGACATCCA CCACGGAAAT GGAACACAAA AGTCTTTCTA TGACGATCCT AGAGTTCTCT ACATTTCCTT ACATAGATAT GAGAATGGTA GATTCTACCC CGGTACCAAG TACGGAGGAG CAGATCAGGT AGGAGAAAAG GATGGGGAAG GGTATAATCT CAATATTCCG TGGAGAAACC CAGGAATGCA CGACGGAGAC TATATATATG CATTCAACCG GGTAGTTCTT CCTGTTATTC TTGAATTTGA CCCTGATCTC ATTATCGTAA GTTCTGGATT CGACGCTGCT GACGGCGATA TCATTGGTGG ATGTCATGTG ACACCTGCTG GATATGGCTA CATGACCCAT TTGTTGAAGG GCATAGCTAA GGGTAAGTTG GCGGTGATTT TAGAAGGAGG CTACAATTTG GACTCTATCA GCAAGAGTGC TCTAGGTGTA GCAAAAGTTC TTGTAGGAGA ACCTCCAGAA GCCACAGTGT CTATGCAGCC TCATTTGGAG ACTATTGAAG TAATAGATGA AGTCGTTAAG GTTCAATCAA GATATTGGAA GTCTTTGAGG TACGGGGTTC CTACAACTTC ATTTGACGAT GTCTACGACT TGAACGGCAC TGGCTCTAAC TACCAATTGC TCAACATTGG TGAGCCAATA AGAGCCAACC AAGTCAATGA GTTGTTCAAC AAGTACTCGT TTGTCAACCT CCCCATAATT TCCAGTGCTA CTGAAGGAGG AGAAAAGAAT GGCATCTTCA GCACAGACTT GCCATCGCAT TTGGACGATA TCATAATAGC TAGTCCAGAT ATCTATGAAA GCACAGTAGT AGTTCTTACG ATCCACGATC CACCGGAGAT CTGGGCCAAC ATCAACCCAA TCAACGGAAG CATTGAGGGC AATTCTTCTG TAATATTGGA ACATCCCTTG ATGCAGATAA TGGAGAAGAT GAAGAAGGAA ACAGACAAAA GCGATTCCAA AGAAAAAATT GGCTACATAG ATATCAACGT TCCGTCATAC CAGTTGCCCA TTCCCTTTGG AAATTCAAAG CAGACTTCTA CCTATAATCC CACATTTTTC GCCCAGGAGC TCTTGCTTTA CATCTGGGAC AACTATTTGG CCTACTTTTC GCAGTTAAAG AAGTTAGTGT TTGTCGGCTT TGGTGATTCG TACCAAGCTA TAGTACATCT ATATGGTAAA CGACCATCTC AGGATATCAA AGATTTAGTT AAGGGTACGG TAGCCTTCGT GAACAGGTCG AACTTGAAGG CTTTGGTCCC AGTGATGGAT GAGTCAATGG TGGACTGGTA CTATCAGAAC TCTGTGATCT TCACCAGTTG TTCGAATCCA TGTTGGGTTA ATCTGAACGG AACTACTCGT TTAGGAAATG GTTCCACTGA AGCAAACGGA GGTGACGATA GCAACAAGAG ACCAAGGAGA AAATTTGGCA GAGTGTTGAA GGCATCTGTG GATGGCTTGT ACGATATAAT CGCCGAAAGA TTCGACGAAG GTGTTGACTT CATCTTGGAT TCCATCGAAG AGTACTCCAG CAGCGAGAGC AGCAACTGAG TTGCAATATG GCGACTTTGT AATTGCTCTC TAATGCTGGC AGCTTTTAGT TCTCCTTATG TATTATACGA ATAGATGATA AATCATTATC TATTTTACAG TAGTCATTAT GTATTATGCG AATAGATGAA TAATTTTCAT
|
Protein sequence | MSEEPTGVAP ESAPKKRLLE DADTDTKSTS NTSVSGTVEN STDINFHPDT NSTEASKGNP DQIPVSNGDD HKRIKLEPKV GNESSIVVVP PTKPQLFYSP LKTGLVYDVR MRYHAKIFTS YFEYIDPHPE DPRRIYRIYK KLAEAGLIVD SSLSGVEDIG PLMVKIPIRE ATAEEILEVH SESHLKFIQS TETMSRERLL EETEKGDSIY VNNDSYFSAK LSCGGTIEAC KAVIEGRVKN SLAIVRPPGH HAEPETPGGF CLFSNVAVAA KNILKAYPES VRKIVIVDWD IHHGNGTQKS FYDDPRVLYI SLHRYENGRF YPGTKYGGAD QVGEKDGEGY NLNIPWRNPG MHDGDYIYAF NRVVLPVILE FDPDLIIVSS GFDAADGDII GGCHVTPAGY GYMTHLLKGI AKGKLAVILE GGYNLDSISK SALGVAKVLV GEPPEATVSM QPHLETIEVI DEVVKVQSRY WKSLRYGVPT TSFDDVYDLN GTGSNYQLLN IGEPIRANQV NELFNKYSFV NLPIISSATE GGEKNGIFST DLPSHLDDII IASPDIYEST VVVLTIHDPP EIWANINPIN GSIEGNSSVI LEHPLMQIME KMKKETDKSD SKEKIGYIDI NVPSYQLPIP FGNSKQTSTY NPTFFAQELL LYIWDNYLAY FSQLKKLVFV GFGDSYQAIV HLYGKRPSQD IKDLVKGTVA FVNRSNLKAL VPVMDESMVD WYYQNSVIFT SCSNPCWVNS NGTTRLGNGS TEANGGDDSN KRPRRKFGRV LKASVDGLYD IIAERFDEGV DFILDSIEEY SSSESSN
|
| |