Gene PICST_71431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_71431 
Symbol 
ID4838284 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp337073 
End bp340172 
Gene Length3100 bp 
Protein Length807 aa 
Translation table12 
GC content43% 
IMG OID640389599 
Productpredicted protein 
Protein accessionXP_001383345 
Protein GI150864507 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.71958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACACAGGGA CCACTCCAGA AGCAGCAACA AATCATGTCG GAGGAACCAA CCGGCGTTGC 
GCCCGAATCG GCTCCGAAAA AAAGGCTTCT TGAAGACGCC GACACCGATA CTAAGTCTAC
GTCTAACACT TCCGTTTCTG GCACTGTTGA AAATCTGACA GACATAAATT TTCATCCAGA
CACCAATTCA ACAGAAGCTT CAAAAGGGAA TCCGGACCAG ATACCAGTGT CGAATGGCGA
TGACCATAAG AGAATAAAGC TTGAGCCCAA AGTTGGAAAC GAGTCTTCTG TTACTCTTGC
GGAGCAAAAT GAGCCTCTAG TTGAAAATAA TGTGGAAACA TACATTGAAC AATTAGATAA
GGAAAAGTCG GCAACGGATC AAGATGAACC AGCAAACAAG ATAGCCACAT TCGAGCCAGC
AAAGCAGGCA GTAGAACAAG CAAAACCTCA GAGTCAAGAA GATAAGCAAG TAAGTGCAAA
TGATGAAGAT GGAAGAGATA AAGAACATAG AAAGGACAAA GAAGAACTGA AAGAAAACGA
GGAAATAACT ATAGAAACCC AAGGAATACC AGAAATTGCT AAGCCTGCTG TTGAAGTAAA
AATCGAGAAA GGCGTGAAAG ACGGAAATGT AAAATCTGAA ACGAAACTTC TTTCGGAGTC
TACAGCTGAT GATAAAATCA AGCAAGAGGA GCCACCAAAA GATAAGAATG CTCACAAAGT
GGTTGGTATT ACTGGAGAAA AACTTGCGGT TGCTTCCAAC GGTGTTGGTG TTAAACAAAG
CACGCCTCAG ATTGTAGTAG TCCCACCCAC CAAACCTCAG TTGTTCTATA GTCCGTTGAA
GACTGGATTG GTGTATGACG TGAGAATGAG ATACCATGCA AAAATTTTCA CTTCTTACTT
TGAGTACATT GACCCACATC CCGAAGACCC GCGTCGTATC TACCGAATCT ACAAGAAGCT
TGCAGAGGCT GGCCTTATAG TAGATAGCTC GCTATCAGGT GTCGAGGATA TAGGTCCTCT
TATGGTGAAA ATCCCCATCA GGGAAGCCAC CGCTGAGGAA ATCTTGGAAG TCCATTCTGA
AAGTCATCTC AAATTCATCC AGTCGACAGA GACTATGTCT AGAGAGCGTT TATTAGAGGA
GACCGAAAAG GGTGACTCTA TTTATGTGAA TAACGATTCG TACTTCTCAG CTAAACTCTC
ATGTGGAGGA ACCATTGAGG CTTGTAAGGC AGTTATTGAA GGCAGAGTGA AAAACTCGTT
GGCTATTGTG AGACCTCCGG GCCATCATGC TGAACCAGAA ACTCCAGGTG GGTTCTGTCT
TTTCAGCAAT GTTGCTGTTG CAGCCAAGAA CATCCTCAAG GCATACCCTG AGTCTGTACG
CAAGATCGTT ATTGTTGATT GGGACATCCA CCACGGAAAT GGAACACAAA AGTCTTTCTA
TGACGATCCT AGAGTTCTCT ACATTTCCTT ACATAGATAT GAGAATGGTA GATTCTACCC
CGGTACCAAG TACGGAGGAG CAGATCAGGT AGGAGAAAAG GATGGGGAAG GGTATAATCT
CAATATTCCG TGGAGAAACC CAGGAATGCA CGACGGAGAC TATATATATG CATTCAACCG
GGTAGTTCTT CCTGTTATTC TTGAATTTGA CCCTGATCTC ATTATCGTAA GTTCTGGATT
CGACGCTGCT GACGGCGATA TCATTGGTGG ATGTCATGTG ACACCTGCTG GATATGGCTA
CATGACCCAT TTGTTGAAGG GCATAGCTAA GGGTAAGTTG GCGGTGATTT TAGAAGGAGG
CTACAATTTG GACTCTATCA GCAAGAGTGC TCTAGGTGTA GCAAAAGTTC TTGTAGGAGA
ACCTCCAGAA GCCACAGTGT CTATGCAGCC TCATTTGGAG ACTATTGAAG TAATAGATGA
AGTCGTTAAG GTTCAATCAA GATATTGGAA GTCTTTGAGG TACGGGGTTC CTACAACTTC
ATTTGACGAT GTCTACGACT TGAACGGCAC TGGCTCTAAC TACCAATTGC TCAACATTGG
TGAGCCAATA AGAGCCAACC AAGTCAATGA GTTGTTCAAC AAGTACTCGT TTGTCAACCT
CCCCATAATT TCCAGTGCTA CTGAAGGAGG AGAAAAGAAT GGCATCTTCA GCACAGACTT
GCCATCGCAT TTGGACGATA TCATAATAGC TAGTCCAGAT ATCTATGAAA GCACAGTAGT
AGTTCTTACG ATCCACGATC CACCGGAGAT CTGGGCCAAC ATCAACCCAA TCAACGGAAG
CATTGAGGGC AATTCTTCTG TAATATTGGA ACATCCCTTG ATGCAGATAA TGGAGAAGAT
GAAGAAGGAA ACAGACAAAA GCGATTCCAA AGAAAAAATT GGCTACATAG ATATCAACGT
TCCGTCATAC CAGTTGCCCA TTCCCTTTGG AAATTCAAAG CAGACTTCTA CCTATAATCC
CACATTTTTC GCCCAGGAGC TCTTGCTTTA CATCTGGGAC AACTATTTGG CCTACTTTTC
GCAGTTAAAG AAGTTAGTGT TTGTCGGCTT TGGTGATTCG TACCAAGCTA TAGTACATCT
ATATGGTAAA CGACCATCTC AGGATATCAA AGATTTAGTT AAGGGTACGG TAGCCTTCGT
GAACAGGTCG AACTTGAAGG CTTTGGTCCC AGTGATGGAT GAGTCAATGG TGGACTGGTA
CTATCAGAAC TCTGTGATCT TCACCAGTTG TTCGAATCCA TGTTGGGTTA ATCTGAACGG
AACTACTCGT TTAGGAAATG GTTCCACTGA AGCAAACGGA GGTGACGATA GCAACAAGAG
ACCAAGGAGA AAATTTGGCA GAGTGTTGAA GGCATCTGTG GATGGCTTGT ACGATATAAT
CGCCGAAAGA TTCGACGAAG GTGTTGACTT CATCTTGGAT TCCATCGAAG AGTACTCCAG
CAGCGAGAGC AGCAACTGAG TTGCAATATG GCGACTTTGT AATTGCTCTC TAATGCTGGC
AGCTTTTAGT TCTCCTTATG TATTATACGA ATAGATGATA AATCATTATC TATTTTACAG
TAGTCATTAT GTATTATGCG AATAGATGAA TAATTTTCAT
 
Protein sequence
MSEEPTGVAP ESAPKKRLLE DADTDTKSTS NTSVSGTVEN STDINFHPDT NSTEASKGNP 
DQIPVSNGDD HKRIKLEPKV GNESSIVVVP PTKPQLFYSP LKTGLVYDVR MRYHAKIFTS
YFEYIDPHPE DPRRIYRIYK KLAEAGLIVD SSLSGVEDIG PLMVKIPIRE ATAEEILEVH
SESHLKFIQS TETMSRERLL EETEKGDSIY VNNDSYFSAK LSCGGTIEAC KAVIEGRVKN
SLAIVRPPGH HAEPETPGGF CLFSNVAVAA KNILKAYPES VRKIVIVDWD IHHGNGTQKS
FYDDPRVLYI SLHRYENGRF YPGTKYGGAD QVGEKDGEGY NLNIPWRNPG MHDGDYIYAF
NRVVLPVILE FDPDLIIVSS GFDAADGDII GGCHVTPAGY GYMTHLLKGI AKGKLAVILE
GGYNLDSISK SALGVAKVLV GEPPEATVSM QPHLETIEVI DEVVKVQSRY WKSLRYGVPT
TSFDDVYDLN GTGSNYQLLN IGEPIRANQV NELFNKYSFV NLPIISSATE GGEKNGIFST
DLPSHLDDII IASPDIYEST VVVLTIHDPP EIWANINPIN GSIEGNSSVI LEHPLMQIME
KMKKETDKSD SKEKIGYIDI NVPSYQLPIP FGNSKQTSTY NPTFFAQELL LYIWDNYLAY
FSQLKKLVFV GFGDSYQAIV HLYGKRPSQD IKDLVKGTVA FVNRSNLKAL VPVMDESMVD
WYYQNSVIFT SCSNPCWVNS NGTTRLGNGS TEANGGDDSN KRPRRKFGRV LKASVDGLYD
IIAERFDEGV DFILDSIEEY SSSESSN