Gene PICST_86017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_86017 
Symbol 
ID4851372 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1629207 
End bp1631408 
Gene Length2202 bp 
Protein Length654 aa 
Translation table 
GC content43% 
IMG OID640393080 
Productpredicted protein 
Protein accessionXP_001387545 
Protein GI126274413 
COG category[R] General function prediction only 
COG ID[COG5141] PHD zinc finger-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCACCCAGA TCTTCGTATA CACCAAGATG TCGTATGTGA ACGACCCAGA TCTCGTGGTC 
ACGCCTCTGT ACTACTCTGG TGGAGTTCCT GTGTTTGCAC CCACTATGAA GCAGTTCGAA
GACTTCTACA AGTTTAACAA GGCCATCAAC AAGTATGGAA TGGAACTGGG AATCGTCAAG
GTAATTCCGC CACGCGAATG GGGTGCTCTG CTTCCACGAG CATACACTCG TAAGAACCTT
CTGAGGGCCA ACATCAAGAA CCCCATAGTG CAGCAGATAT CCATCACAGG TGCTGGAGTG
TACTCTATCC AGAACATCGA GAAACAGAAG AAGTACAACA TTTTCCAATG GAAAGAACTC
TCAGAAAGAT CCAATTACCA GCCTCCAACA CACAGACGTA GATCCTCTCC GGACAAAAGT
GAGAAGGAGG CCGAGAAGCC AATTGCAAAT AATTCTATAA ATTCATCATC TGATTCTAAT
TCTAATCTAG TCTCAGATTT AATTCCAAAT TCAACTCCAA ATTCAACTCC AAGTTCAAAT
GCAAATTACA GTACAAATTT CAATACAAGC ACTAACGATG TCAACGATTC ACCCTCCAAA
CGATTACGAC ACCATCATCA CTATGACCCC TTGTACAACA TCGACTCCTC GCAATTTACA
CCAGAACGAT GCGAAGAACT AGAGAAAACC TACTGGCGTA CTTTGACTTA CGCGGAGCCA
ATGTATGGCG CCGATATGTT AGGATCGATA TTTCCCTCCA ACTTCAAGAG TTGGAACGTG
GACCATTTGC CCAATATTCT TGATCTTATG GATACTCGTT TGCCTGGGGT CAACGATGCC
TATCTATATG CTGGACTCTG GAAGGCCACC TTTGCTTGGC ACTTGGAGGA TCAGGACTTG
TACTCTATCA ACTATTTGCA TTTCGGTTCT CCGAAACAAT GGTACCTGAT TCCGCAGGCG
GAACTGGGCC GTTTTTTCAG CTTGATGAAG GAAACTTTCA CTGATGATTA CAACCATTGT
CATGAGTTTC TTCGTCACAA GACGTTTTTA GTATCTCCGA GTTTTTTGGA CAAACACGGA
ATCAAATATA ACAAGATCGT TCACAACGAA GGCGAGTTCA TGATCACATA TCCATTTGGA
TACCATGCCG GCTTCAATTA TGGCTACAAT TTAGCCGAAT CCGTCAACTT TGCTCTTGAC
GATTGGTTTC CCTATGGTGA AGTAACCAAG AAATGCGAGT GTATCTCCGA TTCTGTTGGC
ATCAATGTCA AGCAACTCTA TTGTAAGTTT CAAGGAATTC CCTACAAGTC AGAGCTGGAC
GGTATAGACA CCGAAGCAAC GGAAACCGAA GCAGAAACGT CACAGGAACC AGACGATATC
AACATCAAGA TCGAGACCAT AGACACAATC ATGGCAATCC ACTCAAAAAA ACGGAAGAAT
TCAGAAAACT CTAAAGAAAC TACAAGAAAA AAGCACAGGA AAGTTTCACC CGCAGAAAAC
ACAGCTTCAA TAACCCCCAA AGAGACTAAA CAGACCAAGC AAACAATCAA GAACGCAAAT
TTTGAATGCT ATTTGTGTCC CAATATATTC AATTCAGCGC AAATATCTCT GAATCCATTA
TTTGAGTTGG TTAGCACAGA TTTGCACGAC TCGAAGACCC ATCAGTACTA CAAAGTGCAC
AGAATATGTG CTGGCATGTT TCCCAAAGAG TTGAAGACAT CTACAAATAG CGCTGGGGAC
GAAATTGTGT TGGGATTGAC TAATATATCC AAGAACCAGA TGAAGCTCAA ATGCAAGGTC
TGTAACAAGG GAGCATCGAA AAAAGAAGCA GCAGTTCACG GAGCCTGCTT TCAATGCTCG
TATCCTAAAT GCACCCGAGG ATTCCACGGC ACATGTGGAG TTTCTGACGG AGTGATATAC
TCATTAGAAA ACCACAAGCC TCTTTGTAAA TATCACCGAA GCAAGAAGGT TTCTTCAGAT
AGATATGATA AGCTCTCCAA GATTTCTGGC GACAACGGCT CCTGGGTACA ATTCTCGTTC
AACACAAATC AGTACTTTGG TTTGGTTCTC ACCAACAACC TTGACGAACA TACGCTTACG
CTTCTAGTTT ATCCAAACTT GAACGATCGT ATTGAGGTGC CATATGATTT TGTCTTAACA
GGCGCTACGT TAAACCAGCT CGATAATCTG GCATTCATCA AC
 
Protein sequence
MSYVNDPDLV VTPLYYSGGV PVFAPTMKQF EDFYKFNKAI NKYGMELGIV KVIPPREWGA 
LLPRAYTRKN LLRANIKNPI VQQISITGAG VYSIQNIEKQ KKYNIFQWKE LSERSNYQPP
THRLSDLIPN STPNSTPSSN ANYSTNFNTS TNDVNDSPSK RLRHHHHYDP LYNIDSSQFT
PERCEELEKT YWRTLTYAEP MYGADMLGSI FPSNFKSWNV DHLPNILDLM DTRLPGVNDA
YLYAGLWKAT FAWHLEDQDL YSINYLHFGS PKQWYLIPQA ELGRFFSLMK ETFTDDYNHC
HEFLRHKTFL VSPSFLDKHG IKYNKIVHNE GEFMITYPFG YHAGFNYGYN LAESVNFALD
DWFPYGEVTK KCECISDSVG INVKQLYCKF QGIPYKSELD GIDTEATETE AETKVSPAEN
TASITPKETK QTKQTIKNAN FECYLCPNIF NSAQISLNPL FELVSTDLHD SKTHQYYKVH
RICAGMFPKE LKTSTNSAGD EIVLGLTNIS KNQMKLKCKV CNKGASKKEA AVHGACFQCS
YPKCTRGFHG TCGVSDGVIY SLENHKPLCK YHRSKKVSSD RYDKLSKISG DNGSWVQFSF
NTNQYFGLVL TNNLDEHTLT LLVYPNLNDR IEVPYDFVLT GATLNQLDNL AFIN