Gene PICST_68096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68096 
Symbol 
ID4840057 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp823937 
End bp825898 
Gene Length1962 bp 
Protein Length578 aa 
Translation table12 
GC content44% 
IMG OID640391372 
Productpredicted protein 
Protein accessionXP_001385858 
Protein GI150866309 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0298848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAACCATCTA CCATAACGGC TGCAAAAAAT ACCAATTGGC TGCAACATGG ACGAGCAGCA 
GGAAATCGGA GGAGCACATG CTCTCCTCCA ACTAGGCTCA AAGCAGAAGG ATGATGGAAT
AGACGACAGC ATTGATGCAA TCGGCAATAA TGAAAACAGC AATGTCAATG AAGATGTAGA
TGACGAAGGA GACCTTTCAG ACTCGAAGAA CCAGCAGATT AATGATGCAG TAGAGGCTGC
TGTCATGAGG TATGTGGGAG GAACTTTGGA CTCTGCAGAG CATGAGAGCA AAAGAAGCAA
ACGGAAGATC CACGATGAGA TCATCAACAA TATCCACGAG TTCAACCAAT GGACTGGATT
CTTAGAAGAA AATATCAGCG ACCATGGCAA CGAGGACTAC GATGCTTATA CGTCTCAACA
GCAGTCTCAG CCGCTCCATC TGAACAAAAG AAGCAAAGGA AAGAAAAGAA GAACTCAACT
GGGAACAAGC GACATCGATC CAGAACTCGA GGCTTTAGGC ACAACTGAAC ACGACCAATT
AGTAGAGGCA GCAATAATTG ATGCCAGAGA ACTTGCCAGA CACATCAATG AACAGGGAGC
CGGAACTGGG AACCTAAATC TCAGCCACCA CCAACAGCAC GCTGGATCTG ACTCAATCAA
TGCGATTACT CAACTTGCTC AGGCTGCTAC ATCGTTATCC GAAACAAAGA AAGCAAAACT
CAAGAGAAAA GATGGGGAGT CATACCAGAT GAAGAACATT GCCTTACGAC CCAAGTTCAA
CAACTTGACC AGTGTAGAAA CTCTCATAGA GGAAGCCTCG GCGCAAGCCT GTGAATGGTT
CAATTCCCTA CCTGACACTA CGGGTAAAGG TCCACGTATG TTTTCGGCAG AAGAGATGAG
TGCAGTAGAC CATTTCGTAG CAGGTTATTG CCATTTGAAC AAATGGACAA GAGAAGATGT
GTGTAACAGA GTGTGGTCCA ACGAGAGAAA GAAAGACAAT TTCTGGGAGT CTTTAGTACG
AGTACTACCA TACAGATCCA GAGCTTCTGT GTACAAGCAT GTGAGGAGAA TCTACCATGT
GTTTGACGTC AGAGCAAAGT GGACAGAGGA AGATGATGCC TTGCTTAAGA AACTTGCACT
TACACATGAG GGTAAATGGA AACAAATTGG AGAAGCTATG GGTAGAATGC CGGAAGATTG
CCGTGACAGG TGGAGAAATT ACGTCAAGTG TGGAGACAAC AGAACATCAA ACCAATGGTC
ACAAGATGAA GAGAACGCCC TTAAGCAGAT CGTCACTGAT ATGTTTCAAC AATCTGGAAA
CAAAGAGTAT GCGTCTATTA ATTGGACCGT TGTTAGTGAA AGAATGAATG GAACCAGATC
TCGTATTCAG TGTCGTTACA AATGGAACAA GTTGGTAAAG AGGGAAACTG CTCTTCGTGC
AACGTACATG AATTCCGACA CAAAACTCTG GATGCTCAGA AAACTCCAGA GCTCTGGCTG
GGATTCTGTT GATAGTGTCG ATTGGACTGA AGTAGCCCGT TTGCATAGGG AAGAGAATGT
TAAGCAGGAT GAGAATGGCT ACCAGTGGGA TGCGCCAGAT TTCAAGGCTA GTTTTGAGAA
GATGAGGTCA GAAGTGAGAG ACCACAAGAG GCTTTCATTT GTTACCATTT TGATGCGTTT
GATTGAGGAT TTGGAAGGAC ATCCAAAGCT TATAGCTCAA CATTTGAGAG AAAACAAGGA
CAACAGTAAC AAGCTCTACT ACGACCGCCA AAACAAGACC AAGAACGACA AAATAGTTGA
TCCAAACGAT CCTGAGTCAA TAGCAACAGC AGCTGTTGCT GCGGTCTCGT CAGGTGTAGA
CGGTGTTGAT GCCCAGCAAC AGGCATATAG CTTATGGCGA TAGAGGCTAA TTTCTTAATC
TAGTTGTACG TATTGATATA CTTGTAAATA TAGAGTTACA TT
 
Protein sequence
MDEQQEIGGA HALLQLGSKQ KDDGIDDSID AIGNNENSNV NEDVDDEGDL SDSKNQQIND 
AVEAAVMRYV GGTLDSAEHE SKRSKRKIHD EIINNIHEFN QWTGFLEENI SDHGNEDYDA
YTSQQQSQPL HSNKRSKGKK RRTQSGTSDI DPELEALGTT EHDQLVEAAI IDARELARHI
NEQGAGTGNL NLSHHQQHAG SDSINAITQL AQAATSLSET KKAKLKRKDG ESYQMKNIAL
RPKFNNLTSV ETLIEEASAQ ACEWFNSLPD TTGKGPRMFS AEEMSAVDHF VAGYCHLNKW
TREDVCNRVW SNERKKDNFW ESLVRVLPYR SRASVYKHVR RIYHVFDVRA KWTEEDDALL
KKLALTHEGK WKQIGEAMGR MPEDCRDRWR NYVKCGDNRT SNQWSQDEEN ALKQIVTDMF
QQSGNKEYAS INWTVVSERM NGTRSRIQCR YKWNKLVKRE TALRATYMNS DTKLWMLRKL
QSSGWDSVDS VDWTEVARLH REENVKQDEN GYQWDAPDFK ASFEKMRSEV RDHKRLSFVT
ILMLDPNDPE SIATAAVAAV SSGVDGVDAQ QQAYSLWR