Gene PICST_67483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67483 
Symbol 
ID4838695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp336703 
End bp339886 
Gene Length3184 bp 
Protein Length657 aa 
Translation table12 
GC content47% 
IMG OID640390010 
Productpredicted protein 
Protein accessionXP_001384019 
Protein GI150864981 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CACCAATAAG TTATAAGAGC AAGCAACGAG CTGTCCGGCC CGACGCATCT AAAGCACGAC 
ATCTCTCGAG GTACGACATT CCCCTCGACG CCCCAAAACC GCAACCCACA CCCAAGAGGC
CCACATCCAC TGGCTAGCTT AACTAGACGT CGCACGACAT TGTACCGCGA CATCCCACGA
CACGACAACC CCCACGGCCG CCTATCACTG ACCCAGCGTT TCTGTCAGTG ACTTGGTCCA
GAATCAGATT ATTATCTGCT CCATATAGTA ATCGTCCTTC CTAGGCTTCA AAAGTCAAGG
AGTAAACGAC GACATTGGAG GCCCCAGAAA ACCCTTCCAA ATATAAGGGG CACCAGCACA
ACAAAGTGGG ACCCACCCAG ACAAGTTTTA GACTCCACCA AAGGTCTAGC CGTGACAATT
TGTAGATCTC CAGTCACTTT ACGACATTAC TTTGACTAGG CTCTAGCACC AACTCTTTCT
CGTCCCCACA TTGCCACCAA TCTCATTGAT AGCCACTAGC ATATTCAGTC ATCTGGTTTG
AAAATTTCTC CACTACGTCA AATTTTCCAG CGAAGCAGAC TAAAGTCACC CTATCCAGAA
GTCTAGCAGG GAAAATATAT AAGGAGCCCT CTTTATATTA TCGAGACATC CGTAATTTCC
ATACCCCGTT CTTGACACTA TAGGAGAGTT TGTTCTTGAT CCTTTTGACA TTTATTGATA
AATCTTACTT TTCCTTGATC TTCCAGATCT TTATCCATAC TTGGTAGTCT CATTTATATA
GTAGTCCATT TGGCTCGCAT ACCGTTTTAC GCTACCTTTC CTCATACAAA ATTAATCCCT
GTTTTGGACC CAAAAATTAA CTAAAAAGCA TCTGCCATTC CACCATTTAG CAAGTAGCTA
ATCCAAATTA CCCCCTACAG CGATATCCTT CAGTGATATC TGCAGAGCCC CCACAAGAGC
CCCCCATCAT TCAGTCCCAG ATATATATTC GCACACCGGT GCTCACCATG AATTACCATA
ACTCAAACCC CGGCGGGCCC GACGCTACGT CGGCTTCCTC AGCACCAGCA TCCACAGCGT
CGTCAGCATC GGCAGCCGCT TCTGCATCCT ATTACTACGC TCCGGTACAA CAGTACCAGC
CGCAACCGCT CCACCACACC ATCCAGCCTA ATGGCCCGAA CTCGGTTCCA ACCACCCCAG
GAGCCCCCAC AAGAAGAGGT CCATGGTCTC CTATGGAAGA CAAGAAGCTT CTTGATCTCA
TCAACATCTT TGGTCCTACC AATTGGGTGC GCATCTCTAA TAGCATTGGA ACGAGAACGC
CGAAGCAATG CCGCGAGCGC TACCACCAGA ACTTGAAGCC GCTGCTAAAC CGTCTGCCAA
TTACAGTAGA AGAAGGCGAG TTGATCGAGC TGTTGGTAGC AAAATACGGT AAGAAATGGG
CCGAAATTTC TCGTCACTTA AATGGCCGTT CCGACAACGC CATCAAGAAT TGGTGGAACG
GAGGAGCCAA CCGTAGAAGA AGAGCCTCGT TGGTGCACGA ACCGAATGTG GCTGGTAACA
GCAACAGCAA TAATAATAAC AACTCGATGA GCAGCTCTAA CGGTAGTAAC GTCAATGCCA
ATGGCCTCAA TAGTAGCACT AGCATCTCGA CGATGTCGGC ATCCACTTCT GCATCCACCA
ACTCCACACG CTCGCAAAAC GGGTCGCTCT CCAGTCCTTC AGGACTTCCC ACGCTTACCC
AGAACAAGAG TTCTGCTAGC TTGCCTGAAG CAGTGTTGTC TGTCAGCTCG GCACAGGTCC
ACCAGTCAAA TTCGCTGCGC TCATCACTCA ACGAGCCTTC GCTCTCGGCA AACTCATCTG
CACTCAACTT ATCTGCTGCA AACCCGAACA GTCTTTCATC CTACGCCATC ACCAATAACC
ATAACACCAA CATCAACAAT ACCAGTAACA ACTCCACAAT ATTACCACCT CCTATCGGAG
CTTCGCAGCC TTCGACTTTC CCCCAGATTC CTCAGCTTCC CCAGATTTCA TTTAACACAT
CCATGTTTGG TAAGCCTGAT GCGCTGTTCA AAGCCCATAC ACCTCCTCCT GGGTCTATGG
CTGCAGCCGT CCCTCATACC ATGACTTCAC CTGTAAAAGC GACTTCACTT AGATCTGCTA
GTTTTGACGT AACCTCGGCT ACTGGAGCCA CGAACGCATC CACTTCAACT CTTACATCCA
CCACTCTTCC TCCAATTTCT TCATCTAACA AGAGAAGACT CTTGGACGAC CCCATCAGTA
GAAGACATTC CACTGCAAAC TACCACTATG CCCATCCCAA CGGAAACACA AATAACAATA
ATAATAATAA TAATAATTTC GCAGTTCCGA CTTCTTCGGC TCCTGGTTCG GCATCTGCTG
CTACAGGTTC AATTATCGGA GGTGCTGGCA CCGTTTCTCC CTCGTACTAT GGCTCGCCAC
TACTTCTCAG TACTCAAGTA TCGAGAAACA ACTCGATCTC ACACTTTGAG TTTCTGACGT
TGAACTCAAC CTCGCATTCT CTGAGAAGAT CGAGTTCGAT AGCACCAGAC TTCTTTCCAA
ATCCATTAAA GGAGCTACAG GCAGCCGCAT CTTCCTTGAA CAAGGAGGGA AACGTGAACC
ACAAACGTAA CATGTCGCAG AACTCGTCGT TCAACTCTCC TTCTTTGACT CCTTCTACCC
GTTTCTCCAT CTCATCAACT ACCTCCCTTT TAAATAACAC TTCTACCAAC TTGACAATGC
CATCAGCCAC AACCCTGCCT TCCAGCAATT ACAACGGTCT CAAGAACGAT CATTCTTCTA
GCAGTGGGTC CATTCCAGCA CTCAAAGAAG AAGTCGAGTT GAAGTTGAAG CACAAGAACG
ACTTGGACGA TGTAGACATG GACGACTCCC ATAACCACTT GCAAAATCCC AGGACAACCA
TGGTGAAAAC CAAGATCTCG GTTCTGAGCC TCATTGATTG AATGAGCGGT GGATCCGTTC
TCTCGGACTG TTTTCGTACT TCCATCTGTA ACAATACTTG TTTTCCTTAT AGACCTCCAA
AAACAGTTTC TTCAATTCTT GGTTATCTAT TTTTCAATTC TCGGGAGTTT ACTCTGTAAC
TAATAATTTG ACTATTACTT CAAGAAAATG AAATACATAT TTAAGAATAC ATGCATCGTT
CTAC
 
Protein sequence
MNYHNSNPGG PDATSASSAP ASTASSASAA ASASYYYAPV QQYQPQPLHH TIQPNGPNSV 
PTTPGAPTRR GPWSPMEDKK LLDLINIFGP TNWVRISNSI GTRTPKQCRE RYHQNLKPSL
NRSPITVEEG ELIESLVAKY GKKWAEISRH LNGRSDNAIK NWWNGGANRR RRASLVHEPN
VAGNSNSNNN NNSMSSSNGS NVNANGLNSS TSISTMSAST SASTNSTRSQ NGSLSSPSGL
PTLTQNKSSA SLPEAVLSVS SAQVHQSNSS RSSLNEPSLS ANSSALNLSA ANPNSLSSYA
ITNNHNTNIN NTSNNSTILP PPIGASQPST FPQIPQLPQI SFNTSMFGKP DASFKAHTPP
PGSMAAAVPH TMTSPVKATS LRSASFDVTS ATGATNASTS TLTSTTLPPI SSSNKRRLLD
DPISRRHSTA NYHYAHPNGN TNNNNNNNNN FAVPTSSAPG SASAATGSII GGAGTVSPSY
YGSPLLLSTQ VSRNNSISHF EFSTLNSTSH SSRRSSSIAP DFFPNPLKEL QAAASSLNKE
GNVNHKRNMS QNSSFNSPSL TPSTRFSISS TTSLLNNTST NLTMPSATTS PSSNYNGLKN
DHSSSSGSIP ALKEEVELKL KHKNDLDDVD MDDSHNHLQN PRTTMVKTKI SVSSLID