Gene PICST_61383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61383 
Symbol 
ID4839864 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1262851 
End bp1264001 
Gene Length1151 bp 
Protein Length250 aa 
Translation table12 
GC content42% 
IMG OID640391179 
Productpredicted protein 
Protein accessionXP_001385936 
Protein GI126138826 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0638] 20S proteasome, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.235012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGCACTGGTT ACGATTTATC CAACAGTGTT TTCTCTCCAG GTAAGTATAG GAAATTCTTG 
GTGGCATGAT ACGAATGGCA ATCTATGATA TGGTGATATG ATATTGTGAT ATTATGATAC
TTTACGGTGT TGATAGGTTG ATAAGATGTG ATAATACGAT GGAAGCTATC GCAAATTATG
TGGTGATGAA ATTGAAAGTA GTGTGATGGA TATCGATATA TCTATTGGAA TTGATTGAGA
CTTGTCCCTT GTACTTCTTT GATGTTGAAA TAATGTTCAT TTTACAATTC CTGATGTTTC
TTGACATTGT AATCCTACTG GTATCCACAT TGACATTATT CCTTCTTTCA TATCGTGTTG
ATATCGATAC CATATTTCAT TTCATTGACG TCATCAATAC TTCGTATTCT TCAAAGTTCG
GTTTACTAAC ACATTTAGAT GGAAGAAACT TCCAGGTCGA ATACGCCATG AAGGCAGTGG
AAAACGGCGG AACATCTATC GGAATCAAAT GCAAAGACGG AATAGTGTTG GCAGTAGAAA
AGATCATAAA CTCGAAGTTG TTGGTCCCTG GCAAGAATAA AAGAATCCAG ACCATTGACA
GACACATTGG TGTAGTGTAC TCGGGGCTTT TACCCGACGG CCGTCATTTT GTGAATAGAG
GCAGAGACGA AGCTCAGTCG TTCAAGTCCA TATATAAGAC TCCAGTTTCG GTGCCTCACC
TCATGGACAG ATTGGGCATC TACGTGCAAA ACTACACTTG CTACAACTCC GTTAGACCGT
TTGGTGTTGT TTCTATTGTA GGAGGAGTAG ATGAAAACGG ACCCCATTTG TACATGATTG
AGCCCAGTGG CTCATGCTGG GGCTACTCTG GCGCTGCAAC TGGTAAAGGT AGACAGACAG
CCAAGTCTGA GTTGGAAAAG TTGAACTACG ATGAGTTGAC TGCGAGGGAA GCCGTCAAGT
CTGCTGCCAA GATTATCCAT CTCGCTCACG AAGACAACAA GGACAAGGAC TACGAGTTGG
AAATCTCCTG GATTTCTATT GAAGAAACTA AGGGAAGGCA TGAATTCGTT CCTGATGACT
TGTTCGAAGA AGCTAAGAAG TATGCTGAAG AGGACGATGA AGAGGACGAG GACGAGGAGA
TGGAATCGTA G
 
Protein sequence
GTGYDLSNSV FSPDGRNFQV EYAMKAVENG GTSIGIKCKD GIVLAVEKII NSKLLVPGKN 
KRIQTIDRHI GVVYSGLLPD GRHFVNRGRD EAQSFKSIYK TPVSVPHLMD RLGIYVQNYT
CYNSVRPFGV VSIVGGVDEN GPHLYMIEPS GSCWGYSGAA TGKGRQTAKS ELEKLNYDEL
TAREAVKSAA KIIHLAHEDN KDKDYELEIS WISIEETKGR HEFVPDDLFE EAKKYAEEDD
EEDEDEEMES