Gene PICST_71233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_71233 
Symbol 
ID4837845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp430019 
End bp431143 
Gene Length1125 bp 
Protein Length344 aa 
Translation table12 
GC content42% 
IMG OID640389160 
Productpredicted protein 
Protein accessionXP_001383360 
Protein GI150864516 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0500] SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.532537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAACTTACT AAGCGGGTAG AAATGACAGT AGTAACACCT GTGGAGTCAG ACAGTGAAAA 
TTTGGCCTTC TCTGAGTTGA AAATCGAAGA CCTGAAGCTG CAAATCCAGG CTGAGGCTCC
AGAAAAAGAA TCCAAACCAC CTTTAGAGTC TAGAATAGGC AAGGATTCTC CCTTCACCTT
CGGACAAAGA TACTTGAAAA GTGACGAGGA TGTTTTCAAC CATAATGCAT GGGACCATGT
GGAATGGGGT GAAGAACAGA TCGAAGAGGC CAAGTCTATG ATAGCTAAAC AGTACGATCA
TCCTGTAAAG GACTTTGACA AGAAGCTCTA CAATTCCAAC CCAGCCAAGT ATTGGGACAT
TTTCTACAGA CATAACAGAG AGAACTTTTT CAAAGACAGA AAGTGGCTTC AAATCGAGTT
CCCATCTTTG TATCAGGTTA CGGCTGAAGA CTACCAGGAA AAATGTACAA TTTTGGAAAT
CGGATGCGGT GCTGGAAATA CATTTTTTCC AGTATTGAGT CAGAACAAGA ACGAAAACTT
GAAGATTGTG GGCTGTGACT ATTCGAAAGT GGCCGTAGAT TTGGTTCGCT CTAATGAACA
GTTTGCTCCT AACCATGAGA AGGGTGTAGC ATTCTCGTCA GTTTGGGATT TGGCTAATCC
TGAAGGACAG CTTCCTGAAG ATGTAGAAGA AAACTCGGTG GACATAGTCA TTATGGTTTT
TGTGTTTCTG GCGCTTTCAC CTGACCAATG GAAGCAGGCT GTCTCCAACT TGGCCAAGAT
TTTGAAGCCC GGTGGAGAGA TTCTCTTCAG AGACTATGGC AGATACGACT TGGCCCAAGT
CAGATTCAAG AAGGGAAGAC TCTTGGACGA TAACTTCTAT ATTAGAGGAG ATGGTACTAG
AGTGTATTTC TTTACGGAAG AGGAGTTGAG ACAGATATTT TGCATAGACG GTCCTTTCAC
CGAAGAGAGA ATTGCCACCG ACAGAAGATT GTTGGTGAAT AGAAAGAAAC AGTTGAAGAT
GTACCGTAAC TGGTTGCAGG CTGTGTTCAG AGGATAACGG TAATTGTAAA TTAGAGCTAA
GGAAATTAGA ATTATTGTAA ACTAGAACTA TTAGAATGAA CTTTT
 
Protein sequence
MTVVTPVESD SENLAFSELK IEDSKSQIQA EAPEKESKPP LESRIGKDSP FTFGQRYLKS 
DEDVFNHNAW DHVEWGEEQI EEAKSMIAKQ YDHPVKDFDK KLYNSNPAKY WDIFYRHNRE
NFFKDRKWLQ IEFPSLYQVT AEDYQEKCTI LEIGCGAGNT FFPVLSQNKN ENLKIVGCDY
SKVAVDLVRS NEQFAPNHEK GVAFSSVWDL ANPEGQLPED VEENSVDIVI MVFVFSALSP
DQWKQAVSNL AKILKPGGEI LFRDYGRYDL AQVRFKKGRL LDDNFYIRGD GTRVYFFTEE
ELRQIFCIDG PFTEERIATD RRLLVNRKKQ LKMYRNWLQA VFRG