Gene PICST_33854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33854 
Symbol 
ID4841161 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp626487 
End bp627827 
Gene Length1341 bp 
Protein Length435 aa 
Translation table12 
GC content40% 
IMG OID640392476 
Productpredicted protein 
Protein accessionXP_001386521 
Protein GI150866802 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.893021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.424834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTGGC CCAAGCATAC ATTTCCAAAG TCGGTTGCTG CTATCCTTAG GAAAGGGCTT 
TGGGCGGAAA GTGAAAAGGG CGAAAATGAC TACCAATTAG CTCTAAAGTA CTATTTAGAA
GCTCTTGAGC ATTGTAATGA AATTGGCATG GATACTCTTT CTGATGAATA CACGGGAATC
CAATTGAAAG TAGGAGAAAT GTTTGAACGT CTCAACATGC CCCAGGACGC AGCATTTGTA
TATAACGAGA TTGCCACGTT ATATTTGATG GTTTTGACAG CAACACCAGA GTCAGAACAA
GGTAGAAGAA TCAACGACAG AGAGCATAGA CGTCATCTCA TCCAGAAAGA TTTGAGAATC
GCAACCAAGT TGGTAGAGTT GAACCGAGAC AATCCACAAT TGTGTCGAGC AATCTTGATT
ACCCACTTGA TTATTGCTCA GGACGAAGTT AGAAAACAGT CTCCATCGTC TGCTAGCCAA
TTGGCTAAGT TAACCCTGCC TGATGAGGTG CACACAACAG ACAACTATAA GGCTACTGTA
CATGACGACT CAATAGTAAT TAGGAATGGA GATGTAGTCA CTAGTTTCAA AAAGAGTCCT
GAACTATGGG AGCCTTTTGC AGAGGAGTAC TTCAATGCCA TGGACTTGTT GGGTGCTTTC
TGTATTTCAC TTGGAGACTT ATCCATGGCT TCTAAAGTGA AGATATCCAT GACTGAGTCC
ATGTTATTGG CAGATGTTGA ACCACACAAG ATTCTTCTTT CCCAGTGTAA TCTAGGATCC
TTGCTTTACT TGCAGGCTGA AGAGTTTCAA GCGCAAGAAA TTGCATGGAG AAGAAAATTC
TCCCAACAAT CGGGTATTGA ATACGAGAAG ATCAAGAGTG AAGAGTTGTT AAATAATCTT
TCCAACTCAG AACAGGTTCA GAAGGAGTTG GAAAAAGCTA TCCCTGCTGC TGATAAAATA
AAATATGAAG AATCCATTGC TTCAAAAGAT AAATGTCTCC AATTATCAAT CAAATCCTAC
GAATCAGTAC TTGAGTTTGC CAAAGGTTTA CCTCAGGAAA TCGTCAAGGG CAATACCGCA
GTCGGTGAAG GAGTAGCATT AGCCACTTAT GGGTTAGGTG TTGTATATCT TCATCTTTCG
CAATATGATA AAGCTGAGAG ATTGTTGAGA GAGTCGAGAG TTAGGTCGAA GAACTGTGGC
TACGATGAAT TGATCACTCA AATTGAACGT GAATTGAATA AGTTATTCAA AGAAAAGAAG
AATTTGAAGA TTGCAGATCC TAAGAATCCA GCCCCTACTG ATGAAGACAT TGAGATAGAT
ATCCTCTTGA AGAAAACATA A
 
Protein sequence
MFWPKHTFPK SVAAILRKGL WAESEKGEND YQLALKYYLE ALEHCNEIGM DTLSDEYTGI 
QLKVGEMFER LNMPQDAAFV YNEIATLYLM VLTATPESEQ GRRINDREHR RHLIQKDLRI
ATKLVELNRD NPQLCRAILI THLIIAQDEV RKQSPSSASQ LAKLTSPDEV HTTDNYKATV
HDDSIVIRNG DVVTSFKKSP ELWEPFAEEY FNAMDLLGAF CISLGDLSMA SKVKISMTES
MLLADVEPHK ILLSQCNLGS LLYLQAEEFQ AQEIAWRRKF SQQSGIEYEK IKKQVQKELE
KAIPAADKIK YEESIASKDK CLQLSIKSYE SVLEFAKGLP QEIVKGNTAV GEGVALATYG
LGVVYLHLSQ YDKAERLLRE SRVRSKNCGY DELITQIERE LNKLFKEKKN LKIADPKNPA
PTDEDIEIDI LLKKT