Gene PICST_90967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_90967 
Symbol 
ID4840684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp443587 
End bp444830 
Gene Length1244 bp 
Protein Length395 aa 
Translation table12 
GC content44% 
IMG OID640391999 
ProductPutative trehalase 
Protein accessionXP_001386103 
Protein GI150866481 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.146206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.851818 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATCCACCCT AAAAGAAAAA TTTCATGAGC CAAGACGCTG AGGAGTTCCG CGCGTTAACA 
TCAACGCTTC TGGCTTTCTA TAACTACCAC CGGTGGGAAA CAGAGCAGAT AGTCAAACCC
CGGCGAATCA AATACGATTC GCTTTCAGCA GACGAGAAGC TCTTGGTTCC ATGGTACGAA
AAGCACACCG AGCACTTGAT GATGTGTATC GAGATGAACA TGCAATTCTG TCAGATGTTG
GCGACAAATA TTGCCACAGA TTGGGGCGTT TCAGCCGATC CAAATGACTG GGAGCCCGCA
ACTGCCAATG AGTACGATAA GGTGAGATCA ACTCTATTGC AATTATCGAA GGAATGGAGT
GACGATGGAC AAAATGAGCG ACAGGTGAGC TACCGCAAGA TTGTGGATGA GTTGGAAGCG
ATGTTTCCTG ACGAAGAGAA ACGGCAGAAT ATCAAAATTC TCAATCCGGG GTGTGGATTA
GGACGGTTGG TGATGGATTT GATCGTGAAG GGTTTCTGGT GCCAGGGCAA TGAGTTCAGC
TACCATATGT TGTTGACATC GAACTTTGTA TTGAACCATT GCAAATTTGC CCACAACTTC
CTGATCTTTC CATATTTGCA CAAATCGTCG CATATGGTCA AGAGGTTAAA TCAGATTCGG
CCAGTGAGCT TACCAGATCT CAATCCTACT TCTATAAGCG AATTGAGCCT GAAGAATCCG
AGTATTCCGT ATGATGAACT CATGTCTATG ACAGCTGGTT CGTTCACCGA CTTGTATGGA
CCCGAAGACT TGGTTATCTC AGAGACTTAC ACCCAGGATA CCATTGCCAA CGAGTTTCGA
TCCACCAACA AGGACCATTT CGACGTGCTC GTGTCGTGCT TCTTCATCGA TACAGCCAGC
AATATCATTG ACTATTTGAA GTCTATCCAT TACTGTTTGA AGACTGGCGG GGTGTGGATC
AACTTTGGCC CGTTGTTGTG GCATTTCGAA GACGATTTCT CGACCAAAAT CATATCCAGA
GATAATACAA AAGTACAGAC TATCATGAAG GGATTGGAGT TGTCGAGAGA GGACTTAGTT
GAATTGGTGG AGAAGATTGG ATTCAAGTTC GAGAAACGTG AGTCGGACAT TGAGACTACC
TACTGTGGAG ATATCAAGGC GTTGGGATCG TTTGTGTATA AATGTGAATA CTGGGTGTGT
CGTAAGTTGT AAAGGTATAG ATTAAAATGA AGTAATTTAA AGGC
 
Protein sequence
MSQDAEEFRA LTSTLSAFYN YHRWETEQIV KPRRIKYDSL SADEKLLVPW YEKHTEHLMM 
CIEMNMQFCQ MLATNIATDW GVSADPNDWE PATANEYDKV RSTLLQLSKE WSDDGQNERQ
VSYRKIVDEL EAMFPDEEKR QNIKILNPGC GLGRLVMDLI VKGFWCQGNE FSYHMLLTSN
FVLNHCKFAH NFSIFPYLHK SSHMVKRLNQ IRPVSLPDLN PTSISELSSK NPSIPYDELM
SMTAGSFTDL YGPEDLVISE TYTQDTIANE FRSTNKDHFD VLVSCFFIDT ASNIIDYLKS
IHYCLKTGGV WINFGPLLWH FEDDFSTKII SRDNTKVQTI MKGLELSRED LVELVEKIGF
KFEKRESDIE TTYCGDIKAL GSFVYKCEYW VCRKL