Gene PICST_33244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33244 
Symbol 
ID4840570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp107211 
End bp108296 
Gene Length1086 bp 
Protein Length361 aa 
Translation table12 
GC content43% 
IMG OID640391885 
Productpredicted protein 
Protein accessionXP_001386036 
Protein GI150866435 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.500627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCAA TATCGAGAGC CTTTGGACGG CAATTGGGTC GACCATCAAC GTTTTCTTCC 
CGTTTTCAAA GTGTTTCTCC AGCTTCTTTT GCTCTTCGTA GTAGAGTTTT TCAGTCGGCC
CCAGCTAGAC AATATGCGAC CTACAACCGT TTCAATGGTT CTTCGTCGTC TTCTTCATGG
AATACGACTA CTTTCATAAA TCTATTAACC AGCAGAAGAA CCATCTACTT TGGTGTAGGT
TTCTTGGGCT TTTATGTCTA CAATCTCAAT GAAGCACCTT TTACTCACAG GCGTAGACTC
ATCTGGATTC CCTACTGGCT CGAAACTAAA ATCGGAGATT TTTCCTATAG ACAAATAATG
TACCAATACG GTGATAAGTT GGTCTCCAGC CAAGATCCCT TGTATGGGCG AATCTCCAAG
ATCATGAATA GATTGCTCTC AGTAGCCCTT GAGAATAACG AGAATCAGGC ACAAAGACAC
CATCTCGAAA GCTTGAAATG GACCATCCAT ATCATCAAAG TAGATCCCAG AGAGTATCCG
CCTAATGCTT TCATTTTGCC CAACGGTAAG ATCTTCATTT TCAGCTCGAT TTTGCCCATC
TGCAAAAACG ACGATGGCTT GGCAACCGTG TTATCACATG AGTTATCGCA TCAGTTAGCT
CATCATTCGT CAGAGCAGTT GTCCAAACAG CCCTTCTACA TCATGTTGTC AACGCTTTTG
TATACAGTAA CAGGAATCAG CTGGTTCAAC GACTTGATGA TTAAGGGTTT ACTTGAAATG
CCTGCTTCAC GTGAAATGGA ATCGGAAGCA GATCACATAG GCTGTGAACT TCTAGCCAGA
TCTTGTTTCA ACATCGGTGA AGCAGTCCAA TTCTGGAAAA GAATGGCTCA AGCAGAAGAA
GGCTTTCAAG CTAGAACTGG ATCTCTGAGA CTACAAGAAT TCTTCTCGAC CCATCCAGCC
ACAGACAGAA GAATAAATGA TATACAACAT TGGACTCCAG GTTTGGAAAT TATAAAGGAA
TCGTCCGGAT GCTACGAACA CCAATTCGGT CTCTTTCAAG AAGTTTCCCG CAACTTCTTT
AGATAA
 
Protein sequence
MFPISRAFGR QLGRPSTFSS RFQSVSPASF ALRSRVFQSA PARQYATYNR FNGSSSSSSW 
NTTTFINLLT SRRTIYFGVG FLGFYVYNLN EAPFTHRRRL IWIPYWLETK IGDFSYRQIM
YQYGDKLVSS QDPLYGRISK IMNRLLSVAL ENNENQAQRH HLESLKWTIH IIKVDPREYP
PNAFILPNGK IFIFSSILPI CKNDDGLATV LSHELSHQLA HHSSEQLSKQ PFYIMLSTLL
YTVTGISWFN DLMIKGLLEM PASREMESEA DHIGCELLAR SCFNIGEAVQ FWKRMAQAEE
GFQARTGSSR LQEFFSTHPA TDRRINDIQH WTPGLEIIKE SSGCYEHQFG LFQEVSRNFF
R