Gene PICST_47614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47614 
Symbol 
ID4839823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1185240 
End bp1186571 
Gene Length1332 bp 
Protein Length443 aa 
Translation table12 
GC content45% 
IMG OID640391138 
Productpredicted protein 
Protein accessionXP_001385579 
Protein GI150866096 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.80881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTC CTCCTCCAAT AGACTTCGAC AAGATCAGAG AGAACTCAGA TGGGTCTCTT 
GGGTCCATGA TAAAGCTGGG CTTCACAGAA GTGATTCTGA ATAACCCCTA CTTCGCAGCT
GGTGGTGGTC TAATGGTATT AGGAACAGGT TTGGCGCTTG CCCGTCAGGG TATTGTCAAG
AGTTCGGGCT TCATCTATCG TCAGTTGCTT GTAGACCTAG AGATTCCTTC TAAGGACAAG
TCGTACCTCT GGTTTCTCGA ATGGATGTCT CAGTATAAAC ACAGAACGCT GCGTCACTTA
TCTGTGGAAA CTAACTTCGT TCAGCACGAT AACGGTTCTG TTTCGACTCG GTTCTCTTTG
GTTCCTGGTC CAGGTAAGCA TTTAATCAAG TACAAAGGTG CCTACATGTT GGTTAATCGT
GAAAGGTCTG GAAAGTTGCT TGATATGACC AGTGGAACAC CGTTTGAAAC AGTGACCTTA
ACCACATTGT ACAGCGACAG AAAGTTGTTC AGCGATTTGT TAGGTGAGGC CAAGCAGCTA
GCTTTGAAAG CTAGAGAGGG CAAGACTGTT TTATACACTT CGTGGGGTCC AGAATGGCGG
CCCTTCGGTC AGCCTAGAAA GAAAAGAATG ATCGGATCGG TTATTCTCGA CAAAAGCATT
GCCGAAGGCA TCATTTCAGA CGTCAAAGAT TTCTTGGACA GTGGAGAATG GTACCATAAA
CGAGGCATAC CCTACAGAAG AGGTTATTTG TTGTACGGAC CACCTGGAAG TGGTAAGACT
TCTTTTATTC AGGCTTTGGC TGGAGAGTTA GACTACAATA TCTGCATTTT GAATTTGTCG
GAAAGCAACT TGACCGACGA CCGGTTGAAC CACTTGATGA ACCACATTCC AGAAAGATCT
ATATTGTTAC TTGAAGATAT CGATGCTGCC TTCAACAAAA GAGCTCAGAC GGAAGACAAG
GGCTACACTT CAGGGGTTAC CTTTTCAGGT TTGTTAAATG CGCTAGATGG AGTTGCTAGT
GCGGAAGAAT GCATTACCTT CATGACAACT AATCATCCCG AAAAGCTTGA CCCAGCCCTC
ATGCGTCCTG GCAGAGTCGA TTATAAGGTT CTAGTGGACA ATGCTACTGA ATACCAGGTC
AGACAGATGT TCTTACGATT CTACGAAAAC GAGAACGAGC TCTGTGAAGT GTTCATGAAC
AAATACAGAC ACCTCCAATT GACAAAGGTC AGCACAGCTC AACTACAGGG ATTGTTTGTC
TACAATAAAA GCAACCCACA GCTGGCCATT GACATGATCG AGACATTGCA GAACCCAAAT
ACCGTGTTCT AG
 
Protein sequence
MASPPPIDFD KIRENSDGSL GSMIKSGFTE VISNNPYFAA GGGLMVLGTG LALARQGIVK 
SSGFIYRQLL VDLEIPSKDK SYLWFLEWMS QYKHRTSRHL SVETNFVQHD NGSVSTRFSL
VPGPGKHLIK YKGAYMLVNR ERSGKLLDMT SGTPFETVTL TTLYSDRKLF SDLLGEAKQL
ALKAREGKTV LYTSWGPEWR PFGQPRKKRM IGSVILDKSI AEGIISDVKD FLDSGEWYHK
RGIPYRRGYL LYGPPGSGKT SFIQALAGEL DYNICILNLS ESNLTDDRLN HLMNHIPERS
ILLLEDIDAA FNKRAQTEDK GYTSGVTFSG LLNALDGVAS AEECITFMTT NHPEKLDPAL
MRPGRVDYKV LVDNATEYQV RQMFLRFYEN ENELCEVFMN KYRHLQLTKV STAQLQGLFV
YNKSNPQSAI DMIETLQNPN TVF