Gene PICST_78783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_78783 
SymbolPRB2 
ID4839803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1293857 
End bp1295883 
Gene Length2027 bp 
Protein Length544 aa 
Translation table12 
GC content48% 
IMG OID640391118 
Productvacuolar protease B 
Protein accessionXP_001385945 
Protein GI126138844 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0089514 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACGGTTTTGG CTGTATCTCG GTAAGCACTG GTTCCTATAT CTGCCATCTG TTACTGCTGT 
TGTTGCTGCT CATATCTACT GCTTTTCTGA ACTTTTCTTC TTTGAAACTC GTTAAATTTG
GACTACTTCT GTTCATTTTC TGCTTAGTAT ATCTGGTCTA TACTATCTGT CACTCAATTA
GTCTAGTCTG TCTACTCCCG TATTCGAGAA CTTCTCCGGA TACAATCTGT TAACTATTAA
CTACATAAAA ATCTATCACT ATGTTGCTTT CCAAGTCGGT TGCTGTTTCC ATCCTCGTTG
CCATGGGCGT TGAGGCTTTG GTTCTTCCCT CTTTTGATGA TATCGTCGAC GTGTTTGGCG
TCGAGAAGGC CGTTCCAAAA GTCAACGAAA AGACCCAGAA CGTCTTGGGC TTGAAGGAAG
CCGTCGACGG CCTCAAGGAC GCTGTTGATG GAGCCAAGAA AGCTGTGGCT CCCTTCCTTG
CCGGTGCCAG AGACCTCATT CCCCACAGAT ACATCGTTGT CTTGAAGGAG TCTGCCTCTG
CCGACGAAAC TGCCTTCCAC AAGGAATGGG TTGCCTTGAA GCACACCGAG TCGTTGGCTG
GTCTCGACGA AAGTCACGAT TTTTTCGCTT CCACCAAGGA CTTCAAGACT GAGGGCGGTA
TTGTTGACTC GTTTGACATT GGCTCCATCG TCAAGGGTTA CTCCGGTTTC TTCCTTGAGT
CCACCATCGA CTTGATTCGT CAGAACCCAT TGGTTGCTTT CGTTGAACAA GACTCCATGG
TCTATGCCTC GGAATTCGAA GTTGAAAAGG GTGCTCCATG GGGTTTGGCC AGAGTCTCTC
ACAGAGAGCC ATTGACCTTA AGCTCGTTCA ATCAGTACTT GTACGACAAC AACGCTGGTA
AGGGTGTGAC TTCTTACGTC ATCGACACCG GTGTTAACGT CAACCACAAG GAATTCGGTG
GAAGAGCCAA GTGGGGTGCC ACCATTCCTT CTGGTGATGC TGATGTTGAT GGTAATGGTC
ACGGTACCCA CTGTGCTGGT ACCATTGCCT CTTCCGCCTA TGGTGTGGCC AAGGGTGCTG
AAGTTGTCGC TGTCAAGGTG TTGAGATCCA ACGGTTCCGG TTCCATGTCT GATGTAGTTA
AAGGTGTTGA ATTCGCTGCC AATGCCCATT CTGCTGCTGC CAAGGAGGCC AAGAAGGGCT
TCAAGGGTTC CACTGCCAAC ATGTCGTTGG GTGGAGGCAA GTCTCCAGCT TTGGATTTGG
CTGTTAACGC TGCTGTCAAG GCCGGTATCC ATTTCGCTGT TGCTGCTGGT AACGAAAACC
AAGATGCATG TAACACTTCT CCTGCTGCTG CTGAGAACGC CATCACTGTC GGGGCTTCGA
CTCTTGATGA CTCCAGAGCT TACTTCTCTA ACTACGGTAA GTGTGTTGAC ATCTTTGCTC
CAGGTTTGAA TATTGTTTCT ACCTACATTG GTTCCGACAC TGCCACAGCA ACCTTGTCTG
GTACTTCGAT GGCTTCTCCA CACATTGCTG GTTTGTTGTC GTACTTTGTT TCGTTGCAAC
CAGGTGCAGA CTCTGAGTTC TTTGTGGCAG CTAACGGTGT TTCTCCATCC CAGTTGAAGA
AGAACTTGAT TGCCTACGGT TCCACAGGTT TGTTGAGTGA CATCCCTGAA GATGGAACTC
CTAACATCTT GGCTTACAAT GGTGGAGGAC ACAACATATC CGAATTCTGG GGTAAGGATG
CTGGTGCTGA GCTCAAGTCG GCTAAGGTTG ACGCCAGAAT CGCCGACATC GAAGGCAAGA
TTGGTTCCTT GTTATCAAAG GTTGACTCCA AGCAGATTCT TGACGATGTC AAGGCTTTGG
TAGACGTTGC CTATGAAAAG TTGCAGGAAA ACTAGAATAG ACATAGATGG CAGAGATGCT
GTTATGGGTT TTAATTCTGA TTCACCGATT TTTTTCATTT GCGACTGGTT TATGTACGGA
TATGTCGCTA GAGCCTCAGA GCTCTGTAGT ATCAATGTTA TAATAAG
 
Protein sequence
MLLSKSVAVS ILVAMGVEAL VLPSFDDIVD VFGVEKAVPK VNEKTQNVLG LKEAVDGLKD 
AVDGAKKAVA PFLAGARDLI PHRYIVVLKE SASADETAFH KEWVALKHTE SLAGLDESHD
FFASTKDFKT EGGIVDSFDI GSIVKGYSGF FLESTIDLIR QNPLVAFVEQ DSMVYASEFE
VEKGAPWGLA RVSHREPLTL SSFNQYLYDN NAGKGVTSYV IDTGVNVNHK EFGGRAKWGA
TIPSGDADVD GNGHGTHCAG TIASSAYGVA KGAEVVAVKV LRSNGSGSMS DVVKGVEFAA
NAHSAAAKEA KKGFKGSTAN MSLGGGKSPA LDLAVNAAVK AGIHFAVAAG NENQDACNTS
PAAAENAITV GASTLDDSRA YFSNYGKCVD IFAPGLNIVS TYIGSDTATA TLSGTSMASP
HIAGLLSYFV SLQPGADSEF FVAANGVSPS QLKKNLIAYG STGLLSDIPE DGTPNILAYN
GGGHNISEFW GKDAGAELKS AKVDARIADI EGKIGSLLSK VDSKQILDDV KALVDVAYEK
LQEN