Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_78783 |
Symbol | PRB2 |
ID | 4839803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1293857 |
End bp | 1295883 |
Gene Length | 2027 bp |
Protein Length | 544 aa |
Translation table | 12 |
GC content | 48% |
IMG OID | 640391118 |
Product | vacuolar protease B |
Protein accession | XP_001385945 |
Protein GI | 126138844 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0089514 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ACGGTTTTGG CTGTATCTCG GTAAGCACTG GTTCCTATAT CTGCCATCTG TTACTGCTGT TGTTGCTGCT CATATCTACT GCTTTTCTGA ACTTTTCTTC TTTGAAACTC GTTAAATTTG GACTACTTCT GTTCATTTTC TGCTTAGTAT ATCTGGTCTA TACTATCTGT CACTCAATTA GTCTAGTCTG TCTACTCCCG TATTCGAGAA CTTCTCCGGA TACAATCTGT TAACTATTAA CTACATAAAA ATCTATCACT ATGTTGCTTT CCAAGTCGGT TGCTGTTTCC ATCCTCGTTG CCATGGGCGT TGAGGCTTTG GTTCTTCCCT CTTTTGATGA TATCGTCGAC GTGTTTGGCG TCGAGAAGGC CGTTCCAAAA GTCAACGAAA AGACCCAGAA CGTCTTGGGC TTGAAGGAAG CCGTCGACGG CCTCAAGGAC GCTGTTGATG GAGCCAAGAA AGCTGTGGCT CCCTTCCTTG CCGGTGCCAG AGACCTCATT CCCCACAGAT ACATCGTTGT CTTGAAGGAG TCTGCCTCTG CCGACGAAAC TGCCTTCCAC AAGGAATGGG TTGCCTTGAA GCACACCGAG TCGTTGGCTG GTCTCGACGA AAGTCACGAT TTTTTCGCTT CCACCAAGGA CTTCAAGACT GAGGGCGGTA TTGTTGACTC GTTTGACATT GGCTCCATCG TCAAGGGTTA CTCCGGTTTC TTCCTTGAGT CCACCATCGA CTTGATTCGT CAGAACCCAT TGGTTGCTTT CGTTGAACAA GACTCCATGG TCTATGCCTC GGAATTCGAA GTTGAAAAGG GTGCTCCATG GGGTTTGGCC AGAGTCTCTC ACAGAGAGCC ATTGACCTTA AGCTCGTTCA ATCAGTACTT GTACGACAAC AACGCTGGTA AGGGTGTGAC TTCTTACGTC ATCGACACCG GTGTTAACGT CAACCACAAG GAATTCGGTG GAAGAGCCAA GTGGGGTGCC ACCATTCCTT CTGGTGATGC TGATGTTGAT GGTAATGGTC ACGGTACCCA CTGTGCTGGT ACCATTGCCT CTTCCGCCTA TGGTGTGGCC AAGGGTGCTG AAGTTGTCGC TGTCAAGGTG TTGAGATCCA ACGGTTCCGG TTCCATGTCT GATGTAGTTA AAGGTGTTGA ATTCGCTGCC AATGCCCATT CTGCTGCTGC CAAGGAGGCC AAGAAGGGCT TCAAGGGTTC CACTGCCAAC ATGTCGTTGG GTGGAGGCAA GTCTCCAGCT TTGGATTTGG CTGTTAACGC TGCTGTCAAG GCCGGTATCC ATTTCGCTGT TGCTGCTGGT AACGAAAACC AAGATGCATG TAACACTTCT CCTGCTGCTG CTGAGAACGC CATCACTGTC GGGGCTTCGA CTCTTGATGA CTCCAGAGCT TACTTCTCTA ACTACGGTAA GTGTGTTGAC ATCTTTGCTC CAGGTTTGAA TATTGTTTCT ACCTACATTG GTTCCGACAC TGCCACAGCA ACCTTGTCTG GTACTTCGAT GGCTTCTCCA CACATTGCTG GTTTGTTGTC GTACTTTGTT TCGTTGCAAC CAGGTGCAGA CTCTGAGTTC TTTGTGGCAG CTAACGGTGT TTCTCCATCC CAGTTGAAGA AGAACTTGAT TGCCTACGGT TCCACAGGTT TGTTGAGTGA CATCCCTGAA GATGGAACTC CTAACATCTT GGCTTACAAT GGTGGAGGAC ACAACATATC CGAATTCTGG GGTAAGGATG CTGGTGCTGA GCTCAAGTCG GCTAAGGTTG ACGCCAGAAT CGCCGACATC GAAGGCAAGA TTGGTTCCTT GTTATCAAAG GTTGACTCCA AGCAGATTCT TGACGATGTC AAGGCTTTGG TAGACGTTGC CTATGAAAAG TTGCAGGAAA ACTAGAATAG ACATAGATGG CAGAGATGCT GTTATGGGTT TTAATTCTGA TTCACCGATT TTTTTCATTT GCGACTGGTT TATGTACGGA TATGTCGCTA GAGCCTCAGA GCTCTGTAGT ATCAATGTTA TAATAAG
|
Protein sequence | MLLSKSVAVS ILVAMGVEAL VLPSFDDIVD VFGVEKAVPK VNEKTQNVLG LKEAVDGLKD AVDGAKKAVA PFLAGARDLI PHRYIVVLKE SASADETAFH KEWVALKHTE SLAGLDESHD FFASTKDFKT EGGIVDSFDI GSIVKGYSGF FLESTIDLIR QNPLVAFVEQ DSMVYASEFE VEKGAPWGLA RVSHREPLTL SSFNQYLYDN NAGKGVTSYV IDTGVNVNHK EFGGRAKWGA TIPSGDADVD GNGHGTHCAG TIASSAYGVA KGAEVVAVKV LRSNGSGSMS DVVKGVEFAA NAHSAAAKEA KKGFKGSTAN MSLGGGKSPA LDLAVNAAVK AGIHFAVAAG NENQDACNTS PAAAENAITV GASTLDDSRA YFSNYGKCVD IFAPGLNIVS TYIGSDTATA TLSGTSMASP HIAGLLSYFV SLQPGADSEF FVAANGVSPS QLKKNLIAYG STGLLSDIPE DGTPNILAYN GGGHNISEFW GKDAGAELKS AKVDARIADI EGKIGSLLSK VDSKQILDDV KALVDVAYEK LQEN
|
| |